Archive

Discover and discuss technology tools

Explore the Tiscuss archive by category or keyword, then jump into conversations around what matters most.

Search and filters
Reset
Active: any category / query: OpenAI / page 2 of 2 / 52 total
AI Tools

Arc Sentry: Advanced Prompt Injection Detector for LLMs

Been working on Arc Sentry, a whitebox prompt injection detector for self-hosted LLMs (Mistral, Llama, Qwen). Most detectors pattern-match on known attack phrases. Arc Sentry watches what the prompt does to the model’s internal representation instead, so it catches indirect, hypothetical, and roleplay-framed attacks that get through keyword filters. Benchmark on indirect/roleplay/technical prompts (40 OOD prompts): • Arc Sentry: Recall 0.80, F1 0.84 • OpenAI Moderation API: Recall 0.75, F1 0.86 • LlamaGuard 3 8B: Recall 0.55, F1 0.71 Arc Sentry has the highest recall — it catches more of the hard cases. Blocks before model.generate() is called. The lightweight pre-filter runs on CPU with no model access. pip install arc-sentry GitHub: https://github.com/9hannahnine-jpg/arc-sentry Happy to answer questions about how it works.

Global · Developers · Apr 27, 2026
AI Tools

OpenAI CEO Apologizes to Tumbler Ridge for Mass Shooting Oversight

In a letter to the residents of Tumbler Ridge, Canada, OpenAI CEO Sam Altman said he is “deeply sorry” that his company failed to alert law enforcement about the suspect in a recent mass shooting.

US/CA/AU · General · Apr 26, 2026
PreviousPage 2 / 2Next