Archive

Discover and discuss technology tools

Explore the Tiscuss archive by category or keyword, then jump into conversations around what matters most.

Search and filters
Reset
Active: any category / query: Benchmarks / page 1 of 1 / 7 total
AI Infrastructure

Fast Local LLM Inference Benchmarks and Deployment Tips

Community benchmarks and infra recommendations for local models.

Global · Developers · Jun 23, 2026
AI Tools

Rscrypto: Rust's Leading Crypto Library with Benchmarks

Rscrypto: The Premier Rust Based Cryptographic Library with Benchmarks Rscrypto stands out as a top tier cryptography library in the Rust programming ecosystem.…

Global · Developers · Jun 4, 2026
AI Infrastructure

Open-Source Infrastructure for AI Desktop Agents

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

Global · Developers · May 13, 2026
AI Tools

AI Coding Agents: Persistent Memory Benchmarks

#1 Persistent memory for AI coding agents based on real-world benchmarks

Global · Developers · May 10, 2026
AI Writing

Google's Deep Research Max: Autonomous Research Agent for Expert Repor

Google quietly dropped something interesting last week. They updated their Deep Research agent (available via Gemini API) and introduced a "Max" tier built on Gemini 3.1 Pro. What it actually does: you give it a topic, it autonomously searches the web (and your private data via MCP), reasons over the sources, and produces a fully cited, professional-grade report — including native charts and infographics. Two modes: Deep Research — faster, lower latency, good for real-time user-facing apps Deep Research Max — uses extended compute, iterates more, designed for background/async jobs (think: nightly cron that generates due diligence reports for analysts by morning) The MCP support is the most interesting part to me. You can point it at proprietary data sources — financial feeds, internal databases — and it treats them as just another searchable context. They're already working with FactSet, S&P Global and PitchBook on this. Benchmarks show a significant jump in retrieval and reasoning vs. the December preview. They also claim it now draws from SEC filings and peer-reviewed journals and handles conflicting evidence better. So what do you think, is it another trying or game changer 😅

Global · Enterprises · Apr 29, 2026
AI Infrastructure

Arc Gate: AI Tool Achieves Perfect Safety Benchmarks

Benchmarked on 40 out-of-distribution prompts, indirect requests, roleplay framings, hypothetical scenarios, technical phrasings. The stuff that slips past everything else. Arc Gate: P=1.00, R=1.00, F1=1.00 OpenAI Moderation API: P=1.00, R=0.75, F1=0.86 LlamaGuard 3 8B: P=1.00, R=0.55, F1=0.71 Zero false positives. Zero misses. Blocked prompts average 329ms and never reach your model. Detection overhead is \~350ms on top of your normal upstream latency. Sits in front of any OpenAI-compatible endpoint. No GPU on your side. One env var to configure. GitHub: https://github.com/9hannahnine-jpg/arc-gate Live dashboard: https://web-production-6e47f.up.railway.app/dashboard Happy to answer questions.

Global · Developers · Apr 28, 2026
AI Infrastructure

Open-Source AI Infrastructure for Desktop Control

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

Global · Developers · Apr 26, 2026
PreviousPage 1 / 1Next