Discover and discuss technology tools

Explore the Tiscuss archive by category or keyword, then jump into conversations around what matters most.

Search and filters

AI Audio AI Design AI Framework AI Infrastructure AI Marketing AI News AI Productivity AI Search AI Security AI Tools AI Video AI Writing

Active: AI Infrastructure / query: LM / page 1 of 1 / 18 total

AI Infrastructure

GLM-5.2-Colibri-INT4: Efficient AI Model for Infrastructure

GLM 5.2 Colibri INT4: Revolutionizing AI in Infrastructure Management The GLM 5.2 Colibri INT4 model stands out as a trailblazer in the realm of AI driven infra…

Global · Developers · Jul 19, 2026

AI Infrastructure

TinyAgents: Rust-Based Recursive LLM Harness for AI Infrastructure

TinyAgents: Revolutionizing AI Infrastructure with Rust Based Recursive LLM Integration TinyAgents, a cutting edge solution in AI infrastructure, leverages Rust…

Global · Developers · Jun 30, 2026

AI Infrastructure

NVIDIA GLM-5.2-NVFP4: Revolutionizing AI Infrastructure

NVIDIA GLM 5.2 NVFP4: Pioneering the Future of AI Infrastructure NVIDIA's recent unveiling of GLM 5.2 NVFP4 signifies a transformative leap in artificial intell…

Global · Developers · Jun 30, 2026

AI Infrastructure

Fast Local LLM Inference Benchmarks and Deployment Tips

Community benchmarks and infra recommendations for local models.

Global · Developers · Jun 23, 2026

AI Infrastructure

Airbnb CEO Brian Chesky to Launch New AI Lab

The Airbnb CEO said last year it hasn't struck an LLM partnership because existing products weren't quite ready.

Global · General · Jun 5, 2026

AI Infrastructure

Mnemo: Local-First AI Memory Layer for LLMs

Mnemo: AI Memory Layer for Local First LLMs Mnemo is an innovative AI memory layer designed to enhance the performance of Local First Language Learning Models (…

Global · Developers · Jun 4, 2026

AI Infrastructure

AirLLM 70B Runs on 4GB GPU: AI Breakthrough

AirLLM 70B inference with single 4GB GPU

Global · Developers · Jun 4, 2026

AI Infrastructure

Tiny-vLLM: High-Performance LLM Inference in C++ and CUDA

Tiny vLLM: Revolutionizing High Performance LLM Inference Tiny vLLM stands at the forefront of high performance inference for large language models (LLMs), desi…

Global · Developers · May 30, 2026

AI Infrastructure

Train Your LLM from Scratch: A Step-by-Step Guide

A straightforward method for training your LLM, from downloading data to generating text.

Global · Developers · May 30, 2026

AI Infrastructure

Llama.cpp: Efficient LLM Inference in C/C++ on GitHub

LLM inference in C/C++

Global · Developers · May 19, 2026

AI Infrastructure

Cerebras Systems: The AI Chip Startup That Almost Failed

Cerebras Systems was 2026's biggest tech IPO so far. But years ago, it burned through hundreds of millions working on a chip many believed impossible.

Global · General · May 17, 2026

AI Infrastructure

Cerebras IPO: Benchmark's Billion-Dollar Bet on AI Hardware

Benchmark almost never backs hardware startups. So Eric Vishria dragged his feet 10 years ago before agreeing to hear Cerebras' pitch.

Global · Founders · May 16, 2026

AI Infrastructure

Agentic AI Infrastructure for Enhancing Human Capabilities

Agentic AI Infrastructure for magnifying HUMAN capabilities.

Global · Developers · May 13, 2026

AI Infrastructure

Rotato: Node.js Proxy Rotates LLM API Keys on 429 Errors

Streamlining API Management with Rotato: A Node.js Proxy for LLM API Key Rotation In the fast paced world of software development, managing API keys efficiently…

Global · Developers · May 3, 2026

AI Infrastructure

Track Real-Time GPU & LLM Pricing Across Cloud Providers

Deploybase is a dashboard for tracking real-time GPU and LLM pricing across cloud and inference providers. You can view performance stats and pricing history, compare side by side, and bookmark to track any changes. https://deploybase.ai

Global · Enterprises · Apr 30, 2026

AI Infrastructure

Arc Gate: AI Tool Achieves Perfect Safety Benchmarks

Benchmarked on 40 out-of-distribution prompts, indirect requests, roleplay framings, hypothetical scenarios, technical phrasings. The stuff that slips past everything else. Arc Gate: P=1.00, R=1.00, F1=1.00 OpenAI Moderation API: P=1.00, R=0.75, F1=0.86 LlamaGuard 3 8B: P=1.00, R=0.55, F1=0.71 Zero false positives. Zero misses. Blocked prompts average 329ms and never reach your model. Detection overhead is \~350ms on top of your normal upstream latency. Sits in front of any OpenAI-compatible endpoint. No GPU on your side. One env var to configure. GitHub: https://github.com/9hannahnine-jpg/arc-gate Live dashboard: https://web-production-6e47f.up.railway.app/dashboard Happy to answer questions.

Global · Developers · Apr 28, 2026

AI Infrastructure

Caliber: Open-Source Proxy for Enforcing LLM Agent Rules

Cross-posting here because this problem affects everyone building with AI agents. Prompt-based guardrails fail. The model follows your system prompt in a demo, then ignores rules when context gets big or the agent chains multiple steps. We built Caliber - an open-source proxy that reads your rules from plain markdown and enforces them at the API layer, not in the prompt. Every call. Provider-agnostic. Just hit 700 GitHub stars ⭐ and nearly 100 forks - the reception from devs building with AI has been amazing. Repo: [https://github.com/caliber-ai-org/ai-setup](https://github.com/caliber-ai-org/ai-setup) Would love: \- Feedback on the approach \- Feature requests from people building AI agents \- Anyone who wants to contribute to the project Building this open-source for the community.

Global · Developers · Apr 27, 2026

AI Infrastructure

Deploying Local LLMs in Production: Best Practices

Discussion thread on infra, latency, and operational best practices.

Global · Developers · Apr 26, 2026

PreviousPage 1 / 1Next