Archive
Discover and discuss technology tools
Explore the Tiscuss archive by category or keyword, then jump into conversations around what matters most.
StructOCR: AI API for Parsing Passports, Invoices, Containers
Unlocking Efficiency: StructOCR for Automated Document Parsing In an era where digital transformation is pivotal, automated document parsing stands out as a gam…
Two Ex-Goldman, Meta Founders Build Voice AI for Africa, Middle East
The startup's own stack for Africa and Middle East is now handling more than 17,000 calls per day.
Hands-Free Voice Interaction with Open-LLM-VTuber
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
Knotch: Hub-and-Spoke Voice Agent for AI Tools
Unlocking AI Synergy: Knotch Hub and Spoke Voice Agent for AI Tools Introduction to Knotch Knotch is revolutionizing how multiple AI tools communicate and colla…
Self-Hosted AI Companion: Moeru-AI/Airi for Gaming and Chat
💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minecraft, Factorio playing. Web / macOS / Windows supported.
Google Unveils Audio-Powered Smart Glasses at IO 2026
Google is calling the new devices "audio glasses," in that users will be able to issue verbal commands to them and get things done via its ecosystem of apps and services, including Gemini.
Discord Adds End-to-End Encryption for Voice and Video Calls
Good news! Discord's hundreds of millions of users now have their communications scrambled, so not even Discord can see them.
Google IO 2026: Talk to Your Gmail Inbox with AI Voice Search
Google expands Gmail’s AI Inbox with conversational voice search, letting users ask Gemini to find buried email details.
Apple's Siri Update Focuses on Enhanced Privacy
Privacy will be a major theme when Apple unveils a new version of Siri.
DreamServer: Local AI Inference and Workflows for Everyone
Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
Dramabox AI Tool: Revolutionizing Voice Cloning with Hugging Face
Dramabox AI Tool: Revolutionizing Voice Cloning with Hugging Face The Dramabox AI tool, integrated with Hugging Face, is transforming the landscape of voice clo…
Amazon Launches AI Shopping Assistant Alexa for Shopping
Alexa for Shopping offers a voice- and touch-enabled shopping experience across mobile, desktop, and Echo Show smart displays. Alexa for Shopping provides more personalized recommendations and automates the shopping experience across Amazon and other online retailers.
Wispr Flow's Hinglish Voice AI Gains Traction in India
Wispr Flow says growth accelerated in India after its Hinglish rollout, even as voice AI products continue to face challenges.
How AI Will Transform Office Communication
How will work setups change if we spend more and more time talking to our computers?
Top AI Dictation Apps Tested and Ranked
AI-powered dictation apps are useful for replying to emails, taking notes, and even coding through your voice
AI Skill Files: Warm Starts for Claude and Gemini Sessions
One thing that frustrates me about most AI workflows is the cold start problem. Every new session you re-explain your business, your voice, your clients. I started solving this with skill files. A skill file is a markdown document you upload to a Claude Project or paste into a Gemini Gem. It holds your context permanently so you never re-explain anything. The three I use most: brand-voice.md: defines tone, writing rules, and platform-specific formatting client-router.md: when you say a client name, Claude loads their full project context automatically seo-aeo-audit-checklist.md: structured audit that scores any website out of 100 across 7 sections including AI search visibility Anyone else using a similar system? Curious what context you keep persistent across sessions.
Voice Agents: 24/7 AI Voice Agents for Client Support
Turn expertise into 24/7 client-facing AI voice agents
VoiceGoat: Practice LLM Attacks with Vulnerable Voice Agent
VoiceGoat: Enhance LLM Security with a Voice Assistant Lab VoiceGoat provides a secure and controlled environment to test and practice Large Language Model (LLL…
Self-Taught Developer from Bahrain Launches Multi-Model AI Platform
https://reddit.com/link/1sxotqx/video/xlaqd9i8guxg1/player I'm a self-taught developer, 39 years old, based in Bahrain. Four months ago I started building AskSary - a multi-model AI platform with a persistent memory layer that sits above all the models. The core idea: the model is not the identity. Most AI tools lose your context the moment you switch models. I built the layer that remembers you across all of them. Here's what's shipped so far: **Models & Routing** Every major model in one place - GPT-5.2, Claude Sonnet 4.6, Grok 4, Gemini 3.1 Pro, DeepSeek R1, O1 Reasoning, Gemini Ultra and more - with smart auto-routing or manual override. **Memory & Context** Persistent cross-model memory. Start with Claude on your phone, switch to GPT on your laptop - it already knows what you discussed. Proactive personalisation that messages you first on login before you've typed a word. **Integrations** Google Drive and Notion - connect once, pull files and pages directly into chat or your RAG Knowledge Base. Unlimited uploads up to 500MB per file via OpenAI Vector Store. **Video Analysis** \- Gemini native video understanding for YouTube URL analysis (no download required, processed natively) and direct file upload up to 500MB. Full breakdown of visuals, audio, dialogue, editing style and key moments. **Generation** Image generation and editing, video studio across Luma, Veo and Kling, music generation via ElevenLabs, video analysis via upload or YouTube URL. **Builder Tools** Vision to Code, Web Architect, Game Engine, Code Lab with SQL Architect, Bug Buster, Git Guru and more. Tavily web search across all models. **Voice & Audio** Real-time 2-way voice chat at near-zero latency, AI podcast mode downloadable as MP3, Voiceover, Voice Notes, Voice Tuner. **Platform** Custom agents, 30+ live interactive themes, smart search, media gallery, folder organisation, full RTL support across 26 languages, iOS and Android apps, Apple Vision Pro. **Where it is now** 129 countries. Currently at 40 new signups a day. 1080 Signup's so far after 4 weeks or so. MRR just started. Zero ad spend. All of it built solo, one feature at a time, on a balcony in Bahrain. **The Stack:** Frontend - Next.js, Capacitor (iOS and Android) and Vanilla JS / React Backend - Vercel serverless functions, Firebase / Firestore (database + auth) and Firebase Admin SDK AI Models - OpenAI (GPT, GPT-Image-1), Anthropic (Claude), Google (Gemini), xAI (Grok), DeepSeek Generation APIs - Luma AI (video), Kling via Replicate (video), Veo via Replicate (video), ElevenLabs (music), Flux via Replicate (image editing), Meshy (3D — coming soon) Integrations - Google Drive (OAuth 2.0), Notion (OAuth 2.0), Tavily (web search), OpenAI Vector Store (RAG), Stripe (payments), CloudConvert (document conversion), Sentry (error tracking), Formidable (file handling) Rendering - Mermaid (flow charts) and MathJax Platforms - Web, iOS, Android, Apple Vision Pro (visionOS) Languages - 26 UI languages with full RTL support [asksary.com](http://asksary.com) Happy to answer questions on any part of the build - stack, architecture, API cost management, anything.
Durable.co: AI Platform for Rapid Website, Brand, and Invoice Creation
AI-driven platform for rapid website, brand, and invoice creation.
Parlor Jarvis: Real-Time Multilingual Voice AI Tool
Parlor Jarvis: Revolutionizing Multilingual Voice Output with Real Time AI In today's interconnected world, effective communication across languages is more cru…
Parlor Jarvis: Real-Time AI with Audio, Screen Input & Voice Output
Realtime AI Translation with Parlor Jarvis: Revolutionizing Multilingual Communication Parlor Jarvis is an innovative AI tool designed to bridge language barrie…
Parlor Jarvis: Real-Time Multilingual Voice AI Tool
Parlor Jarvis: Revolutionizing Multilingual Voice Output with Real Time AI In today's interconnected world, effective communication across languages is more cru…
Parlor Jarvis: Real-Time AI with Audio, Screen Input & Voice Output
Realtime AI Translation with Parlor Jarvis: Revolutionizing Multilingual Communication Parlor Jarvis is an innovative AI tool designed to bridge language barrie…