Discover and discuss technology tools

Explore the Tiscuss archive by category or keyword, then jump into conversations around what matters most.

Search and filters

AI Audio AI Design AI Framework AI Infrastructure AI Marketing AI News AI Productivity AI Search AI Security AI Tools AI Video AI Writing

Active: any category / query: voice / page 1 of 1 / 49 total

AI Audio

Voicebox: Open-Source AI Voice Studio for Cloning and Creation

The open-source AI voice studio. Clone, dictate, create.

Global · Developers · Jun 23, 2026

AI Video

AI Tool Dubs Videos in 40 Languages Using Original Voice

Revolutionizing Video Content with AI Driven Dubbing: 40 Languages, Original Voice In the rapidly evolving world of digital content, the ability to reach a glob…

Global · General · Jun 7, 2026

AI Tools

StructOCR: AI API for Parsing Passports, Invoices, Containers

Unlocking Efficiency: StructOCR for Automated Document Parsing In an era where digital transformation is pivotal, automated document parsing stands out as a gam…

Global · General · Jun 6, 2026

AI Audio

MisoTTS: Revolutionizing AI Audio with Hugging Face

MisoTTS: Revolutionizing AI Audio with Hugging Face MisoTTS, an innovative text to speech (TTS) model, is making waves in the realm of artificial intelligence. …

Global · Developers · Jun 5, 2026

AI Tools

Two Ex-Goldman, Meta Founders Build Voice AI for Africa, Middle East

The startup's own stack for Africa and Middle East is now handling more than 17,000 calls per day.

Other · General · Jun 4, 2026

AI Tools

Hands-Free Voice Interaction with Open-LLM-VTuber

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Global · General · Jun 4, 2026

AI Tools

Knotch: Hub-and-Spoke Voice Agent for AI Tools

Unlocking AI Synergy: Knotch Hub and Spoke Voice Agent for AI Tools Introduction to Knotch Knotch is revolutionizing how multiple AI tools communicate and colla…

Global · General · Jun 2, 2026

AI Audio

VoxCPM2: Multilingual Speech Generation and Voice Cloning

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Global · Developers · May 30, 2026

AI Audio

OpenMOSS/MOSS-TTS: Revolutionizing AI Audio with High-Fidelity Speech

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

Global · Developers · May 28, 2026

AI Tools

Self-Hosted AI Companion: Moeru-AI/Airi for Gaming and Chat

💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minecraft, Factorio playing. Web / macOS / Windows supported.

Global · General · May 26, 2026

AI Audio

AI Reconstructs Dead Pilots' Voices, NTSB Temporarily Blocks Access

People used AI on a spectrogram image of cockpit recordings to reconstruct them, forcing the NTSB to temporarily block access to its docket system.

Global · General · May 23, 2026

AI Tools

Google Unveils Audio-Powered Smart Glasses at IO 2026

Google is calling the new devices "audio glasses," in that users will be able to issue verbal commands to them and get things done via its ecosystem of apps and services, including Gemini.

Global · General · May 20, 2026

AI Tools

Discord Adds End-to-End Encryption for Voice and Video Calls

Good news! Discord's hundreds of millions of users now have their communications scrambled, so not even Discord can see them.

Global · General · May 20, 2026

AI Tools

Google IO 2026: Talk to Your Gmail Inbox with AI Voice Search

Google expands Gmail’s AI Inbox with conversational voice search, letting users ask Gemini to find buried email details.

Global · General · May 20, 2026

AI Productivity

Google Adds Voice Prompts to Docs and Keep in Workspace Update

Google is letting users create drafts, take notes, and search for email with voice with the new Workspace update

Global · General · May 19, 2026

AI Productivity

Google's AI Enhances Gmail with Voice Search

Google expands Gmail’s AI Inbox with conversational voice search, letting users ask Gemini to find buried email details.

Global · General · May 19, 2026

AI Tools

Apple's Siri Update Focuses on Enhanced Privacy

Privacy will be a major theme when Apple unveils a new version of Siri.

Global · General · May 18, 2026

AI Tools

DreamServer: Local AI Inference and Workflows for Everyone

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

Global · General · May 18, 2026

AI Audio

Open Source Voice Agent Platform: Dograh HQ

Open Source Voice Agent Platform

Global · Developers · May 18, 2026

AI Audio

Aratako's Irodori-TTS-500M-v3: Advanced AI Audio Tool

Aratako's Irodori TTS 500M v3: Advanced AI Audio Tool Aratako's Irodori TTS 500M v3 represents a cutting edge advancement in AI driven audio tools. This sophist…

Global · Developers · May 17, 2026

AI Tools

Dramabox AI Tool: Revolutionizing Voice Cloning with Hugging Face

Dramabox AI Tool: Revolutionizing Voice Cloning with Hugging Face The Dramabox AI tool, integrated with Hugging Face, is transforming the landscape of voice clo…

Global · General · May 16, 2026

AI Tools

Amazon Launches AI Shopping Assistant Alexa for Shopping

Alexa for Shopping offers a voice- and touch-enabled shopping experience across mobile, desktop, and Echo Show smart displays. Alexa for Shopping provides more personalized recommendations and automates the shopping experience across Amazon and other online retailers.

Global · General · May 14, 2026

AI Tools

Wispr Flow's Hinglish Voice AI Gains Traction in India

Wispr Flow says growth accelerated in India after its Hinglish rollout, even as voice AI products continue to face challenges.

Asia · General · May 11, 2026

AI Tools

How AI Will Transform Office Communication

How will work setups change if we spend more and more time talking to our computers?

Global · General · May 11, 2026

AI Audio

Supertone's Supertonic-3: Revolutionizing AI Audio

Supertone's Supertonic 3: Revolutionizing AI Audio Supertone's latest innovation, the Supertonic 3, is transforming the landscape of artificial intelligence dri…

Global · General · May 10, 2026

AI Audio

AI Audio: Sarashina 2.2 TTS by Sbintuitions on Hugging Face

AI Audio: Harnessing Sarashina 2.2 TTS by Sbintuitions on Hugging Face Overview Sarashina 2.2 TTS, developed by Sbintuitions and hosted on Hugging Face, stands …

Global · General · May 3, 2026

AI Tools

Top AI Dictation Apps Tested and Ranked

AI-powered dictation apps are useful for replying to emails, taking notes, and even coding through your voice

Global · General · May 3, 2026

AI Audio

Microsoft's VibeVoice-ASR: Revolutionizing AI Audio Recognition

Microsoft's VibeVoice ASR: Revolutionizing AI Audio Recognition Microsoft's VibeVoice ASR is a pioneering solution in the realm of AI driven audio recognition, …

Global · Developers · May 1, 2026

AI Tools

AI Skill Files: Warm Starts for Claude and Gemini Sessions

One thing that frustrates me about most AI workflows is the cold start problem. Every new session you re-explain your business, your voice, your clients. I started solving this with skill files. A skill file is a markdown document you upload to a Claude Project or paste into a Gemini Gem. It holds your context permanently so you never re-explain anything. The three I use most: brand-voice.md: defines tone, writing rules, and platform-specific formatting client-router.md: when you say a client name, Claude loads their full project context automatically seo-aeo-audit-checklist.md: structured audit that scores any website out of 100 across 7 sections including AI search visibility Anyone else using a similar system? Curious what context you keep persistent across sessions.

Global · General · Apr 30, 2026

AI Tools

Voice Agents: 24/7 AI Voice Agents for Client Support

Turn expertise into 24/7 client-facing AI voice agents

Global · Enterprises · Apr 29, 2026

AI Tools

VoiceGoat: Practice LLM Attacks with Vulnerable Voice Agent

VoiceGoat: Enhance LLM Security with a Voice Assistant Lab VoiceGoat provides a secure and controlled environment to test and practice Large Language Model (LLL…

Global · General · Apr 28, 2026

AI Tools

Self-Taught Developer from Bahrain Launches Multi-Model AI Platform

https://reddit.com/link/1sxotqx/video/xlaqd9i8guxg1/player I'm a self-taught developer, 39 years old, based in Bahrain. Four months ago I started building AskSary - a multi-model AI platform with a persistent memory layer that sits above all the models. The core idea: the model is not the identity. Most AI tools lose your context the moment you switch models. I built the layer that remembers you across all of them. Here's what's shipped so far: **Models & Routing** Every major model in one place - GPT-5.2, Claude Sonnet 4.6, Grok 4, Gemini 3.1 Pro, DeepSeek R1, O1 Reasoning, Gemini Ultra and more - with smart auto-routing or manual override. **Memory & Context** Persistent cross-model memory. Start with Claude on your phone, switch to GPT on your laptop - it already knows what you discussed. Proactive personalisation that messages you first on login before you've typed a word. **Integrations** Google Drive and Notion - connect once, pull files and pages directly into chat or your RAG Knowledge Base. Unlimited uploads up to 500MB per file via OpenAI Vector Store. **Video Analysis** \- Gemini native video understanding for YouTube URL analysis (no download required, processed natively) and direct file upload up to 500MB. Full breakdown of visuals, audio, dialogue, editing style and key moments. **Generation** Image generation and editing, video studio across Luma, Veo and Kling, music generation via ElevenLabs, video analysis via upload or YouTube URL. **Builder Tools** Vision to Code, Web Architect, Game Engine, Code Lab with SQL Architect, Bug Buster, Git Guru and more. Tavily web search across all models. **Voice & Audio** Real-time 2-way voice chat at near-zero latency, AI podcast mode downloadable as MP3, Voiceover, Voice Notes, Voice Tuner. **Platform** Custom agents, 30+ live interactive themes, smart search, media gallery, folder organisation, full RTL support across 26 languages, iOS and Android apps, Apple Vision Pro. **Where it is now** 129 countries. Currently at 40 new signups a day. 1080 Signup's so far after 4 weeks or so. MRR just started. Zero ad spend. All of it built solo, one feature at a time, on a balcony in Bahrain. **The Stack:** Frontend - Next.js, Capacitor (iOS and Android) and Vanilla JS / React Backend - Vercel serverless functions, Firebase / Firestore (database + auth) and Firebase Admin SDK AI Models - OpenAI (GPT, GPT-Image-1), Anthropic (Claude), Google (Gemini), xAI (Grok), DeepSeek Generation APIs - Luma AI (video), Kling via Replicate (video), Veo via Replicate (video), ElevenLabs (music), Flux via Replicate (image editing), Meshy (3D — coming soon) Integrations - Google Drive (OAuth 2.0), Notion (OAuth 2.0), Tavily (web search), OpenAI Vector Store (RAG), Stripe (payments), CloudConvert (document conversion), Sentry (error tracking), Formidable (file handling) Rendering - Mermaid (flow charts) and MathJax Platforms - Web, iOS, Android, Apple Vision Pro (visionOS) Languages - 26 UI languages with full RTL support [asksary.com](http://asksary.com) Happy to answer questions on any part of the build - stack, architecture, API cost management, anything.

Other · Developers · Apr 28, 2026

AI Infrastructure

AI Comedian's Strategy to Protect Voice from AI Training

Apparently the best defense against AI copying your voice is strawberry mango forklift supersize fries.

Global · General · Apr 27, 2026

AI Audio

Microsoft's Open-Source Voice AI: VibeVoice Unveiled

Open-Source Frontier Voice AI

Global · Developers · Apr 27, 2026

AI Audio

AI Audio Tool OmniVoice: Revolutionizing Voice Synthesis

OmniVoice: Revolutionizing Voice Synthesis with AI In the ever evolving landscape of artificial intelligence, OmniVoice emerges as a game changer in voice synth…

Global · General · Apr 27, 2026

AI Tools

Durable.co: AI Platform for Rapid Website, Brand, and Invoice Creation

AI-driven platform for rapid website, brand, and invoice creation.

Global · Founders · Apr 27, 2026

AI Video

Fliki: AI Text to Video with Voiceovers

Transform text into captivating videos with lifelike AI voiceovers.

Global · Marketers · Apr 27, 2026

AI Audio

MiMo-V2.5 Voice: Bilingual ASR for Dialects, Code-Switching, and Songs

Bilingual ASR for dialects, code-switching, and songs

Global · General · Apr 27, 2026

AI Audio

Grok Voice Think Fast 1.0 API Release: Advanced Voice Agent

Our most capable voice agent is now available via API

Global · General · Apr 27, 2026

AI Video

AI Video Tools for Ads and Content: A Comprehensive Review

Been experimenting with a few AI video tools recently to speed up content + ad creation, figured I’d share what actually stood out These tools are getting pretty good, especially if you don’t have a full editing setup or team Here’s a quick breakdown of what I tried: Runway What it does: Text/image to video + editing tools Cool stuff: Good quality outputs, lots of features Best for: Creative experiments, short clips My take: Powerful, but took me a bit to get consistent results Pika What it does: Generates short videos from prompts Cool stuff: Fast and easy to try ideas Best for: Quick social clips My take: Fun to use, but hard to control exact outcomes Synthesia What it does: AI avatar videos with voice Cool stuff: Clean talking head style content Best for: Tutorials, explainers My take: Solid for info content, less useful for ads InVideo AI What it does: Script to full video Cool stuff: Templates + automation Best for: Beginners, quick drafts My take: Easy, but everything started to feel templated Luma Dream Machine What it does: Realistic AI generated scenes Cool stuff: Visually impressive outputs Best for: Cinematic style clips My take: Looks great, but hit or miss depending on prompt Higgsfield What it does: AI video with more control over shots + motion Cool stuff: Can guide camera movement, pacing, structure Best for: Ads or anything that needs to feel intentional My take: Feels closer to actually building a video vs just generating one Biggest takeaways: most tools are great for ideas, not final ads control > randomness if you’re making anything performance focused you’ll probably end up combining tools instead of relying on one A lot of these have free tiers, so worth testing yourself If I had to pick one I’d keep experimenting with, probably higgsfield just because the extra control makes it feel a bit more usable for actual ad work Curious what others are sticking with rn 👀

Global · General · Apr 27, 2026

AI Tools

Parlor Jarvis: Real-Time Multilingual Voice AI Tool

Parlor Jarvis: Revolutionizing Multilingual Voice Output with Real Time AI In today's interconnected world, effective communication across languages is more cru…

Global · General · Apr 26, 2026

AI Audio

ElevenLabs Launches AI Voice Generation Platform

AI voice generation and text-to-speech platform.

Global · General · Apr 26, 2026

AI Tools

Parlor Jarvis: Real-Time AI with Audio, Screen Input & Voice Output

Realtime AI Translation with Parlor Jarvis: Revolutionizing Multilingual Communication Parlor Jarvis is an innovative AI tool designed to bridge language barrie…

Global · General · Apr 26, 2026

AI Video

Synthesia AI Video Generator: Create AI Videos with Avatars and Voiceo

Create AI-generated videos with avatars and voiceovers.

Global · General · Apr 26, 2026

AI Tools

Parlor Jarvis: Real-Time Multilingual Voice AI Tool

Parlor Jarvis: Revolutionizing Multilingual Voice Output with Real Time AI In today's interconnected world, effective communication across languages is more cru…

Global · General · Apr 26, 2026

AI Audio

VoxCPM2: OpenBmb's Latest AI Audio Model Revolutionizes

VoxCPM2: Revolutionizing AI Audio with OpenBmb's Latest Model VoxCPM2, the newest addition to the AI audio landscape from OpenBmb, is set to transform the way w…

Global · General · Apr 26, 2026

AI Video

HeyGen: AI Video Generation with Avatars and Voice Cloning

AI video generation with avatars and voice cloning.

Global · General · Apr 26, 2026

AI Tools

Parlor Jarvis: Real-Time AI with Audio, Screen Input & Voice Output

Realtime AI Translation with Parlor Jarvis: Revolutionizing Multilingual Communication Parlor Jarvis is an innovative AI tool designed to bridge language barrie…

Global · General · Apr 26, 2026

AI Audio

OpenBmb/VoxCPM2: Revolutionizing AI Audio on Hugging Face

OpenBMB/VoxCPM2: Revolutionizing Voice Command Automation OpenBMB/VoxCPM2 is a cutting edge tool designed to facilitate seamless integration of voice command fe…

Global · General · Apr 26, 2026

PreviousPage 1 / 1Next