Archive
Discover and discuss technology tools
Explore the Tiscuss archive by category or keyword, then jump into conversations around what matters most.
Building a ChatGPT-like LLM in PyTorch from Scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Dive into LLMs: Hands-On AI Framework Tutorial
《动手学大模型Dive into LLMs》系列编程实践教程
Hormone Lab Results Interpreter: AI Tool for Men's Health
Hormone Lab Results Interpreter: AI Tool for Men's Health Men's health is complex, and hormone levels play a crucial role in overall well being. Understanding h…
Trading System V2: AI's Role in Deterministic Execution
Thanks to the incredible feedback on my last post, I’m officially moving away from the "distributed veto" system (where 8 LLM agents argue until they agree to trade). For v2, I am implementing a strict State Machine using a deterministic runtime (llm-nano-vm). The new rule is simple: Python owns the math and the execution contract. The LLM only interprets the context. I've sketched out a 5-module architecture, but before I start coding the new Python feature extractors, I want to sanity-check the exact roles I’m giving to the AI. Here is the blueprint: 1. The HTF Agent (Higher Timeframe - D1/H4) Python: Extracts structural levels, BOS/CHoCH, and premium/discount zones. LLM Role: Reads this hard data to determine the institutional narrative and select the most relevant Draw on Liquidity (DOL). 2. The Structure Agent (H1) Python: Identifies all valid Order Blocks (OB) and Fair Value Gaps (FVG) with displacement. LLM Role: Selects the highest-probability Point of Interest (POI) based on the HTF Agent's narrative. 3. The Trigger Agent (M15/M5) 100% Python (NO LLM): Purely deterministic. It checks for liquidity sweeps and LTF CHoCH inside the selected POI. 4. The Context Agent LLM Role: Cross-references active killzones, news blackouts, and currency correlations to either greenlight or veto the setup. 5. The Risk Agent 100% Python (NO LLM): Calculates Entry, SL, TP, Expected Value (EV), and position sizing. The state machine will only transition to EXECUTING if the deterministic Trigger and Risk modules say yes. The LLMs are basically just "context providers" for the state machine. My questions for the quants/architects here: Does this division of labor make sense? Am I giving the LLMs too much or too little responsibility in step 1 and 2? By making the Trigger layer (M15/M5) 100% deterministic, am I losing the core advantage of having an AI, or is this the standard way to avoid execution paralysis? Would you merge the HTF and Structure agents to reduce token constraints/hallucinations, or is separating them better for debugging? Would love to hear your thoughts before I dive into the codebase.
AI Tool Comparison: Claude, GPT-4, and Gemini for Article Summarizatio
I've been building a product around AI-powered reading (more on that later) and wanted to share findings on summarization quality across major LLMs. Tested with 50 articles across news, research papers, blog posts, and technical docs: **Claude (Sonnet/Haiku):** \- Best at preserving nuance and avoiding oversimplification \- Strongest at academic content \- Excellent for "explain this without losing the point" **GPT-4:** \- Fastest summaries, often most concise \- Sometimes drops important context \- Good for news, weaker on academic **Gemini:** \- Strongest source citations \- Tends to add information not in the original \- Good for factual but careful with creative content Most surprising finding: **bias detection accuracy**. Claude flagged loaded language and framing in 78% of test articles correctly. GPT 64%. Gemini 51%. Anyone else doing similar comparisons? Would love to hear what you're seeing
New Benchmark for Testing LLMs for Deterministic Outputs
New Benchmark for Evaluating Large Language Models for Deterministic Outputs In the rapidly evolving landscape of artificial intelligence, the evaluation of lar…
Mastering AEO: How to Get Cited by AI and Boost Your Visibility
SEO or AEO? Why you’re not showing up in AI answers (yet) This is a consolidation of findings from Neil Patel and Hubspot plus what we have found to work well on our own website. Most business owners are still playing the old game. Some aren’t playing at all. They’re thinking in rankings, keywords, and “getting to page one.” Meanwhile, the ground is shifting under them. Google Search is still dominant, but even it has changed. It’s no longer just a list of blue links. It’s summarizing, interpreting, and answering. And tools like ChatGPT and Perplexity AI aren’t ranking pages at all. They’re answering questions. Which creates a problem most people haven’t fully processed yet: **Users don’t need to click your website anymore to get value.** CTR is dropping. Site visits are declining. Because the answer is already sitting in front of them. And yet, paradoxically… **Your website has never mattered more.** Because now it’s not just competing for clicks. It’s competing to be **the source that gets cited in the answer.** # What actually changed AI search works like this: User asks a question → system searches multiple sources → pulls the best chunks → builds an answer → cites what it trusts If your content isn’t structured for that flow, you don’t exist. Not “low ranking.” Invisible. # What AI actually cares about AI doesn’t care about your keyword density or your clever SEO hacks. It cares if your content is: * easy to find * easy to understand * easy to quote That’s AEO (Answer Engine Optimization). Not magic. Not a secret algorithm. Just being usable inside an answer. # What actually works If you do nothing else, do this: # 1. Start with the answer Don’t spend 800 words “building context.” Bad: “AI is transforming industries…” Better: “AEO is how you structure content so AI tools can find, understand, and cite it in answers.” That’s what gets pulled. # 2. Structure like a human, not a content farm Use: * clear headings * short sections * simple tables * FAQs AI extracts. It doesn’t patiently read your thought leadership essay. Walls of text = ignored. # 3. Be consistent about who you are Your: * business name * description * services * location Need to match everywhere. If your site, LinkedIn, Reddit, and directories all say different things, AI doesn’t trust you. No trust = no citation. # 4. Keep things updated Outdated content doesn’t get used. Simple: * update pages * keep timestamps current * maintain your sitemap Not exciting. Still works. # 5. Let crawlers access your site If AI crawlers can’t access your content, you won’t get cited. Blocking them and expecting visibility is… optimistic. # 6. Measure the right things Stop obsessing over rankings. Track: * Are you mentioned? * Are you cited? * Which pages show up? If you’re not measuring AI visibility, you’re guessing. # Why you’re not cited (yet) Most businesses don’t get cited because: * their content is vague * their structure is messy * their positioning is inconsistent AI didn’t ignore you. It couldn’t understand you. # What you actually need (and what you don’t) You don’t need: * a massive content team * expensive tools * some “AI SEO expert” selling confidence You need: * 10–20 clear, structured pages * direct answers * consistent messaging * basic technical setup That’s enough to start showing up. # The technical layer (the stuff everyone ignores) These are the files quietly determining whether you exist to AI at all. # robots.txt Controls crawler access. If bots can’t crawl your site, you don’t get indexed. # sitemap.xml Tells crawlers what pages exist and what’s been updated. No sitemap = slower discovery = less visibility. # JSON-LD (structured data) Explains what your business, pages, and content actually are. Without it, AI guesses. Poorly. # llms.txt A machine-readable summary of your site for AI systems. Not widely adopted yet, but useful for shaping how you’re interpreted. # crawlers.txt An emerging way to control AI-specific crawlers. Still early. Treat it as a signal, not enforcement. # Human query-based metadata Your content should be built around real questions, not keyword fantasies. Instead of: “AI Solutions for SMB Efficiency Optimization” Write: “How can a small business use AI without hiring a developer?” AI systems think in questions. If you match that, you get used. If you don’t, you get skipped. # How it all fits together * robots.txt / crawlers.txt → controls access * sitemap.xml → tells crawlers what exists * JSON-LD → explains what things are * llms.txt → suggests how to interpret it * query-based content → makes it usable in answers Miss one, you weaken the system. Miss most, you disappear. # Simple test Ask: “What companies would you recommend for \[your category\] in \[your region\]?” If you’re not mentioned or cited, that’s your baseline. No opinions. Just signal. # Bottom line SEO was about ranking pages. AEO is about being useful inside an answer. If your content helps AI explain something clearly, you get cited.
How Do Developers Correct AI LLMs When They Spread Misinformation?
I watched Last Week Tonight's piece on AI chatbots today, and it got me thinking about that old screenshot of a Google search in which Gemini recommends adding "1/8 cup of non-toxic glue" to pizza in order to make the cheese better stick to the slice. When something like this goes viral, I have to assume (though I could be wrong) that an employee at Google specifically goes out of their way to address that topic in particular. The image is a meme, of course, but I imagine Google wouldn't be keen to leave themselves open to liability if their LLM recommends that users consume glue. Does the developer "talk" to the LLM to correct it about that specific case? Do they compile specific information about (e.g.) pizza construction techniques and feed it that data to bring it to the forefront? Do their actions correct only the case in question, or do they make changes to the LLM that affects its accuracy more broadly (e.g. "teaching" the LLM to recognize that some Reddit comments are jokes)? On a more heavy note, the LWT piece includes several stories of chatbots encouraging users to self-harm. How does the process differ when developers are trying to prevent an LLM from giving that sort of response?
Waiting for LLMs? Play a Game with This AI Tool
Waiting for LLMs? Play a Game with This AI Tool In the fast paced world of technology, waiting for the latest large language models (LLMs) can be frustrating. F…
Preventing AI Model Collapse: The Need for Human-Generated Data
Im all for acceleration. I think the faster we hit AGI the better. but theres a bottleneck nobody here talks about enough-training data. right now we are quietly poisoning the well. More than half of online content is already synthetic. bots talking to bots, articles written by AI, reddit threads generated by LLMs. when the next generation of models trains on this they eat their own tail. model collapse is real. we saw it with image generators. Outputs get blander, weirder, less useful.we need a way to label or filter human-generated data. not because humans are better but because diversity prevents collapse. I know the standard solution sounds like a dystopian meme. biometric scanners, iris codes, hardware verification. and yeah maybe it is dystopian. but so is a dead internet where nothing can be trusted.Reddit CEO Steve Huffman put it simply recently - platforms need to know you're human without knowing your name. Face ID / Touch ID level stuff. im not saying that specific device is the answer. but the category of solution - proof of human that doesnt create a surveillance state - seems necessary if we want to keep scaling past the cliff.what do you think? Is proof-of-personhood just a regulatory speed bump, or is it infrastructure for the next generation of AI?curious where this sub lands.
Show HN: AI Prediction Market Analysis App with LLMs and Data APIs
Show HN: AI Prediction Market Analysis App with LLMs and Data APIs Discover the future of market analysis with our innovative AI Prediction Market Analysis App.…
QuickCompare by Trismik: Compare & Pick Best LLMs
Compare LLMs on your data, measure, and pick the best.
Arc Sentry: Advanced Prompt Injection Detector for LLMs
Been working on Arc Sentry, a whitebox prompt injection detector for self-hosted LLMs (Mistral, Llama, Qwen). Most detectors pattern-match on known attack phrases. Arc Sentry watches what the prompt does to the model’s internal representation instead, so it catches indirect, hypothetical, and roleplay-framed attacks that get through keyword filters. Benchmark on indirect/roleplay/technical prompts (40 OOD prompts): • Arc Sentry: Recall 0.80, F1 0.84 • OpenAI Moderation API: Recall 0.75, F1 0.86 • LlamaGuard 3 8B: Recall 0.55, F1 0.71 Arc Sentry has the highest recall — it catches more of the hard cases. Blocks before model.generate() is called. The lightweight pre-filter runs on CPU with no model access. pip install arc-sentry GitHub: https://github.com/9hannahnine-jpg/arc-sentry Happy to answer questions about how it works.
Deploying Local LLMs in Production: Best Practices
Discussion thread on infra, latency, and operational best practices.