close

Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
There Are Too Many RAG Optimization Techniques, So I Organized Them — and the Big Picture Finally Made Sense

There Are Too Many RAG Optimization Techniques, So I Organized Them — and the Big Picture Finally Made Sense

Image 1
Comments
8 min read
Building a Voice-Controlled Local AI Agent with Whisper, LLaMA, and LangGraph
Cover image for Building a Voice-Controlled Local AI Agent with Whisper, LLaMA, and LangGraph

Building a Voice-Controlled Local AI Agent with Whisper, LLaMA, and LangGraph

Comments
4 min read
I Stopped Treating AI Spend Like a Monthly Bill

I Stopped Treating AI Spend Like a Monthly Bill

Comments
1 min read
Building a Voice-Controlled Local AI Agent using Speech-to-Text and LLMs

Building a Voice-Controlled Local AI Agent using Speech-to-Text and LLMs

Comments
2 min read
The gap between detecting hallucinations and handling them

The gap between detecting hallucinations and handling them

Image 2
Comments
2 min read
From 66% to 96%: How I Fixed a Drive-Thru Voice Agent Before It Took a Single Real Call
Cover image for From 66% to 96%: How I Fixed a Drive-Thru Voice Agent Before It Took a Single Real Call

From 66% to 96%: How I Fixed a Drive-Thru Voice Agent Before It Took a Single Real Call

Image 1
Comments
4 min read
Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks
Cover image for Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks

Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks

Comments
3 min read
Why I Built TokenBar: AI Spend Should Not Be a Monthly Surprise

Why I Built TokenBar: AI Spend Should Not Be a Monthly Surprise

Comments
1 min read
The 'Freelance Agentics' Era: How AI Agents Are Reshaping Freelance Work
Cover image for The 'Freelance Agentics' Era: How AI Agents Are Reshaping Freelance Work

The 'Freelance Agentics' Era: How AI Agents Are Reshaping Freelance Work

Comments
3 min read
How Top Companies Are Shipping AI Agents Today (Apr 15)
Cover image for How Top Companies Are Shipping AI Agents Today (Apr 15)

How Top Companies Are Shipping AI Agents Today (Apr 15)

Comments
3 min read
The day I realized AI costs need a warning light

The day I realized AI costs need a warning light

Comments
2 min read
Boosting llama.cpp with Auto-Tuning, Qwen Quantization Benchmarks, & Mobile Ollama AI Servers

Boosting llama.cpp with Auto-Tuning, Qwen Quantization Benchmarks, & Mobile Ollama AI Servers

Comments
3 min read
SafePaths: How We Reduced Token Consumption by 85% — The Benchmark Story

SafePaths: How We Reduced Token Consumption by 85% — The Benchmark Story

Comments
2 min read
340% and Climbing: What the CIS Prompt Injection Report Means for Enterprise AI Agents
Cover image for 340% and Climbing: What the CIS Prompt Injection Report Means for Enterprise AI Agents

340% and Climbing: What the CIS Prompt Injection Report Means for Enterprise AI Agents

Comments
10 min read
No GPU? No problem!, running local AI efficiently on my CPU.
Cover image for No GPU? No problem!, running local AI efficiently on my CPU.

No GPU? No problem!, running local AI efficiently on my CPU.

Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.