High Bit
High Bit
Initialized Capital
ZeroEntropy: The Hidden Bottleneck in AI. Retrieval, Not Models
34 minutes Posted Jan 30, 2026 at 10:18 pm.
AI models keep getting better, but most AI systems still fail in production. Why?In this episode of High Bit, Brett Gibson sits down with Ghita Houir Alami, cofounder and CEO of ZeroEntropy, to break down the real bottleneck holding AI agents back: retrieval.Ghita explains why embeddings alone can’t reliably surface the right information, why tools like Slack search feel so frustrating, and how rerankers add a critical second pass that dramatically improves accuracy. She walks through ZeroEntropy’s approach to training rerankers using pairwise comparisons and Elo-style scoring, and why this method generalizes across domains like code, finance, and biology.The conversation goes deep into:Why AI agents fail even when the data exists.How reranking fixes poor ordering from vector search.Why “accuracy” now includes helpful context, not just correct answers.What actually changes when retrieval becomes trustworthy enough to remove humans from the loop.If you’re building AI agents, search systems, customer support bots, or internal knowledge tools, this episode explains what’s breaking today, and what has to change for AI to work reliably at scale.
What changes when retrieval works
What ZeroEntropy builds
Why retrieval became the real problem
Why search fails (Slack included)
Why embeddings fall short
Rerankers: the missing layer
Why rerankers matter most
Pairwise ranking vs scoring
Elo scoring for documents
Fast rerankers via distillation
Why old training methods break
Retrieval for AI agents
Recency, memory, personalization
What reliable retrieval unlocks
0:00
34:41
Download MP3
Show notes
AI models keep getting better, but most AI systems still fail in production. Why?In this episode of High Bit, Brett Gibson sits down with Ghita Houir Alami, cofounder and CEO of ZeroEntropy, to break down the real bottleneck holding AI agents back: retrieval.Ghita explains why embeddings alone can’t reliably surface the right information, why tools like Slack search feel so frustrating, and how rerankers add a critical second pass that dramatically improves accuracy. She walks through ZeroEntropy’s approach to training rerankers using pairwise comparisons and Elo-style scoring, and why this method generalizes across domains like code, finance, and biology.The conversation goes deep into:Why AI agents fail even when the data exists.How reranking fixes poor ordering from vector search.Why “accuracy” now includes helpful context, not just correct answers.What actually changes when retrieval becomes trustworthy enough to remove humans from the loop.If you’re building AI agents, search systems, customer support bots, or internal knowledge tools, this episode explains what’s breaking today, and what has to change for AI to work reliably at scale.(00:00) What changes when retrieval works(00:39) What ZeroEntropy builds(01:42) Why retrieval became the real problem(03:12) Why search fails (Slack included)(05:11) Why embeddings fall short(07:11) Rerankers: the missing layer(10:11) Why rerankers matter most(12:44) Pairwise ranking vs scoring(13:52) Elo scoring for documents(16:33) Fast rerankers via distillation(18:07) Why old training methods break(21:29) Retrieval for AI agents(24:20) Recency, memory, personalization(32:06) What reliable retrieval unlocks(33:42) What’s next for ZeroEntropyFollow Ghita and ZeroEntropy for more:X@ghita__ha@ZeroEntropy_AILinkedInhttps://www.linkedin.com/in/ghita-houir-alami/https://www.linkedin.com/company/zeroentropy-inc