Show notes
Send us Fan Mail This week on Sidecar Sync, Amith Nagarajan and Mallory Mejias trace one of the biggest stories in AI: how cutting-edge intelligence keeps getting compressed into smaller, faster, cheaper models. From GPT-4-era pricing to today’s open models like Gemma 4, they unpack the forces driving this shift, including distillation, mixture-of-experts architectures, quantization, and Google Research’s new TurboQuant breakthrough. Along the way, they explore what this means for association...



