Argmax
Argmax
Vahe Hagopian, Taka Hasegawa, Farrukh Rahman
A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm
Mixture of Experts
In this episode we talk about the paper "Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.
Oct 8, 2024
54 min
LoRA
We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y
Sep 2, 2023
1 hr 2 min
15: InstructGPT
In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.
Mar 28, 2023
57 min
14: Whisper
This week we talk about Whisper. It is a weakly supervised speech recognition model.
Mar 17, 2023
49 min
13: AlphaTensor
We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.
Mar 11, 2023
49 min
12: SIRENs
In this episode we talked about "Implicit Neural Representations with Periodic Activation Functions" and the strength of periodic non-linearities.
Oct 25, 2022
54 min
11: CVPR Workshop on Autonomous Driving Keynote by Ashok Elluswamy, a Tesla engineer
In this episode we discuss this video: https://youtu.be/jPCV4GKX9DwHow Tesla approaches collision detection with novel methods.
Sep 30, 2022
48 min
10: Outracing champion Gran Turismo drivers with deep reinforcement learning
We discuss Sony AI's accomplishment of creating a novel AI agent that can beat professional racers in Gran Turismo. Some topics include:- The crafting of rewards to make the agent behave nicely- What is QR-SAC?- How to deal with "rare" experiences in the replay bufferLink to paper: https://www.nature.com/articles/s41586-021-04357-7
Aug 23, 2022
54 min
8: GATO (A Generalist Agent)
Today we talk about GATO, a multi-modal, multi-task, multi-embodiment generalist agent.
Jul 29, 2022
44 min
9: Heads-Up Limit Hold'em Poker Is Solved
Today we talk about recent AI advances in Poker; specifically the use of counterfactual regret minimization to solve the game of 2-player Limit Texas Hold'em.
Jul 29, 2022
47 min
Load more