37:05
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
Yannic Kilcher
Shared 19/10/2024
53:01
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters (Paper)
Yannic Kilcher
Shared 05/10/2024
01:03:55
Privacy Backdoors: Stealing Data with Corrupted Pretrained Models (Paper Explained)
Yannic Kilcher
Shared 04/08/2024
01:11:57
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools (Paper Explained)
Yannic Kilcher
Shared 26/06/2024
29:21
[ML News] OpenAI is in hot waters (GPT-4o, Ilya Leaving, Scarlett Johansson legal action)
Yannic Kilcher
Shared 21/05/2024
33:25
ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)
Yannic Kilcher
Shared 01/05/2024
17:46
[ML News] Devin exposed | NeurIPS track for high school students
Yannic Kilcher
Shared 27/04/2024
37:16
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Yannic Kilcher
Shared 24/04/2024
09:54
[ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news)
Yannic Kilcher
Shared 15/04/2024
27:31
[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃)
Yannic Kilcher
Shared 13/04/2024
44:04
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping (Searchformer)
Yannic Kilcher
Shared 06/04/2024
26:59
[ML News] Grok-1 open-sourced | Nvidia GTC | OpenAI leaks model names | AI Act
Yannic Kilcher
Shared 26/03/2024
26:49
[ML News] Devin AI Software Engineer | GPT-4.5-Turbo LEAKED | US Gov't Report: Total Extinction
Yannic Kilcher
Shared 17/03/2024
53:14
[ML News] Elon sues OpenAI | Mistral Large | More Gemini Drama
Yannic Kilcher
Shared 10/03/2024
42:33
[ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles
Yannic Kilcher
Shared 01/03/2024
50:02
V-JEPA: Revisiting Feature Prediction for Learning Visual Representations from Video (Explained)
Yannic Kilcher
Shared 19/02/2024
01:23:58
What a day in AI! (Sora, Gemini 1.5, V-JEPA, and lots of news)
Yannic Kilcher
Shared 18/02/2024
54:23
Lumiere: A Space-Time Diffusion Model for Video Generation (Paper Explained)
Yannic Kilcher
Shared 04/02/2024