31:45
LLaMA Pro: Progressive LLaMA with Block Expansion (Paper Explained)
Yannic Kilcher
Shared 07/01/2024
40:40
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)
Yannic Kilcher
Shared 24/12/2023
47:37
Scalable Extraction of Training Data from (Production) Language Models (Paper Explained)
Yannic Kilcher
Shared 03/12/2023
21:50
I built the most expensive CPU ever! (Every instruction is a prompt)
Yannic Kilcher
Shared 08/11/2023
32:26
Efficient Streaming Language Models with Attention Sinks (Paper Explained)
Yannic Kilcher
Shared 14/10/2023
46:44
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)
Yannic Kilcher
Shared 07/10/2023
28:25
Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)
Yannic Kilcher
Shared 13/09/2023
53:06
Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)
Yannic Kilcher
Shared 03/09/2023
44:10
[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise
Yannic Kilcher
Shared 15/08/2023
53:31
DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)
Yannic Kilcher
Shared 12/08/2023