Librarium
Settings
NOTICE:
This is a dev instance, don't rely on it working 100% of the time.
1/1
58:22
Sparse Expert Models (Switch Transformers, GLAM, and more... w/ the Authors)
Yannic Kilcher
Shared 21/04/2022
33:46
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Yannic Kilcher
Shared 22/01/2021