Librarium
Settings

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention (Paper Explained)

Yannic Kilcher