NISHIO Hirokazu
[Translate]
Linear Attention
Tweet
Related Pages
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention
Transformerの学習理論: In-context learningにおける汎化と最適化の理論
"
Engineer's way of creating knowledge
" the English version of my book is now available on
[Engineer's way of creating knowledge]
(C)NISHIO Hirokazu / Converted from
[Scrapbox]
at
4/12/2026, 8:41:09 PM
[Edit]