Bibliography
-
Transformer Interpretability Beyond Attention Visualization
-
A Rigourous Study of the Deep Taylor Decomposition
-
Transformer Interpretability Beyond Attention Visualization
-
Escaping the big data paradigm with compact transformers
-
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
subscribe via RSS