LLM
Collection
4 items • Updated
A mathematical framework analyzing Transformers as interacting particle systems reveals the emergence of clusters over time.
Transformers play a central role in the inner workings of large language models. We develop a mathematical framework for analyzing Transformers based on their interpretation as interacting particle systems, which reveals that clusters emerge in long time. Our study explores the underlying theory and offers new perspectives for mathematicians as well as computer scientists.
Get this paper in your agent:
hf papers read 2312.10794 curl -LsSf https://hf.co/cli/install.sh | bash No model linking this paper
No dataset linking this paper
No Space linking this paper