Transformer models are gauge invariant: A mathematical connection between AI and particle physics
Leo van Nierop
arXiv:2412.14543·cs.LG·Published 2024-12-19
In particle physics, the fundamental forces are subject to symmetries called gauge invariance. It is a redundancy in the mathematical description of any physical system. In this article I will demonstrate that the transformer architecture exhibits the same properties, and show that the default representation of transformers has partially, but not fully removed the gauge invariance.
TopicsParticle & High Energy Physics
Tagsparticle-physics
arXiv categoriescs.LG, hep-th
arXiv abstract pagePDF