Definition
The Transformer is the neural network architecture introduced by Google in 2017 that revolutionized AI. It uses a mechanism called 'attention' to process data sequences in parallel rather than sequentially. Virtually all modern language models (GPT, Claude, Gemini, LLaMA) are based on this architecture.
Related terms
EXPLORE