CT No.243: Disambiguation, sliding doors, hallucinations, and madeleines
Transformers take static vector embeddings, which assign single values to every token, and expand their context, nearly simultaneously as they process the context of every other word in the sentence. But who cares, let's listen to a pop song!