Inside LLMs, Visualized
Part 1 · Cause (Mechanism)
Why attention disperses as tokens grow
Transformer attention computes token-to-token relations at N² cost. See how attention dilutes as the token count grows.
Note: this visualization's interface is in Korean.