Inside LLMs, Visualized

Part 1 · Cause (Mechanism)

Why attention disperses as tokens grow

Transformer attention computes token-to-token relations at N² cost. See how attention dilutes as the token count grows.

Note: this visualization's interface is in Korean.