image_2025-06-11-17-10-35
  • Weights:
    • \(W_{Q}\), \(W_{K}\), \(W_{V}\)
    • \(W_{O}\)
    • \(W_{up}\), \(W_{down}\)

Layers

-------------
[NORM]
[ATTN]
    [QKV]
    [Self Dot-Product]
    [O Proj]
[Residual Add]
[NORM]
[MLP]
    [UP Proj]
    [NORM]
    [DOWN Proj]
[Residual Add]
[NORM]
-------------