Ainiketan ainiketan.in
Papers Dictionary This Week Learning Paths ∑ Playground
Support Us
Papers Dictionary This Week Learning Paths ∑ Playground Support Us ☕
← Dictionary / d_model (model dimension)

d_model (model dimension)

Appears in 1 paper

The dimension of all input and output vectors in the Transformer.

As used in Paper 08 — Attention Is All You Need →

The dimension of all input and output vectors in the Transformer. The original paper uses d_model = 512. All residual connections must match this dimension. Larger d_model = more expressive representations = more parameters = more compute.

Appears in papers

Paper 08 — Attention Is All You Need →
Browse Dictionary
← All terms A–Z
Share
WhatsApp
Ainiketan

Where India learns AI — deeply, freely, together.

जहाँ हर जिज्ञासु AI सीखे — खुलकर, गहराई से, साथ में।

Free forever No ads No login Open source

Learn

All 24 Papers Math Tutorials Dictionary Learning Paths This Week in AI

Community

Student Journal Soon Paper Club Soon Research Questions Soon Mentor Network Soon Teacher Packs Soon

Site

About Scholarship Fund Impact Corrections Support Us ☕ Terms & Copyright
☕
Buy us a chai

This site is free forever. If it helped you, support it for others.

GitHub Sponsors →
Weekly digest

5 things in AI every week. Plain English. Free.

© 2026 Ainiketan · Built for India, for free, forever · Suggest a correction

Content license: CC BY 4.0 · Hosted on Vercel · Privacy-friendly analytics (no cookies)

All summaries are original writing by Ainiketan — we link to sources and do not reproduce copyrighted text. Copyright concerns: askainiketan@gmail.com · Terms & Copyright