Model Size (N)
The number of parameters in a neural network.
The number of parameters in a neural network. For transformers, this includes embeddings, attention weights, and feedforward layers. Measured in millions (M), billions (B), or trillions (T).