KV Chunk (Key-Value Chunk)
A segment of the key and value matrices corresponding to a subset of the sequence.
A segment of the key and value matrices corresponding to a subset of the sequence. In Ring Attention with P GPUs, each GPU initially holds one KV chunk. As computation proceeds, chunks circulate around the ring.