Segment embedding

Appears in 1 paper

One of three embeddings summed to form each token's input representation.

As used in Paper 11 — BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding →

One of three embeddings summed to form each token's input representation. Tells the model whether a token belongs to sentence A or sentence B in a two-sentence input. Two learned vectors: E_A and E_B.