Next Sentence Prediction (NSP)
BERT's second pre-training objective.
BERT's second pre-training objective. Given two sentences A and B, classify whether B is the genuine next sentence after A or a randomly sampled sentence. Uses the [CLS] token's hidden state. Later research showed this task contributes little and may be removed (RoBERTa).