BLEU score
Bilingual Evaluation Understudy.
Bilingual Evaluation Understudy. The standard automated metric for
Bilingual Evaluation Understudy. The standard metric for machine translation quality, comparing a model's output to human-written reference translations. Ranges from 0 to 100; higher is better. The attention model improved BLEU by ~2 points on English-to-French, with larger gains on longer sentences.