Transformer
A neural network architecture based on self-attention, introduced in "Attention Is All You Need" (2017).
A neural network architecture based on self-attention, introduced in "Attention Is All You Need" (2017). Processes sequences in parallel and has largely replaced RNNs/LSTMs.