An RNN with feedback connections.
Notes related to Long short-term memory (LSTM)
Transformer machine learning model
Papers related to Long short-term memory (LSTM)
- Outrageously large neural networks: The sparsely-gated mixture-of-experts layer [shazeer:arxiv:2017]
- Attention is all you need [vaswani:arxiv:2017]