Encoder-decoder architecture | Natural Language Processing Class Notes