PyTorch implementation of DeFINE word embeddings with AWD-LSTM for language modeling. The input and output embeddings for AWD-LSTMM are tied