The most basic LSTM tagger model in pytorch; explain relationship between nll loss, cross entropy loss and softmax function.