Building a deep neural network that functions as part of an end-to-end automatic speech recognition pipeline