Vaswani Transformer Architecture Neural Network