Transformer Model Architecture Deep Learning