Explain Self Attention In Deep Learning