Multihead Self Attention Explained Variance Sklearn