Masked Multihead Attention Tensorflow