Vision Transformer With Sparse Scan Prior