Language Vision Model