Large Vision Language Model Survey