Vision Language Model Github Student