Text Guided Vision Large Model