Vision Language Models For Vision Tasks