Datasets For Large Language Models Tutorial