Text Preprocessing For Llm