Language Model Evaluation In R