User Testing Large Language Evaluation