How To Test And Evaluate Gpt 4 Models