Score Entropy Policy Optimization