Optimizing Large Language Modeling Sla