Hybrid Flow Rlhf In Ai