Hierarchical Reinforcement Learning Based Inference