Meet ONI: A Distributed Architecture for Simultaneous Reinforcement Learning Policies and Intrinsic Reward Learning with LLM Feedback
Reward functions play a crucial role in reinforcement learning (RL) systems, but their design presents significant challenges in balancing the ...