WEBRL: A Self-Evolving Online Curriculum Reinforcement Learning Framework for Training High-Performing Web Agents with Open LLMs
Large language models (LLMs) have demonstrated exceptional capabilities in understanding human language, reasoning, and knowledge acquisition, suggesting their potential to ...