On the benefits of pixel-based hierarchical policies for task generalization
Reinforcement learning practitioners typically avoid hierarchical policies, especially in image-based observation spaces. Typically, the performance improvement on a single task ...