We introduce Hierarchical Equivariant Policy (HEP), a novel framework for efficient and generalizable robotic manipulation. HEP is built on two core ideas: a hierarchical policy structure and a new Frame Transfer interface that enables seamless generalization and robustness.
Hierarchical Policy Structure
- High-level policy: Responsible for global, long-horizon planning by predicting a “keypose” (i.e., a target 3D translation) that serves as a subgoal.
- Low-level policy: Generates fine-grained motion trajectories in a local coordinate frame anchored at the keypose.
This separation allows the high-level policy to focus on strategy while the low-level handles precise control, greatly reducing complexity.
Frame Transfer Interface
Instead of hard constraints, the high-level outputs a reference translation defining a local frame. The low-level operates relative to this frame, providing:
- Soft constraints: Local trajectory optimization.
- Passing Generalization Ability: We prove that high level's generalization ability can be passed to low level.