DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body
DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body
Alberto Chiappa
EPFL
Alessandro Marin Vargas
EPFL
Alexander Mathis
EPFL
Neural Information Processing Systems (NeurIPS), 2022
[Code]

Reinforcement learning typically seeks to learn control policies in stable environments. Yet, real world scenarios require continuous adaptation. In particular, learning to locomote when the length and the thickness of different body parts vary is challenging, as the policy is required to adapt to the current configuration to successfully balance and advance the agent. We study this problem in four classical continuous control environments, augmented with morphological perturbations. We show that a control policy based on the proprioceptive state performs poorly with highly variable body configurations, while an (oracle) agent with access to a learned encoding of the perturbation performs significantly better. We introduce DMAP, a biologically-inspired, attention-based policy network architecture. It combines a distributed policy, with individual controllers for each joint, and an attention mechanism, to dynamically gate sensory information from different body parts. DMAP can be trained end-to-end in all the considered environments and perturbation intensities, overall matching or surpassing the performance of an oracle agent with access to the morphology information. Thus DMAP, implementing principles of control drawn from the biological world, provides a strong inductive bias for learning challenging sensorimotor tasks. Overall, our work corroborates the power of these principles in challenging locomotion tasks.


Paper and Bibtex

Citation
 
Alberto Chiappa, Alessandro Marin Vargas, Alexander Mathis. DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body. NeurIPS, 2022.

[Bibtex]
@misc{dmap2022,
    title={DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body},
    author={Chiappa, Alberto and Marin Vargas, Alessandro and
    Mathis, Alexander},
    year={2022},
    Booktitle={NeurIPS}
}