Computer Science > Robotics

arXiv:2203.04439 (cs)

[Submitted on 8 Mar 2022]

Title:$\mathrm{SO}(2)$-Equivariant Reinforcement Learning

Authors:Dian Wang, Robin Walters, Robert Platt

View PDF

Abstract:Equivariant neural networks enforce symmetry within the structure of their convolutional layers, resulting in a substantial improvement in sample efficiency when learning an equivariant or invariant function. Such models are applicable to robotic manipulation learning which can often be formulated as a rotationally symmetric problem. This paper studies equivariant model architectures in the context of $Q$-learning and actor-critic reinforcement learning. We identify equivariant and invariant characteristics of the optimal $Q$-function and the optimal policy and propose equivariant DQN and SAC algorithms that leverage this structure. We present experiments that demonstrate that our equivariant versions of DQN and SAC can be significantly more sample efficient than competing algorithms on an important class of robotic manipulation problems.

Comments:	Published at ICLR 2022
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2203.04439 [cs.RO]
	(or arXiv:2203.04439v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2203.04439

Submission history

From: Dian Wang [view email]
[v1] Tue, 8 Mar 2022 23:09:25 UTC (39,547 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2022-03

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Robotics

Title:$\mathrm{SO}(2)$-Equivariant Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:$\mathrm{SO}(2)$-Equivariant Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators