Computer Science > Machine Learning

arXiv:2204.10419 (cs)

[Submitted on 21 Apr 2022 (v1), last revised 20 Jan 2023 (this version, v2)]

Title:Learning Sequential Latent Variable Models from Multimodal Time Series Data

Authors:Oliver Limoyo, Trevor Ablett, Jonathan Kelly

View PDF

Abstract:Sequential modelling of high-dimensional data is an important problem that appears in many domains including model-based reinforcement learning and dynamics identification for control. Latent variable models applied to sequential data (i.e., latent dynamics models) have been shown to be a particularly effective probabilistic approach to solve this problem, especially when dealing with images. However, in many application areas (e.g., robotics), information from multiple sensing modalities is available -- existing latent dynamics methods have not yet been extended to effectively make use of such multimodal sequential data. Multimodal sensor streams can be correlated in a useful manner and often contain complementary information across modalities. In this work, we present a self-supervised generative modelling framework to jointly learn a probabilistic latent state representation of multimodal data and the respective dynamics. Using synthetic and real-world datasets from a multimodal robotic planar pushing task, we demonstrate that our approach leads to significant improvements in prediction and representation quality. Furthermore, we compare to the common learning baseline of concatenating each modality in the latent space and show that our principled probabilistic formulation performs better. Finally, despite being fully self-supervised, we demonstrate that our method is nearly as effective as an existing supervised approach that relies on ground truth labels.

Comments:	In: Petrovic, I., Menegatti, E., Marković, I. (eds) Intelligent Autonomous Systems 17. IAS 2022. Lecture Notes in Networks and Systems, vol 577. Springer, Cham
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2204.10419 [cs.LG]
	(or arXiv:2204.10419v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.10419
Related DOI:	https://doi.org/10.1007/978-3-031-22216-0_35

Submission history

From: Oliver Limoyo [view email]
[v1] Thu, 21 Apr 2022 21:59:24 UTC (8,255 KB)
[v2] Fri, 20 Jan 2023 07:11:44 UTC (8,252 KB)

Computer Science > Machine Learning

Title:Learning Sequential Latent Variable Models from Multimodal Time Series Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Sequential Latent Variable Models from Multimodal Time Series Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators