Computer Science > Machine Learning

arXiv:2401.11666 (cs)

[Submitted on 22 Jan 2024]

Title:P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Authors:Zhiyuan Wang, Xiaoyang Qu, Jing Xiao, Bokui Chen, Jianzong Wang

Abstract:Catastrophic forgetting poses a substantial challenge for managing intelligent agents controlled by a large model, causing performance degradation when these agents face new tasks. In our work, we propose a novel solution - the Progressive Prompt Decision Transformer (P2DT). This method enhances a transformer-based model by dynamically appending decision tokens during new task training, thus fostering task-specific policies. Our approach mitigates forgetting in continual and offline reinforcement learning scenarios. Moreover, P2DT leverages trajectories collected via traditional reinforcement learning from all tasks and generates new task-specific tokens during training, thereby retaining knowledge from previous studies. Preliminary results demonstrate that our model effectively alleviates catastrophic forgetting and scales well with increasing task environments.

Comments:	Accepted by the 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2401.11666 [cs.LG]
	(or arXiv:2401.11666v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2401.11666

Submission history

From: Zhiyuan Wang [view email]
[v1] Mon, 22 Jan 2024 02:58:53 UTC (1,469 KB)

Computer Science > Machine Learning

Title:P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators