Computer Science > Machine Learning

arXiv:2301.13465 (cs)

[Submitted on 31 Jan 2023]

Title:GDOD: Effective Gradient Descent using Orthogonal Decomposition for Multi-Task Learning

Authors:Xin Dong, Ruize Wu, Chao Xiong, Hai Li, Lei Cheng, Yong He, Shiyou Qian, Jian Cao, Linjian Mo

View PDF

Abstract:Multi-task learning (MTL) aims at solving multiple related tasks simultaneously and has experienced rapid growth in recent years. However, MTL models often suffer from performance degeneration with negative transfer due to learning several tasks simultaneously. Some related work attributed the source of the problem is the conflicting gradients. In this case, it is needed to select useful gradient updates for all tasks carefully. To this end, we propose a novel optimization approach for MTL, named GDOD, which manipulates gradients of each task using an orthogonal basis decomposed from the span of all task gradients. GDOD decomposes gradients into task-shared and task-conflict components explicitly and adopts a general update rule for avoiding interference across all task gradients. This allows guiding the update directions depending on the task-shared components. Moreover, we prove the convergence of GDOD theoretically under both convex and non-convex assumptions. Experiment results on several multi-task datasets not only demonstrate the significant improvement of GDOD performed to existing MTL models but also prove that our algorithm outperforms state-of-the-art optimization methods in terms of AUC and Logloss metrics.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2301.13465 [cs.LG]
	(or arXiv:2301.13465v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.13465
Journal reference:	Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022: 386-395
Related DOI:	https://doi.org/10.1145/3511808.3557333

Submission history

From: Ruize Wu [view email]
[v1] Tue, 31 Jan 2023 08:08:24 UTC (951 KB)

Computer Science > Machine Learning

Title:GDOD: Effective Gradient Descent using Orthogonal Decomposition for Multi-Task Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GDOD: Effective Gradient Descent using Orthogonal Decomposition for Multi-Task Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators