Computer Science > Machine Learning

arXiv:2306.08173 (cs)

[Submitted on 13 Jun 2023 (v1), last revised 1 Mar 2024 (this version, v2)]

Title:Safeguarding Data in Multimodal AI: A Differentially Private Approach to CLIP Training

Authors:Alyssa Huang, Peihan Liu, Ryumei Nakada, Linjun Zhang, Wanrong Zhang

View PDF

Abstract:The surge in multimodal AI's success has sparked concerns over data privacy in vision-and-language tasks. While CLIP has revolutionized multimodal learning through joint training on images and text, its potential to unintentionally disclose sensitive information necessitates the integration of privacy-preserving mechanisms. We introduce a differentially private adaptation of the Contrastive Language-Image Pretraining (CLIP) model that effectively addresses privacy concerns while retaining accuracy. Our proposed method, Dp-CLIP, is rigorously evaluated on benchmark datasets encompassing diverse vision-and-language tasks such as image classification and visual question answering. We demonstrate that our approach retains performance on par with the standard non-private CLIP model. Furthermore, we analyze our proposed algorithm under linear representation settings. We derive the convergence rate of our algorithm and show a trade-off between utility and privacy when gradients are clipped per-batch and the loss function does not satisfy smoothness conditions assumed in the literature for the analysis of DP-SGD.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:2306.08173 [cs.LG]
	(or arXiv:2306.08173v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.08173

Submission history

From: Peihan Liu [view email]
[v1] Tue, 13 Jun 2023 23:32:09 UTC (756 KB)
[v2] Fri, 1 Mar 2024 04:24:04 UTC (772 KB)

Computer Science > Machine Learning

Title:Safeguarding Data in Multimodal AI: A Differentially Private Approach to CLIP Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Safeguarding Data in Multimodal AI: A Differentially Private Approach to CLIP Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators