Computer Science > Information Theory

arXiv:1810.07014 (cs)

[Submitted on 14 Oct 2018 (v1), last revised 2 Jan 2020 (this version, v2)]

Title:Bregman Divergence Bounds and Universality Properties of the Logarithmic Loss

Authors:Amichai Painsky, Gregory W. Wornell

View PDF

Abstract:A loss function measures the discrepancy between the true values and their estimated fits, for a given instance of data. In classification problems, a loss function is said to be proper if a minimizer of the expected loss is the true underlying probability. We show that for binary classification, the divergence associated with smooth, proper, and convex loss functions is upper bounded by the Kullback-Leibler (KL) divergence, to within a normalization constant. This implies that by minimizing the logarithmic loss associated with the KL divergence, we minimize an upper bound to any choice of loss from this set. As such the logarithmic loss is universal in the sense of providing performance guarantees with respect to a broad class of accuracy measures. Importantly, this notion of universality is not problem-specific, enabling its use in diverse applications, including predictive modeling, data clustering and sample complexity analysis. Generalizations to arbitrary finite alphabets are also developed. The derived inequalities extend several well-known $f$-divergence results.

Comments:	arXiv admin note: substantial text overlap with arXiv:1805.03804
Subjects:	Information Theory (cs.IT)
Cite as:	arXiv:1810.07014 [cs.IT]
	(or arXiv:1810.07014v2 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.1810.07014

Submission history

From: Amichai Painsky [view email]
[v1] Sun, 14 Oct 2018 05:42:10 UTC (354 KB)
[v2] Thu, 2 Jan 2020 06:25:20 UTC (237 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IT

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
math
math.IT

References & Citations

DBLP - CS Bibliography

listing | bibtex

Amichai Painsky
Gregory W. Wornell

export BibTeX citation

Computer Science > Information Theory

Title:Bregman Divergence Bounds and Universality Properties of the Logarithmic Loss

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:Bregman Divergence Bounds and Universality Properties of the Logarithmic Loss

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators