ENH Add Multiclass Brier Score Loss #22046

ogrisel · 2021-12-21T14:24:36Z

Resolves #16055.
This PR updates #18699 by @aggvarun01 8000 after a merge with main and resolves merge conflicts. I do not have the permissions to push directly in the original branch and opening a sub-PR pointing to #18699 would lead to an unreadable diff because of the one-year merge sync.

I also added a changelog entry and demonstrate the new function in the multiclass calibration example.

@aggvarun01 if you want feel free to pull the last commit from this commit from this branch to your branch. Alternatively we can finalize the review here.

…score_loss

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…e_loss

ogrisel

ogrisel left a comment •

+1 for merge once https://github.com/scikit-learn/scikit-learn/pull/22046/files#r1943364780 is addressed (and conflicts are resolved).

@lorentzenchr do you agree with the way this PR evolved, in particular the points I raised in #22046 (comment)?

…ore_loss

ogrisel

Still +1 for merge (I cannot approve the PR in github because I am the creator of the PR).

doc/whats_new/upcoming_changes/sklearn.metrics/22046.feature.rst

lorentzenchr

I would have preferred to have all the non-related input validation and test thing changes in a separate PR.

doc/modules/model_evaluation.rst

lorentzenchr · 2025-03-06T22:16:34Z

sklearn/metrics/tests/test_common.py

-    if name in METRICS_REQUIRE_POSITIVE_Y:
-        y_true, y_pred = _require_positive_targets(y_true, y_pred)
+    always_symmetric = True
+    for _ in range(5):


A comment would help: why the loop? (make lucky test passes very unlikely)

lorentzenchr · 2025-03-06T22:18:42Z

sklearn/metrics/tests/test_common.py

+    if always_symmetric:  # pragma: no cover
+        raise ValueError(f"{name} seems to be symmetric")


Suggested change

if always_symmetric: # pragma: no cover

raise ValueError(f"{name} seems to be symmetric")

if not always_symmetric:

raise ValueError(f"{name} seems to be asymmetric")

There should be a test for this, e.g. applying test_not_symmetric_metric to log_loss and test for a fail.

The meta test test_symmetry_tests in 6de5e13 checks test_symmetric_metric and test_not_symmetric_metric.

sklearn/metrics/_classification.py

lorentzenchr · 2025-03-06T22:27:53Z

sklearn/metrics/_classification.py

+    For :math:`N` observations labeled from :math:`C` possible classes, the Brier
+    score is defined as:
+
+    .. math::
+        \\frac{1}{N}\\sum_{i=1}^{N}\\sum_{c=1}^{C}(y_{ic} - \\hat{p}_{ic})^{2}


If I remember correctly, we try to avoid LaTeX in docstrings and just link to the user guide. If LaTeX then only in the the Notes section (this may be a numpy thing) @glemaitre my know better.

The math are moved to the Notes section in 58e5f18. If you feel that's too redundant with the User Guide I can remove the Notes section.

sklearn/metrics/_classification.py

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

antoinebaker · 2025-03-20T09:39:14Z

Thanks for the reviews @thomasjpfan and @lorentzenchr. I think the PR is ready for a final round of reviews.

antoinebaker · 2025-03-20T09:47:44Z

I would have preferred to have all the non-related input validation and test thing changes in a separate PR.

Yes, sorry that we mix refactoring the brier_score_loss and log_loss input validation with adding multiclass support for Brier score. There will be a follow up PR to harmonize further the log_loss and brier_score_loss API and testing.

doc/modules/model_evaluation.rst

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

lorentzenchr · 2025-03-20T18:38:38Z

There will be a follow up PR to harmonize further the log_loss and brier_score_loss API and testing.

If you intent to do that, I would very much like to have the classification metrics better structured, i.e. putting log loss and brier score at the top where they belong. This might also help with not needing to define things twice.

Co-authored-by: Varun Aggarwal <varunaggarwal@Varuns-MBP.fios-router.home> Co-authored-by: Antoine Baker <antoine.baker59@gmail.com>

Varun Aggarwal and others added 30 commits October 25, 2020 20:47

add multi-class support

f630718

fix swapped y_true y_prob

e08d4f4

fix docstring

eff8854

fix docstring

d864395

fix variable name spelling

32ab60a

add tests

6e73c0d

merge upstream

7ce3f85

import re

9cd4247

fix docstring

1369945

fix linting

a183d06

fix linting

08688d3

remove unused import

4f8a5f2

add multiclass_brier_score_loss

7b51433

add tests

d5c90bf

fix docstring

2243828

Merge remote-tracking branch 'upstream/master' into multiclass_brier_…

9893101

…score_loss

use f-strings

3e4465f

fix tests

eafda42

8000 fix error message

038abf7

fix docstring

838f827

fix linting

5ef41c7

Update sklearn/metrics/_classification.py

4fb4c4f

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Apply suggestions from code review

3260bf3

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

split tests

86d793e

add private function

411ec1a

add warning for labels

f84493c

Merge remote-tracking branch 'origin/main' into multiclass_brier_scor…

79f014d

…e_loss

Fix multiclass_brier_score_loss docstring sections order

50f50ef

Add entry in the changelog

884c434

Update multiclass calibration example

cdc4cc9

antoinebaker added 2 commits January 6, 2025 11:15

remove log_loss mention

1054b83

fix doctest

c163b6a

glemaitre mentioned this pull request Jan 7, 2025

feat: Design of EstimatorReport probabl-ai/skore#997

Merged

19 tasks

update test_common

01fa561

ogrisel commented Feb 5, 2025

View reviewed changes

antoinebaker and others added 4 commits February 6, 2025 11:56

Merge remote-tracking branch 'upstream/main' into multiclass_brier_sc…

e45a660

…ore_loss

return float

4e27bbf

changelog

a5b448b

Merge branch 'main' into multiclass_brier_score_loss

ef0bbe8

ogrisel commented Mar 6, 2025

View reviewed changes

doc/whats_new/upcoming_changes/sklearn.metrics/22046.feature.rst Outdated Show resolved Hide resolved

ogrisel added the Waiting for Reviewer label Mar 6, 2025

lorentzenchr reviewed Mar 6, 2025

View reviewed changes

F438 thomasjpfan reviewed Mar 6, 2025

View reviewed changes

sklearn/metrics/_classification.py Show resolved Hide resolved

lorentzenchr removed the Waiting for Reviewer label Mar 7, 2025

antoinebaker and others added 6 commits March 17, 2025 11:18

Apply suggestions from code review

3595b8e

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

doc

58e5f18

symmetry tests

6de5e13

test y_proba with two columns

653d4ae

Merge branch 'main' into multiclass_brier_score_loss

242ee3e

Merge branch 'main' into multiclass_brier_score_loss

6359c7d

lorentzenchr approved these changes Mar 20, 2025

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

Apply suggestions from code review

e3e406c

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

lorentzenchr merged commit 318a282 into scikit-learn:main Mar 20, 2025
33 checks passed

ogrisel deleted the multiclass_brier_score_loss branch March 24, 2025 15:34

agriyakhetarpal mentioned this pull request Mar 26, 2025

DOC Use nightly WASM wheels for JupyterLite in the dev documentation #31085

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH Add Multiclass Brier Score Loss #22046

ENH Add Multiclass Brier Score Loss #22046

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		if always_symmetric: # pragma: no cover
		raise ValueError(f"{name} seems to be symmetric")

Uh oh!

ENH Add Multiclass Brier Score Loss #22046

ENH Add Multiclass Brier Score Loss #22046

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!