TST Extend tests for `scipy.sparse.*array` in `sklearn/ensemble/tests/test_weight_boosting.py` #27148

yuanx749 · 2023-08-24T06:23:12Z

Towards #27090.

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

…st_weight_boosting.py

github-actions · 2023-08-24T06:24:56Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: bc7839b. Link to the linter CI: here}

ogrisel

Here are suggestions to improve variable names to make the intentions of the tests easier to grasp.

Otherwise, LGTM.

ogrisel · 2023-08-24T09:02:05Z

sklearn/ensemble/tests/test_weight_boosting.py

+    sparse_results = sparse_classifier.staged_decision_function(X_test_sparse)
+    dense_results = dense_classifier.staged_decision_function(X_test)
+    for sprase_res, dense_res in zip(sparse_results, dense_results):
+        assert_array_almost_equal(sprase_res, dense_res)


While we are at it, let's fix the typo: sprase => sparse.

Furthermore, the names "sparse_results" and "sparse_res" are confusing. Those are not sparse out datastructures but results of a classifier that fits and predicts on sparse inputs datastructures.

I think we should rename those to dense_clf_results / sparse_clf_results instead (and similarly for the "_res" variables).

ogrisel · 2023-08-24T09:06:00Z

sklearn/ensemble/tests/test_weight_boosting.py

+
+
+@pytest.mark.parametrize(
+    "sparse_container, sparse_type",


Same comment for sparse_type.

ogrisel · 2023-08-24T09:06:32Z

sklearn/ensemble/tests/test_weight_boosting.py

@@ -308,7 +314,20 @@ def test_sample_weights_infinite():
        clf.fit(iris.data, iris.target)


-def test_sparse_classification():
+@pytest.mark.parametrize(
+    "sparse_container, sparse_type",


Please rename sparse_type to expected_internal_type.

yuanx749 · 2023-08-24T10:13:30Z

As per your suggestions, I changed the variable names to be more clear. @ogrisel

ogrisel

Lgtm!

OmarManzoor · 2023-08-24T13:41:01Z

sklearn/ensemble/tests/test_weight_boosting.py

+    # Verify sparsity of data is maintained during training
+    types = [i.data_type_ for i in sparse_classifier.estimators_]
+
+    assert all([t == expected_internal_type for t in types])


Thanks for the PR @yuanx749! I just have a question regarding fixing the expected type for each parametrized case. Previously we were checking whether we have either csc_matrix or csr_matrix, now we only have csc for csc containers and csr matrix otherwise. I haven't checked the code so just want to confirm that do we expect csr array in all the other cases?

Yes, according to the doc
https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.AdaBoostClassifier.html#sklearn.ensemble.AdaBoostClassifier.fit

Sparse matrix can be CSC, CSR, COO, DOK, or LIL. COO, DOK, and LIL are converted to CSR.

Thanks for clarifying

OmarManzoor

LGTM

…/test_weight_boosting.py` (scikit-learn#27148)

TST Extend tests for scipy.sparse.*array in sklearn/ensemble/tests/te…

04cf3b4

…st_weight_boosting.py

github-actions bot added the module:ensemble label Aug 24, 2023

ogrisel mentioned this pull request Aug 24, 2023

TST Extend tests for scipy.sparse.*array #27090

Closed

ogrisel added the No Changelog Needed label Aug 24, 2023

ogrisel reviewed Aug 24, 2023

View reviewed changes

yuanx749 added 2 commits August 24, 2023 17:52

Improve var name

dbc2fff

Merge branch 'main' into sparse-weight-boosting

bc7839b

ogrisel approved these changes Aug 24, 2023

View reviewed changes

OmarManzoor reviewed Aug 24, 2023

View reviewed changes

OmarManzoor approved these changes Aug 24, 2023

View reviewed changes

OmarManzoor merged commit a9611d0 into scikit-learn:main Aug 24, 2023

yuanx749 deleted the sparse-weight-boosting branch August 25, 2023 03:15

akaashpatelmns pushed a commit to akaashp2000/scikit-learn that referenced this pull request Aug 25, 2023

TST Extend tests for scipy.sparse.*array in `sklearn/ensemble/tests…

946b5de

…/test_weight_boosting.py` (scikit-learn#27148)

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Aug 29, 2023

TST Extend tests for scipy.sparse.*array in `sklearn/ensemble/tests…

3b10f98

…/test_weight_boosting.py` (scikit-learn#27148)

REDVM pushed a commit to REDVM/scikit-learn that referenced this pull request Nov 16, 2023

TST Extend tests for scipy.sparse.*array in `sklearn/ensemble/tests…

548d1eb

…/test_weight_boosting.py` (scikit-learn#27148)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

TST Extend tests for `scipy.sparse.*array` in `sklearn/ensemble/tests/test_weight_boosting.py` #27148

TST Extend tests for `scipy.sparse.*array` in `sklearn/ensemble/tests/test_weight_boosting.py` #27148

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TST Extend tests for scipy.sparse.*array in sklearn/ensemble/tests/test_weight_boosting.py #27148

TST Extend tests for scipy.sparse.*array in sklearn/ensemble/tests/test_weight_boosting.py #27148

Uh oh!

Conversation

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Uh oh!

✔️ Linting Passed

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

TST Extend tests for `scipy.sparse.*array` in `sklearn/ensemble/tests/test_weight_boosting.py` #27148

TST Extend tests for `scipy.sparse.*array` in `sklearn/ensemble/tests/test_weight_boosting.py` #27148