Infinite loop when running isotonic regression with some zero-valued weights #4297

ogrisel · 2015-02-26T15:03:36Z

I extract the following bug from the discussion in #2507 (comment) :

import numpy as np
import sklearn.isotonic

regression = sklearn.isotonic.IsotonicRegression()
n_samples = 60

x = np.linspace(-3, 3, n_samples)
y = x + np.random.uniform(size=n_samples)
w = np.random.uniform(size=n_samples)
w[5:8] = 0
regression.fit(x, y, sample_weight=w)

This bug alone should probably be considered a release critical bug for 0.16.

The text was updated successfully, but these errors were encountered:

mjbommar · 2015-02-27T23:08:21Z

@ogrisel , have fix in my personal repo but want to wait until the work @amueller and I did in #4302 is done to minimize mess.

amueller · 2015-02-27T23:27:27Z

I don't understand the fix. How does that work? sample_weight needs to have the same shape as X and y, right? I think after #4302, it is just a matter of dropping the points with zero weight in fit.

amueller · 2015-02-27T23:33:42Z

Never mind, I misread your fix, it is good. It only works after removing fit_transform in #4302, though.

mjbommar · 2015-02-27T23:47:56Z

@amueller , yup, which is why I wanted to wait to pull against master after #4302 is merged :)

ogrisel · 2015-03-05T20:38:10Z

Appart from the rng comment, your fix LGTM (once #4302 is merged ;)

mjbommar · 2015-03-05T20:43:16Z

Thanks, good catch. Sorry for missing. Done now:
mjbommar@6e9d254

[MRG + 2] Adding fix for issue #4297, isotonic infinite loop

mjbommar · 2015-03-06T21:23:29Z

We are good on this thanks to @amueller's work today . I believe it can be closed.

GaelVaroquaux · 2015-03-06T21:46:24Z

Thanks!

* tag '0.16b1': (1589 commits) 0.16.X branching, version 0.16b1 Fix scikit-learn#4351. Rendering of docs in MinMaxScaler. Fix rebase conflict MAINT use canonical PEP-440 dev version consistently Adding fix for issue scikit-learn#4297, isotonic infinite loop DOC deprecate random_state for DBSCAN FIX/TST boundary cases in dbscan (closes scikit-learn#4073) Do not shuffle in DBSCAN (warn if `random_state` is used). Update docstring predict_proba() Update documentation of predict_proba in tree module add scipy2013 tutorial links to presentations on website. TST boundary handling in LSHForest.radius_neighbors ENH improve docstrings and test for radius_neighbors models use a pipeline for pre-processing feature selection, as per best practise DOC remove unnecessary backticks in CONTRIBUTING. ENH no need for tie breaking jitter in calibration Implement "secondary" tie strategy in isotonic. Adding unit test to cover ties/duplicate x values in Isotonic Regression re: issue scikit-learn#4184 MAINT fix typo pyagm -> pygamg in SkipTest STYLE trailing spaces ...

* releases: (1589 commits) 0.16.X branching, version 0.16b1 Fix scikit-learn#4351. Rendering of docs in MinMaxScaler. Fix rebase conflict MAINT use canonical PEP-440 dev version consistently Adding fix for issue scikit-learn#4297, isotonic infinite loop DOC deprecate random_state for DBSCAN FIX/TST boundary cases in dbscan (closes scikit-learn#4073) Do not shuffle in DBSCAN (warn if `random_state` is used). Update docstring predict_proba() Update documentation of predict_proba in tree module add scipy2013 tutorial links to presentations on website. TST boundary handling in LSHForest.radius_neighbors ENH improve docstrings and test for radius_neighbors models use a pipeline for pre-processing feature selection, as per best practise DOC remove unnecessary backticks in CONTRIBUTING. ENH no need for tie breaking jitter in calibration Implement "secondary" tie strategy in isotonic. Adding unit test to cover ties/duplicate x values in Isotonic Regression re: issue scikit-learn#4184 MAINT fix typo pyagm -> pygamg in SkipTest STYLE trailing spaces ... Conflicts: sklearn/externals/joblib/__init__.py sklearn/externals/joblib/numpy_pickle.py sklearn/externals/joblib/parallel.py sklearn/externals/joblib/pool.py

* dfsg: (1589 commits) 0.16.X branching, version 0.16b1 Fix scikit-learn#4351. Rendering of docs in MinMaxScaler. Fix rebase conflict MAINT use canonical PEP-440 dev version consistently Adding fix for issue scikit-learn#4297, isotonic infinite loop DOC deprecate random_state for DBSCAN FIX/TST boundary cases in dbscan (closes scikit-learn#4073) Do not shuffle in DBSCAN (warn if `random_state` is used). Update docstring predict_proba() Update documentation of predict_proba in tree module add scipy2013 tutorial links to presentations on website. TST boundary handling in LSHForest.radius_neighbors ENH improve docstrings and test for radius_neighbors models use a pipeline for pre-processing feature selection, as per best practise DOC remove unnecessary backticks in CONTRIBUTING. ENH no need for tie breaking jitter in calibration Implement "secondary" tie strategy in isotonic. Adding unit test to cover ties/duplicate x values in Isotonic Regression re: issue scikit-learn#4184 MAINT fix typo pyagm -> pygamg in Ski 5134 pTest STYLE trailing spaces ...

ogrisel added the Bug label Feb 26, 2015

ogrisel added this to the 0.16 milestone Feb 26, 2015

ogrisel mentioned this issue Feb 26, 2015

Adding unit test to cover ties/duplicate x values in Isotonic Regression... #4185

Closed

ogrisel changed the title ~~Infinite loop when running isotonic regression~~ Infinite loop when running isotonic regression with some zero-valued weights Feb 26, 2015

mjbommar added a commit to mjbommar/scikit-learn that referenced this issue Feb 27, 2015

Adding test for issue scikit-learn#4297, isotonic infinite loop

da04cd9

mjbommar added a commit to mjbommar/scikit-learn that referenced this issue Feb 27, 2015

Adding fix for issue scikit-learn#4297, isotonic infinite loop

167b5e0

mjbommar mentioned this issue Feb 27, 2015

Fix for issue #4297, infinite isotonic mjbommar/scikit-learn#1

Closed

mjbommar mentioned this issue Mar 2, 2015

[MRG+1] Isotonic regression duplicate fixes #4302

Merged

amueller pushed a commit to amueller/scikit-learn that referenced this issue Mar 6, 2015

Adding fix for issue scikit-learn#4297, isotonic infinite loop

2415100

amueller added a commit that referenced this issue Mar 6, 2015

Merge pull request #4352 from amueller/issue-4297-infinite-isotonic_bak

4cc0235

[MRG + 2] Adding fix for issue #4297, isotonic infinite loop

GaelVaroquaux closed this as completed Mar 6, 2015

cemoody pushed a commit to cemoody/scikit-learn that referenced this issue Mar 7, 2015

Adding fix for issue scikit-learn#4297, isotonic infinite loop

8000

bc38a04

amueller mentioned this issue Mar 23, 2015

IsotonicRegression gives NANs on normal data #2507

Closed

rasbt pushed a commit to rasbt/scikit-learn that referenced this issue Apr 6, 2015

Adding fix for issue scikit-learn#4297, isotonic infinite loop

6443fcc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Infinite loop when running isotonic regression with some zero-valued weights #4297

Infinite loop when running isotonic regression with some zero-valued weights #4297

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Infinite loop when running isotonic regression with some zero-valued weights #4297

Infinite loop when running isotonic regression with some zero-valued weights #4297

Comments

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!