DOC Fix doc of defaults in sklearn.utils.sparsefuncs.py #18025

franslarsson · 2020-07-29T09:22:33Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR change how default values are documented in sklearn.utils.sparsefuncs.py and update docstring according to guideline.

Any other comments?

rth · 2020-07-30T11:22:14Z

Thanks @franslarsson , there are some linting issues that would need to be resolved.

franslarsson · 2020-07-30T16:57:10Z

Thanks @franslarsson , there are some linting issues that would need to be resolved.

Thanks for pointing it out, the linting issue is solved.

glemaitre

A couple of changes to be in line with our documentation guideline.

sklearn/utils/sparsefuncs.py

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

franslarsson · 2020-07-31T16:06:17Z

A couple of changes to be in line with our documentation guideline.

Thank you @glemaitre for the review! I have committed your changes.

rth · 2020-07-31T17:22:50Z

Valid numpy dtypes are listed in https://numpy.org/doc/stable/user/basics.types.html and float is not one of them. Numpy does convert float type to,

>>> np.dtype(float)
dtype('float64')

but we better write it explicitly as np.float64 (or np.floating if it can be 32bit or 64bit depending on the input or platform).

float is confusing because it's unclear whether it's C float (i.e. 32 bit) or python float (64 bit).

Sorry if this goes in an orthogonal way to @glemaitre 's comments.

glemaitre · 2020-08-01T08:02:12Z

Yep I got a bit sloppy there. @franslarsson Could you mention the NumPy types.

franslarsson · 2020-08-01T20:52:17Z

Yep I got a bit sloppy there. @franslarsson Could you mention the NumPy types.

Absolutely, I will do that. I noticed that for example inplace_csr_row_scale(), np.int64, np.float64 and np.float32 seem all to be valid types and give the same results. Should the requirement that dtype=np.float64 still be documented in the docstring or should that be removed in those cases? What do you prefer?

glemaitre · 2020-08-03T06:42:55Z

We can stipulate a set of potential type dtype={np.int64, np.float32, np.float64}

glemaitre · 2020-08-03T06:44:46Z

I would say that it could be handy for the low-level functions while the high-level functions (usually intended for a user) this might be less interesting and omitting the dtype would be OK. But this is only a personal opinion :)

glemaitre · 2020-08-21T12:43:53Z

@franslarsson Could you fix the part regarding the valid numpy dtype?

franslarsson · 2020-08-21T20:02:01Z

@franslarsson Could you fix the part regarding the valid numpy dtype?

I have committed one suggestion which includes both 32 and 64. I am a bit unsure how to choose between np.float32 and np.float64 (same for np.int32 and np.int64) so let me know if you have other suggestions. :)

haiatn

LGTM

cmarmo · 2020-11-23T10:43:43Z

Hi @franslarsson do you mind fixing conflicts? Then we can push a bit to merge and close #15761. Thanks a lot for your work so far.

franslarsson · 2020-12-06T15:26:19Z

Hi @franslarsson do you mind fixing conflicts? Then we can push a bit to merge and close #15761. Thanks a lot for your work so far.

Hi @cmarmo! I have now fixed the conflicts, sorry for taking so long. Let me know if there is something else that needs to be fixed.

glemaitre

Thanks for the PR. LGTM. Could you just remove some blank line that have been added.

sklearn/utils/sparsefuncs.py

franslarsson · 2020-12-07T20:29:59Z

Thanks for the review @glemaitre, I have removed the extra blank lines.

glemaitre · 2020-12-15T09:37:43Z

Thanks @franslarsson

We should probably backport this PR in 0.24.X for documentation consistency.

…#18025) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

franslarsson added 3 commits July 28, 2020 23:11

docstring improvements

3a04f65

update options for axis

7d5437d

update type in docstring

6a605be

github-actions bot added the module:utils label Jul 29, 2020

fix too long line

44fedf8

glemaitre reviewed Jul 31, 2020

View reviewed changes

franslarsson and others added 2 commits July 31, 2020 17:57

Apply suggestions from code review - added dtype for arrays

153a7a9

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Apply suggestions from code review - specify format of sparse matrix

4469a3b

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

make doc of CSR and CSC consistent

df0e8a9

franslarsson mentioned this pull request Aug 20, 2020

DOC Fix doc of defaults in grower.py #18206

Merged

Add dtype

4af49f1

franslarsson mentioned this pull request Aug 30, 2020

DOC Fix default values, shape and dtype for preprocessing._discretization.py #18290

Merged

cmarmo mentioned this pull request Sep 24, 2020

Fix documentation of default values in all classes #15761

Closed

haiatn approved these changes Oct 7, 2020

View reviewed changes

cmarmo added the Waiting for Reviewer label Oct 7, 2020

fix conflicts

893e163

glemaitre approved these changes Dec 7, 2020

View reviewed changes

sklearn/utils/sparsefuncs.py Outdated Show resolved Hide resolved

sklearn/utils/sparsefuncs.py Outdated Show resolved Hide resolved

cmarmo removed the Waiting for Reviewer label Dec 7, 2020

STL: Remove extra blank lines

7eca347

cmarmo added the Waiting for Reviewer label Dec 8, 2020

glemaitre merged commit 8e9ced2 into scikit-learn:master Dec 15, 2020

cmarmo removed the Waiting for Reviewer label Dec 15, 2020

cmarmo added this to the 0.24 milestone Dec 15, 2020

alfaro96 added the To backport PR merged in master that need a backport to a release branch defined based on the milestone. label Dec 16, 2020

glemaitre added a commit to glemaitre/scikit-learn that referenced this pull request Dec 22, 2020

DOC Fix doc of defaults in sklearn.utils.sparsefuncs.py (scikit-learn…

137684d

…#18025) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

glemaitre mentioned this pull request Dec 22, 2020

Release 0.24.0 #19058

Merged

14 tasks

glemaitre mentioned this pull request Apr 22, 2021

Release 0.24.2 #19954

Merged

12 tasks

glemaitre added a commit to glemaitre/scikit-learn that referenced this pull request Apr 22, 2021

DOC Fix doc of defaults in sklearn.utils.sparsefuncs.py (scikit-learn…

6873365

…#18025) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

DOC Fix doc of defaults in sklearn.utils.sparsefuncs.py #18025

DOC Fix doc of defaults in sklearn.utils.sparsefuncs.py #18025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DOC Fix doc of defaults in sklearn.utils.sparsefuncs.py #18025

DOC Fix doc of defaults in sklearn.utils.sparsefuncs.py #18025

Uh oh!

Conversation

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!