10000 [WIP] Multiple Imputation: Example with IterativeImputer by RianneSchouten · Pull Request #11370 · scikit-learn/scikit-learn · GitHub
[go: up one dir, main page]

Skip to content

[WIP] Multiple Imputation: Example with IterativeImputer #11370

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 164 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
164 commits
Select commit Hold shift + click to select a range
686d758
add example multiple imputation
RianneSchouten Jun 27, 2018
965ae8e
adjust figure widths and legends
RianneSchouten Jun 27, 2018
fa082de
changed code according pep rules and increased figure size
RianneSchouten Jun 28, 2018
e3e2465
solve two issues from lgtm and improve introduction text
RianneSchouten Jun 28, 2018
15b3b91
remove spaces in arguments and add lines for definitions
RianneSchouten Jun 28, 2018
ed1db8b
put rules in separate functions and include explanation
RianneSchouten Jun 29, 2018
7c3fb97
line from 80 to 79 characters
RianneSchouten Jun 29, 2018
40dd0bf
DOC Format in DBSCAN
qinhanmin2014 Sep 3, 2018
721ebae
MNT Change max_bound -> max_eps in OPTICS (#11984)
adrinjalali Sep 3, 2018
84c4e54
COSMIT remove unnecessary _TreeNode methods (#11983)
jnothman Sep 3, 2018
07051bc
DOC OPTICS: improve docstring and add default values. (#11987)
adrinjalali Sep 3, 2018
ddf37c7
DOC adding scikit-multilearn to related projects list (#11988)
niedakh Sep 3, 2018
a041821
TST FIX use match rather than message in pytest.raises (#12001)
wdevazelhes Sep 4, 2018
efeb23d
DOC note controversy on multiclass balanced accuracy definition (#11994)
jnothman Sep 5, 2018
dff84c8
MNT Use `fmax` when finding the maximum (#12005)
jakirkham Sep 5, 2018
8d5b08d
DOC small changes in outlier detection documentation (#12003)
albertcthomas Sep 5, 2018
c268230
MNT Remove n_clusters_ in OPTICS (#11981)
qinhanmin2014 Sep 5, 2018
5e101a2
Joblib 0.12.4 (#12007)
ogrisel Sep 5, 2018
242410f
MAINT make pytest collection ignore folders with Python scripts (#12011)
ogrisel Sep 5, 2018
a9c6ad9
[MRG+1] break the tie in Meanshift in case cluster intensities are th…
adrinjalali Sep 5, 2018
e726f7a
DOC fix minor spacing issue in the iris dataset description (#12019)
gabrielecalvo Sep 5, 2018
1fafc5c
TST use urlopen monkeypatch for test_decode_* (#12020)
jnothman Sep 6, 2018
3a80162
DOC gradient boosting fit() supports sparse X (#12022)
vivekk0903 Sep 6, 2018
b4bf033
DOC: Add pytest version in documentation (#12002)
wdevazelhes Sep 6, 2018
a056a57
DOC fix for linnerud dataset (#12024)
papaonlegs Sep 6, 2018
d3d09c3
MAINT skip joblib vendor test on debian (#12027)
ogrisel Sep 6, 2018
121dd5a
MNT Fix utils.sparse import in neural_network.rbm (#12032)
jeremiedbb Sep 7, 2018
79f5d14
MNT Revert the deprecation of min_samples_leaf and min_weight_fractio…
jnothman Sep 8, 2018
e5333f5
OPTICS remove redundant recursion (#11985)
adrinjalali Sep 8, 2018
2242f4c
EXA use openml fetcher in plot_gpr_co2.py example (#12004)
maxcopeland Sep 8, 2018
177900b
CI Workaround to test numpy 1.8.2 and scipy 0.13.3 (#12042)
rth Sep 9, 2018
251e58b
FIX ordering_ type and cosmetic changes to structure for OPTICS main …
jnothman Sep 9, 2018
a86709f
[MRG] MNT rename min_cluster_size_ratio to min_cluster_size (#11913)
jnothman Sep 9, 2018
5ec0001
DOC `sample_weight` removed from the docs in `SVR` class. (#12046)
Sep 11, 2018
f71de6f
MNT Unify and refactor strategy error (#12050)
JarnoRFB Sep 12, 2018
2ed18e0
[MRG] DOC Examples added to the rest of linear models (#11975)
adrinjalali Sep 12, 2018
1906c95
DOC Generated author list from github (#11708)
TomDLT Sep 12, 2018
3ee1cfc
ENH Allow scoring of dummies without testsamples (#11957)
JarnoRFB Sep 13, 2018
dad5c36
DOC Fix docstring inconsistency in nmf.py (#12063)
zjpoh Sep 13, 2018
36536c6
MAINT Fix invalid escape sequence (#12064)
adrinjalali Sep 13, 2018
17c6c90
DOC fix typos in documentation. (#12059)
zdgriffith Sep 13, 2018
06b4307
DOC Include fetch_openml doc in user guide (#12065)
qinhanmin2014 Sep 13, 2018
e36254c
MNT: Anonimize IP for Google Analytics (#12038)
rth Sep 13, 2018
ec69171
TST Use pytest.raises instead of legacy constructions (#12057)
rth Sep 13, 2018
dfdf605
BUG always raise on NaN in OneHotEncoder for object dtype data (#12033)
jorisvandenbossche Sep 13, 2018
c2f9206
MAINT joblib 0.12.5 (#12066)
ogrisel Sep 13, 2018
5be05c6
ENH Adds drop in FeatureUnion (#11640)
thomasjpfan Sep 13, 2018
6a57983
ENH Better error message for metrics of neighbors (#11914)
zjpoh Sep 13, 2018
781edc4
MNT Duplicate import in test_pipeline.py
qinhanmin2014 Sep 14, 2018
1049fb1
ENH Better error message for sparse metrics of neighbors (#12073)
zjpoh Sep 14, 2018
55ede24
MAINT: skip doctest for Python 2 (#12074)
glemaitre Sep 14, 2018
aa53493
fix typo
amueller Sep 14, 2018
a79d44e
FIX OPTICS Change quick_scan floating point comparison to isclose (#1…
adrinjalali Sep 16, 2018
43c8563
TST: skip test requiring internet using --skip-network (#12067)
glemaitre Sep 16, 2018
abbdc91
DOC Fix description of SVC intercept_ shape in user guide (#12070)
zdgriffith Sep 16, 2018
6de7957
TST Ignore warnings in common test to avoid collection errors (#12093)
massich Sep 16, 2018
efe7b8c
DOC Fix optics metric issues (DOC and precomputed) (#12028)
adrinjalali Sep 17, 2018
e616ee3
DOC move OPTICS to 0.21
jnothman Sep 17, 2018
daaa2a5
DOC Reword to avoid that people draw wrong conclusions (#12095)
kno10 Sep 17, 2018
8068b97
MNT Only checks warnings on latest depedendencies versions in CI (#12…
rth Sep 17, 2018
b91cbda
[MRG] MNT Re-enable PyPy CI (#12039)
rth Sep 18, 2018
8a5ff27
DOC BaggingRegressor missing default value for oob_score in docstring…
adrinjalali Sep 19, 2018
f0532c5
DOC Removing quotes from variant names. (#12113)
louib Sep 20, 2018
84dd57b
[MRG] Update test_metaestimators to pass y parameter when calling sco…
oliverrausch Sep 20, 2018
e1cf0db
DOC Update fit_transform docstring of OneHotEncoder (#12117)
albertcthomas Sep 20, 2018
0bbd935
DOC Typo in OneHotEncoder
qinhanmin2014 Sep 20, 2018
81601fb
Fix typo (#12126)
Mottl Sep 21, 2018
4035e60
[MRG +1] ColumnTransformer: store evaluated function column specifier…
jorisvandenbossche Sep 21, 2018
5d30b2d
ENH (0.21) Make OPTICS more memory efficient when calling kneighbors …
TomDLT Sep 22, 2018
e81bcd5
Added the changes to remove the documentation support statements for …
Sriharsha-hatwar Sep 22, 2018
0c0a9e8
DOC Replaced the deprecated early_stopping parameter with n_iter_no_c…
Sriharsha-hatwar Sep 22, 2018
e8ca4cd
[MRG] DOC covariance doctest examples (#12124)
adrinjalali Sep 24, 2018
755c7dd
DOC (0.21) OPTICS Note the order of reachability_ and core_distances_…
qinhanmin2014 Sep 24, 2018
2d66034
DOC Removed duplicated doc in tree.rst (#11922)
NicolasHug Sep 24, 2018
fc7d6e6
[MRG] Fix FutureWarnings in logistic regression examples (#12114)
ogrisel Sep 24, 2018
5431a1a
BUG: check equality instead of identity in check_cv (#12155)
glemaitre Sep 25, 2018
f15ebb9
[MRG] Convert ColumnTransformer input list to numpy array (#12104)
vinayak-mehta Sep 25, 2018
661a8b4
add sparse_threshold to make_columns_transformer (#12152)
datajanko Sep 25, 2018
e58f366
ColumnTransformer generalization to work on empty lists (#12084)
janvanrijn Sep 25, 2018
09851ac
TST update make_column_transformer test + add comment (#12156)
jorisvandenbossche Sep 25, 2018
9427c36
coef0 is a float, not an int (#12161)
djsutherland Sep 25, 2018
db5abae
DOC: update link to other versions (#12158)
jorisvandenbossche Sep 26, 2018
c946391
DOC added 0.20 contributors to whats new. (#12160)
amueller Sep 26, 2018
e63feeb
DOC More specific about the limitation of make_column_transformer (#1…
qinhanmin2014 Sep 26, 2018
b886da5
MAINT update comment
jnothman Sep 26, 2018
b915ca6
MNT Avoid using "is" when comparing strings (#12168)
qinhanmin2014 Sep 26, 2018
fe05e79
OPTIM remove useless overhead caused by nested parallelism in mean_sh…
ogrisel Sep 26, 2018
6463406
MNT Unused import in plot_gpr_co2.py
qinhanmin2014 Sep 26, 2018
7d72720
[MRG] Crash when using SGDClassifier with early stopping in a paralle…
ogrisel Sep 26, 2018
86931ad
Dedicate the release to Raghav
ogrisel Sep 26, 2018
4207225
DOC start section for the 0.20.1 bugfix notes (#12170)
ogrisel Sep 26, 2018
dbfd872
DOC add a known issue entry for euclidean_distances precision (#12176)
jnothman Sep 27, 2018
88b49e5
Fix parallel backend neighbors (#12172)
tomMoral Sep 27, 2018
a11154e
[MRG] improve check_non_negative for sparse matrices (#12106)
TomDLT Sep 27, 2018
da0cb32
FIX Use take instead of choose in compute_sample_weight (#12165)
jnothman Sep 27, 2018
d88bef1
DOC Add sections to whats new 0.20.1 (#12183)
qinhanmin2014 Sep 27, 2018
2e2e69d
DOC KDE normalisation clarified (#11275)
haroldfox Sep 28, 2018
819d8ef
[MRG] Fix diagonal in DBSCAN with precomputed sparse neighbors graph …
TomDLT Sep 28, 2018
ea52161
MNT Move what's new entry
qinhanmin2014 Sep 28, 2018
a358d7d
DOC Add Versionadded tag to sklearn/_config.py (#12187)
J0 Sep 28, 2018
239482f
BaseSearchCV._run_search raises NotImplementedError instead of being …
adrinjalali Sep 28, 2018
2cf145d
DOC Add versionadded to set_config (#12196)
qinhanmin2014 Sep 29, 2018
0b58bc3
DOC Improve ColumnTransformer docstrings (#12206)
LilyX2021 Sep 29, 2018
9d58ca5
MNT Remove duplicate import of warnings & unused variables (#12203)
rebekahkim Sep 30, 2018
663e024
DOC Fix typo in neighbors/nearest_centroid.py (#12223)
jdethurens Sep 30, 2018
e1c3c22
DOC Fixing summary table in the linear model documentation. (#12220)
markhannel Sep 30, 2018
a3616f6
MNT Use name instead of float to specify colors (#12199)
qinhanmin2014 Oct 1, 2018
94c70ff
[MRG] More informative error message in OneHotEncoder(categories=None…
ygivenx Oct 1, 2018
59b15c5
add explicit mention of scaing for saga in logisticregression docs. (…
amueller Oct 1, 2018
11612fc
MNT Raise error for duplicate classes when constructing a MultiLabelB…
samwaterbury Oct 2, 2018
5313325
DOC Encourage contributors to use sklearn.show_versions() (#12225)
qinhanmin2014 Oct 2, 2018
acf3bab
MNT Add versionadded to set_config parameters
qinhanmin2014 Oct 2, 2018
dd3b705
MNT Unused imports in examples
qinhanmin2014 Oct 2, 2018
60cf1d6
Fix numpy.int overflow in make_classification (#10811)
VarIr Oct 2, 2018
7166cd5
MNT Remove duplicate entry in whats new
qinhanmin2014 Oct 2, 2018
dfd009d
Remove test_import_sklearn_no_warnings (#12244)
rth Oct 2, 2018
cbbe489
DOC fix cross-entropy typo in tree docs (#12242)
rasbt Oct 3, 2018
bfab306
[MRG] Added Tips in SVM user guide for tuning C parameter in LinearSV…
Oct 3, 2018
da85815
DOC what's new entry for "Fix numpy.int overflow in make_classificati…
rth Oct 3, 2018
3e5777a
[MRG] Fast PolynomialFeatures on dense arrays (#12251)
TomDLT Oct 3, 2018
fb7be87
[MRG + 1] return_train_score deprecation (#12241)
amueller Oct 3, 2018
e0e7387
Remove unused private functions (#12253)
rth Oct 4, 2018
24aa6b8
DOC Remove mentions of removed AUTHORS.rst file (#12262)
jacquerie Oct 4, 2018
f456a40
MNT Change show_versions format to suit markdown (#12255)
qinhanmin2014 Oct 4, 2018
5e24762
DOC add note on discretization creating non-linearity (#12269)
jnothman Oct 4, 2018
e9cdb55
MNT Updated PyPI URLs (#12274)
TakingItCasual Oct 4, 2018
2eca77b
MNT complete VotingClassifier flatten_transform deprecation (#12252)
amueller Oct 4, 2018
8e08028
DOC Move 'for instance' to front
jnothman Oct 4, 2018
58228cb
[MRG+1] ColumnTransformer fix having mixed types in a single passthro…
adrinjalali < 10000 relative-time datetime="2018-10-04T13:37:39Z" class="no-wrap">Oct 4, 2018
b725921
[MRG] Created 'cross-validation estimator' entry in glossary (#11661)
NicolasHug Oct 4, 2018
3f5bf97
DOC removed ambiguity in pipeline gridsearch example (#12272)
sjtrny Oct 4, 2018
74b56db
MNT Converting http to https (#12277)
TakingItCasual Oct 4, 2018
877e3f3
MNT Move what's new entry
qinhanmin2014 Oct 4, 2018
1e052e9
Converting http to https (2)... (#12292)
TakingItCasual Oct 4, 2018
0bbb7d0
DOC Add class example for LedoitWolf (#12214)
andreanr Oct 5, 2018
afa0694
FIX cache of OpenML fetcher (#12246)
janvanrijn Oct 5, 2018
08924c3
EXA Fix bad data visualisation in "Importance of Feature Scaling" (#1…
benjastudio Oct 5, 2018
a1be325
MNT Make check_X_y raise a better error when y is None (#12283)
jacquerie Oct 5, 2018
8afe43e
DOC Added "mars" testimonial to testimonials page (#12298)
GauravAhlawat Oct 5, 2018
9e1d48f
DOC What's new typo
qinhanmin2014 Oct 5, 2018
e6359e2
DOC Fix broken link to joblib documentation (#12301)
kushalchauhan98 Oct 5, 2018
b020f62
MNT Apply pep8 to docs code (#12275)
kataev Oct 5, 2018
f4e7d2b
Converting http to https (3)... (#12302)
TakingItCasual Oct 5, 2018
3804ccd
MNT Refactors doc test into seperate script (#12248)
thomasjpfan Oct 6, 2018
4280308
ENH (0.21) Remember predecessor in OPTICS (#12135)
kno10 Oct 7, 2018
2020867
FIX (0.21) OPTICS correctly handle multiple infs in reachability arra…
adrinjalali Oct 7, 2018
63e5ae6
TST Use same random seeds for both GaussianMixture.fit (#12307)
oleksandr-pavlyk Oct 7, 2018
4e2e1fa
ENH Cache class mapping in MultiLabelBinarizer() (#12116)
kiote Oct 8, 2018
5fd9e03
ENH Raise an error when pos_label is not in binary y_true (#12313)
qinhanmin2014 Oct 8, 2018
c5b020f
TST Use v_measure_score to compare label arrays up to permutation (#1…
oleksandr-pavlyk Oct 9, 2018
205ff38
DOC What's new typo
qinhanmin2014 Oct 9, 2018
5df8cd3
EXA Fix title, overlapping plots and axis labels in plot_ols_ridge_va…
Oct 9, 2018
03c3af5
[MRG] Fix fetch_openml when ignore attributes are numeric (#12330)
janvanrijn Oct 9, 2018
bbb0d93
[MRG] FIX Update power_transform docstring and add FutureWarning (#12…
chang Oct 10, 2018
c8a4132
DOC check_array() and check_X_y() documentation update (#12340)
dvukolov Oct 10, 2018
a80bbd9
ENH add get_n_leaves() and get_max_depth() to DesicionTrees (#12300)
adrinjalali Oct 10, 2018
00c2f41
DOC fix logistic regression.fit docstring on y (#12343)
adrinjalali Oct 10, 2018
39bd736
[MRG] Move RandomTreesEmbedding criterion & max_features to be class …
rohan-varma Oct 10, 2018
831c760
ENH (0.21) Add max_error to the existing set of metrics for regressio…
whiletruelearn Oct 11, 2018
8985a63
DOC Update v0.20.rst with power_transform API change (#12351)
chang Oct 11, 2018
0f94f29
MNT simple deprecations and removals for 0.21 (#12238)
amueller Oct 11, 2018
c13ba26
[MRG] Matplotlib tree plotting (#9251)
amueller Oct 11, 2018
096440e
save latest changes
RianneSchouten Oct 12, 2018
ed9f3c8
Merge branch 'master' into multiple-imputation
RianneSchouten Oct 12, 2018
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
43 changes: 36 additions & 7 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -41,13 +41,12 @@ jobs:
# Test examples run with minimal dependencies
- MINICONDA_PATH: ~/miniconda
- CONDA_ENV_NAME: testenv
- PYTHON_VERSION: 2
- NUMPY_VERSION: 1.8.2
# XXX: plot_gpc_xor.py fails with scipy 0.13.3
- SCIPY_VERSION: 0.14
- MATPLOTLIB_VERSION: 1.3
- SCIKIT_IMAGE_VERSION: 0.9.3
- PANDAS_VERSION: 0.13.1
- PYTHON_VERSION: "2"
- NUMPY_VERSION: "1.10"
- SCIPY_VERSION: "0.16"
- MATPLOTLIB_VERSION: "1.4"
- SCIKIT_IMAGE_VERSION: "0.11"
- PANDAS_VERSION: "0.17.1"
steps:
- checkout
- run: ./build_tools/circle/checkout_merge_commit.sh
Expand All @@ -65,6 +64,21 @@ jobs:
path: ~/log.txt
destination: log.txt

pypy3:
docker:
- image: pypy:3-6.0.0
steps:
- restore_cache:
keys:
- pypy3-ccache-{{ .Branch }}
- pypy3-ccache
- checkout
- run: ./build_tools/circle/build_test_pypy.sh
- save_cache:
key: pypy3-ccache-{{ .Branch }}-{{ .BuildNum }}
paths:
- ~/.ccache
- ~/.cache/pip

deploy:
docker:
Expand All @@ -89,6 +103,21 @@ workflows:
jobs:
- python3
- python2
- pypy3:
filters:
branches:
only:
- 0.20.X
- deploy:
requires:
- python3
pypy:
triggers:
- schedule:
cron: "0 0 * * *"
filters:
branches:
only:
- master
jobs:
- pypy3
9 changes: 6 additions & 3 deletions .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ matrix:
CYTHON_VERSION="*" PYAMG_VERSION="*" PILLOW_VERSION="*"
JOBLIB_VERSION="*" COVERAGE=true
CHECK_PYTEST_SOFT_DEPENDENCY="true" TEST_DOCSTRINGS="true"
SKLEARN_SITE_JOBLIB=1
SKLEARN_SITE_JOBLIB=1 CHECK_WARNINGS="true"
if: type != cron
# flake8 linting on diff wrt common ancestor with upstream/master
- env: RUN_FLAKE8="true" SKIP_TESTS="true"
Expand All @@ -58,11 +58,14 @@ matrix:
# installed from their CI wheels in a virtualenv with the Python
# interpreter provided by travis.
- python: 3.6
env: DISTRIB="scipy-dev"
env: DISTRIB="scipy-dev" CHECK_WARNINGS="true"
if: type = cron OR commit_message =~ /\[scipy-dev\]/

install: source build_tools/travis/install.sh
script: bash build_tools/travis/test_script.sh
script:
- bash build_tools/travis/test_script.sh
- bash build_tools/travis/test_docs.sh
- bash build_tools/travis/test_pytest_soft_dependency.sh
after_success: source build_tools/travis/after_success.sh
notifications:
webhooks:
Expand Down
75 changes: 0 additions & 75 deletions AUTHORS.rst

This file was deleted.

28 changes: 18 additions & 10 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -178,17 +178,25 @@ following rules before submitting:
as your Python, scikit-learn, numpy, and scipy versions. This information
can be found by running the following code snippet:

```python
import platform; print(platform.platform())
import sys; print("Python", sys.version)
import numpy; print("NumPy", numpy.__version__)
import scipy; print("SciPy", scipy.__version__)
import sklearn; print("Scikit-Learn", sklearn.__version__)
```
For scikit-learn >= 0.20:

```python
import sklearn; sklearn.show_versions()
```

For scikit-learn < 0.20:

```python
import platform; print(platform.platform())
import sys; print("Python", sys.version)
import numpy; print("NumPy", numpy.__version__)
import scipy; print("SciPy", scipy.__version__)
import sklearn; print("Scikit-Learn", sklearn.__version__)
```

- Please be specific about what estimators and/or functions are involved
and the shape of the data, as appropriate; please include a
[reproducible](http://stackoverflow.com/help/mcve) code snippet
[reproducible](https://stackoverflow.com/help/mcve) code snippet
or link to a [gist](https://gist.github.com). If an exception is raised,
please provide the traceback.

Expand Down Expand Up @@ -222,8 +230,8 @@ be placed in ``_build/html/stable`` and are viewable in a web browser. See the

For building the documentation, you will need
[sphinx](http://sphinx.pocoo.org/),
[matplotlib](http://matplotlib.org/), and
[pillow](http://pillow.readthedocs.io/en/latest/).
[matplotlib](https://matplotlib.org/), and
[pillow](https://pillow.readthedocs.io/en/latest/).

When you are writing documentation, it is important to keep a good
compromise between mathematical and algorithmic details, and give
Expand Down
5 changes: 4 additions & 1 deletion ISSUE_TEMPLATE.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
<!--
If your issue is a usage question, submit it here instead:
- StackOverflow with the scikit-learn tag: http://stackoverflow.com/questions/tagged/scikit-learn
- StackOverflow with the scikit-learn tag: https://stackoverflow.com/questions/tagged/scikit-learn
- Mailing List: https://mail.python.org/mailman/listinfo/scikit-learn
For more information, see User Questions: http://scikit-learn.org/stable/support.html#user-questions
-->
Expand Down Expand Up @@ -43,6 +43,9 @@ it in the issue: https://gist.github.com
#### Versions
<!--
Please run the following snippet and paste the output below.
For scikit-learn >= 0.20:
import sklearn; sklearn.show_versions()
For scikit-learn < 0.20:
import platform; print(platform.platform())
import sys; print("Python", sys.version)
import numpy; print("NumPy", numpy.__version__)
Expand Down
1 change: 0 additions & 1 deletion MANIFEST.in
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,5 @@ recursive-include examples *
recursive-include sklearn *.c *.h *.pyx *.pxd *.pxi
recursive-include sklearn/datasets *.csv *.csv.gz *.rst *.jpg *.txt *.arff.gz *.json.gz
include COPYING
include AUTHORS.rst
include README.rst

16 changes: 9 additions & 7 deletions README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,8 @@ SciPy and distributed under the 3-Clause BSD license.

The project was started in 2007 by David Cournapeau as a Google Summer
of Code project, and since then many volunteers have contributed. See
the `AUTHORS.rst <AUTHORS.rst>`_ file for a complete list of contributors.
the `About us <http://scikit-learn.org/dev/about.html#authors>`_ page
for a list of core contributors.

It is currently maintained by a team of volunteers.

Expand All @@ -56,8 +57,8 @@ scikit-learn requires:
**Scikit-learn 0.20 is the last version to support Python2.7.**
Scikit-learn 0.21 and later will require Python 3.5 or newer.

For running the examples Matplotlib >= 1.3.1 is required. A few examples
require scikit-image >= 0.9.3 and a few examples require pandas >= 0.13.1.
For running the examples Matplotlib >= 1.4 is required. A few examples
require scikit-image >= 0.11.3 and a few examples require pandas >= 0.17.1.

scikit-learn also uses CBLAS, the C interface to the Basic Linear Algebra
Subprograms library. scikit-learn comes with a reference implementation, but
Expand Down Expand Up @@ -100,7 +101,7 @@ Important links
~~~~~~~~~~~~~~~

- Official source code repo: https://github.com/scikit-learn/scikit-learn
- Download releases: https://pypi.python.org/pypi/scikit-learn
- Download releases: https://pypi.org/project/scikit-learn/
- Issue tracker: https://github.com/scikit-learn/scikit-learn/issues

Source code
Expand All @@ -120,7 +121,7 @@ Testing
~~~~~~~

After installation, you can launch the test suite from outside the
source directory (you will need to have the ``pytest`` package installed)::
source directory (you will need to have ``pytest`` >= 3.3.0 installed)::

pytest sklearn

Expand All @@ -143,7 +144,8 @@ Project History

The project was started in 2007 by David Cournapeau as a Google Summer
of Code project, and since then many volunteers have contributed. See
the `AUTHORS.rst <AUTHORS.rst>`_ file for a complete list of contributors.
the `About us <http://scikit-learn.org/dev/about.html#authors>`_ page
for a list of core contributors.

The project is currently maintained by a team of volunteers.

Expand All @@ -165,7 +167,7 @@ Communication

- Mailing list: https://mail.python.org/mailman/listinfo/scikit-learn
- IRC channel: ``#scikit-learn`` at ``webchat.freenode.net``
- Stack Overflow: http://stackoverflow.com/questions/tagged/scikit-learn
- Stack Overflow: https://stackoverflow.com/questions/tagged/scikit-learn
- Website: http://scikit-learn.org

Citation
Expand Down
13 changes: 10 additions & 3 deletions appveyor.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ environment:
global:
# SDK v7.0 MSVC Express 2008's SetEnv.cmd script will fail if the
# /E:ON and /V:ON options are not enabled in the batch script interpreter
# See: http://stackoverflow.com/a/13751649/163740
# See: https://stackoverflow.com/a/13751649/163740
CMD_IN_ENV: "cmd /E:ON /V:ON /C .\\build_tools\\appveyor\\run_with_env.cmd"
WHEELHOUSE_UPLOADER_USERNAME: sklearn-appveyor
WHEELHOUSE_UPLOADER_SECRET:
Expand All @@ -20,6 +20,7 @@ environment:
- PYTHON: "C:\\Python37-x64"
PYTHON_VERSION: "3.7.0"
PYTHON_ARCH: "64"
CHECK_WARNINGS: "true"

- PYTHON: "C:\\Python27"
PYTHON_VERSION: "2.7.8"
Expand All @@ -45,7 +46,7 @@ install:
Where-Object pullRequestId -eq $env:APPVEYOR_PULL_REQUEST_NUMBER)[0].buildNumber) { `
throw "There are newer queued builds for this pull request, failing early." }

# Install Python (from the official .msi of http://python.org) and pip when
# Install Python (from the official .msi of https://python.org) and pip when
# not already installed.
- "powershell ./build_tools/appveyor/install.ps1"
- "SET PATH=%PYTHON%;%PYTHON%\\Scripts;%PATH%"
Expand All @@ -72,7 +73,13 @@ test_script:
# installed library.
- mkdir "../empty_folder"
- cd "../empty_folder"
- pytest --showlocals --durations=20 --pyargs sklearn
- ps: >-
if (Test-Path variable:global:CHECK_WARNINGS) {
$env:PYTEST_ARGS = "-Werror::DeprecationWarning -Werror::FutureWarning"
} else {
$env:PYTEST_ARGS = ""
}
- "pytest --showlocals --durations=20 %PYTEST_ARGS% --pyargs sklearn"
# Move back to the project folder
- cd "../scikit-learn"

Expand Down
4 changes: 2 additions & 2 deletions benchmarks/bench_covertype.py
Original file line number Diff line number Diff line change
Expand Up @@ -34,10 +34,10 @@
S. Shalev-Shwartz, Y. Singer, N. Srebro - In Proceedings of ICML '07.

* `"Training Linear SVMs in Linear Time"
<www.cs.cornell.edu/People/tj/publications/joachims_06a.pdf>`_
<https://www.cs.cornell.edu/people/tj/publications/joachims_06a.pdf>`_
T. Joachims - In SIGKDD '06

[1] http://archive.ics.uci.edu/ml/datasets/Covertype
[1] https://archive.ics.uci.edu/ml/datasets/Covertype

"""
from __future__ import division, print_function
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/bench_plot_nmf.py
Original file line number Diff line number Diff line change
Expand Up @@ -96,7 +96,7 @@ def _nls_subproblem(X, W, H, tol, max_iter, alpha=0., l1_ratio=0.,
----------
C.-J. Lin. Projected gradient methods for non-negative matrix
factorization. Neural Computation, 19(2007), 2756-2779.
http://www.csie.ntu.edu.tw/~cjlin/nmf/
https://www.csie.ntu.edu.tw/~cjlin/nmf/
"""
WtX = safe_sparse_dot(W.T, X)
WtW = np.dot(W.T, W)
Expand Down
4 changes: 2 additions & 2 deletions benchmarks/bench_plot_randomized_svd.py
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@
----------
(1) Finding structure with randomness: Stochastic algorithms for constructing
approximate matrix decompositions
Halko, et al., 2009 http://arxiv.org/abs/arXiv:0909.4061
Halko, et al., 2009 https://arxiv.org/abs/0909.4061

(2) A randomized algorithm for the decomposition of matrices
Per-Gunnar Martinsson, Vladimir Rokhlin and Mark Tygert
Expand Down Expand Up @@ -106,7 +106,7 @@
MAX_MEMORY = np.int(2e9)

# The following datasets can be dowloaded manually from:
# CIFAR 10: http://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz
# CIFAR 10: https://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz
# SVHN: http://ufldl.stanford.edu/housenumbers/train_32x32.mat
CIFAR_FOLDER = "./cifar-10-batches-py/"
SVHN_FOLDER = "./SVHN/"
Expand Down
4 changes: 4 additions & 0 deletions build_tools/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
# Makefile for maintenance tools

authors:
python generate_authors_table.py > ../doc/authors.rst
4 changes: 2 additions & 2 deletions build_tools/appveyor/install.ps1
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
# Sample script to install Python and pip under Windows
# Authors: Olivier Grisel, Jonathan Helmus, Kyle Kastner, and Alex Willmer
# License: CC0 1.0 Universal: http://creativecommons.org/publicdomain/zero/1.0/
# License: CC0 1.0 Universal: https://creativecommons.org/publicdomain/zero/1.0/

$MINICONDA_URL = "http://repo.continuum.io/miniconda/"
$MINICONDA_URL = "https://repo.continuum.io/miniconda/"
$BASE_URL = "https://www.python.org/ftp/python/"
$GET_PIP_URL = "https://bootstrap.pypa.io/get-pip.py"
$GET_PIP_PATH = "C:\get-pip.py"
Expand Down
Loading
0