8000 Update branch by arka204 · Pull Request #5 · arka204/scikit-learn · GitHub
[go: up one dir, main page]

Skip to content

Update branch #5

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 251 commits into from
May 22, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
251 commits
Select commit Hold shift + click to select a range
913da3f
MNT Fix higgs boson benchmark (#16740)
NicolasHug Mar 22, 2020
dcfb3df
MAINT Define custom markers for pytest (#16652)
cmarmo Mar 23, 2020
3a72410
DOC replace boston dataset in example plot_outlier_detection (#16687)
lucyleeow Mar 24, 2020
7733f5e
amend breast cancer (#16751)
lucyleeow Mar 24, 2020
104b736
DOC add permutation importance to GradientBoostingRegressor (#16742)
nilichen Mar 24, 2020
942001a
DOC Update contributing guide for flake8-diff (#16758)
johannfaouzi Mar 24, 2020
36ebf3e
[MRG] Monotonic constraints for GBDT (#15582)
NicolasHug Mar 24, 2020
c5d1d45
DOC Minor update to random state glossary (#16735)
NicolasHug Mar 25, 2020
4bb4856
DOC Change number of classes in lfw_pair documentation (#16761)
lucgiffon Mar 25, 2020
e5a1ba0
MNT minor optimization for subsampling in GBDT (#16766)
NicolasHug Mar 25, 2020
fb2b01f
DOC update links to https in the README (#16774)
timvink Mar 27, 2020
fa1ea2a
ENH Adds support for pandas dataframe with only sparse arrays (#16728)
thomasjpfan Mar 27, 2020
0a866ec
MNT New options for higgs-boson benchmark (#16779)
NicolasHug Mar 27, 2020
88ce8cd
MNT remove coupling between pipeline methods (#16777)
NicolasHug Mar 27, 2020
a84ba0c
DOC simplify pipeline docstring example (#16780)
NicolasHug Mar 28, 2020
ada94ae
DOC Fix broken link in whats_new for 0.23 (#16796)
justmarkham Mar 29, 2020
5a43408
Fix mypy errors (#16726)
rth Mar 30, 2020
a2ccc33
STY PEP8 fix E731 in linear_model/_logistic.py (#16786)
Mar 30, 2020
d6d624d
API make __init__ params in decomposition kwonly (#16722)
adrinjalali Mar 31, 2020
ab01816
MNT avoid thread limit for non nested parts of KMeans (#16499)
jeremiedbb Mar 31, 2020
44c44a5
DOC minimal docstring fix + UG for feature selection (#16810)
NicolasHug Mar 31, 2020
97a6b08
API make __init__ args in ensemble kwonly (#16724)
adrinjalali Mar 31, 2020
360e828
DOC Use default colors (#16808)
NicolasHug Mar 31, 2020
c2f5fe3
MNT remove deprecated externals.joblib (#16814)
NicolasHug Mar 31, 2020
c84ad60
DOC Fix Confidence Interval Again (#16809)
roryhr Apr 1, 2020
a5dfba9
DOC docstring styles updates to dbscan (#16819)
NicolasHug Apr 1, 2020
37776a0
CLN Removes unneeded code in histgradientboosting (#16820)
thomasjpfan Apr 1, 2020
557218c
FIX Truncated SVD should raise a NotFittedError if not fitted (#16821)
alexitkes Apr 2, 2020
2d25ea7
DOC Add link to User Guide from Pipeline section (#16829)
justmarkham Apr 2, 2020
395cfd7
BLD Sets minimal supported numpy version in pyproject.toml (#16823)
BenjaminLiuPenrose Apr 4, 2020
1116182
DOC Remove duplicated docstring in MinMaxScaler (#16835)
NicolasHug Apr 4, 2020
ff67812
DOC Remove warning from ColumnTransformer in User Guide (#16830)
justmarkham Apr 5, 2020
16b3bf7
MNT Raise error when parameter isn't correct in SVC and… (#16736)
NicolasHug Apr 5, 2020
b2bc40c
FIX Improve TruncatedSVD.transform on sparse csc matrices (#16837)
PandaTinker Apr 5, 2020
1735d3d
DOC Improve CountVectorizer and TfidfVectorizer return type (#16790)
bmaingret Apr 5, 2020
bcc24c9
DOC Add missing comma in naive_bayes.rst (#16844)
CirQ Apr 5, 2020
7b5c703
ENH Add fitted check for kde (#16762)
quangngd Apr 5, 2020
dba4514
CI Pins pillow version to fix ci failure (#16843)
thomasjpfan Apr 5, 2020
b568f20
[DOC Update random_state descriptions for Kernel Approximation (#16838)
DatenBiene Apr 5, 2020
fc3601e
correct spelling mistake from not effect to no effect (#16854)
londumas Apr 6, 2020
e3e9137
DOC Updates to SVM User Guide (#16769)
NicolasHug Apr 6, 2020
eaf0a04
FIX Libsvm and liblinear rand() fix for convergence on windows target…
smarie Apr 6, 2020
270c673
TST Fixes comment in test to reflect actual missing value (#16872)
thomasjpfan Apr 7, 2020
a655de5
[MRG] MNT Fixes for PCA with n_components='mle' (#16841)
NicolasHug Apr 7, 2020
f45c0c0
DOC phrasing improvements and typos (#16744)
raduspaimoc Apr 9, 2020
fe1ec1c
DOC Update random_state descriptions for ensemble/_base (#16847)
DatenBiene Apr 10, 2020
9a4bd37
DOC replace Boston in gradient_boosting.py (#16891)
lucyleeow Apr 10, 2020
0447dd9
DOC added example to permutation importance (#16460)
magda-zielinska Apr 10, 2020
c381750
API feature_selection's constructor params -> kwonly (#16867)
adrinjalali Apr 10, 2020
f31e25b
ENH Better error message needed when accidentall… (#16861)
MichaelShoemaker Apr 10, 2020
0a93fc9
DOC Clarify what can be passed to the plotting utilities (#16893)
justmarkham Apr 10, 2020
3b79637
Merge pull request #1 from scikit-learn/master
arka204 Apr 11, 2020
cadafb7
DOC Fix grammar and clarify VotingRegressor (#16896)
lucyleeow Apr 11, 2020
8122e77
DOC Fix typos, wording in plot_gradient_boosting_regression.py (#16894)
lucyleeow Apr 11, 2020
c2b31ac
DOC replace Boston in _classes.py (#16892)
lucyleeow Apr 14, 2020
a2d361b
DOC Fix wording, typo in plot_voting_regressor.py (#16895)
lucyleeow Apr 14, 2020
9cc5558
DOC remove boston from tutorial.rst (#16889)
lucyleeow Apr 14, 2020
9901d8d
Fix _deprecate_positional_args for kwonly args w/o default (#16850)
thomasjpfan Apr 14, 2020
bd9fd0f
DOC Minor updates to the Decision Tree User Guide (#16905)
justmarkham Apr 14, 2020
5e2d74b
DOC add versionadded versionchanged v0.19 (#16233)
noatamir Apr 10000 15, 2020
6cd77c2
API make feature_extraction's constructors' params kwonly (#16866)
adrinjalali Apr 15, 2020
cb9ddbb
TST Replace boston in ensemble test_bagging (#16921)
lucyleeow Apr 15, 2020
9d366a4
ENH Add custom loss support for HistGradientBoosting (#16908)
gbolmier Apr 15, 2020
9358a6e
DOC Update random_state description for Multiclass (#16839)
DatenBiene Apr 15, 2020
abfb6fd
[MRG] Add jitter to LassoLars (#15179)
angelaambroz Apr 17, 2020
269afa3
DOC Fixed Plot Mnist Example (#16200)
marimeireles Apr 17, 2020
7a77214
DOC Improve neighbors documentation (#16923)
haochunchang Apr 17, 2020
2d03d78
MNT simplify xfail check marking logic (#16949)
NicolasHug Apr 17, 2020
522ecac
DOC Fix docstring issue in parametrize_with_checks (#16953)
thomasjpfan Apr 17, 2020
5abd22f
FIX Bug in mark_xfail_checks (#16954)
thomasjpfan Apr 18, 2020
cb49ad4
MNT removed _safe_tags utility (#16950)
NicolasHug Apr 19, 2020
5dfca46
DOC DataConversionWarning: Add example to doc of DataCon… (#16704)
danielvangelder Apr 19, 2020
a0e6b95
MNT add pip-wheel-metadata to gitignore
jnothman Apr 19, 2020
670b85c
ENH ColumnTransformer.get_feature_names() handles passthrough (#14048)
lrjball Apr 19, 2020
4d9478f
DOC User Guide and docs for LDA and QDA (#16243)
NicolasHug Apr 19, 2020
dc0cc6e
[MRG] API kwonly args in impute, inspection, kernel_ridge (#16926)
adrinjalali Apr 19, 2020
b4757f7
ENH add vlines_ attribute to PDP Display to hide deciles (#15785)
NicolasHug Apr 19, 2020
6973096
DOC details on the use of xfail_checks (#16968)
NicolasHug Apr 20, 2020
7e15285
API make gaussian_process __init__ params kwarg (#16870)
adrinjalali Apr 20, 2020
c8e0558
DOC Add link, fix wording of KNeighborsRegressor (#16969)
lucyleeow Apr 20, 2020
7736062
DOC Libsvm liblinear rand fix - minor doc and header fixes (#16979)
smarie Apr 21, 2020
c531bd0
API Adds missing keyword only argument to PCA (#16975)
thomasjpfan Apr 21, 2020
f82a2cb
replace boston (#16922)
lucyleeow Apr 21, 2020
089c8a1
[MRG] MNT requires_y tag with y=None validation (#16622)
NicolasHug Apr 22, 2020
5f6dfcb
API Deprecate positional arguments in pipeline (#16997)
thomasjpfan Apr 22, 2020
1523f39
MNT fix memory leak in elkan KMeans (#17000)
jeremiedbb Apr 22, 2020
388999b
API Deprecate positional arguments in tree module (#16966)
thomasjpfan Apr 22, 2020
79df406
API Deprecate positional arguments in random_projection (#16995)
thomasjpfan Apr 22, 2020
6717c6a
API Deprecate positional arguments in semi_supervised module (#16974)
thomasjpfan Apr 22, 2020
8010cad
API Deprecate positional arguments in svm module (#16973)
thomasjpfan Apr 23, 2020
fbae1ed
API kwonly args in base (#17006)
NicolasHug Apr 23, 2020
bbedfa0
use semilogx for a more readable xaxis (#17001)
mathurinm Apr 23, 2020
49d213d
[MRG] API kwonly for neural_network module (#17002)
NicolasHug Apr 23, 2020
946fdde
BUG Fix instability issue of ARDRegression (with speedup) (#16849)
NicolasHug Apr 23, 2020
a93b15f
ENH Poisson loss for HistGradientBoostingRegressor (#16692)
Apr 23, 2020
3693492
DOC Add link and fix typo in nearest neighbours estimators (#17017)
lucyleeow Apr 23, 2020
2592edd
MNT Fixes DataConversionWarning doctest in pypy (#16965)
thomasjpfan Apr 23, 2020
e54cd3c
API Deprecate positional arguments in preprocessing (#16996)
thomasjpfan Apr 23, 2020
923b13c
API Adds defaults to Display Objects (#16933)
thomasjpfan Apr 23, 2020
7844d1c
TST Replace boston in histgradboost test_predictor (#16918)
lucyleeow Apr 23, 2020
88ba943
MNT Remove redundant doctest ELLIPSIS annotations (#16992)
jnothman Apr 23, 2020
e392bfd
MNT Add pre-comit configuration (#16957)
rth Apr 23, 2020
94d8911
DOC Fix typo in ensemble.rst (#16999)
lucyleeow Apr 23, 2020
facd117
TST Replace boston dataset in test_permutation_importance.py (#17020)
lucyleeow Apr 24, 2020
0d04de2
API kwonly for naive_bayes (#17003)
NicolasHug Apr 24, 2020
361bf75
DOC Fix link to user guide (#16989)
stephendavidmarsh Apr 24, 2020
2587a03
[MRG] DOC fix ref for ParameterSampler (#16983)
NicolasHug Apr 24, 2020
89993d2
Update the URL of valgrind-python.supp in the doc (#17029)
ogrisel Apr 24, 2020
501ee56
TST Replace Boston dataset in test_impute (#17025)
lucyleeow Apr 24, 2020
2955d9f
API kwonly for neighbors module (#17004)
NicolasHug Apr 24, 2020
02309ff
API kwonly args in manifold, metrics, mixture, model_selection, multc…
adrinjalali Apr 24, 2020
be134a2
TST Replace Bostond dataset in test_iforest (#17031)
lucyleeow Apr 25, 2020
923d887
DOC Fix a typo in comment (#17037)
H-Yin Apr 25, 2020
fc6ee00
MNT Fix 'clf' variable naming in test_forest (#16929)
lucyleeow Apr 26, 2020
e8a42ae
CI Remove pin now that Pillow issue is fixed (#17043)
hugovk Apr 26, 2020
28c08d0
DOC fix typos in cross validation user guide (#17042)
Apr 26, 2020
4755ae7
DOC Add versionchanged and versionadded for v0.20 (#16199)
borovikova Apr 26, 2020
81c1e9d
API kwonly for utils (#17007)
NicolasHug Apr 26, 2020
f621d8a
Revert "API kwonly for utils (#17007)" (#17045)
rth Apr 26, 2020
a14953a
ENH buffer openml stream rather than reading all at once (#16084)
jnothman Apr 26, 2020
a35b892
Fix Mypy issue in _openml.py (#17047)
NicolasHug Apr 26, 2020
7ede028
DOC add detail about flip_y parameter in make_classification (#17049)
tianchuliang Apr 26, 2020
f624f4e
DOC cleaning up to 0.23/whats new (#17015)
adrinjalali Apr 27, 2020
76ef8b0
API kwonly for utils (#17046)
NicolasHug Apr 27, 2020
839b356
DOC Removes examples from exceptions docstrings (#17040)
thomasjpfan Apr 27, 2020
1b119c4
MNT fix generate_authors_table.py (#17011)
adrinjalali Apr 27, 2020
91e9427
DOC Make release highlights the first gallery section (#16952)
thomasjpfan Apr 27, 2020
a1261a7
DOC kwonly section in whatsnew (#17059)
NicolasHug Apr 27, 2020
1ba0651
API make load_* args in datasets kwarg only (#16719)
adrinjalali Apr 27, 2020
5b2c931
API pairwise_distances will require explicit V/VI param if Y is given…
jnothman Apr 27, 2020
41b18fe
ENH Added n_components_ to SparsePCA and MiniBatchSparsePCA (#16981)
Reksbril Apr 27, 2020
fc00415
Deprecate class support for check_estimator (#17032)
NicolasHug Apr 27, 2020
964c830
MNT Change print_changed_only default to True (#17061)
NicolasHug Apr 27, 2020
dbc3593
DOC Version added and changed labels added for v0.18 (#wimlds) (#16222)
brigitteunger Apr 27, 2020
fb76de7
DOC Exchanging Boston for california dataset in plot missing values (…
maikia Apr 28, 2020
8b4d4f4
DOC Add whats new for missing PRs (#17066)
thomasjpfan Apr 28, 2020
3deacb9
MNT consistently call import_array() after cimport of numpy (#17054)
grlee77 Apr 28, 2020
1d3a553
[MRG] Fix LinearModelsCV for loky backend. (#14264)
jeremiedbb Apr 28, 2020
5435408
DOC markup fixes for change log
jnothman Apr 28, 2020
9f015c8
FIX Mixed bool dtype in pandas (#17008)
thomasjpfan Apr 28, 2020
acbe13c
ENH Adds pandas IntegerArray support to check_array (#16508)
thomasjpfan Apr 28, 2020
1bd7404
DOC Fixes formating in whats new (#17076)
thomasjpfan Apr 28, 2020
2dd12af DOC Improve claim prediction example (#16648)
ogrisel Apr 28, 2020
1c69a8a
DOC small typos and fixes for poisson example (#17078)
Apr 29, 2020
ad6a9f9
TST Skips derivative check on 32bit platforms (#17073)
thomasjpfan Apr 29, 2020
9f04837
DOC Update funders. (#17079)
cmarmo Apr 29, 2020
b9403f6
DOC Better headers in Poisson regression example (#17080)
ogrisel Apr 29, 2020
ee2508c
ENH Adds HTML visualizations for estimators (#14180)
thomasjpfan Apr 29, 2020
3a6c8c4
DOC Remove unnecessary comment (#17091)
lucyleeow Apr 30, 2020
0c0b834
DOC detail fit_intercept docstring (#17096)
bharatr21 May 1, 2020
863c1d6
DOC Feature highlights for 0.23 (#17062)
NicolasHug May 1, 2020
c71a1c2
MISC fix rst syntax (#17098)
GaelVaroquaux May 1, 2020
04d2e32
Speedup MultiTaskLasso (#17021)
agramfort May 2, 2020
5756205
DOC Adds release highlights to front page (#17071)
thomasjpfan May 3, 2020
8b1b281
EXA Remove stray executable flag from example (#17116)
ckastner May 3, 2020
4fae539
DOC Correcting an attribute's name (#17110)
May 4, 2020
a0c76ce
MNT Remove sklearn logger default StreamHandler (#16451)
cdeil May 4, 2020
a670bb9
add dtreevis to related packages #17105 (#17113)
codeloop May 4, 2020
962849a
DOC Add tslearn to related projects (#17109)
Abo7atm May 4, 2020
2cad437
STY Adjust line height of code blocks (#17094)
thomasjpfan May 4, 2020
bd3fb2a
DOC Remove unmaintained related projects (#17112)
rth May 5, 2020
b2b88d3
MNT bump master version to 0.24.dev0 (#17121)
adrinjalali May 5, 2020
f23b940
FIX Adjusts html_repr based on configuration (#17093)
thomasjpfan May 5, 2020
70f0062
DOC: move missing projects from wiki to related_projects (#17129)
cmarmo May 5, 2020
5ceaad1
DOC add authors and release date to whats_new/0.23 (#17122)
adrinjalali May 5, 2020
b9f7b9d
DOC 0.23/whats_new hotfix (#17131)
adrinjalali May 5, 2020
f72d971
ENH Add random_state parameter to AffinityPropagation (#16801)
cmarmo May 5, 2020
226e5c4
MNT Use Vt instead of V as returned by svd() (#17100)
NicolasHug May 5, 2020
2a5c22a
Delegate choice of final model to sub class in LinearModelCV (#17099)
mathurinm May 6, 2020
8529567
DOC highlights improvements re estimator visualisation (#17136)
jnothman May 7, 2020
89fcef7
MNT Removed duplicate method calls in pickle estimator test (#17152)
mloning May 7, 2020
92ed385
DOC Fix deprecation warning in plot_roc_crossval (#17155)
lucyleeow May 8, 2020
ad6fc80
DOC fix indentation in cv parameter description (#17161)
NicolasHug May 8, 2020
c7a4bc9
TST Replace Boston dataset in test_validation (#17146)
lucyleeow May 8, 2020
0d29d2e
DOC Fix links and clarify plot_random_dataset.py (#17150)
lucyleeow May 8, 2020
e2256c1
TST Replace Boston dataset in test_coordinate_descent.py (#17153)
lucyleeow May 8, 2020
c36c104
DOC Ensure All Attributes are Documented: OAS: location_ added (#17163)
hhnnhh May 9, 2020
464dc37
Merge pull request #2 from scikit-learn/master
arka204 May 10, 2020
77279d6
Common check for sample weight invariance with removed samples (#16507)
rth May 10, 2020
818e16a
Revert "Common check for sample weight invariance with removed sample…
rth May 10, 2020
865069c
DOC Fix documentation of default values in base, birch (#16195)
brigitteunger May 10, 2020
192109a
MNT properly activate the env in the linting CI (#17177)
rth May 10, 2020
534f8ae
DOC add versionadded versionchanged for 0.21 (#16737)
Reksbril May 10, 2020
4f5384b
MNT Remove class support check estimator and parametrize_with_checks …
NicolasHug May 10, 2020
a6a20f2
MNT fix test for binary crossentropy in hist-GBDT (#16691)
May 11, 2020
350b865
DOC Add treelite to related projects (#17166)
NicolasHug May 12, 2020
b4e215c
FEA Additional `TimeSeriesSplit` Functionality (#13204)
kykosic May 12, 2020
a4dcada
DOC add an example in docstring of OneVsOneClassifier (#16700)
j0rd1smit May 12, 2020
d1f8a16
MAINT change default value for min/max of IterativeImputer (#16493)
DarshanGowda0 May 12, 2020
d40d993
CLN Refactors _encode into two functions (#17101)
thomasjpfan May 12, 2020
4b92a77
MNT Cleaner cythn cdef loss function in SGD (#17191)
May 12, 2020
cbe9f08
MNT Put back and properly deprecate MaskedArray (#17199)
NicolasHug May 13, 2020
ccf5c36
FIX don't modify sample weight inplace in KMeans (#17204)
jeremiedbb May 13, 2020
5f9555b
FIX changed_only=True with kwargs parameters (#17205)
NicolasHug May 13, 2020
7316a19
DOC fix typo in 0.22 and 0.23 whats new files (#17211)
ogrisel May 13, 2020
7117a63
MNT Removes voting classifer mention (#17196)
thomasjpfan May 13, 2020
85d24b3
pyproject.toml: Update numpy dependency on PyPy to 1.14.0 (#17215)
jvesely May 14, 2020
78a213b
DOC use Ames housing for transformed_target example (#16741)
lucyleeow May 14, 2020
8abe0a8
MNT Replaced kwargs by named args for train_test_split (#17216)
hugolmn May 14, 2020
bbab854
FIX Fix path to what's new in the navigation bar. (#17223)
alfaro96 May 14, 2020
ca344be
DOC Removes duplicate this in the release history. (#17220)
alfaro96 May 14, 2020
8595fe4
DOC Fix argument ref in Isomap docs (#17217)
AdamSpannbauer May 14, 2020
3510a79
DOC Fix formating in docstrings _regression.py (#17226)
lucyleeow May 14, 2020
90d00da
FIX Number of threads in KMeans should not be bigger than number of c…
jeremiedbb May 15, 2020
350438c
MNT Add threadpoolctl in show_versions (#17241)
jeremiedbb May 15, 2020
05e4c92
DOC update whatsnew_0.23 with removal of _safe_tags() (#17245)
rtavenar May 15, 2020
b0cd595
MNT Ignore xfail_checks tag in check_estimator, with warning (#17222)
NicolasHug May 15, 2020
d03dfa2
MNT Removed name parameter from private check generators (#17224)
NicolasHug May 15, 2020
9d677b5
DOC Fix and extend mixed dtype column transformer example (#17088)
ogrisel May 16, 2020
b70472b
DOC correct estimator name in example (#17239)
gio8tisu May 16, 2020
a4b59c7
DOC fix links in truncated SVD docs (#17194)
NicolasHug May 16, 2020
0db5f36
ENH Move threadpoolctl outside of iteration loop in KMeans (#17235)
jeremiedbb May 16, 2020
0ba19ad
MNT Fix incorrect source code link for wrapped objects (#17247)
alfaro96 May 17, 2020
b6e839c
CLN Remove duplicate line (#17254)
changebo May 17, 2020
f58c2bc
FEA Add staged_predict to HistGradientBoosting (#16985)
haochunchang May 17, 2020
a497523
DOC Update plot_column_transformer to notebook style (#17028)
lucyleeow May 17, 2020
89d9729
DOC Add explanation of why iterative imputer is experimental (#17115)
skeller88 May 17, 2020
60de458
DOC Removes uneeded __init__ in API documentation (#17243)
thomasjpfan May 18, 2020
1986c89
DOC exchange Boston for Diabetes at plot model complexity influence (…
maikia May 18, 2020
30788de
DOC Use california data in compose.rst (#16871)
lucyleeow May 18, 2020
f2e873f
FIX/ENH CheckingClassifier support parameters and sparse matrices (#1…
glemaitre May 18, 2020
29f83f8
DOC Fix parametrize_with_checks link (#17263)
rth May 18, 2020
79b0943
DOC avoid FutureWarnings for deprecations examples (#17264)
May 18, 2020
43c0efa
BUG Fixes cython code for ppc arch (#17201)
thomasjpfan May 18, 2020
4744e28
DOC prepare whats_new for 0.23.1 (#17270)
adrinjalali May 18, 2020
b632762
DOC Add threadpoolctl requirement to documentation (#17273)
alfaro96 May 18, 2020
3ff3981
DOC Replace boston dataset in ensemble.rst (#16876)
lucyleeow May 19, 2020
d46663c
DOC Update/clarify plot_unveil_tree_structure.py (#16942)
lucyleeow May 19, 2020
e770715
MNT Completes position arg deprecation (#17272)
thomasjpfan May 19, 2020
9bc5c1d
TST replace boston by diabetes dataset in weigh_boosting (#17154)
lucyleeow May 19, 2020
2d12bb5
CI Check default doctest directives in linter (#17269)
lucyleeow May 19, 2020
82011bb
DOC Clarify wording in haversine (#17280)
bharatr21 May 19, 2020
f6b8bc0
ENH add importance_getter to RFE* and SelectFromModel
venkyyuvy May 20, 2020
dd51dc6
DOC tweak to what's new
jnothman May 20, 2020
923fdc3
DOC Update year in copyright statement (#17281)
alfaro96 May 20, 2020
5dcdd89
MNT/TST Replace boston by synthetic dataset in ensemble test_forest (…
lucyleeow May 20, 2020
26b2678
DOC Fix RandomizedSearchCV's docstring by changing the cv_results_ da…
barberogaston May 20, 2020
b4db36d
DOC Update plot_mahalanobis_distances to notebook style (#17089)
lucyleeow May 20, 2020
049d71b
TST Replace boston dataset in test_plot_partial_dependence (#17144)
lucyleeow May 20, 2020
0deaa3b
DOC Amend plot_digits_classification.py to use notebook style (#17016)
lucyleeow May 20, 2020
b3dca3d
DOC update PCA random_state description (#17293)
crispinlogan May 20, 2020
1931e14
Remove deprecated utils (#17133)
NicolasHug May 20, 2020
ac8cbb3
[MRG] More deprecation for 0.24 (#17142)
NicolasHug May 20, 2020
7cc815c
[MRG] More deprecation for 0.24 (#17142)
NicolasHug May 20, 2020
2f26540
ENH use colors from style in tree plotting (#17187)
amueller May 21, 2020
e1c44ba
DOC invite underrepresented groups to contribute (#16567)
amueller May 22, 2020
a1cc566
DOC Updated deprecation section in classes.rst (#17296)
NicolasHug May 22, 2020
a0a23d2
DOC Adds GaussianMixture and BayesianGaussianMixture examples (#15193)
darioka May 22, 2020
15a658f
Merge pull request #3 from scikit-learn/master
arka204 May 22, 2020
ec786b3
Merge branch 'alpha-close-or-equal-0-update' into alpha-1
arka204 May 22, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,7 @@ doc/samples
*.prof
.tox/
.coverage
pip-wheel-metadata

lfw_preprocessed/
nips2010_pdf/
Expand Down
22 changes: 22 additions & 0 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
repos:
- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v2.3.0
hooks:
- id: check-yaml
- id: end-of-file-fixer
- id: trailing-whitespace
- repo: https://gitlab.com/pycqa/flake8
rev: 3.7.8
hooks:
- id: flake8
types: [file, python]
# only check for unused imports for now, as long as
# the code is not fully PEP8 compatible
args: [--select=F401]
- repo: https://github.com/pre-commit/mirrors-mypy
rev: v0.730
hooks:
- id: mypy
args:
- --ignore-missing-imports
files: sklearn/
2 changes: 1 addition & 1 deletion Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -67,4 +67,4 @@ code-analysis:
pylint -E -i y sklearn/ -d E1103,E0611,E1101

flake8-diff:
./build_tools/circle/linting.sh
git diff upstream/master -u -- "*.py" | flake8 --diff
27 changes: 14 additions & 13 deletions README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -31,12 +31,12 @@ SciPy and is distributed under the 3-Clause BSD license.

The project was started in 2007 by David Cournapeau as a Google Summer
of Code project, and since then many volunteers have contributed. See
the `About us <http://scikit-learn.org/dev/about.html#authors>`__ page
the `About us <https://scikit-learn.org/dev/about.html#authors>`__ page
for a list of core contributors.

It is currently maintained by a team of volunteers.

Website: http://scikit-learn.org
Website: https://scikit-learn.org


Installation
Expand All @@ -51,6 +51,7 @@ scikit-learn requires:
- NumPy (>= 1.13.3)
- SciPy (>= 0.19.1)
- joblib (>= 0.11)
- threadpoolctl (>= 2.0.0)

**Scikit-learn 0.20 was the last version to support Python 2.7 and Python 3.4.**
scikit-learn 0.23 and later require Python 3.6 or newer.
Expand All @@ -73,21 +74,21 @@ or ``conda``::

conda install scikit-learn

The documentation includes more detailed `installation instructions <http://scikit-learn.org/stable/install.html>`_.
The documentation includes more detailed `installation instructions <https://scikit-learn.org/stable/install.html>`_.


Changelog
---------

See the `changelog <http://scikit-learn.org/dev/whats_new.html>`__
See the `changelog <https://scikit-learn.org/dev/whats_new.html>`__
for a history of notable changes to scikit-learn.

Development
-----------

We welcome new contributors of all experience levels. The scikit-learn
community goals are to be helpful, welcoming, and effective. The
`Development Guide <http://scikit-learn.org/stable/developers/index.html>`_
`Development Guide <https://scikit-learn.org/stable/developers/index.html>`_
has detailed information about contributing code, documentation, tests, and
more. We've included some basic information in this README.

Expand Down Expand Up @@ -120,7 +121,7 @@ source directory (you will need to have ``pytest`` >= 3.3.0 installed)::

pytest sklearn

See the web page http://scikit-learn.org/dev/developers/advanced_installation.html#testing
See the web page https://scikit-learn.org/dev/developers/advanced_installation.html#testing
for more information.

Random number generation can be controlled during testing by setting
Expand All @@ -131,15 +132,15 @@ Submitting a Pull Request

Before opening a Pull Request, have a look at the
full Contributing page to make sure your code complies
with our guidelines: http://scikit-learn.org/stable/developers/index.html
with our guidelines: https://scikit-learn.org/stable/developers/index.html


Project History
---------------

The project was started in 2007 by David Cournapeau as a Google Summer
of Code project, and since then many volunteers have contributed. See
the `About us <http://scikit-learn.org/dev/about.html#authors>`__ page
the `About us <https://scikit-learn.org/dev/about.html#authors>`__ page
for a list of core contributors.

The project is currently maintained by a team of volunteers.
Expand All @@ -153,19 +154,19 @@ Help and Support
Documentation
~~~~~~~~~~~~~

- HTML documentation (stable release): http://scikit-learn.org
- HTML documentation (development version): http://scikit-learn.org/dev/
- FAQ: http://scikit-learn.org/stable/faq.html
- HTML documentation (stable release): https://scikit-learn.org
- HTML documentation (development version): https://scikit-learn.org/dev/
- FAQ: https://scikit-learn.org/stable/faq.html

Communication
~~~~~~~~~~~~~

- Mailing list: https://mail.python.org/mailman/listinfo/scikit-learn
- IRC channel: ``#scikit-learn`` at ``webchat.freenode.net``
- Stack Overflow: https://stackoverflow.com/questions/tagged/scikit-learn
- Website: http://scikit-learn.org
- Website: https://scikit-learn.org

Citation
~~~~~~~~

If you use scikit-learn in a scientific publication, we would appreciate citations: http://scikit-learn.org/stable/about.html#citing-scikit-learn
If you use scikit-learn in a scientific publication, we would appreciate citations: https://scikit-learn.org/stable/about.html#citing-scikit-learn
18 changes: 17 additions & 1 deletion azure-pipelines.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,9 +17,14 @@ jobs:
displayName: Add conda to PATH
- bash: sudo chown -R $USER $CONDA
displayName: Take ownership of conda installation
- bash: conda create --name flake8_env --yes flake8
- bash: |
set -ex
conda create --name flake8_env --yes python=3.8
source activate flake8_env
pip install flake8 mypy==0.770
displayName: Install flake8
- bash: |
set -ex
if [[ $BUILD_SOURCEVERSIONMESSAGE =~ \[lint\ skip\] ]]; then
# skip linting
echo "Skipping linting"
Expand All @@ -29,6 +34,17 @@ jobs:
./build_tools/circle/linting.sh
fi
displayName: Run linting
- bash: |
set -ex
if [[ $BUILD_SOURCEVERSIONMESSAGE =~ \[lint\ skip\] ]]; then
# skip linting
echo "Skipping linting"
exit 0
else
source activate flake8_env
mypy sklearn/ --ignore-missing-imports
fi
displayName: Run mypy
- bash: |
if [[ $BUILD_SOURCEVERSIONMESSAGE =~ \[scipy-dev\] ]] || \
[[ $BUILD_REASON == "Schedule" ]]; then
Expand Down
77 changes: 36 additions & 41 deletions benchmarks/bench_hist_gradient_boosting_higgsboson.py
Original file line number Diff line number Diff line change
Expand Up @@ -25,12 +25,14 @@
parser.add_argument('--learning-rate', type=float, default=1.)
parser.add_argument('--subsample', type=int, default=None)
parser.add_argument('--max-bins', type=int, default=255)
parser.add_argument('--no-predict', action="store_true", default=False)
parser.add_argument('--cache-loc', type=str, default='/tmp')
args = parser.parse_args()

HERE = os.path.dirname(__file__)
URL = ("https://archive.ics.uci.edu/ml/machine-learning-databases/00280/"
"HIGGS.csv.gz")
m = Memory(location='/tmp', mmap_mode='r')
m = Memory(location=args.cache_loc, mmap_mode='r')

n_leaf_nodes = args.n_leaf_nodes
n_trees = args.n_trees
Expand All @@ -56,6 +58,27 @@ def load_data():
return df


def fit(est, data_train, target_train, libname):
print(f"Fitting a {libname} model...")
tic = time()
est.fit(data_train, target_train)
toc = time()
print(f"fitted in {toc - tic:.3f}s")


def predict(est, data_test, target_test):
if args.no_predict:
return
tic = time()
predicted_test = est.predict(data_test)
predicted_proba_test = est.predict_proba(data_test)
toc = time()
roc_auc = roc_auc_score(target_test, predicted_proba_test[:, 1])
acc = accuracy_score(target_test, predicted_test)
print(f"predicted in {toc - tic:.3f}s, "
f"ROC AUC: {roc_auc:.4f}, ACC: {acc :.4f}")


df = load_data()
target = df.values[:, 0]
data = np.ascontiguousarray(df.values[:, 1:])
Expand All @@ -68,56 +91,28 @@ def load_data():
n_samples, n_features = data_train.shape
print(f"Training set with {n_samples} records with {n_features} features.")

print("Fitting a sklearn model...")
tic = time()
est = HistGradientBoostingClassifier(loss='binary_crossentropy',
learning_rate=lr,
max_iter=n_trees,
max_bins=max_bins,
max_leaf_nodes=n_leaf_nodes,
n_iter_no_change=None,
early_stopping=False,
random_state=0,
verbose=1)
est.fit(data_train, target_train)
toc = time()
predicted_test = est.predict(data_test)
predicted_proba_test = est.predict_proba(data_test)
roc_auc = roc_auc_score(target_test, predicted_proba_test[:, 1])
acc = accuracy_score(target_test, predicted_test)
print(f"done in {toc - tic:.3f}s, ROC AUC: {roc_auc:.4f}, ACC: {acc :.4f}")
fit(est, data_train, target_train, 'sklearn')
predict(est, data_test, target_test)

if args.lightgbm:
print("Fitting a LightGBM model...")
tic = time()
lightgbm_est = get_equivalent_estimator(est, lib='lightgbm')
lightgbm_est.fit(data_train, target_train)
toc = time()
predicted_test = lightgbm_est.predict(data_test)
predicted_proba_test = lightgbm_est.predict_proba(data_test)
roc_auc = roc_auc_score(target_test, predicted_proba_test[:, 1])
acc = accuracy_score(target_test, predicted_test)
print(f"done in {toc - tic:.3f}s, ROC AUC: {roc_auc:.4f}, ACC: {acc :.4f}")
est = get_equivalent_estimator(est, lib='lightgbm')
fit(est, data_train, target_train, 'lightgbm')
predict(est, data_test, target_test)

if args.xgboost:
print("Fitting an XGBoost model...")
tic = time()
xgboost_est = get_equivalent_estimator(est, lib='xgboost')
xgboost_est.fit(data_train, target_train)
toc = time()
predicted_test = xgboost_est.predict(data_test)
predicted_proba_test = xgboost_est.predict_proba(data_test)
roc_auc = roc_auc_score(target_test, predicted_proba_test[:, 1])
acc = accuracy_score(target_test, predicted_test)
print(f"done in {toc - tic:.3f}s, ROC AUC: {roc_auc:.4f}, ACC: {acc :.4f}")
est = get_equivalent_estimator(est, lib='xgboost')
fit(est, data_train, target_train, 'xgboost')
predict(est, data_test, target_test)

if args.catboost:
print("Fitting a Catboost model...")
tic = time()
catboost_est = get_equivalent_estimator(est, lib='catboost')
catboost_est.fit(data_train, target_train)
toc = time()
predicted_test = catboost_est.predict(data_test)
predicted_proba_test = catboost_est.predict_proba(data_test)
roc_auc = roc_auc_score(target_test, predicted_proba_test[:, 1])
acc = accuracy_score(target_test, predicted_test)
print(f"done in {toc - tic:.3f}s, ROC AUC: {roc_auc:.4f}, ACC: {acc :.4f}")
est = get_equivalent_estimator(est, lib='catboost')
fit(est, data_train, target_train, 'catboost')
predict(est, data_test, target_test)
1 change: 1 addition & 0 deletions build_tools/azure/install.sh
Original file line number Diff line number Diff line change
Expand Up @@ -97,6 +97,7 @@ elif [[ "$DISTRIB" == "conda-pip-latest" ]]; then
make_conda "python=$PYTHON_VERSION"
python -m pip install -U pip
python -m pip install pytest==$PYTEST_VERSION pytest-cov pytest-xdist

python -m pip install pandas matplotlib pyamg scikit-image
# do not install dependencies for lightgbm since it requires scikit-learn
python -m pip install lightgbm --no-deps
Expand Down
11 changes: 11 additions & 0 deletions build_tools/circle/linting.sh
Original file line number Diff line number Diff line change
Expand Up @@ -161,3 +161,14 @@ then
echo $bad_deprecation_property_order
exit 1
fi

# Check for default doctest directives ELLIPSIS and NORMALIZE_WHITESPACE

doctest_directive="$(git grep -nw -E "# doctest\: \+(ELLIPSIS|NORMALIZE_WHITESPACE)")"

if [ ! -z "$doctest_directive" ]
then
echo "ELLIPSIS and NORMALIZE_WHITESPACE doctest directives are enabled by default, but were found in:"
echo "$doctest_directive"
exit 1
fi
4 changes: 2 additions & 2 deletions build_tools/generate_authors_table.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,14 +11,15 @@
import getpass
import time
from pathlib import Path
from os import path

print("user:", file=sys.stderr)
user = input()
passwd = getpass.getpass("Password or access token:\n")
auth = (user, passwd)

LOGO_URL = 'https://avatars2.githubusercontent.com/u/365630?v=4'
REPO_FOLDER = Path(__file__).parent.parent
REPO_FOLDER = Path(path.abspath(__file__)).parent.parent


def get(url):
Expand Down Expand Up @@ -100,7 +101,6 @@ def get_profile(login):
'Duchesnay': 'Edouard Duchesnay',
'Lars': 'Lars Buitinck',
'MechCoder': 'Manoj Kumar',
'jeremiedbb': 'Jérémie Du Boisberranger',
}
if profile["name"] in missing_names:
profile["name"] = missing_names[profile["name"]]
Expand Down
15 changes: 5 additions & 10 deletions conftest.py
Original file line number Diff line number Diff line change
Expand Up @@ -87,23 +87,18 @@ def pytest_collection_modifyitems(config, items):
def pytest_configure(config):
import sys
sys._is_pytest_session = True
# declare our custom markers to avoid PytestUnknownMarkWarning
config.addinivalue_line(
"markers",
"network: mark a test for execution if network available."
)


def pytest_unconfigure(config):
import sys
del sys._is_pytest_session< 6CC9 /td>


def pytest_runtest_setup(item):
if isinstance(item, DoctestItem):
set_config(print_changed_only=True)


def pytest_runtest_teardown(item, nextitem):
if isinstance(item, DoctestItem):
set_config(print_changed_only=False)


# TODO: Remove when modules are deprecated in 0.24
# Configures pytest to ignore deprecated modules.
collect_ignore_glob = [
Expand Down
Loading
0