8000 Update branch by arka204 · Pull Request #11 · arka204/scikit-learn · GitHub
[go: up one dir, main page]

Skip to content

Update branch #11

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
81 commits
Select commit Hold shift + click to select a range
fc474d6
ENH Release GIL in DistanceMetric when validating data (#17038)
webber26232 May 23, 2020
83b7e58
MNT Simplifies Azure pipeline linting (#17315)
thomasjpfan May 23, 2020
d5de894
ENH scipy blas for svm kernel function (#16530)
jim0421 May 24, 2020
9368545
ENH Fix averaged RMSE (#17309)
swierh May 24, 2020
612ebc8
DOC Add triage team section in "About us" page (#17305)
NicolasHug May 25, 2020
7494da6
MNT removed deprecated files generation (#17132)
NicolasHug May 25, 2020
0c84afa
DOC Rename features in highlight of feature importance (#17331)
venkyyuvy May 25, 2020
bb4b097
DOC Fix typo in quantile_transformer (#17343)
lucyleeow May 25, 2020
f42cf8a
MNT Replace kwargs by named args for resample (#17324)
alfaro96 May 25, 2020
75c76ae
DOC Add links, update plot_all_scaling (#17340)
lucyleeow May 26, 2020
76df39f
ENH allow to specify which methods should run a check in CheckingClas…
BenjaminBossan May 26, 2020
6b68144
FIX Allow sparse input data for OutputCodeClassifier (#17233)
zoj613 May 26, 2020
6f33c5c
TST Remove Boston dataset in test_mlp (#17337)
lucyleeow May 26, 2020
f93f560
ENH add support for sample weights in MAE (#17225)
lucyleeow May 27, 2020
ffd1873
DOC improve supporting types input and output in preprocessing module…
genziano May 27, 2020
6c3e179
DOC Specify the meaning of y=None in fit_transform (#17366)
alfaro96 May 27, 2020
753da1d
DOC Minor fixes (#17325)
justmarkham May 28, 2020
d42611f
DOC Emphasized determinism of ParameterGrid in docs (#17383)
zaccharieramzi Jun 1, 2020
332aaa1
DOC cleaning parameter docstrings in _dict_learning.py (#17376)
reshamas Jun 1, 2020
5816817
DOC Update preprocessor in CountVectorizer (#17413)
yagi-3 Jun 1, 2020
f27adc5
TST Replace Boston dataset in test_tree (#17290)
lucyleeow Jun 1, 2020
863e58f
ENH Add option to scale output to unit var in RobustScaler (#17193)
lucyleeow Jun 2, 2020
7229c05
DOC Improve API documentation of CalibratedClassifierCV (#15134)
pspachtholz Jun 2, 2020
337a2f0
MRG/GOV add adrin to TC in governance doc (#17344)
amueller Jun 2, 2020
015a8a5
STY Makes ordered list and unordered list look the same. (#17429)
thomasjpfan Jun 3, 2020
690b920
DOC Use non paywall url in outlier_detection.rst (#17434)
lucyleeow Jun 3, 2020
0e04239
DOC Clarify how initial conditions are chosen for NMF. (#17369)
bsmith89 Jun 3, 2020
d9d5fb4
DOC Replace Canopy installers by Enthought Deployment Manager (#17336)
alfaro96 Jun 3, 2020
b364028
MNT Enables subset_invariance tests to run with SparsePCA (#17319)
thomasjpfan Jun 3, 2020
8cce5bf
MNT fix section order of what's new v0.24 (#17437)
lucyleeow Jun 3, 2020
8feb045
Common check for sample weight invariance with removed samples (#17176)
rth Jun 3, 2020
e919022
MNT remove deprecated presort param (#17446)
NicolasHug Jun 4, 2020
9b384c5
DOC Fix typo: R_ij -> R_{ij} for Davies-Bouldin Index (#17452)
SleepyBag Jun 5, 2020
c3fd8ec
DOC Fix formating in plot_roc_curve docstring example (#17457)
lucyleeow Jun 5, 2020
a704934
replaced array_like to array-like (#17464)
muokicaleb Jun 6, 2020
58d74c1
Replaces array_like with array-like (#17463)
icoder18 Jun 6, 2020
ec1ccaf
[Sprint] Docs linear model base.py Replaced `array_like` to `array-l…
muokicaleb Jun 6, 2020
7262918
replaced array_like to array-like (#17461)
muokicaleb Jun 6, 2020
8a34a7a
Replace array_like with array-like in iss `sklearn/feature_selection/…
muokicaleb Jun 6, 2020
2653fa9
DOC shorten estimators links for sklearn/feature_selection/* and skle…
pardeep-singh Jun 6, 2020
30b672e
DOC Use tuple for shape in roc_curve docstring (#17456)
lucyleeow Jun 6, 2020
53b1385
DOC rename array_like to array-like in _dist_metrics.pyx (#17468)
muokicaleb Jun 6, 2020
a54e7e9
DOC Standarize default documentation for cross_decomposition (#17471)
kohakukun Jun 6, 2020
fb8a497
Fixed typos (#17459)
RafeyIqbalRahman Jun 6, 2020
dc01f00
DOC Enabling links to objects in docstrings of Bagging (#17467)
martinoywa Jun 6, 2020
70c45bc
DOC Minor fix to _agglomerative.py (#17454)
jaketae Jun 6, 2020
2457175
repalced array_like to array-like in file sklearn/neighbors/_kde.py (…
muokicaleb Jun 6, 2020
6998f62
DOC Enabling links to objects in docstrings of RANSACRegressor (#17479)
martinoywa Jun 6, 2020
1019f98
ENH: Added keyword arguments to make_pipeline (#17477)
loldja Jun 6, 2020
831a00a
DOC Fix documentation of default values in sklearn.utils.validation (…
neinkeinkaffee Jun 6, 2020
2253807
DOC fixed default values in doc string for plot_tree (#17496)
madelgi Jun 6, 2020
324d3b5
DOC Enable links to objects in docstrings (Sprint Issue #17417) (#17488)
poorna-kumar Jun 6, 2020
715e240
DOC added a warning to maxabs_scale to prevent data leakage (#17487)
krumeto Jun 6, 2020
d6ed0d0
DOC Fixing n_jobs doc param in multiclass.py (#17489)
annejeevan Jun 6, 2020
f110672
DOC remove ref to browser ext (#17501)
emdupre Jun 6, 2020
4eca61b
DOC Standardize formatting for default values (#17509)
amy12xx Jun 6, 2020
baf8bed
DOC standardized formatting for default values in transformer class. …
ezebunandu Jun 6, 2020
e57fab1
DOC Update default doc string values for linear_model (#17506)
genvalen Jun 6, 2020
23c48bf
DOC Standardized formatting for default values in manifold tsne (#17511)
amy12xx Jun 6, 2020
5a85627
DOC adding suggested spaces in multioutput/RegressorChain (#17498)
marenwestermann Jun 6, 2020
cacf6bf
ENH Added keyword arguments to Shuffle (#17474)
JosephTLucas Jun 6, 2020
d8be25f
DOC fixes default values documentation in sklearn/mixture module (#17…
violetr Jun 6, 2020
aeffba3 8000
DOC Made required changes to kernel_approximation.py (#17473)
Mariam-ke Jun 6, 2020
74b445e
DOC Add example to linear_model.PoissonRegressor (#17453)
nuka137 Jun 7, 2020
b4fa62e
DOC Fix documentation of default values in cluster module (#17455)
Jun 7, 2020
7681721
DOC Add example to linear_model.GammaRegressor (#17519)
nuka137 Jun 7, 2020
b898f25
DOC Add examples to linear_model.TweedieRegressor (#17516)
nuka137 Jun 7, 2020
5297365
DOC Standarize default documentation for feature_selection (#17465)
kohakukun Jun 7, 2020
6df781b
ENH Replace *args with named arguments in make_union (#17472)
jaketae Jun 7, 2020
3f10622
DOC Add examples to PrecisionRecall and ConfusionMatrix Display (#17492)
pardeep-singh Jun 7, 2020
53befd8
DOC Update default values CalibratedClassifierCV docstring (#17494)
asubramaniyan Jun 7, 2020
c4691ff
DOC Adding scikit-lego to Related projects (#17503)
krumeto Jun 8, 2020
51df262
MNT remove boston from the common test / estimator checks (#17356)
glemaitre Jun 8, 2020
c115ed7
DOC standardize default values manifold _isomap _locally_linear (#17520)
amy12xx Jun 8, 2020
9ae41ab
changed the colors of plot_compare_cross_decompisition.py to default …
rachel-fischoff Jun 8, 2020
3ff6277
MNT Allow pull request labeler to fail (#17530)
thomasjpfan Jun 8, 2020
5d04910
ENH add score_samples method in base search CV (#17478)
teonbrooks Jun 8, 2020
7536b25
MNT Fix spelling of _check_classifier_response_method (#17529)
lucyleeow Jun 8, 2020
3c0e30a
DOC Update default values in kernel_approximation doc string (#17536)
asubramaniyan Jun 8, 2020
2968400
Merge pull request #10 from scikit-learn/master
arka204 Jun 8, 2020
7e1b649
Merge branch 'Proposition-for-BernoulliNB-and-MultinomialNB-when-alph…
arka204 Jun 8, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 1 addition & 4 deletions .codecov.yml
Original file line number Diff line number Diff line change
Expand Up @@ -12,11 +12,8 @@ coverage:
default:
# Be tolerant on slight code coverage diff on PRs to limit
# noisy red coverage status on github PRs.
# Note The coverage stats are still uploaded
# Note: The coverage stats are still uploaded
# to codecov so that PR reviewers can see uncovered lines
# in the github diff if they install the codecov browser
# extension:
# https://github.com/codecov/browser-extension
target: auto
threshold: 1%

Expand Down
1 change: 1 addition & 0 deletions .github/workflows/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@ jobs:
runs-on: ubuntu-latest
steps:
- uses: thomasjpfan/labeler@v2.4.6
continue-on-error: true
if: github.repository == 'scikit-learn/scikit-learn'
with:
repo-token: "${{ secrets.GITHUB_TOKEN }}"
Expand Down
154 changes: 0 additions & 154 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -77,157 +77,3 @@ _configtest.o.d
sklearn/utils/_seq_dataset.pyx
sklearn/utils/_seq_dataset.pxd
sklearn/linear_model/_sag_fast.pyx

# deprecated paths
# TODO: Remove in 0.24
# All of these files should have a match in _build_utils/deprecated_modules.py
sklearn/utils/mocking.py

sklearn/ensemble/bagging.py
sklearn/ensemble/base.py
sklearn/ensemble/forest.py
sklearn/ensemble/gradient_boosting.py
sklearn/ensemble/iforest.py
sklearn/ensemble/stacking.py
sklearn/ensemble/voting.py
sklearn/ensemble/weight_boosting.py
sklearn/tree/export.py
sklearn/tree/tree.py

sklearn/neural_network/rbm.py
sklearn/neural_network/multilayer_perceptron.py

sklearn/utils/weight_vector.py
sklearn/utils/seq_dataset.py
sklearn/utils/fast_dict.py
sklearn/utils/testing.py

sklearn/cluster/affinity_propagation_.py
sklearn/cluster/bicluster.py
sklearn/cluster/birch.py
sklearn/cluster/dbscan_.py
sklearn/cluster/hierarchical.py
sklearn/cluster/k_means_.py
sklearn/cluster/mean_shift_.py
sklearn/cluster/optics_.py
sklearn/cluster/spectral.py

sklearn/mixture/base.py
sklearn/mixture/gaussian_mixture.py
sklearn/mixture/bayesian_mixture.py

sklearn/covariance/elliptic_envelope.py
sklearn/covariance/empirical_covariance_.py
sklearn/covariance/graph_lasso_.py
sklearn/covariance/robust_covariance.py
sklearn/covariance/shrunk_covariance_.py

sklearn/cross_decomposition/cca_.py
sklearn/cross_decomposition/pls_.py

sklearn/svm/base.py
sklearn/svm/classes.py
sklearn/svm/bounds.py
sklearn/svm/libsvm.py
sklearn/svm/libsvm_sparse.py
sklearn/svm/liblinear.py

sklearn/decomposition/base.py
sklearn/decomposition/dict_learning.py
sklearn/decomposition/cdnmf_fast.py
sklearn/decomposition/factor_analysis.py
sklearn/decomposition/fastica_.py
sklearn/decomposition/incremental_pca.py
sklearn/decomposition/kernel_pca.py
sklearn/decomposition/nmf.py
sklearn/decomposition/online_lda.py
sklearn/decomposition/online_lda_fast.py
sklearn/decomposition/pca.py
sklearn/decomposition/sparse_pca.py
sklearn/decomposition/truncated_svd.py

sklearn/gaussian_process/gpr.py
sklearn/gaussian_process/gpc.py

sklearn/datasets/base.py
sklearn/datasets/california_housing.py
sklearn/datasets/covtype.py
sklearn/datasets/kddcup99.py
sklearn/datasets/lfw.py
sklearn/datasets/olivetti_faces.py
sklearn/datasets/openml.py
sklearn/datasets/rcv1.py
sklearn/datasets/samples_generator.py
sklearn/datasets/species_distributions.py
sklearn/datasets/svmlight_format.py
sklearn/datasets/twenty_newsgroups.py

sklearn/feature_extraction/dict_vectorizer.py
sklearn/feature_extraction/hashing.py
sklearn/feature_extraction/stop_words.py

sklearn/linear_model/base.py
sklearn/linear_model/bayes.py
sklearn/linear_model/cd_fast.py
sklearn/linear_model/coordinate_descent.py
sklearn/linear_model/huber.py
sklearn/linear_model/least_angle.py
sklearn/linear_model/logistic.py
sklearn/linear_model/omp.py
sklearn/linear_model/passive_aggressive.py
sklearn/linear_model/perceptron.py
sklearn/linear_model/ransac.py
sklearn/linear_model/ridge.py
sklearn/linear_model/sag.py
sklearn/linear_model/sag_fast.py
sklearn/linear_model/sgd_fast.py
sklearn/linear_model/stochastic_gradient.py
sklearn/linear_model/theil_sen.py

sklearn/metrics/cluster/bicluster.py
sklearn/metrics/cluster/supervised.py
sklearn/metrics/cluster/unsupervised.py
sklearn/metrics/cluster/expected_mutual_info_fast.py

sklearn/metrics/base.py
sklearn/metrics/classification.py
sklearn/metrics/regression.py
sklearn/metrics/ranking.py
sklearn/metrics/pairwise_fast.py
sklearn/metrics/scorer.py

sklearn/inspection/partial_dependence.py
sklearn/inspection/permutation_importance.py

sklearn/neighbors/ball_tree.py
sklearn/neighbors/base.py
sklearn/neighbors/classification.py
sklearn/neighbors/dist_metrics.py
sklearn/neighbors/graph.py
sklearn/neighbors/kd_tree.py
sklearn/neighbors/kde.py
sklearn/neighbors/lof.py
sklearn/neighbors/nca.py
sklearn/neighbors/nearest_centroid.py
sklearn/neighbors/quad_tree.py
sklearn/neighbors/regression.py
sklearn/neighbors/typedefs.py
sklearn/neighbors/unsupervised.py

sklearn/manifold/isomap.py
sklearn/manifold/locally_linear.py
sklearn/manifold/mds.py
sklearn/manifold/spectral_embedding_.py
sklearn/manifold/t_sne.py

sklearn/semi_supervised/label_propagation.py

sklearn/preprocessing/data.py
sklearn/preprocessing/label.py

sklearn/feature_selection/base.py
sklearn/feature_selection/from_model.py
sklearn/feature_selection/mutual_info.py
sklearn/feature_selection/rfe.py
sklearn/feature_selection/univariate_selection.py
sklearn/feature_selection/variance_threshold.py
2 changes: 0 additions & 2 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -18,8 +18,6 @@ clean-ctags:
clean: clean-ctags
$(PYTHON) setup.py clean
rm -rf dist
# TODO: Remove in when all modules are removed.
$(PYTHON) sklearn/_build_utils/deprecated_modules.py

in: inplace # just a shortcut
inplace:
Expand Down
18 changes: 6 additions & 12 deletions azure-pipelines.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,35 +13,29 @@ jobs:
pool:
vmImage: ubuntu-18.04
steps:
- bash: echo "##vso[task.prependpath]$CONDA/bin"
displayName: Add conda to PATH
- bash: sudo chown -R $USER $CONDA
displayName: Take ownership of conda installation
- task: UsePythonVersion@0
inputs:
versionSpec: '3.8'
- bash: |
set -ex
conda create --name flake8_env --yes python=3.8
source activate flake8_env
pip install flake8 mypy==0.770
displayName: Install flake8
displayName: Install linters
- bash: |
set -ex
if [[ $BUILD_SOURCEVERSIONMESSAGE =~ \[lint\ skip\] ]]; then
# skip linting
echo "Skipping linting"
echo "Skipping flake8 linting"
exit 0
else
source activate flake8_env
./build_tools/circle/linting.sh
fi
displayName: Run linting
- bash: |
set -ex
if [[ $BUILD_SOURCEVERSIONMESSAGE =~ \[lint\ skip\] ]]; then
# skip linting
echo "Skipping linting"
echo "Skipping mypy linting"
exit 0
else
source activate flake8_env
mypy sklearn/ --ignore-missing-imports
fi
displayName: Run mypy
Expand Down
46 changes: 25 additions & 21 deletions build_tools/generate_authors_table.py
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,8 @@

print("user:", file=sys.stderr)
user = input()
passwd = getpass.getpass("Password or access token:\n")
auth = (user, passwd)
token = getpass.getpass("access token:\n")
auth = (user, token)

LOGO_URL = 'https://avatars2.githubusercontent.com/u/365630?v=4'
REPO_FOLDER = Path(path.abspath(__file__)).parent.parent
Expand All @@ -38,13 +38,15 @@ def get(url):

def get_contributors():
"""Get the list of contributor profiles. Require admin rights."""
# get members of scikit-learn core-dev on GitHub
# get core devs and triage team
core_devs = []
team = 11523
for page in [1, 2]: # 30 per page
reply = get("https://api.github.com/teams/%d/members?page=%d" %
(team, page))
core_devs.extend(reply.json())
triage_team = []
for team_id, lst in zip((11523, 3593183), (core_devs, triage_team)):
for page in [1, 2]: # 30 per page
reply = get(
f"https://api.github.com/teams/{team_id}/members?page={page}"
)
lst.extend(reply.json())

# get members of scikit-learn on GitHub
members = []
Expand All @@ -55,32 +57,31 @@ def get_contributors():
members.extend(reply.json())

# keep only the logins
core_devs = [c['login'] for c in core_devs]
members = [c['login'] for c in members]
core_devs = set(c['login'] for c in core_devs)
triage_team = set(c['login'] for c in triage_team)
members = set(c['login'] for c in members)

# add missing contributors with GitHub accounts
members.extend(['dubourg', 'mbrucher', 'thouis', 'jarrodmillman'])
members |= {'dubourg', 'mbrucher', 'thouis', 'jarrodmillman'}
# add missing contributors without GitHub accounts
members.extend(['Angel Soler Gollonet'])
members |= {'Angel Soler Gollonet'}
# remove CI bots
members.remove('sklearn-ci')
members.remove('sklearn-lgtm')
members.remove('sklearn-wheels')
members -= {'sklearn-ci', 'sklearn-lgtm', 'sklearn-wheels'}
triage_team -= core_devs # remove ogrisel from triage_team

# remove duplicate, and get the difference of the two sets
core_devs = set(core_devs)
members = set(members)
emeritus = members.difference(core_devs)
emeritus = members - core_devs - triage_team

# get profiles from GitHub
core_devs = [get_profile(login) for login in core_devs]
emeritus = [get_profile(login) for login in emeritus]
triage_team = [get_profile(login) for login in triage_team]

# sort by last name
core_devs = sorted(core_devs, key=key)
emeritus = sorted(emeritus, key=key)
triage_team = sorted(triage_team, key=key)

return core_devs, emeritus
return core_devs, emeritus, triage_team


def get_profile(login):
Expand Down Expand Up @@ -143,10 +144,13 @@ def generate_list(contributors):

if __name__ == "__main__":

core_devs, emeritus = get_contributors()
core_devs, emeritus, triage_team = get_contributors()

with open(REPO_FOLDER / "doc" / "authors.rst", "w+") as rst_file:
rst_file.write(generate_table(core_devs))

with open(REPO_FOLDER / "doc" / "authors_emeritus.rst", "w+") as rst_file:
rst_file.write(generate_list(emeritus))

with open(REPO_FOLDER / "doc" / "triage_team.rst", "w+") as rst_file:
rst_file.write(generate_table(triage_team))
9 changes: 0 additions & 9 deletions conftest.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,10 +13,8 @@
import pytest
from _pytest.doctest import DoctestItem

from sklearn import set_config
from sklearn.utils import _IS_32BIT
from sklearn.externals import _pilutil
from sklearn._build_utils.deprecated_modules import _DEPRECATED_MODULES

PYTEST_MIN_VERSION = '3.3.0'

Expand Down Expand Up @@ -97,10 +95,3 @@ def pytest_configure(config):
def pytest_unconfigure(config):
import sys
del sys._is_pytest_session


# TODO: Remove when modules are deprecated in 0.24
# Configures pytest to ignore deprecated modules.
collect_ignore_glob = [
os.path.join(*deprecated_path.split(".")) + ".py"
for _, deprecated_path, _, _ in _DEPRECATED_MODULES]
7 changes: 7 additions & 0 deletions doc/about.rst
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,13 @@ in the FAQ.

:ref:`How you can contribute to the project <contributing>`

Triage Team
-----------
The following people are active contributors who also help with triaging
issues, PRs, and general maintenance:

.. include:: triage_team.rst

Emeritus Core Developers
------------------------
The following people have been active contributors in the past, but are no
Expand Down
2 changes: 1 addition & 1 deletion doc/authors.rst
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
</div>
<div>
<a href='https://github.com/thomasjpfan'><img src='https://avatars2.githubusercontent.com/u/5402633?v=4' class='avatar' /></a> <br />
<p>Thomas J Fan</p>
<p>Thomas J. Fan</p>
</div>
<div>
<a href='https://github.com/agramfort'><img src='https://avatars2.githubusercontent.com/u/161052?v=4' class='avatar' /></a> <br />
Expand Down
9 changes: 5 additions & 4 deletions doc/governance.rst
Original file line number Diff line number Diff line change
Expand Up @@ -71,10 +71,11 @@ subject to a two-third majority of all cast votes as well as a simple majority
approval of all the current TC members. TC members who do not actively engage
with the TC duties are expected to resign.

The initial Technical Committee of scikit-learn consists of :user:`Alexandre Gramfort <agramfort>`,
:user:`Olivier Grisel <ogrisel>`, :user:`Andreas Müller <amueller>`, :user:`Joel Nothman <jnothman>`,
:user:`Hanmin Qin <qinhanmin2014>`, :user:`Gaël Varoquaux <GaelVaroquaux>`, and
:user:`Roman Yurchak <rth>`.
The Technical Committee of scikit-learn consists of :user:`Alexandre
Gramfort <agramfort>`, :user:`Olivier Grisel <ogrisel>`, :user:`Adrin Jalali
<adrinjalali>`, :user:`Andreas Müller <amueller>`, :user:`Joel Nothman
<jnothman>`, :user:`Hanmin Qin <qinhanmin2014>`, :user:`Gaël Varoquaux
<GaelVaroquaux>`, and :user:`Roman Yurchak <rth>`.

Decision Making Process
=======================
Expand Down
Loading
0