seckcoder
diff --git a/‎.gitignore
Lines changed: 2 additions & 0 deletions b/‎.gitignore
Lines changed: 2 additions & 0 deletions
diff --git a/‎.mailmap
Lines changed: 30 additions & 2 deletions b/‎.mailmap
Lines changed: 30 additions & 2 deletions
diff --git a/‎MANIFEST.in
Lines changed: 1 addition & 3 deletions b/‎MANIFEST.in
Lines changed: 1 addition & 3 deletions
diff --git a/‎Makefile
Lines changed: 5 additions & 4 deletions b/‎Makefile
Lines changed: 5 additions & 4 deletions
diff --git a/‎README-py3k.rst
Lines changed: 1 addition & 1 deletion b/‎README-py3k.rst
Lines changed: 1 addition & 1 deletion
diff --git a/‎README.rst
Lines changed: 4 additions & 1 deletion b/‎README.rst
Lines changed: 4 additions & 1 deletion
diff --git a/‎benchmarks/bench_covertype.py
Lines changed: 3 additions & 2 deletions b/‎benchmarks/bench_covertype.py
Lines changed: 3 additions & 2 deletions
diff --git a/‎benchmarks/bench_plot_fastkmeans.py
Lines changed: 2 additions & 2 deletions b/‎benchmarks/bench_plot_fastkmeans.py
Lines changed: 2 additions & 2 deletions
diff --git a/‎doc/Makefile
Lines changed: 1 addition & 1 deletion b/‎doc/Makefile
Lines changed: 1 addition & 1 deletion
diff --git a/‎doc/conf.py
Lines changed: 2 additions & 1 deletion b/‎doc/conf.py
Lines changed: 2 additions & 1 deletion
diff --git a/‎doc/datasets/twenty_newsgroups_fixture.py
Lines changed: 1 addition & 1 deletion b/‎doc/datasets/twenty_newsgroups_fixture.py
Lines changed: 1 addition & 1 deletion
diff --git a/‎doc/developers/index.rst
Lines changed: 82 additions & 23 deletions b/‎doc/developers/index.rst
Lines changed: 82 additions & 23 deletions
@@ -37,3 +37,5 @@ nips2010_pdf/
 *.nt.bz2
 *.tar.gz
 *.tgz
+
+examples/cluster/joblib
@@ -1,8 +1,12 @@
 Gael Varoquaux <gael.varoquaux@normalesup.org> gvaroquaux <gael.varoquaux@normalesup.org>
 Gael Varoquaux <gael.varoquaux@normalesup.org> Gael varoquaux <gael.varoquaux@normalesup.org>
 Gael Varoquaux <gael.varoquaux@normalesup.org> GaelVaroquaux <gael.varoquaux@normalesup.org>
+Gael Varoquaux <gael.varoquaux@normalesup.org> Varoquaux <varoquau@normalesup.org>
 Olivier Grisel <olivier.grisel@ensta.org> ogrisel <olivier.grisel@ensta.org>
+Olivier Grisel <olivier.grisel@ensta.org> Olivier Grisel <ogrisel@turingcarpet.(none)>
 Alexandre Gramfort <alexandre.gramfort@inria.fr> Alexandre Gramfort <alexandre.gramfort@gmail.com>
+Alexandre Gramfort <alexandre.gramfort@inria.fr> Alexandre Gramfort <alexandre.gramfort@m4x.org>
+Alexandre Gramfort <alexandre.gramfort@inria.fr> Alexandre Gramfort <gramfort@localhost.(none)>
 Matthieu Perrot <matthieu.perrot@cea.fr> Matthieu Perrot <revilyo@earth.(none)>
 Matthieu Perrot <matthieu.perrot@cea.fr> revilyo <revilyo@earth.(none)>
 Vincent Michel <vincent.michel@inria.fr> vincent <vincent@vincent.org>
@@ -12,6 +16,7 @@ Vincent Michel <vincent.michel@inria.fr> Vincent M <vm.michel@gmail.com>
 Vincent Michel <vincent.michel@inria.fr> Vincent Michel <vincent.michel@logilab.fr>
 Vincent Michel <vincent.michel@inria.fr> Vincent M <vincent.michel@logilab.fr>
 Vincent Michel <vincent.michel@inria.fr> Vincent michel <vmic@crater2.logilab.fr>
+Vincent Michel <vincent.michel@inria.fr> Vincent Michel <vm.michel@gmail.com>
 Ariel Rokem <arokem@berkeley.edu> arokem <arokem@berkeley.edu>
 Bertrand Thirion <bertrand.thirion@inria.fr> bthirion <bertrand.thirion@inria.fr>
 Peter Prettenhofer <peter.prettenhofer@gmail.com> pprett <peter.prettenhofer@gmail.com>
@@ -23,19 +28,42 @@ James Bergstra <james.bergstra@gmail.com> james.bergstra <james.bergstra@gmail.c
 Xinfan Meng <mxf3306@gmail.com> mxf <mxf@chomsky.localdomain>
 Jan Schlüter <scikit-learn@jan-schlueter.de> f0k <scikit-learn@jan-schlueter.de>
 Vlad Niculae <vlad@vene.ro> vene <vlad@vene.ro>
-Andreas Müller <amueller@ais.uni-bonn.de> amueller <amueller@ais.uni-bonn.de>
 Virgile Fritsch <virgile.fritsch@gmail.com> VirgileFritsch <virgile.fritsch@gmail.com>
 Virgile Fritsch <virgile.fritsch@gmail.com> Virgile <virgile.fritsch@gmail.com>
+Virgile Fritsch <virgile.fritsch@gmail.com> Virgile <virgile@virgile-Precision-M4400.(none)>
 Jean Kossaifi <jean.kossaifi@gmail.com> Jean  KOSSAIFI <jkossaifi@is208616.intra.cea.fr>
 Jean Kossaifi <jean.kossaifi@gmail.com> JeanKossaifi <jean.kossaifi@gmail.com>
-Jake Vanderplas <vanderplas@astro.washington.edu> Jacob Vanderplas <jakevdp@yahoo.com>
+Jean Kossaifi <jean.kossaifi@gmail.com> Jean Kossaifi <kossaifi@is208616.intra.cea.fr>
+Jake VanderPlas <vanderplas@astro.washington.edu> Jacob Vanderplas <jakevdp@yahoo.com>
+Jake VanderPlas <vanderplas@astro.washington.edu> Jake Vanderplas <jakevdp@yahoo.com>
+Jake VanderPlas <vanderplas@astro.washington.edu> Jake Vanderplas <vanderplas@astro.washington.edu>
 Andreas Mueller <amueller@ais.uni-bonn.de> Andy <amueller@ais.uni-bonn.de>
+Andreas Mueller <amueller@ais.uni-bonn.de> unknown <Andreas Mueller@MSRC-3645211.europe.corp.microsoft.com>
 Andreas Mueller <amueller@ais.uni-bonn.de> andy <andy@marvin>
 Andreas Mueller <amueller@ais.uni-bonn.de> Andreas Mueller <amueller@templateimage.ista.local>
+Andreas Mueller <amueller@ais.uni-bonn.de> Andreas Müller <amueller@ais.uni-bonn.de>
 Brian Holt <bh00038@cvplws63.eps.surrey.ac.uk> bdholt1 <bdholt1@gmail.com>
+Brian Holt <bh00038@cvplws63.eps.surrey.ac.uk> Brian Holt <bdholt1@gmail.com>
 Robert Layton <robertlayton@gmail.com> robertlayton <robertlayton@gmail.com>
+Robert Layton <robertlayton@gmail.com> = <robertlayton@gmail.com>
 Fabian Pedregosa <fabian@fseoane.net> Fabian Pedregosa <fabian.pedregosa@inria.fr>
 Lars Buitinck <L.J.Buitinck@uva.nl> Lars Buitinck <larsmans@gmail.com>
 Lars Buitinck <L.J.Buitinck@uva.nl> unknown <Lars@.(none)>
 Lars Buitinck <L.J.Buitinck@uva.nl> Lars Buitinck <l.j.buitinck@uva.nl>
 DraXus <draxus@gmail.com> draxus <draxus@hammer.ugr>
+Edouard DUCHESNAY <ed203246@is206877.intra.cea.fr>  Edouard Duchesnay <duchesnay@is143433.(none)>
+Edouard DUCHESNAY <ed203246@is206877.intra.cea.fr> Edouard Duchesnay <edouard.duchesnay@gmail.com>
+Edouard DUCHESNAY <ed203246@is206877.intra.cea.fr> duchesnay <edouard.duchesnay@gmail.com>
+Edouard DUCHESNAY <ed203246@is206877.intra.cea.fr> duchesnay <edouard@is2206219.(none)> 
+Emmanuelle Gouillart <emmanuelle.gouillart@nsup.org> Emmanuelle Gouillart <emma@aleph.(none)>
+Emmanuelle Gouillart <emmanuelle.gouillart@nsup.org> emmanuelle <emmanuelle.gouillart@nsup.org> 
+Gilles Louppe <g.louppe@gmail.com> Gilles Louppe <g.louppe@ulg.ac.be>
+Nelle Varoquaux <nelle.varoquaux@gmail.com> Nelle Varoquaux <nelle@phgroup.com>
+Nicolas Pinto <pinto@alum.mit.edu> Nicolas Pinto <pinto@mit.edu>
+Olivier Hervieu <olivier.hervieu@gmail.com> Olivier Hervieu <olivier.hervieu@tinyclues.com>
+Satrajit Ghosh <satra@mit.edu> Satrajit Ghosh <satrajit.ghosh@gmail.com>
+Shiqiao Du <lucidfrontier.45@gmail.com> Shiqiao Du <s.du@freebit.net>
+Shiqiao Du <lucidfrontier.45@gmail.com> Shiqiao <lucidfrontier.45@gmail.com>
+Tim Sheerman-Chase <t.sheerman-chase@surrey.ac.uk> Tim Sheerman-Chase <ts00051@ts00051-desktop.(none)>
+Vincent Schut <schut@sarvision.nl> Vincent Schut <vincent@TIMO.(none)>
+iBayer <mane.desk@gmail.com> ibayer <mane.desk@gmail.com>
@@ -1,7 +1,5 @@
 include *.rst
-include test.py
-include scikits/__init__.py
 recursive-include doc *
 recursive-include examples *
 recursive-include sklearn *.c *.h *.pyx
-recursive-include sklearn/datasets *.csv *.csv.gz *.TXT *.rst *.jpg *.txt
+recursive-include sklearn/datasets *.csv *.csv.gz *.rst *.jpg *.txt
@@ -10,11 +10,11 @@ CTAGS ?= ctags
 all: clean inplace test
 
 clean-pyc:
-	find . -name "*.pyc" | xargs rm -f
+	find sklearn -name "*.pyc" | xargs rm -f
 
 clean-so:
-	find . -name "*.so" | xargs rm -f
-	find . -name "*.pyd" | xargs rm -f
+	find sklearn -name "*.so" | xargs rm -f
+	find sklearn -name "*.pyd" | xargs rm -f
 
 clean-build:
 	rm -rf build
@@ -36,13 +36,14 @@ test-doc:
 	doc/developers doc/tutorial/basic doc/tutorial/statistical_inference
 
 test-coverage:
+	rm -rf coverage .coverage
 	$(NOSETESTS) -s --with-coverage --cover-html --cover-html-dir=coverage \
 	--cover-package=sklearn sklearn
 
 test: test-code test-doc
 
 trailing-spaces:
-	find . -name "*.py" | xargs perl -pi -e 's/[ \t]*$$//'
+	find sklearn -name "*.py" | xargs perl -pi -e 's/[ \t]*$$//'
 
 cython:
 	find sklearn -name "*.pyx" | xargs $(CYTHON)
 
@@ -16,7 +16,7 @@ of these is:
 To generate python3 compatible sources for selected modules, run the
 2to3 tool on the module::
 
-    2to3 -wn --no-diffs scikits/learn/$module
+    2to3 -wn --no-diffs sklearn/$module
 
 If you would like to help with porting to python3, please propose
 yourself in the scikit-learn mailing list:
 
@@ -76,5 +76,8 @@ source directory (you will need to have nosetest installed)::
 
     python -c "import sklearn; sklearn.test()"
 
-See web page http://scikit-learn.sourceforge.net/install.html#testing
+See web page http://scikit-learn.org/stable/install.html#testing
 for more information.
+
+    Random number generation can be controled during testing by setting
+    the SKLEARN_SEED environment variable
@@ -52,6 +52,7 @@
 
 from time import time
 import os
+import sys
 import numpy as np
 from optparse import OptionParser
 
@@ -182,7 +183,7 @@ def benchmark(clf):
     'alpha': 0.001,
     'n_iter': 2,
     }
-classifiers['SGD'] = SGDClassifier( **sgd_parameters)
+classifiers['SGD'] = SGDClassifier(**sgd_parameters)
 
 ######################################################################
 ## Train CART model
@@ -207,7 +208,7 @@ def benchmark(clf):
 selected_classifiers = opts.classifiers.split(',')
 for name in selected_classifiers:
     if name not in classifiers:
-        op.error('classifier %r unknwon')
+        op.error('classifier %r unknown' % name)
         sys.exit(1)
 
 print("")
 
@@ -42,7 +42,7 @@ def compute_bench(samples_range, features_range):
             # let's prepare the data in small chunks
             mbkmeans = MiniBatchKMeans(init='k-means++',
                                       k=10,
-                                      chunk_size=chunk)
+                                      batch_size=chunk)
             tstart = time()
             mbkmeans.fit(data)
             delta = time() - tstart
@@ -78,7 +78,7 @@ def compute_bench_2(chunks):
         tstart = time()
         mbkmeans = MiniBatchKMeans(init='k-means++',
                                     k=8,
-                                    chunk_size=chunk)
+                                    batch_size=chunk)
 
         mbkmeans.fit(X)
         delta = time() - tstart
 
@@ -106,4 +106,4 @@ doctest:
 	      "results in $(BUILDDIR)/doctest/output.txt."
 
 download-data:
-	python -c "from scikits.learn.datasets.lfw import check_fetch_lfw; check_fetch_lfw()"
+	python -c "from sklearn.datasets.lfw import check_fetch_lfw; check_fetch_lfw()"
@@ -73,7 +73,7 @@
 # built documents.
 #
 # The short X.Y version.
-version = '0.11'
+version = '0.12'
 # The full version, including alpha/beta/rc tags.
 import sklearn
 release = sklearn.__version__
@@ -220,6 +220,7 @@
 # Additional stuff for the LaTeX preamble.
 latex_preamble = """
 \usepackage{amsmath}\usepackage{amsfonts}\usepackage{bm}\usepackage{morefloats}
+\usepackage{enumitem} \setlistdepth{10}
 """
 
 # Documents to append as an appendix to all manuals.
 
@@ -6,7 +6,7 @@
 from os.path import exists
 from os.path import join
 from nose import SkipTest
-from scikits.learn.datasets import get_data_home
+from sklearn.datasets import get_data_home
 
 
 def setup_module(module):
 
@@ -75,29 +75,24 @@ repository <http://github.com/scikit-learn/scikit-learn/>`__ on GitHub:
 
         $ git clone git@github.com:YourLogin/scikit-learn.git
 
- 4. Work on this copy, on your computer, using Git to do the version
-    control::
+ 4. Create a branch to hold your changes::
 
-        $ git add modified_files
-        $ git commit
-        $ git push origin master
-
-    and so on.
+        $ git checkout -b my-feature
 
-If your changes are not just trivial fixes, it is better to directly
-work in a branch with the name of the feature you are working on. In
-this case, replace step 4 with step 5:
+    and start making changes. Never work in the ``master`` branch!
 
-  5. Create a branch to host your changes and publish it on your public
-     repo::
+ 5. Work on this copy, on your computer, using Git to do the version
+    control. When you're done editing, do::
 
-        $ git checkout -b my-feature
         $ git add modified_files
         $ git commit
-        $ git push origin my-feature
 
-When you are ready, and you have pushed your changes to your GitHub repo, go
-the web page of the repo, and click on 'Pull request' to send us a pull
+    to record your changes in Git, then push them to GitHub with::
+
+        $ git push -u origin my-feature
+
+Finally, go to the web page of the your fork of the scikit-learn repo,
+and click 'Pull request' to send your changes to the maintainers for review.
 request. This will send an email to the committers, but might also send an
 email to the mailing list in order to get more visibility.
 
@@ -109,8 +104,7 @@ email to the mailing list in order to get more visibility.
   to use instead of ``origin``. If we choose the name ``upstream`` for it, the
   command will be::
 
-        $ git remote add upstream git@github.com:scikit-learn/scikit-learn.git
-
+        $ git remote add upstream https://github.com/scikit-learn/scikit-learn.git
 
 (If any of the above seems like magic to you, then look up the
 `Git documentation <http://git-scm.com/documentation>`_ on the web.)
@@ -156,6 +150,8 @@ You can also check for common programming errors with the following tools:
         $ pip install nose coverage
         $ nosetests --with-coverage path/to/tests_for_package
 
+      see also :ref:`testing_coverage`
+
     * No pyflakes warnings, check with::
 
         $ pip install pyflakes
@@ -185,13 +181,13 @@ and Cython optimizations.
   on all new contributions will get the overall code base quality in the
   right direction.
 
-EasyFix Issues
---------------
+Easy Issues
+-----------
 
 A great way to start contributing to scikit-learn is to pick an item from the
-list of `EasyFix issues
-<https://github.com/scikit-learn/scikit-learn/issues?labels=EasyFix>`_
-in the issue tracker.  Resolving these issues allow you to start contributing
+list of `Easy issues
+<https://github.com/scikit-learn/scikit-learn/issues?labels=Easy>`_
+in the issue tracker. Resolving these issues allow you to start contributing
 to the project without much prior knowledge. Your assistance in this area will
 be greatly appreciated by the more experienced developers as it helps free up
 their time to concentrate on other issues.
@@ -230,13 +226,76 @@ it.
    slightly differently. To get the best results, you should use version
    1.0.
 
+.. _testing_coverage:
+
+Testing and improving test coverage
+------------------------------------
+
+High-quality `unit testing <http://en.wikipedia.org/wiki/Unit_testing>`_
+is a corner-stone of the sciki-learn development process. For this
+purpose, we use the `nose <http://nose.readthedocs.org/en/latest/>`_
+package. The tests are functions appropriately names, located in `tests`
+subdirectories, that check the validity of the algorithms and the
+different options of the code.
+
+The full scikit-learn tests can be run using 'make' in the root folder.
+Alternatively, running 'nosetests' in a folder will run all the tests of
+the corresponding subpackages.
+
+We expect code coverage of new features to be at least around 90%.
+
+.. note:: **Workflow to improve test coverage**
+
+   To test code coverage, you need to install the `coverage
+   <http://pypi.python.org/pypi/coverage>`_ package in addition to nose.
+
+   1. Run 'make test-coverage'. The output lists for each file the line
+      numbers that are not tested.
+
+   2. Find a low hanging fruit, looking at which lines are not tested,
+      write or adapt a test specifically for these lines.
+
+   3. Loop.
+
+
+
 Developers web site
 -------------------
 
 More information can be found on the `developer's wiki
 <https://github.com/scikit-learn/scikit-learn/wiki>`_.
 
 
+Issue Tracker Tags
+------------------
+All issues and pull requests on the
+`Github issue tracker <https://github.com/scikit-learn/scikit-learn/issues>`_
+should have (at least) one of the following tags:
+
+:Bug / Crash:
+    Something is happening that clearly shouldn't happen.
+    Wrong results as well as unexpected errors from estimators go here.
+
+:Cleanup / Enhancement:
+    Improving performance, usability, consistency.
+
+:Documentation:
+    Missing, incorrect or sub-standard documentations and examples.
+
+:New Feature:
+    Feature requests and pull requests implementing a new feature.
+
+There are two other tags to help new contributors:
+
+:Easy:
+    This issue can be tackled by anyone, no experience needed.
+    Ask for help if the formulation is unclear.
+
+:Moderate:
+    Might need some knowledge of machine learning or the package,
+    but is still approachable for someone new to the project.
+
+
 Other ways to contribute
 ========================