@@ -48,7 +48,7 @@ The tutorial folder should contain the following sub-folders:
48
48
49
49
50
50
You can already copy the skeletons into a new folder somewhere
51
- on your hard-drive named ``sklearn_tut_workspace `` where you
51
+ on your hard-drive named ``sklearn_tut_workspace ``, where you
52
52
will edit your own files for the exercises while keeping
53
53
the original skeletons intact:
54
54
@@ -92,7 +92,7 @@ manually from the website and use the :func:`sklearn.datasets.load_files`
92
92
function by pointing it to the ``20news-bydate-train `` sub-folder of the
93
93
uncompressed archive folder.
94
94
95
- In order to get faster execution times for this first example we will
95
+ In order to get faster execution times for this first example, we will
96
96
work on a partial dataset with only 4 categories out of the 20 available
97
97
in the dataset::
98
98
@@ -136,7 +136,7 @@ document in the training set. In this case the category is the name of the
136
136
newsgroup which also happens to be the name of the folder holding the
137
137
individual documents.
138
138
139
- For speed and space efficiency reasons ``scikit-learn `` loads the
139
+ For speed and space efficiency reasons, ``scikit-learn `` loads the
140
140
target attribute as an array of integers that corresponds to the
141
141
index of the category name in the ``target_names `` list. The category
142
142
integer id of each sample is stored in the ``target `` attribute::
@@ -292,7 +292,7 @@ Now that we have our features, we can train a classifier to try to predict
292
292
the category of a post. Let's start with a :ref: `naïve Bayes <naive_bayes >`
293
293
classifier, which
294
294
provides a nice baseline for this task. ``scikit-learn `` includes several
295
- variants of this classifier; the one most suitable for word counts is the
295
+ variants of this classifier, and the one most suitable for word counts is the
296
296
multinomial variant::
297
297
298
298
>>> from sklearn.naive_bayes import MultinomialNB
0 commit comments