8000 DOC writing fixes in text_analytics/working_with_text_data.rst (#25184) · npache/scikit-learn@e2dd391 · GitHub
[go: up one dir, main page]

Skip to content

Commit e2dd391

Browse files
authored
DOC writing fixes in text_analytics/working_with_text_data.rst (scikit-learn#25184)
1 parent 1f3c1be commit e2dd391

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

doc/tutorial/text_analytics/working_with_text_data.rst

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ The tutorial folder should contain the following sub-folders:
4848

4949

5050
You can already copy the skeletons into a new folder somewhere
51-
on your hard-drive named ``sklearn_tut_workspace`` where you
51+
on your hard-drive named ``sklearn_tut_workspace``, where you
5252
will edit your own files for the exercises while keeping
5353
the original skeletons intact:
5454

@@ -92,7 +92,7 @@ manually from the website and use the :func:`sklearn.datasets.load_files`
9292
function by pointing it to the ``20news-bydate-train`` sub-folder of the
9393
uncompressed archive folder.
9494

95-
In order to get faster execution times for this first example we will
95+
In order to get faster execution times for this first example, we will
9696
work on a partial dataset with only 4 categories out of the 20 available
9797
in the dataset::
9898

@@ -136,7 +136,7 @@ document in the training set. In this case the category is the name of the
136136
newsgroup which also happens to be the name of the folder holding the
137137
individual documents.
138138

139-
For speed and space efficiency reasons ``scikit-learn`` loads the
139+
For speed and space efficiency reasons, ``scikit-learn`` loads the
140140
target attribute as an array of integers that corresponds to the
141141
index of the category name in the ``target_names`` list. The category
142142
integer id of each sample is stored in the ``target`` attribute::
@@ -292,7 +292,7 @@ Now that we have our features, we can train a classifier to try to predict
292292
the category of a post. Let's start with a :ref:`naïve Bayes <naive_bayes>`
293293
classifier, which
294294
provides a nice baseline for this task. ``scikit-learn`` includes several
295-
variants of this classifier; the one most suitable for word counts is the
295+
variants of this classifier, and the one most suitable for word counts is the
296296
multinomial variant::
297297

298298
>>> from sklearn.naive_bayes import MultinomialNB

0 commit comments

Comments
 (0)
0