8000 DOC Update TfidfVectorizer analyzer documentation to match CountVecto… · xhluca/scikit-learn@aa26538 · GitHub
[go: up one dir, main page]

Skip to content

Commit aa26538

Browse files
dstineXing
authored andcommitted
DOC Update TfidfVectorizer analyzer documentation to match CountVectorizer (scikit-learn#12713)
1 parent 6ac8fe3 commit aa26538

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

sklearn/feature_extraction/text.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1357,8 +1357,10 @@ class TfidfVectorizer(CountVectorizer):
13571357
preprocessing and n-grams generation steps.
13581358
Only applies if ``analyzer == 'word'``.
13591359
1360-
analyzer : string, {'word', 'char'} or callable
1360+
analyzer : string, {'word', 'char', 'char_wb'} or callable
13611361
Whether the feature should be made of word or character n-grams.
1362+
Option 'char_wb' creates character n-grams only from text inside
1363+
word boundaries; n-grams at the edges of words are padded with space.
13621364
13631365
If a callable is passed it is used to extract the sequence of features
13641366
out of the raw, unprocessed input.

0 commit comments

Comments
 (0)
0