10000 news_classifier/README.md at develop · josejuanmartinez/news_classifier · GitHub
[go: up one dir, main page]

Skip to content

Latest commit

 

History

History
27 lines (20 loc) · 1.34 KB

README.md

File metadata and controls

27 lines (20 loc) · 1.34 KB

KAGGLE NEWS CLASSIFIER

A Huffpost (from KAGGLE) News Classifier using different statistical and neural algorithms

Kaggle Huffpost News Classifier Example

Note: Doc2Vec models ARE NOT IN THIS REPOSITORY, download them from https://github.com/jhlau/doc2vec and install them in /embeddings folder

Example:

python Classifier.py --test_size 0.25 --algo 'Naive Bayes' 'Decision Tree' 'Adaboost' 'Support Vector Machine' 'Random Forest' 'Gradient Descent' --min_feat_size 1 --cleanse --embeddings 'enwiki_dbow'

Usage: Classif
2E14
ier.py [-h] [--cleanse] [--features FEATURES [FEATURES ...]]
                     [--algo ALGO [ALGO ...]] [--min_feat_size MIN_COMB_SIZE]
                     [--embeddings EMBEDDINGS] [--test_size TEST_SIZE]

Runs Classifier

optional arguments:
  -h, --help            show this help message and exit
  --cleanse
  --features FEATURES (Example: --features 'd2v' 'link' 'authors' 'headline' 'short_description')
  --algo ALGO (Example: --algo 'Naive Bayes' 'Decision Tree' 'Adaboost' 'Support Vector Machine' 'Random Forest' 'Gradient Descent')
  --min_feat_size MIN_COMB_SIZE (Example: 1 for all combinations)
  --embeddings EMBEDDINGS (Example: 'apnews_dbow' OR 'enwiki_dbow' OR ...)
  --test_size TEST_SIZE (Example: 0.25)
0