Skip to content

Predict a body text's classification tag using PySpark with Word2Vec and all vs one approach.

Notifications You must be signed in to change notification settings

bastidas/PySpark_Classify_Text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Predict a classification tag for a body of text in a all vs one strategy. The final output is a file, classification.pkl, that contains a row tuple for each of the top 100 tags in the training data set: ("some tag name", [prediction_values]*len(number of test cases))

  • Uses PySpark and Word2Vec

About

Predict a body text's classification tag using PySpark with Word2Vec and all vs one approach.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published