GitHub - bastidas/PySpark_Classify_Text: Predict a body text's classification tag using PySpark with Word2Vec and all vs one approach.

Predict a classification tag for a body of text in a all vs one strategy. The final output is a file, classification.pkl, that contains a row tuple for each of the top 100 tags in the training data set: ("some tag name", [prediction_values]*len(number of test cases))

Uses PySpark and Word2Vec

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
PySpark_Classify_Text.ipynb		PySpark_Classify_Text.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

bastidas/PySpark_Classify_Text

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages