Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Approach state of art #158

Open
arjunmenon opened this issue Mar 6, 2017 · 1 comment
Open

Approach state of art #158

arjunmenon opened this issue Mar 6, 2017 · 1 comment

Comments

@arjunmenon
Copy link

arjunmenon commented Mar 6, 2017

How do you include the TF-IDF weights in this method?
Compared to simple MNB having count of plain bag of words, MNB with TF-IDF gets more accuracy.
How do you implement this?
www.cs.waikato.ac.nz/~eibe/pubs/kibriya_et_al_cr.pdf

Also, as a bonus a complement naive bayes further improves the problem of inconsistent dataset size, which with plain MNB favours the larger one.
This scenario is fairly common.
Some inputs on that as well would be appreciated,.

@Ch4s3
Copy link
Member

Ch4s3 commented Mar 6, 2017

We've discussed TF-IDF, but I haven't had time to dig into it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants