You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @srivatsan88 ,
I am working on a problem of text classification where the labels are quite similar, like
Bad reputation
customer issues
delays
good reputation
The thing is there is a major overlap between the first 3 labels, as many have common words and could fall into multiple categories.
Example - Delay could have words overlapping with bad reputation, same way with customer issues and bad reputation.
Is there any good approach to be taken that can ensure good metrics?
And what would be an ideal number of data points required. Currently there is only about 6000 data points.
Cheers.
The text was updated successfully, but these errors were encountered:
adithyaan-creator
changed the title
What would be the best way to approach a classification probelm with similar labels?
What would be the best way to approach a classification problem with similar labels?
Jan 16, 2021
Hi @srivatsan88 ,
I am working on a problem of text classification where the labels are quite similar, like
The thing is there is a major overlap between the first 3 labels, as many have common words and could fall into multiple categories.
Example - Delay could have words overlapping with bad reputation, same way with customer issues and bad reputation.
Is there any good approach to be taken that can ensure good metrics?
And what would be an ideal number of data points required. Currently there is only about 6000 data points.
Cheers.
The text was updated successfully, but these errors were encountered: