Not so good by estimating high confidence for random text/gibberish #113
Unanswered
GabrielKesler
asked this question in
Q&A
Replies: 1 comment 2 replies
-
Hi @GabrielKesler, thanks for your question. Have you read the documentation about the confidence metric? It is a relative metric, i.e. the most likely language always gets the value What is the point of feeding the language detector with gibberish text anyway? This is a very contrived example. I don't think that the texts you want to classify are of this sort. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I have this code snippet:
Resulting in this:
Seems that this library is giving very high confidence values for gibberish/random words, which is unacceptable.
Any suggestions ?
Beta Was this translation helpful? Give feedback.
All reactions