Why am I forced to specify 2 languages for the detection to work with? #161
Replies: 2 comments 5 replies
-
You need to choose at least two languages because the confidence score for each language is calculated relatively to the other languages. It's difficult to come up with a confidence metric for a single language that works independently of any other language. I'm working on it but it may take a while. Please note that you should build the detector only once and reuse this instance throughout your program. Rebuilding the detector for each text is costly and should be avoided. |
Beta Was this translation helpful? Give feedback.
-
I would also like a single language mode. I have a (large) corpus where 165 different languages have been identified by polyglot. At this time, I want better accuracy of ENGLISH entries, so I am trying lingua. I've set up a two language model (ENGLISH and SPANISH) as they are the two most frequent languages, but:
This being said, very impressed by the accuracy, so far. |
Beta Was this translation helpful? Give feedback.
-
I have a project where players can choose their language, and then only messages in their language would be sent to their chat.
This is impossible(or unreliable) to implement because this code is illegal:
Beta Was this translation helpful? Give feedback.
All reactions