Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-OCR alignment: add method=custom with user-defined prioritization #14

Open
bertsky opened this issue May 31, 2024 · 0 comments
Open

Comments

@bertsky
Copy link
Collaborator

bertsky commented May 31, 2024

Unfortunately OCR engines and models vary widely w.r.t. quality of their confidences (uncalibrated probability estimation). The aligner therefore has a hard time reaching good decisions. But sometimes users would still be able to formulate a priority rule – "if >80% use OCR1, else if >90% use OCR2, else use OCR3".

We should support that by adding method="custom" and a new parameter priority as a list (JSON array) of floats of confidence thresholds for the corresponding input fileGrps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant