You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A recent small-scale in-house experiments show that Tesseract 5 (T5)'s outperforms Tesseract 4 in terms of accuracy measure (CER). Hence we would like to update this app to T5 instead of retiring it in favor of apps based on larger models (such as doctr or llava). This will keep an entry in the app directory for a "good-enough" text recognition under restricted HW environment.
A recent small-scale in-house experiments show that Tesseract 5 (T5)'s outperforms Tesseract 4 in terms of accuracy measure (CER). Hence we would like to update this app to T5 instead of retiring it in favor of apps based on larger models (such as doctr or llava). This will keep an entry in the app directory for a "good-enough" text recognition under restricted HW environment.
I'd also like to take this update work as an opportunity to partially address clamsproject/app-role-filler-binder#4 issue, in that the updated T5 wrapper app should use the current docTR app as a reference implementation to organize input/output MMIF structure. This means, we would like to wrap tesseract's internal structure understanding scheme in our vocabulary terms. Concretely, in docTR-wrapper, we translated
Page
: no translation as we only deal with single-page scenario (one image at a time)Block
-->Paragraph
from LAPPS vocabLine
-->Sentence
from LAPPS vocabWord
-->Token
from LAPPS vocabSimilarly with T5, we translate
Page
: no translation as we only deal with single-page scenario (one image at a time)Block
-->Paragraph
from LAPPS vocabPar
: no translation as we already used LAPPSParagraph
, and there's no middle level betweenParagraph
andSentence
in LAPPS vocab.Line
-->Sentence
from LAPPS vocabWord
-->Token
from LAPPS vocabThis will naturally address #18 once implemented.
The text was updated successfully, but these errors were encountered: