Skip to content

This repository utilizes Donut (Document Understanding Transformer) for information extraction from scanned images TNI-AU dataset.

Notifications You must be signed in to change notification settings

Kecilin-Team/Donut-Docs-Implementation

Repository files navigation

Donut-Docs-Implementation

This repository utilizes Donut (Document Understanding Transformer) for information extraction from scanned images TNI-AU dataset. Document Types are:

• Decree_On_Implementation.

• Military_Education_Certificate.

• Rank_Promotion_Degree.


[UPDATE]

Accuracy of Model:

  • Rank_Promotion_Degree: - Total number of samples: 4, Tree Edit Distance (TED) based accuracy score: 0.7174657534246576, F1 accuracy score: 0.8181818181818182

  • Decree_on_implementation: - Total number of samples: 9, Tree Edit Distance (TED) based accuracy score: 0.7151542241291141, F1 accuracy score: 0.4727272727272727

  • Military_Education_Certificate: - Total number of samples: 3, Tree Edit Distance (TED) based accuracy score: 0.23286052009456262, F1 accuracy score: 0.24


[UPDATE]

All Data Files:

About

This repository utilizes Donut (Document Understanding Transformer) for information extraction from scanned images TNI-AU dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published