Idefics2-OCR

Fine-tuned the HuggingFaceM4/idefics2-8b model on the nielsr/docvqa_1200_examples_donut dataset for document VQA pairs. Checkout Idefics2-OCR on Hugging Face.

Find the rest of training details here.

Finetune

Set your wanb token in the .env file as WANDB_API.

Install the requirements

pip install -r requirements.txt

Finetune Idefics

python3 idefics2.py --wandb True

Run the app

You don't have to finetune for running the app the model is loaded from Hugging Face.

python3 app.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Idefics2FT.ipynb		Idefics2FT.ipynb
Idefics2_OCR.ipynb		Idefics2_OCR.ipynb
LICENSE		LICENSE
README.md		README.md
app.py		app.py
datacollator.py		datacollator.py
idefics2.py		idefics2.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Idefics2-OCR

Finetune

Run the app

About

Releases

Packages

Languages

License

mishra-18/idefics2-ocr

Folders and files

Latest commit

History

Repository files navigation

Idefics2-OCR

Finetune

Run the app

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages