Named Entity Recognition (NER) from Images

Overview

This project is an experimental implementation of Named Entity Recognition (NER) from images using the Phi 3.5 Vision Instruct model. The system takes an image and extracts meaningful entities directly from it. It was developed as part of a hackathon challenge, and as such, it remains a work in progress. Several features and optimizations are planned but have not yet been completed.

Project Status

⚠️ Important:
This project was built for a hackathon and is not a complete, production-ready solution. The current implementation works in basic scenarios but lacks extensive error handling, performance optimizations, and support for all edge cases.

Features

Uses Phi 3.5 Vision Instruct model for extracting entities from images.
Input: An image containing text (e.g., receipts, documents, posters).
Output: Named entities such as person names, organizations, and dates.
Limited preprocessing for handling noise in image data.
Hackathon build — expect incomplete features and limitations.

Setup

Prerequisites

Python 3.8+
PyTorch (CUDA support recommended for faster processing)
Required libraries:
- transformers
- torchvision
- Pillow

🎊🎊 top 10% 🎊🎊

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
better_images		better_images
images		images
multi_process		multi_process
phi3		phi3
processed_images		processed_images
student_resource		student_resource
README.md		README.md
bert.ipynb		bert.ipynb
cleaned_predictions_model.csv		cleaned_predictions_model.csv
csv.rar		csv.rar
inference.ipynb		inference.ipynb
inference0tolast.csv		inference0tolast.csv
peft.ipynb		peft.ipynb
phi_inference.ipynb		phi_inference.ipynb
preprocess.ipynb		preprocess.ipynb
report.pdf		report.pdf
simple.ipynb		simple.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Named Entity Recognition (NER) from Images

Overview

Project Status

Features

Setup

Prerequisites

About

Releases

Packages

Contributors 3

Languages

ariyha/NER-Amazon-ML-Hackathon-2k24

Folders and files

Latest commit

History

Repository files navigation

Named Entity Recognition (NER) from Images

Overview

Project Status

Features

Setup

Prerequisites

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages