This project extracts the picture area at the specified position in the video, recognizes the characters in the image and prints to the specified position in the pdf file.
I provided a simulator developed by Csharp on windows form environments that can help to test this application, in additional a Real-time result present client named as Monitor_receiver
included.
Before running this program, the local OS need to install several plugins to support this software.
-
pytesseract is OCR LSTM engine to be installed for recognition.
-
Imagemagic to convert pdf file to image and additional image processing function.
-
ghostscript only for solving the bugs during the image converter.
-
Check the requirement file to install the package needed.
Run Main GUI:
python OCR_main.py
-
The default video stream is the local camera.
-
For remote video stream, this app currently tested only on the RSTP protocol of M1054 IP camera. First, click the remote camera of the menu bar to active entry of input of IP address and port. Second, enter the RTSP IP address and port of your IP camera, then click connect button.
-
Press left button of the mouse and draw an area that needs to be recognized. Then right button of the mouse to confirm your selection, pop-up windows will be show to collect the name of this area.
-
In addition to managing the boxes of area. Simply click the menu button, select one item to operate (button or keyboard shortcut: "F2" for Rename and "Delete" for delete )
-
Select the PDF file to receive the recognition result.
-
Drop down the box name and select one item, then click the button to set up the PDF position be to print the recognition result of this item.
-
Set all items to relevant PDF position.
-
Run a single operation show this result on your default PDF viewer.
-
Set a timer to automatically generate the result and present on the remote application in Monitor_receiver. Be sure to run
Monitor_receiver
before timer of Main GUI running.