Skip to content

Latest commit

 

History

History
84 lines (67 loc) · 1.65 KB

README.md

File metadata and controls

84 lines (67 loc) · 1.65 KB

CIPRS Reader

Build Status

Setup and Run:

Add pdf file to parse in /ignore folder then run:

docker build -t ciprs-reader .
docker run --rm -v "${PWD}:/usr/src/app" ciprs-reader python ciprs-reader.py ignore/cypress-example.pdf

Example output:

[
    {
        "General": {
            "County": "DURHAM",
            "File No": "00GR000000"
        },
        "Case Information": {
            "Case Status": "DISPOSED",
            "Offense Date": "2018-01-01T20:00:00"
        },
        "Defendant": {
            "Date of Birth/Estimated Age": "1990-01-01",
            "Name": "DOE,JON,BOJACK",
            "Race": "WHITE",
            "Sex": "MALE"
        },
        "District Court Offense Information": [
            {
                "Records": [
                    {
                        "Action": "CHARGED",
                        "Description": "SPEEDING(70 mph in a 50 mph zone)",
                        "Severity": "TRAFFIC",
                        "Law": "20-141(J1)"
                    }
                ],
                "Disposed On": "2010-01-01",
                "Disposition Method": "DISMISSAL WITHOUT LEAVE BY DA"
            }
        ],
        "Superior Court Offense Information": [],
    }
]

Local Setup

Pre-requisites:

brew cask install pdftotext

Setup:

pip install -r requirements.txt
pip install -e .

Read CIPRS PDF:

python ciprs_-eader.py ./cypress-example.pdf

Run Jupyter:

jupyter-lab

Run tests:

pytest --pylint

Code for Durham