About

Named entity recognition (NER) is a sub-task of information extraction (IE) that seeks out and categorizes specified entities in a body or bodies of texts. NER is also known simply as entity identification, entity chunking and entity extraction. NER is used in many fields in artificial intelligence (AI) including natural language processing (NLP) and machine learning.

In this task the objective is to recognize the store number within the store description. For this, a semi-automatic model is created using Python and SpaCy (free and open- source library for NLP) for the collection of entities.

For run this project with other databases ensure that it has the same structure as 'work-shops.csv' or send directly to Dataset class the files for train, validate and test.

Structure

In the data folder will be the necessary files to create the NER.
Inside db are the databases.
In the info folder all the analyses carried out will be saved.
Output corresponds to the created models.
In src are the Python classes.

Steps to Run

Download SpaCy library “pip install -U spacy”
Download the pipeline for English medium size “python -m spacy download en_core_web_md”
In init.py run extractor_model.save_info() to obtain the test .csv with token and entities.
In init.py run extractor_model.model() to create the train and dev.spacy files
With the terminal open in the project folder run “python -m spacy train ./data/ config.cfg --output ./output” for train the data and create the models.
In init.py run extractor_model.save_info(nlp=spacy.load(“../output/model- best”)) to obtain the test .csv with token and entities with the new model.
In init.py run extractor_model.evaluate_model() for evaluate the test and create metrics with the best model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Structure

Steps to Run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
data		data
db		db
info		info
output		output
src		src
.DS_Store		.DS_Store
README.md		README.md

es-roly99/NER-example-with-SpaCy

Folders and files

Latest commit

History

Repository files navigation

About

Structure

Steps to Run

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages