namespacehandler

Repository for parsing namespaces for Glygen metadata such as species, cell lines, diseases etc. to be used as part of typeahead functionality and validation in GlyTableMaker or other applications.

Currently supported dictionaries:

NCBI Taxonomy: download new_taxdump.tar.gz and extract names.dmp
Cellosaurus Ontology: dowload cellosaurus.txt
UBERON: download uberon-base.json
Human Disease Ontology: download doid-base.json
Human Phenotype Ontology: download hp.json

After downloading the files, place them in a folder and use the folder name as the argument while running the following java applications to generate the formatted dictionaries. If you would like to run the following without providing an argument, create a folder named "original" in the top level folder of the repository and place the downloaded files there.

There are 3 Java applications to execute.

NCBITaxonomyParser.java - parses the "names.dmp" file from the folder (command line argument) and generates species.txt file in namespaces folder of the repository.
CelllineParser.java - parses the "cellosaurus.txt" from the given folder and generates cellline.txt file in namespaces folder of the repository.
OBOParser.java - parses the files with .json extension in the given folder and generates the corresponding .txt versions in namespaces folder of the repository.

The generated files are tab separated text files in the following format Synonym Name URI where Synonym is the synonym or equivalent name, Name is the name to be stored and URI is either the ontology URI or the web link to the entry. Last part of the URI contains the ontology identifier or the original identifer used (eg. taxonomy id for NCBI Taxonomy, accession number for Cellosaurus).

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.settings		.settings
namespaces		namespaces
src/main/java/org/glygen/namespacehandler		src/main/java/org/glygen/namespacehandler
.classpath		.classpath
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

namespacehandler

About

Releases

Packages

Languages

License

glygener/namespacehandler

Folders and files

Latest commit

History

Repository files navigation

namespacehandler

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages