Skip to content

Repository for parsing namespaces for Glygen metadata such as species, cell lines, diseases etc. to be used as typeahead

License

Notifications You must be signed in to change notification settings

glygener/namespacehandler

Repository files navigation

namespacehandler

Repository for parsing namespaces for Glygen metadata such as species, cell lines, diseases etc. to be used as part of typeahead functionality and validation in GlyTableMaker or other applications.

Currently supported dictionaries:

After downloading the files, place them in a folder and use the folder name as the argument while running the following java applications to generate the formatted dictionaries. If you would like to run the following without providing an argument, create a folder named "original" in the top level folder of the repository and place the downloaded files there.

There are 3 Java applications to execute.

  • NCBITaxonomyParser.java - parses the "names.dmp" file from the folder (command line argument) and generates species.txt file in namespaces folder of the repository.
  • CelllineParser.java - parses the "cellosaurus.txt" from the given folder and generates cellline.txt file in namespaces folder of the repository.
  • OBOParser.java - parses the files with .json extension in the given folder and generates the corresponding .txt versions in namespaces folder of the repository.

The generated files are tab separated text files in the following format Synonym Name URI where Synonym is the synonym or equivalent name, Name is the name to be stored and URI is either the ontology URI or the web link to the entry. Last part of the URI contains the ontology identifier or the original identifer used (eg. taxonomy id for NCBI Taxonomy, accession number for Cellosaurus).

About

Repository for parsing namespaces for Glygen metadata such as species, cell lines, diseases etc. to be used as typeahead

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages