-
Notifications
You must be signed in to change notification settings - Fork 108
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update build procedure of imports #1052
Conversation
Obsolete terms still excluded.
Groups: .obo, .owl, or .owl.gz NCBItaxon shifted to .owl.gz because of large size (like CHEBI).
versionIRI for each import is saved to file at build/<import name>.version
No imports to merge & no need to specify annotations to retain, all are retained.
Currently affects only SYMP & TRANS, as other ontologies are built as modules.
Refresh means download the latest version and rebuild the corresponding _import.owl file. Execute with `make refresh_<import name>`. This differs from the shorter `make <import name>` which reuses previously downloaded import sources if they exist.
To clarify what it does.
The .version files will now automatically be made in the same dir as the source files (currently all in src/ontology/imports/build) and make will always search for these files in the imports/build dir.
Announces whether update is happening from a source file with the same version or a different version and shows the version(s).
Also expliclity declare .version files as .SECONDARY. Presence of global declaration causes .version intermediate files not to be regenerated when their inputs are and makes predicting make behavior challenging.
To keep main _import.owl file rules together (organizational only).
We should test the following to make sure these updates work as intended before merging this into
|
|
From list in issue #946
|
Reordering these commands does not change output in any way.
Discarding non-FOODON terms from PO & BFO along with the additional branches those terms created in the foodon import. Note: Explicit removal of BFO is subsumed and superseded by this change.
Discarding non-CL terms from CARO and BFO along with the additional branches they cause to form in the CL import.
Definitions are no longer included, fixing error from multiple definition for IAO:0000115 in import.
Fixes multiple labels issue on CL:0000000 & CL:0000540.
The only import previously using this query is FOODON. This addition does not change the FOODON import in any way.
Ensures HP import does not retain duplicate labels from source, which would cause errors in obo files.
Modifications: - Terms in 'onset' branch now retain definitions - Duplicate labels from source will be removed
Fixed errors in FOODON and CL by updating. Had to modify build commands to remove extra branches in FOODON and to avoid extra terms from outside both ontologies namespaces (BFO + CARO for CL and BFO + PO for FOODON). We should consider switching sources to ontology |
No longer contains definitions.
No longer contains definitions.
No longer contains definitions.
Retaining all class & axiom annotations.
agreed, let's look into the switch for the August build.
…On Wed, Jul 27, 2022 at 10:13 AM J. Allen Baron ***@***.***> wrote:
Fixed errors in FOODON and CL by updating. Had to modify build commands to
remove extra branches in FOODON and to avoid extra terms from outside both
ontologies namespaces (BFO + CARO for CL and BFO + PO for FOODON).
We should consider switching sources to ontology -base.owl files where
they are produced. This could simplify some of our import build commands.
—
Reply to this email directly, view it on GitHub
<#1052 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABBB4DM3T7K6IS374MHRXNTVWE7XVANCNFSM53H645VQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
--
Lynn M. Schriml, Ph.D.
Associate Professor
Institute for Genome Sciences
University of Maryland School of Medicine
Department of Epidemiology and Public Health
670 W. Baltimore St., HSFIII, Room 3061
Baltimore, MD 21201
P: 410-706-6776 | F: 410-706-6756
***@***.***
|
No longer contains definitions.
I went ahead and refreshed all the imports, excluding ECO, DISDRIV, and OMIM_SUSC which don't have automated build rules yet, to ensure they are all up-to-date and formatted as desired (with or without definitions). For each updated import, I compared the previous version with the updated version 1) with Protege to verify the trees look as expected and their are no unexpected branches and 2) with All of these updates appear correct to me. |
Finally, I did a test merge with the I then reviewed ext.owl and all of the imports (again) and they appeared to have correctly updated versionIRIs and otherwise looked as expected. Finally, I reviewed all the reports. There are a few warnings that may be worth addressing (I did fix 1 whitespace warning) but nothing breaking. This PR is now ready to be merged. |
This PR accomplishes the following:
<import name>_import.owl
files as anrdfs:comment
at the time a<import name>_import.owl
file is built.<import_name>_import.owl
files are accessible online by both their ontology IRI and version IRIs.make refresh_<import_name>
commands to allow re-downloading a specific import source when re-building that import (as opposed to redownloading all imports OR rebuilding the import from a previously downloaded source file, when it exists).Note that
make clean_imports
has also been renamed tomake refresh_imports
to better reflect its use in re-downloading AND rebuilding imports.