Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

develop --> main #181

Merged
merged 44 commits into from
Dec 19, 2024
Merged

develop --> main #181

merged 44 commits into from
Dec 19, 2024

Conversation

joeflack4
Copy link
Contributor

No description provided.

joeflack4 and others added 30 commits September 8, 2024 16:53
- Update: Now adding synonyms for alt & included symbols, w/ type of mondo#abbreviation.
- Misc updates: Todo comments. Renamed variables/methods for consistency and accuracy. Refactored some things.
- In previous commits, stated that this would also include 'included' symbols, but for now we have decided not to proceed with that.
- Update: Now adding synonyms for alternative symbols, w/ type of mondo#abbreviation.
- Update: Now stripping the text ', INCLUDED' from symbols. Previously was only doing for titles.
- Update: For included & alternative titles/symbols ending in ', FORMERLY', these are being added as relatedSynonyms, and also being marked as owl:deprecated.
- Misc updates: Updated some comments and codestyle.
- Bug fix: mondo#ABBREVIATION URI
- Update: Typo and codestyle of cleanup_label()
- Update: Docstring to make clearer
- Update: Typo fix
- Update: Now "former" included titles/symbols are also being processed
- Update: Refactor: Cleaned some redundancy in how this was being called. Simplified code. Modularized other parts of code.
- Update: Now considering all abbreviations when checking a title and uppercasing them. Previously was only looking at the first preferred symbol.
- Update: Refactored, added comments and todos, simplified code.
- Bug fix: Was not actually uppercasing previously. The .replace() usage was incorrect.
- Upgrade to v1.5.3. Has a bugfix for SSSOM and includes dotenv package.
- Bug fix: Need to add '--user --break-system-packages' due to new nature of ODK security.
- Analysis: Self-referential “phenotype in the gene position + Phenotype field without a MIM” + "morbidmap.txt entry not in Phenotype-Gene Relationships table" case
- Update: No longer just analysis. Now filtering these out.
- Update: Revert filtering out. Now log as a review.tsv case.
- Update: README: general documentation for d2g pipeline
- Updates to README.md docs: (i) changed verbiage 'related' to 'pattern', (ii) added an example from morbidmap.txt.
- Updates: Code comments: Added verbiage 'disease-defining'
Update morbidmap analysis notebook
- Update: Docs: Fixed a few places where the phrase Gene->Disease was used but should have been Disease->Gene, and vice versa.
- Update: Some additional rephrasing.
Disease-Gene: Self-referential cases
- Delete: some files somehow accidentally added
- Renamed function: Now more accurate and shorter
- Codestyle: Line length
- Code quality: Redundant imports
- Update comments
- Add: New cases: '3. D2G: somatic', '4. D2G: Phenotype is gene', and '5. D2G: Phenotype type error'.
- Renamed the classLabel for the 2 existing review cases to be shorter / simpler.
- Refactored code for DRYness, adding _add_to_review_tsv() and REVIEW_CASE_NAME_MAP. Also streamlined the values / error messages for these to be more consistent and easier to read.
- Bug fix: Fixed a sorting issue for the 'self-referential' review case.
- Codestyle: Added a space after comma
- Docs: Added clarifying text about how to easily identify individual cases when a case contains multiple rows.
- Docs: Added text about the relationship between review cases 2 and 3
- Refactor: Moved review.tsv code into a separate function in order to make the disease2gene section more readable.
- Docs: Updated docs for 'D2G: self-referential' representation within review.tsv itself to be more clear.
- Updated some comments to be clearer
- Update: Verbiage for case 3 for additional clarity
Abbreviation recasing: use all abbreviations
Added a way for us to manually make exclusions, such that entries in morbidmap.txt will not get populated as disease-genes associations.
- Add: data/exclusions-disease-gene.tsv: Manually curated file.
- Update: Logic to utilize the above TSV.
- Update: Added Sabrina's ORCID to exclusions file.
- Bug fix: Was still not filtering exlcusions correctly; was missing a logical condition.
- Bug fix?: Added RO:0003302 entries to disease-gene-relationships (.sparql / .tsv). I think these were previously left out by mistake.
- Update: A comment to be more clear
@joeflack4 joeflack4 self-assigned this Dec 15, 2024
@joeflack4 joeflack4 requested a review from twhetzel December 15, 2024 00:08
@joeflack4
Copy link
Contributor Author

@twhetzel I guess there is no super hurry to merge review / merge this, as I'm making a release tomorrow off of the branch I just worked on.

I don't mind reviewing this by myself but if you want to give it a look-over, feel free!

Copy link
Contributor Author

@joeflack4 joeflack4 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just got done taking a second look through all these recent changes. LGTM!

Copy link
Contributor

@twhetzel twhetzel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@joeflack4 joeflack4 merged commit e8717a1 into main Dec 19, 2024
1 check passed
@joeflack4 joeflack4 deleted the develop branch December 19, 2024 00:33
@joeflack4 joeflack4 restored the develop branch December 19, 2024 00:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants