-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
b2c4429
commit 28f1c42
Showing
10 changed files
with
116 additions
and
98 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,34 @@ | ||
# batch | ||
|
||
This command creates prompts from all phenopackets in the input directory. | ||
|
||
## Getting the input data | ||
|
||
Go to the [Releases](https://github.com/monarch-initiative/phenopacket-store/releases) section of | ||
[phenopacket-store](https://github.com/monarch-initiative/phenopacket-store){:target="_blank"}, and download the | ||
latest release (currently 0.1.5 on April 29, 2024, but evolving rapidly). Currently, this resource contains over 4300 phenopackets. | ||
|
||
|
||
Download one of the archives (e.g., ``all_phenopackets.zip``) and unpack in a location of your choice. | ||
|
||
|
||
Then run the following command. | ||
|
||
```shell title="batch" | ||
java -jar phenopacket2prompt.jar batch -d <all_phenopackets> | ||
``` | ||
where ``<all_phenopackets>`` is the complete relative or absolute path to the unpacked directory. | ||
|
||
phenopacket2prompt will create a new subdirectory called ``prompts``in the current directory. It will contain | ||
one folder for each language (currently, English-en and Spanish-es), as well as a file called ``correct_results.tsv`` | ||
with the following structure | ||
|
||
|
||
| Disease name | OMIM identifier | Prompt file name | | ||
|--------------------------------------------|:---------------:|-------------------------------------------------:| | ||
| Birt-Hogg-Dube syndrome 2 | OMIM:620459 | PMID_36440963_IIIPMID_36440963_III-33-prompt.txt | | ||
| Immunodeficiency 115 with autoinflammation | OMIM:620632 | PMID_26008899_patient-prompt.txt | | ||
| Jacobsen syndrome | OMIM:147791 | PMID_15266616_148-prompt.txt | | ||
|
||
|
||
Note that the prompt file name is the same for every language. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# English | ||
|
||
Todo -- let's write a summary of the translations in each language. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
6 changes: 6 additions & 0 deletions
6
src/main/java/org/monarchinitiative/phenopacket2prompt/output/CorrectResult.java
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
package org.monarchinitiative.phenopacket2prompt.output; | ||
|
||
import org.monarchinitiative.phenol.ontology.data.TermId; | ||
|
||
public record CorrectResult(String promptFileName, TermId diseaseId, String diseaseLabel) { | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters