Releases: PGScatalog/pgscatalog_utils
Releases · PGScatalog/pgscatalog_utils
v0.5.3
v0.5.2
v0.5.1
v0.5.0
v0.4.3
Changelog
- Fix retrying downloads when the EBI servers are sleepy on a Monday morning
- Give a more helpful error message when there's no valid matches in
match_combine
- Fix numeric sample identifiers breaking ancestry analysis
- Check chr prefix in samplesheets
v0.4.2
v0.4.1
v0.4.0
Changelog
New features
ancestry_analysis
: use genetic PCA loadings to compare samples to population reference panels, and report PGS according to where they sit in empirical distributions (percentile/Z) and/or PGS adjusted for continuous axes of genetic ancestry (regressing PGS~PCs).relabel_ids
: a program to make a variant information file IDs match another variant information file (e.g. a reference panel)check_samplesheet
: a program for validating pgsc_calc samplesheets
Improvements
download_scorefiles
refactored to prefer HTTPS and attempt FTP fallback (to minimise FTP busy errors)
v0.3.1
Changelog
Bug fixes
- Fix reading effect weight column of some PGS Catalog scoring files as a string instead of a float (e.g. PGS002619, PGS000908, PGS002704, PGS002678) PGScatalog/pgsc_calc#79
v0.3.0
Changelog
- Handle large traits in the PGS Catalog more efficiently (e.g. cancer, autoimmune diseases)
- Split - apply - combine approach: added
combine_matches
for use with (match_variants --only_match
) - More lazy operations
- Split - apply - combine approach: added
- Improve correctness of labelling: now all matches are labelled simultaneously
- Get scoring file information (e.g. publication, trait) from header data and store in a separate log file