Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Properly handle CSV-like TSVs #23

Closed
Tracked by #1566
joverlee521 opened this issue Dec 6, 2024 · 0 comments · Fixed by #25
Closed
Tracked by #1566

Properly handle CSV-like TSVs #23

joverlee521 opened this issue Dec 6, 2024 · 0 comments · Fixed by #25
Assignees

Comments

@joverlee521
Copy link
Contributor

Parent issue: nextstrain/augur#1566

Audit handling of TSVs in the repo to ensure that we are properly manipulating CSV-like TSVs.
See Nextstrain Data formats docs for guidelines.

@genehack genehack self-assigned this Dec 18, 2024
genehack added a commit that referenced this issue Dec 18, 2024
This facilitates making sure you're starting with a clean slate when
updating builds.
genehack added a commit that referenced this issue Dec 18, 2024
* Wrap tsv-utils usage in `csv2tk --csv-delim $'\t'` /
  `csvtk fix-quotes --tabs`
* Remove `csvtk fix-quotes` at start of pipeline in
  "format_ncbi_dataset_report" rule
* Remove '-l' flag in "format_ncbi_dataset_report" rule
@genehack genehack mentioned this issue Dec 18, 2024
1 task
genehack added a commit that referenced this issue Dec 19, 2024
This facilitates making sure you're starting with a clean slate when
updating builds.
genehack added a commit that referenced this issue Dec 19, 2024
* Wrap tsv-utils usage in `csv2tk --csv-delim $'\t'` /
  `csvtk fix-quotes --tabs`
* Remove `csvtk fix-quotes` at start of pipeline in
  "format_ncbi_dataset_report" rule
* Remove '-l' flag in "format_ncbi_dataset_report" rule
genehack added a commit that referenced this issue Dec 20, 2024
* Update "subset_curated_metadata_columns" rule:
  * Swap `tsv-select` for `csvtk cut`

* Update "format_ncbi_dataset_report" rule:
  * Remove '-l' flag from `csvtk add-header`
  * Remove `csvtk del-quotes | tsv-select` at end of pipeline; instead
    extend `csvtk mutate` to handle field re-arrangement
genehack added a commit that referenced this issue Dec 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants