Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a note about where our S3 buckets live. Also, tweak docker docs. #15

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion cloud/docker/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@ If there's an image that you want to run, like one of the ones from [BioCurious'
Because the organization is `biocurious`, the repo is `sra-tools`, and the tag is `2.9.0`, the image would be named `biocurious/sra-tools:2.9.0` and you would use it like so:

```
docker run <other docker args> biocurious/sra-tools:2.9.0 /usr/local/ncbi/sra-tools/bin/fastq-dump <fastq-dump args>
docker run <other docker args> biocurious/sra-tools:2.9.0 fastq-dump <fastq-dump args>
```

## Building images yourself
Expand Down
2 changes: 1 addition & 1 deletion docs/bioinformatics/pichia_pipeline.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ default location, your home directory, to a partition were there is ample space.
the quality of raw NGS sequencing data.

```
docker run -i --rm -t -v $BIO_DATA:/data <fastqc image id>\
docker run -i --rm -t -v $BIO_DATA:/data biocurious/fastqc:0.11.7 \
/data/pichia/ERR1294016_1.fastq.gz \
/data/pichia/ERR1294016_2.fastq.gz \
-o /data/pichia/fastqc
Expand Down
13 changes: 13 additions & 0 deletions docs/data-management.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# S3 buckets

The S3 buckets use the prefix `bioc-cf` for "BioCurious Cuttlefish". If you need access to them, ask at the meetups or in Slack and someone will help you out.

- `bioc-cf-reference`: Reference data used in the cuttlefish project (e.g. sequences for comparable organisms).
- `bioc-cf-supplemental`: Stuff that would be in the supplemental section of a paper. Our eventual cuttlefish sequences, etc.
- `bioc-temp`: Temporary data. Contents are deleted automatically after 60 days.
- More can be created if further organization is helpful.

There are different IAM roles you can be in, depending on your needs.

- `researcher`: If you just need somewhere to store files temporarily that's not your hard drive. This will let you write to `bioc-temp` and read other buckets.
- `curator`: Everything in `researcher`, plus write access to other buckets.