-
Hi pgsc_calc team! Thanks for making this great tool. I have been struggling to get the pgsc_calc test profile up and running on our cloud computing setup. We work on Terra, which spins up Google Cloud VMs. Unfortunately, we do not have sudo permissions when interacting with this terminal (see Using the terminal and interactive analysis shell in Terra – Terra Support)) and so I have not been able to get Docker or Singularity to play nicely when running pgsc_calc. We are able to run a startup script which allows you to preconfigure an environment with packages that would otherwise require sudo permissions to install (Preconfigure a Cloud Environment with a startup script – Terra Support), but then once opening the terminal I am unable to run docker with “docker run hello-world”. We also have the ability to start up the VM with docker images from DockerHub or Google Artifact Registry (https://support.terra.bio/hc/en-us/articles/360037143432-Docker-tutorial-Custom-cloud-environments-for-Jupyter-Notebooks) Attempt 1: DockerWith this, I tried installing docker using the startup script, and then running it in the background. When trying to run docker run hello-world, the specific error message I get is below. Startup script
Error message
Docker info
I am on Ubuntu 20.04.6 (jammy). I have not been able to get the rootless Docker to work either unfortunately (Run the Docker daemon as a non-root user (Rootless mode) | Docker Docs). Attempt 2: Offline modeI tried following the tutorial at https://pgsc-calc.readthedocs.io/en/latest/how-to/offline.html . Unfortunately, I get similar error messages about the unshare operation not being permitted
Attempt 3: CondaWhen trying to run the pgsc_calc test profile using conda, I get an error about not being able to find a pgscatalog Module. When trying to install pgscatalog-core, it mentions that I need Python >=3.11. Unfortunately, the python version in Terra is 3.10.12
After installing a newer python version using user version of miniconda as mentioned in #293 , I get the same error
QuestionsMy questions for you are 1) have you been able to successful set up pgsc_calc on any cloud computing setup (Terra, All of Us, UK Biobank Research Analysis Platform) and 2) do you have any specific guidance for troubleshooting on my end? Thanks so much for your time! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
You're right, I don't think singularity or docker is a good fit for this kind of situation where you're loading software into a VM without admin rights. On HPCs the admins install singularity for users, and when you have admin rights you can install docker with Conda is normally the best backup choice in this situation. I have two suggestions:
Then run:
This isn't the best way to work with the calculator (containers are best), but it should work OK to get alpha 5 working. I was able to run the test profile in an interactive session on a HPC using this approach. For cloud execution we normally use Nextflow's support for batch executors like Cloud Batch instead of a dedicated VM. I think some users have worked with the UK Biobank's RAP before but I have no experience with it. Good luck! Please let me know if you have any problems. |
Beta Was this translation helpful? Give feedback.
Yes, if you have downloaded every dependency on your system you don't need to use a -profile, nextflow will just call system utilities. Installing these dependencies manually is a pain, which is why I suggested making the big conda environment.
The test profile is useful to make sure each process in the workflow can run and you're able to generate results, before you try working with your target genomes.
Before installing and using miniconda3, perhaps try
unset PYTHONPATH
. I have a feeling it may be interfering with the conda environment.