Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vpt HPC execution #18

Open
josemq opened this issue Aug 3, 2023 · 1 comment
Open

vpt HPC execution #18

josemq opened this issue Aug 3, 2023 · 1 comment

Comments

@josemq
Copy link

josemq commented Aug 3, 2023

Hello,

We've converted a Docker image to a Singularity image for execution in an HPC environment. After running for more than 3 hours and 120GB plus memory consumed, we are now being overloaded with the messages below.

2023-08-03 17:53:19,360 - ./task-20 - INFO - prepare_watershed_images
2023-08-03 17:54:43,495 - ./task-5 - INFO - watershed
2023-08-03 17:54:46,306 - distributed.worker_memory - WARNING - Unmanaged memory use is high. This may indicate a memory leak or the memory may not be released to the OS; see https://distributed.dask.org/en/latest/worker-memory.html#memory-not-released-back-to-the-os for more information. -- Unmanaged memory: 11.09 GiB -- Worker memory limit: 15.72 GiB
2023-08-03 17:54:51,999 - distributed.worker_memory - WARNING - Unmanaged memory use is high. This may indicate a memory leak or the memory may not be released to the OS; see https://distributed.dask.org/en/latest/worker-memory.html#memory-not-released-back-to-the-os for more information. -- Unmanaged memory: 11.00 GiB -- Worker memory limit: 15.72 GiB
2023-08-03 17:54:52,099 - distributed.worker_memory - WARNING - Unmanaged memory use is high. This may indicate a memory leak or the memory may not be released to the OS; see https://distributed.dask.org/en/latest/worker-memory.html#memory-not-released-back-to-the-os for more information. -- Unmanaged memory: 11.00 GiB -- Worker memory limit: 15.72 GiB
2023-08-03 17:54:52,199 - distributed.worker_memory - WARNING - Unmanaged memory use is high. This may indicate a memory leak or the memory may not be released to the OS; see https://distributed.dask.org/en/latest/worker-memory.html#memory-not-released-back-to-the-os for more information. -- Unmanaged memory: 11.00 GiB -- Worker memory limit: 15.72 GiB

I terminated the process but would like to know if this is expected and what are the typical hardware resource requirements for this type of execution. Please let me know if you need further details and how can we further troubleshoot. Below are the parameters we are executing with:

singularity exec /home/USER/VPT_Run/vpt/vpt_latest.sif vpt
--verbose
--processes 16
--log-file $OUTDIR'/run.log'
--log-level 1
run-segmentation
--segmentation-algorithm $OUTDIR'/watershed.json'
--input-images $OUTDIR'/'$RUN'/mosaic_(?P[A-Za-z]+)_z(?P[0-9]+).tif'
--input-micron-to-mosaic $OUTDIR'/'$RUN'/micron_to_mosaic_pixel_transform.mat'
--overwrite
--output-path $OUTDIR'/'$RUN

@lindsey-ching
Copy link

Hello,

I'm also having this same issue. Are there any updates on this problem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants