-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Estimated run-time for FRASER2 #476
Comments
Hi @chrisodhams , |
Hi @ischeller , I've rerun now with 120GB memory on 5 cores (using a single-node with max of 700GB memory) and set max run time of 2 weeks. If this fails I will run with 20 cores 120GB each and span across multiple nodes. Thanks, |
Hi @ischeller, Thanks for getting back. I've rerun using 700GB total memory split over 10 cores (70GB per core) for FRASER2 using 314 samples (using the same config above). It is still running after 2 weeks - with the last line at June 22 as: 'Thu Jun 22 02:28:21 2023: Run hyper optimization with 12 options.' It has been stuck on this for 13 days and no temporary outputs have been generated. I've limited the sample set to 76 and rerun with the same compute 700GB total memory split over 10 cores and same config. It's still stuck on the hyper optimization step. Any ideas what is happening here? Thanks, |
Hi Chris, not sure, there's no reason why it would stop in the run hyper parameter optimization. I recently tried in a cohort of ~200 samples and it fully ran in the usual 3-4 hours in our server. Can you try the following:
What are the values of: Number of junctions and Number of splice sites?
Then maybe check the total counts per sample by executing
|
Hi @vyepez88 ,
So there are no zero counts in the columns (samples) for split and non-split, but there are 0 counts in 151 rows (junctions) for the for non-split counts (theta). Would this be a problem? How best to remove these rows of 0 counts? Just attempting now with the ten sample group - will let you know. Many thanks. |
Hi Chris, so all looks good, the number of junctions, splice sites and reads. It is fine if there are rows with all 0 non-split reads in splice sites, that means that for that splice site, all reads are spliced. |
Hi Chris, how did it go with the 10 samples? |
Hi @vyepez88, Sorry I was still waiting for confirmation of jobs to complete. I ran for a subset of 39 samples as a test and it still did not complete the hyper optimization step within ~48hours (I can try with 10 samples but I think it will be the same story). Using the code within the DROP pipeline to set the value of q, with a sample set of 39, the values of q are:
I manually set q as 4 and continued using the FRASER R package manually, and all the subsequent steps run successfully (fit, calculateZscore, calculatePvalues, etc). These all completed in the time frame expected. It's very hard to debug what is going on here as there are no temporary outputs/messages but however I try with the sample size and memory/core allocation - it just does not pass the hyper optimization step. |
Good that at least it worked after setting the q. |
Just setting that up now. I'm sure this ran successfully when we initially installed DROP but I'll run again just to confirm! |
Hi @vyepez88 , |
Hi @chrisodhams, do you have any updates on this? I think you were able to successfully run it, or? |
Hi,
I was wondering how long a typical FRASER2 run is expected to be?
If I peek at an ongoing job, it seems to hang or not update for at least a few days (5 days in this example), the tail is shown below:
I'm running 314 samples through FRASER2 using DROP v1.3.3 with the following config:
My compute set-up is as follows (10 cores, 60GB each, run-time of 1 week):
Do you have any estimations with a similar number of samples? Do I trust that this is still running and wait longer?
Many thanks,
Chris
The text was updated successfully, but these errors were encountered: