-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The problem of multitasking parallel processing #68
Comments
Could you help me the #include<torch.extension.h> didn't have the file? |
@Pter61 What config do you run? And what is the full command you ran? And please provide your pytorch version and cuda version also. |
I just follow the readme'' Environment to run
conda create -n...
conda install...
git clone(apex)but it throws exception
pop install Cython
-r requiremen t
./scripts/init.sh (It has errors, which didn't know #include<torch.extension.h>)
Thanks for you attention
…---Original---
From: "Weijie Su"<[email protected]>
Date: Wed, Dec 30, 2020 11:53 AM
To: "jackroos/VL-BERT"<[email protected]>;
Cc: "Comment"<[email protected]>;"Donglearner"<[email protected]>;
Subject: Re: [jackroos/VL-BERT] The problem of multitasking parallel processing (#68)
@Pter61 What config do you run? And what is the full command you ran?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
I have solved the problem. Thanks |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
While I was training in 2 GPU(V100 16G), this error occurred in the middle of the second epoch
...
File: ".../python3.6/site-packages/torch/autograd/inti/.py", line93, in backward allow_unreachable=True) # allow_unreachable flag
RuntimeRrror: element 0 of tensors does not require grad and does no have grad_fn
However, I can continue to train in a single GPU.
Could you please help me deal with this problem?
The text was updated successfully, but these errors were encountered: