Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test data processing issue #9

Open
zheshiyige opened this issue Jul 14, 2019 · 7 comments
Open

test data processing issue #9

zheshiyige opened this issue Jul 14, 2019 · 7 comments
Assignees
Labels
bug Something isn't working

Comments

@zheshiyige
Copy link

When I run the script on the test data, it looks like slow and sometimes seems no change. Could you tell me how to solve this issue? Thank you!

@AmitMY
Copy link
Owner

AmitMY commented Jul 14, 2019

First thing I would try is:

  1. In planner/naive_planner.py change is_parallel from True to False. That will make sure the issue is not the parallelization.
  2. Under cache directory, delete the train-planner directory (which will cause the system to instantiate it, without needing to re-train things)

If it is still stuck, I believe it is under tqdm, can you post a screenshot of the number it is stuck on (and the time?)

@AmitMY AmitMY self-assigned this Jul 14, 2019
@AmitMY AmitMY added the bug Something isn't working label Jul 14, 2019
@zheshiyige
Copy link
Author

WeChat Image_20190714094514

Here is the the number and time it is stuck on after I following the above instructions, it looks like still stuck.

@AmitMY
Copy link
Owner

AmitMY commented Jul 14, 2019

Wow, 8 hours seems excessive. Can you also say how much time has passed? this sometimes happens to me, and then after 20 minutes or so I see it passed a few more and that just 1 was very large to process.

@zheshiyige
Copy link
Author

Actually, it only takes about 10 minutes to process 892/1862, but then there are no changes in the following, there is no change in most time of 8 hours.

@zheshiyige
Copy link
Author

Do I need to skip this test case? Is it because this case has too many possible plans so that it takes a long time to generate the plan?

@AmitMY
Copy link
Owner

AmitMY commented Jul 18, 2019

Sorry for the late response.
You can skip this part of the pipeline, but then you will not get test-set evaluation results.

The test set is built of 2 parts, and the largest, hardest graphs to process are numbers 860-960.
I'm unsure how I can further help here, except to say - try again - and make sure your machine has enough memory. If I would know exactly what part of the code is so hard for it, I would definitely try to help here.

There is one immediate possible solution, which is to create a sub-pipeline for every test datum, such that you would run the code until it is stuck, then rerun, and it would continue where it left off, hopefully with much clearer memory (I'm suspecting a memory leak)

@jeffersonHsieh
Copy link

Hi, I also encountered this problem when the code runs on the test set. I tried this changing the " is_parallel" from True to False as you suggested but the problem still persists. I'm wondering if there is an update on this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants