-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when I run "python3 scripts/extract_repos.py --cwd ../mathlib4 --config configs/config_mathlib_v4.13.json --training_data " #6
Comments
Thank you for the interest in our project! I think the problem is that the argument for |
Thanks for your prompt reply. You are right! But I have other problems. When I run 'python3 scripts/extract_repos.py --cwd ../ntp-toolkit --config configs/config_mathlib_v4.13.json --training_data --full_proof_training_data --premises --state_comments --full_proof_training_data_states --training_data_with_premises --declarations --imports' All parameters are ok except 'training_data_with_premises' error: ././././scripts/training_data_with_premises.lean:263:91: application type mismatch
When I remove 'training_data_with_premises' and run 'python3 scripts/extract_repos.py --cwd ../ntp-toolkit --config configs/config_mathlib_v4.13.json --training_data --full_proof_training_data --premises --state_comments --full_proof_training_data_states --declarations --imports'
concurrent.futures.process._RemoteTraceback: The above exception was the direct cause of the following exception: Traceback (most recent call last):
|
Since there was another error before, could it be that the act of me correcting this error has led to other mistakes? error: ././././scripts/state_comments.lean:60:38: unknown constant 'Lean.HashSet.ofList'
I changed 'Lean.HashSet.ofList' to 'Std.HashSet.ofList'. Then the error was solved. |
Hi @yuchen814, To obtain miniCTX-style data, you can specify |
Thank you very much! After I run 'python3 scripts/extract_repos.py --cwd ../ntp-toolkit --config configs/config_mathlib_v4.13.json --training_data --full_proof_training_data --premises --state_comments --full_proof_training_data_states --declarations --imports'. Is it right(I ran it for about a few hours, but there were only three files.)? |
The output looks very weird and it isn't what I would expect. Can you first |
Oh right, I'm sorry I forgot about this logic in the code. I pushed a new commit to try to fix this. Yes, it is okay to skip if there is a mismatch. This happened because you tried to convert some theorem/definition from the Lean core, which this logic didn't cover because of my mistake. Can you try to |
Thanks for your help! |
@yuchen814 I think potentially the reason is that the internal memory on your computer ran out. Can you check the memory usage? If the I personally did not try running the convert script on all of mathlib, because miniCTX is supposed to be used on other smaller projects (see the miniCTX paper). If you are extracting data for training a model, then probably the output files in |
Thank you for your help! |
Happy to help :) Let me know if you have any other questions or feedback! |
Great Work!
when I run "python3 scripts/extract_repos.py --cwd ../mathlib4 --config configs/config_mathlib_v4.13.json --training_data ", I have error like this:
[?/?] Computing build jobs⣿ [?/?] Computing build jobs⣿ [?/?] Computing build jobs⣿ [?/?] Computing build jobs⣿ [?/?] Computing build jobs⣿ [?/?] Computing bu✖ [335/339] Building scripts.training_data
trace: .> LEAN_PATH=././.lake/packages/batteries/.lake/build/lib:././.lake/packages/Qq/.lake/build/lib:././.lake/packages/aesop/.lake/build/lib:././.lake/packages/proofwidgets/.lake/build/lib:././.lake/packages/Cli/.lake/build/lib:././.lake/packages/importGraph/.lake/build/lib:././.lake/packages/LeanSearchClient/.lake/build/lib:././.lake/packages/mathlib/.lake/build/lib:././.lake/build/lib DYLD_LIBRARY_PATH= /Users/yuchen/.elan/toolchains/leanprover--lean4---v4.13.0-rc3/bin/lean -Dlinter.unusedVariables=false -Dlinter.deprecated=false ././././scripts/training_data.lean -R ./././. -o ././.lake/build/lib/scripts/training_data.olean -i ././.lake/build/lib/scripts/training_data.ilean -c ././.lake/build/ir/scripts/training_data.c --json
info: stderr:
failed to write '././.lake/build/lib/scripts/training_data.olean': 2 No such file or directory
error: Lean exited with code 1
✖ [335/339] Building scripts.training_data
trace: .> LEAN_PATH=././.lake/packages/batteries/.lake/build/lib:././.lake/packages/Qq/.lake/build/lib:././.lake/packages/aesop/.lake/build/lib:././.lake/packages/proofwidgets/.lake/build/lib:././.lake/packages/Cli/.lake/build/lib:././.lake/packages/importGraph/.lake/build/lib:././.lake/packages/LeanSearchClient/.lake/build/lib:././.lake/packages/mathlib/.lake/build/lib:././.lake/build/lib DYLD_LIBRARY_PATH= /Users/yuchen/.elan/toolchains/leanprover--lean4---v4.13.0-rc3/bin/lean -Dlinter.unusedVariables=false -Dlinter.deprecated=false ././././scripts/training_data.lean -R ./././. -o ././.lake/build/lib/scripts/training_data.olean -i ././.lake/build/lib/scripts/training_data.ilean -c ././.lake/build/ir/scripts/training_data.c --json
info: stderr:
failed to write '././.lake/build/lib/scripts/training_data.olean': 2 No such file or directory
error: Lean exited with code 1
✖ [335/339] Building scripts.training_data
trace: .> LEAN_PATH=././.lake/packages/batteries/.lake/build/lib:././.lake/packages/Qq/.lake/build/lib:././.lake/packages/aesop/.lake/build/lib:././.lake/packages/proofwidgets/.lake/build/lib:././.lake/packages/Cli/.lake/build/lib:././.lake/packages/importGraph/.lake/build/lib:././.lake/packages/LeanSearchClient/.lake/build/lib:././.lake/packages/mathlib/.lake/build/lib:././.lake/build/lib DYLD_LIBRARY_PATH= /Users/yuchen/.elan/toolchains/leanprover--lean4---v4.13.0-rc3/bin/lean -Dlinter.unusedVariables=false -Dlinter.deprecated=false ././././scripts/training_data.lean -R ./././. -o ././.lake/build/lib/scripts/training_data.olean -i ././.lake/build/lib/scripts/training_data.ilean -c ././.lake/build/ir/scripts/training_data.c --json
info: stderr:
failed to write '././.lake/build/lib/scripts/training_data.olean': 2 No such file or directory
error: Lean exited with code 1
Some required builds logged failures:
error: build failed
0%| | 0/5397 [00:07<?, ?it/s]
Some required builds logged failures:
error: build failed
Some required builds logged failures:
error: build failed
How to solve this? Thanks for your help.
The text was updated successfully, but these errors were encountered: