Skip to content

Commit

Permalink
add
Browse files Browse the repository at this point in the history
  • Loading branch information
weekendfish committed Jan 22, 2025
1 parent 7dbe8b7 commit e194b4f
Show file tree
Hide file tree
Showing 4 changed files with 6 additions and 1 deletion.
3 changes: 3 additions & 0 deletions .gitmodules
Original file line number Diff line number Diff line change
@@ -1,3 +1,6 @@
[submodule "webpage_archive"]
path = webpage_archive
url = https://github.com/transTerminus/webpage_archive
[submodule "content_archive"]
path = content_archive
url = https://github.com/project-polymorph/content_archive
1 change: 1 addition & 0 deletions content_archive
Submodule content_archive added at c06c57
1 change: 1 addition & 0 deletions download/Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,7 @@ random:
random_new:
python .github/downloader/download/download.py --output-dir .github/downloader/webpage_archive/new_all_results/20250121 --download-type webpage --pattern ".*" --order random
python .github/downloader/download/download.py --output-dir .github/downloader/webpage_archive/new_all_results/20250122 --download-type webpage --pattern ".*sohu.*|.*sina.*|.*ifeng.*|.*163.*|.*chinanews.*|.*thepaper.*|.*qq.*" --order random
python .github/downloader/download/download.py --output-dir .github/downloader/webpage_archive/new_all_results/20250123_res/downloads --download-type webpage --pattern ".*" --order random


jina:
Expand Down
2 changes: 1 addition & 1 deletion webpage_archive

0 comments on commit e194b4f

Please sign in to comment.