This JavaScript application provides an efficient solution for collecting information about books from various websites. It is designed to automatically and systematically extract essential details such as titles, authors, descriptions, and more from diverse online sources.
- Have access to the "my-scrapers" package.
- Copy "config_template.json", rename it "config.json" and fill it.
node book-scraper.js usage
-
When acquiring book links, all catalogs where an error occurred are logged in a file named "errorsWithGet.xlsx". These catalogs can be revisited using the "get-errors" command. The operation is similar for Wp (see usage).
-
During the scraping process, books that have already been found are not searched for again unless they have missing information. Discovered books are saved in a temporary file and will be automatically resumed when the program restarts after a crash.
npm run lint
npm run pre-release