Collection of common functions and classes for Rensetsu's service scraper
Simply install the package as git on pip
.
pip install git+https://github.com/rensetsu/librensetsu.git
This library will download all orphaned dependencies for you to utilize it without any additional setup, so only this package is needed to be installed.
Why? This is because the library itself acted similarly like SDK instead to develop a scraper/web crawler for individual services to be used in Rensetsu unifieddatabase.
alive-progress
: Progress bar for long running tasksbeautifulsoup4
: HTML parserdacite
: Utility to convert dict to dataclass recursivelycloudscraper
: Cloudflare bypassing librarycutlet
: Handle Japanese text transliteration to Latinfake-useragent
: Random user agent generatorfugashi[unidic]
: Japanese tokenizer, required bycutlet
fuzzywuzzy
: Fuzzy string matching librarypluralizer
: English pluralization librarypython-dotenv
: Loads.env
file as environment variables.python-Levenshtein
: Levenshtein distance calculation library, required byfuzzywuzzy
requests
: HTTP client library
This project is licensed under the MIT License - see the LICENSE file for details.