librensetsu

Collection of common functions and classes for Rensetsu's service scraper

Usage

Simply install the package as git on pip.

pip install git+https://github.com/rensetsu/librensetsu.git

Regarding Dependencies

This library will download all orphaned dependencies for you to utilize it without any additional setup, so only this package is needed to be installed.

Why? This is because the library itself acted similarly like SDK instead to develop a scraper/web crawler for individual services to be used in Rensetsu unifieddatabase.

Installed dependencies

alive-progress: Progress bar for long running tasks
beautifulsoup4: HTML parser
dacite: Utility to convert dict to dataclass recursively
cloudscraper: Cloudflare bypassing library
cutlet: Handle Japanese text transliteration to Latin
fake-useragent: Random user agent generator
fugashi[unidic]: Japanese tokenizer, required by cutlet
fuzzywuzzy: Fuzzy string matching library
pluralizer: English pluralization library
python-dotenv: Loads .env file as environment variables.
python-Levenshtein: Levenshtein distance calculation library, required by fuzzywuzzy
requests: HTTP client library

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

librensetsu

Usage

Regarding Dependencies

Installed dependencies

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

librensetsu

Usage

Regarding Dependencies

Installed dependencies

License