Skip to content

slayerpart/distributed-crawler-boilerplate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scraper Boilerplate

A crawler boilerplate implemented in scrapy framework that is using scrapy-fake-useragent in order to pretend a randomized user agent per request and scrapy-proxies to distribute requests over a pool of specified proxies declared in proxy_list.txt

Furthermore, a local MongoDB is used to store extracted items.

About

A crawler boilerplate implemented in scrapy framework

Topics

Resources

Stars

Watchers

Forks

Languages