Skip to content
/ Crawler Public

Universal Crawler for web or local file system crawlering. The module enables you to extend the functionality of any location-related searches in really easy way - by overriding the appropriate default functions in the subclass. For example you can change default web filters or add some statistics. In "thread-version" branch you can also find mu…

Notifications You must be signed in to change notification settings

wosiu/Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 

Repository files navigation

Crawler

Universal Crawler for web or local file system crawlering. The module enables you to extend the functionality of any location-related searches in really easy way - by overriding the appropriate default functions in the subclass. For example you can change default web filters or add some statistics. In "thread-version" branch you can also find multi-threaded version. In "icm" branch you cand find extended module for collecting statistics from logs of Apache Hadoop jobs' tasks. In master you can find Demo1 and Demo2 - extensions of crawler. Let me know if you will use my code, have some suggestions or just find this helpfull :)

About

Universal Crawler for web or local file system crawlering. The module enables you to extend the functionality of any location-related searches in really easy way - by overriding the appropriate default functions in the subclass. For example you can change default web filters or add some statistics. In "thread-version" branch you can also find mu…

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages