Skip to content

shz117/WebSearchEngineProjects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a repo for course projects @ Professor Torsten Suel's course Web Search Engines.

  1. Jcrawler : a primitive multi-threaded focused web crawler to collect web pages from www, with concentration on given key words. Language : python

  2. indexer : a c++ program to parse web pages, do reverse index, and generate final index for later query processing. involving massive data processing, file compression(var-byte).

  3. query processor, ask former built inverted index to answer user's search queries.

  4. Foursquare crawler and recommendation system : including a crawler to collect user, venue, rating, check in information from Foursquare, Twitter and Facebook, then apply machine learning algorithms (collaborative-filtering, SVD, etc) to recommend friends and venues to users.

About

Course projects in Web Search Engines Course @ NYU-POLY

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published