Skip to content

"Scrappers" for aruodas.lt and skelbiu.lt, plus scraped data

Notifications You must be signed in to change notification settings

ihah/aruodas-skelbiu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Info

  • Scripts are made only for learning purpose.
  • Scripts originally made only for collecting sample data.
  • Scripts requires manually change the total of list pages exists in the website.

Aruodas.lt filters

  • Only apartments
  • Only full established apartments

Skelbiu.lt filters

  • Only apartments
  • Do not show Aruodas.lt ads

How it works

  1. Script starts in list page (1);
  2. It will collect all links to the ads. (in aruodas.py it will collect url from the title plus city)
  3. Next links are passed to collect data about the AD, get_data(item).
  4. get_data() - collect the data and returns it, if no error occurred the data will be recorded into csv file, if error occurred script will skip the ad.
  5. After script runs out of the links it will change the page to next one and repeat the process.

CSV rows

  • 'city' - city where apartment is
  • 'years' - year of construction
  • 'area' - area of the apartment m^2
  • 'price' - price of the apartment in euros
  • 'avg_heat_per_m' - average cost of heating per month in the apartment in euros (only in aruodas.lt csv, has nulls)
  • 'url' - url to the AD

About

"Scrappers" for aruodas.lt and skelbiu.lt, plus scraped data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages