Skip to content

A web crawler that scrapes smartphone listings data from OLX.bg (a popular buy and sell marketplace website in Bulgaria) and saves a list of items posted with an unusually low price.

Notifications You must be signed in to change notification settings

victor-yanev/OLX-Web-Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

OLX Web Crawler

A web crawler that scrapes smartphone listings data from OLX.bg (a popular buy and sell marketplace website in Bulgaria), keeps track of the mean price for certain models and saves a csv file of those which are posted with an unusually low price.

Instructions

Go to your Terminal, in your project directory and type:

scrapy crawl electronics

The third parameter is the name of the spider that you want to run. It is set in the name property of the created Spider class.
You can disable the debugger if you don’t want to see debugging information:

scrapy crawl --nolog electronics

Output

It returns a .csv file that contains objects for item listings in the following format:

{
  "title": "SAMSUNG J4 Perfektno Sastoyanie", 
  "price": "149 lv.", 
  "location": "Kamenitsa 1, gr. Plovdiv, Oblast Plovdiv", 
  "condition": "izpolzvano", 
  "url": "https://www.olx.bg/ad/samsung-j4-perfektno-sastoyanie-CID632-ID8q8fz.html"
}

The extracted data can then be used to perform statistics and extract useful information

About

A web crawler that scrapes smartphone listings data from OLX.bg (a popular buy and sell marketplace website in Bulgaria) and saves a list of items posted with an unusually low price.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages