This project is aimed to scrape restaurants information from websites. Currently, fehd (Licensed Restaurants registered with Food and Environmental Hygiene Department in Hong Kong)is the only spider available in this project.
$ git clone [email protected]:sammyfung/hk_restaurants.git
$ cd hk_restaurants
$ python3 -m venv venv
$ source venv/bin/activate
$ pip install -r requirements.txt
Run the web scraper and export scraped data to CSV file.
$ cd hk_restaurants/hk_restaurants
$ scrapy crawl fehd -o fehd.csv