This project is to retrieve data from Zillow and analyze trends in the housing market.
- Get data for all remaining zip codes [Marc]
- Get zip code data (maybe from the Census?) and append to Firestore db [Matt & Sehee]
- Make visualizations from data [Vishal & Selena]
- Heatmaps
- (Maybe) transfer firebase data across accounts [Matt]
- Get zip code from Firestore that has not yet been scraped (see 'firestore.py')
- Get random user-agent for heading (headers.py)
- Get first page of search results
- If http request was not blocked by captcha:
a. Get range of page results (for the for-loop below)
(Number of pages is the number of results/40, with a max of 25. parse_results.py, lines 32-35)
b. For each page in search results (starting with page 2):
i. If http request was not blocked: (ip_rotations.py, line 86)
1. Parse and clean response (clean_results.py)
2. For each listing in search results:
a. Add to Firestore 'listings' collection (see 'firestore.py') - Repeat from step 3 with rentals = True (append '/rentals' to URL)
- Update zip code in Firestore with 'scraped' = True
- Repeat from step 1