This project scrapes content from a Shopee merchant page and individual product pages. The scraped data is then posted to our server via an API.
- Automates login to Shopee
- Handles captcha via a Python script
- Scrapes data from a Shopee merchant page
- Scrapes details from individual product pages
- Updates scraped data to the server using an API
- Node.js
- Playwright
- Python (for captcha handling)
- account to scrap
-
Clone this repository:
git clone https://github.com/nsanzimfura-eric/web-scraping.git
-
Navigate into the project directory:
cd web-scraping
-
Install dependencies:
npm install
-
Add your
.env
variables:cp .env.sample .env
-
Update
.env
with your API endpoints and Shopee account details.
- To start the scraper:
npm start
I, Nsanzimfura Eric contributed to this web-scraping app, and an author.
This project is licensed under the MIT License - see the LICENSE.md file for details.