Skip to content

Latest commit

 

History

History
21 lines (14 loc) · 852 Bytes

readme.md

File metadata and controls

21 lines (14 loc) · 852 Bytes

nutella-scrape

NPM

nutella

  1. Run sudo npm install nutella-scrape -g
  2. Run nutella-scrape
  3. ???
  4. LEARN!!

In this tutorial, we will work through how to scrape websites using Node.js for the primary purpose of using it in other programs -- in servers, frontends (yes, Node works in the browser!), or just writing a table to disk for analysis elsewhere.

The DOM (Document Object Model) is an abstract concept describing how we can interact with HTML. JavaScript is GREAT for traversing HTML (i.e., the DOM) because it was made to work with HTML in the first place.

TODO

  • parallel
  • spoofing
  • cookies/login walls
  • electron-microscope