ML1_Task2

Medium Article Scraper

This Python script scrapes the contents of a Medium article given its URL and saves the extracted text in a .txt file. The script extracts all the paragraphs from the HTML page of the article and stores the content in a directory called scraped_articles. It uses beautiful soup library that scapes text from the webpage with the help of an html parser.

Requirements

Python 3.9
Libraries:
- requests
- BeautifulSoup (from the bs4 package)

To install the necessary libraries, run:

pip install requests
pip install beautifulsoup4

When prompted to input enter the following medium article url:

https://medium.com/@subashgandyer/papa-what-is-a-neural-network-c5e5cc427c7

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
scraped_articles		scraped_articles
README.md		README.md
Task1_Python_script.py		Task1_Python_script.py
task2.ipynb		task2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML1_Task2

Medium Article Scraper

Requirements

About

Releases

Packages

Languages

VaishnaviBarot/WebScrapper

Folders and files

Latest commit

History

Repository files navigation

ML1_Task2

Medium Article Scraper

Requirements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages