Skip to content

This is a small python project, that collects data from a website and saves to a json file.

Notifications You must be signed in to change notification settings

enivill/web-scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web scraping in python

This is a small project, that collects data from a website and then saves them to a json file.

Description

Parsed URL: https://www.hyperia.sk/kariera/

The program finds all job offers in the URL and gains more information by opening hyperlinks ("viac info").

It finds information about the title, salary, location, contract type and contact and saves them to the data.json file.

Libraries

Libraries used for this project: BeautifulSoup4, urllib.request

Beautiful Soup is a Python library for parsing HTML and XML documents.

Urllib is a python 3 package for opening and reading URL-s.

Install required packages

These are included in the requirements.txt file. You can install them by the help of the following command:

pip install -r requirements.txt

Execute program

You can execute python script in command line typing py main.py

About

This is a small python project, that collects data from a website and saves to a json file.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages