Skip to content

Latest commit

 

History

History
46 lines (33 loc) · 1.86 KB

File metadata and controls

46 lines (33 loc) · 1.86 KB

Summer Olympics Data Analysis

This was the 2nd task done under the Elite Techno Groups as Python for ML/AI Intern.

ABOUT

  • The "Summer Olympics Dataset" is from the year 1896-2012.
  • It contains data about Athlete like their Name, Gender, Country they represent, the Medal (Gold/Silver/Bronze) they won and in which Sport, Discipline and Event.
  • It also contains the City names and Year in which the Olympics was held.

DATA ANALYSIS

  • The dataset was analysed and 8 Major Questions, as asked by the instructor, is answered with the help of two major analysis libraries of Python i.e. Numpy and Pandas and also for those questions in which Top 5 was asked, is answered with the help of bar plots using two major visualization libraries of Python i.e. Matplotlib and Seaborn.
  • Eight Major Questions answered by data analysis are:-
    • In how many cities Summer Olympics is held so far?
    • Which sport is having most number of Gold Medals so far? (Top 5)
    • Which sport is having most number of medals so far? (Top 5)
    • Which player has won most number of medals? (Top 5)
    • Which player has won most number Gold Medals of medals? (Top 5)
    • In which year India won first Gold Medal in Summer Olympics?
    • Which event is most popular in terms on number of players? (Top 5)
    • Which sport is having most female Gold Medalists? (Top 5)

PROGRAMMING LANGUAGE

  • Python
  • Libraries Used
    • Numpy
    • Pandas
    • Matplotlib
    • Seaborn

TOOLS USED

  • Jupyter Notebook:- For Data Analysis
  • GIT:- To push our project file to GitHub

AUTHOR

Subham Surana

CONNECT WITH ME