Skip to content

29akeshvimt/British_Airways_

 
 

Repository files navigation

Evoastra Ventures Pvt Ltd

image

Data Science Internship Project [Team M]

British Airways Customer Feedback Analysis

Project Overview

This project involves scraping customer reviews from the Skytrax website, cleaning and analyzing the data to gain insights, and presenting the findings to influence key business decisions at British Airways (BA). The project focuses on understanding customer feedback and sentiments to improve service quality, reduce costs, and increase revenue.

Tasks Performed

1. Data Collection

  • Objective: Collect customer review data specifically related to British Airways from the Skytrax website.
  • Method: Utilized web scraping techniques in Python to extract review data, focusing on comments related to the airline’s services.
  • Output: Gathered a comprehensive dataset containing customer reviews, including comments, ratings, and review dates.

2. Data Cleaning

  • Objective: Prepare the raw data for analysis by cleaning and organizing it.
  • Process:
    • Removed duplicates and irrelevant entries.
    • Standardized text formatting (e.g., lowercasing, removing special characters).
    • Handled missing values and corrected any data inconsistencies.
  • Output: A clean dataset ready for analysis, with relevant and structured information.
git

3. Data Analysis

  • Objective: Analyze the cleaned data to uncover insights into customer sentiments and common topics.
  • Approaches:
    • Sentiment Analysis: Assessed the overall sentiment (positive, neutral, negative) of customer reviews using natural language processing (NLP) techniques.
    • Topic Modelling: Identified common themes and topics discussed in the reviews using methods like Latent Dirichlet Allocation (LDA).
    • Word Clouds: Created visual representations of the most frequently mentioned words and phrases.
  • Output: Generated insights into customer satisfaction, common pain points, and areas for improvement.
git

4. Visualization and Reporting

  • Objective: Present the findings in a clear, concise, and visually appealing format.
  • Process:
    • Created visualizations such as sentiment distribution graphs, topic frequency charts, and word clouds.
    • Summarized key insights and metrics into a single PowerPoint slide.
  • Output: A presentation-ready slide with visualizations and explanatory notes for board-level discussions.

Tools and Technologies

  • Programming Language: Python
  • Libraries: BeautifulSoup, Pandas, NLTK, Scikit-learn, Matplotlib, WordCloud, Seaborn
  • IDE: Jupyter Notebook
  • Visualization Tool: PowerPoint

How to Run the Project

  1. Clone this repository.
  2. Install the necessary Python libraries from requirements.txt.
  3. Run the Jupyter Notebook to scrape and analyze the data.
  4. Generate visualizations and compile the final PowerPoint slide.

About

Customer Feedback Analysis Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%