About | Features | Technologies | Requirements | Starting | Photos | Results | Author
This project focuses on analyzing customer churn data to identify patterns and factors contributing to service cancellations. The goal is to uncover insights that can help improve customer retention by analyzing historical data and visualizing key trends.
✔️ Customer churn analysis;
✔️ Data visualization to uncover patterns;
✔️ Identification of factors contributing to churn.
The following tools were used in this project:
-
Before starting, you need to have Python installed.
-
Install Jupyter extension for VS Code or use Anaconda for managing Python environments and running Jupyter Notebooks.
# Clone this project
$ git clone https://github.com/tamirespatrocinio/customer-churn-analysis
# Navigate into the project directory
$ cd customer-churn-analysis
# Install dependencies
$ pip install pandas numpy openpyxl nbformat ipykernel plotly
The data set used in the analysis contains the following columns:
Column Name | Type | Description |
---|---|---|
age | float64 | Customer's age. |
gender | object | Customer's gender (Male/Female). |
customer_time | float64 | Length of time the customer has been with the company (in months). |
frequency_use | float64 | Frequency of service use by the customer. |
callcenter_calls | float64 | Number of calls made to the call center. |
delay_days | float64 | Number of days of late payments. |
signature | object | Type of customer subscription (Basic, Standard, Premium). |
contract_duration | object | Contract duration (Monthly, Quarterly, Annual). |
total_expense | float64 | Total spent by the client to date. |
last_interaction_months | float64 | Months since last customer interaction. |
canceled | float64 | Indicates whether the customer has canceled (0.0 = Not Canceled, 1.0 = Canceled). |
Chart 1: Number of calls to the call center by customers. Chart 2: Number of days in delay for customer payments. Chart 3: Duration of contracts (Monthly, Quarterly, Annual) by customers.
The initial analysis revealed that the proportion of cancellations is as follows:
- Canceled: 56.71%
- Active: 43.29%
After applying retention measures, the cancellation rate fell from 56.71% to 36.27%, which corresponds to a reduction of approximately 20.44 percentage points. This reflects a significant improvement.