Bank Customer Churn Prediction with Advanced Machine Learning & Streamlit 💡💵

This project predicts whether a bank customer will churn (leave) or stay using cutting-edge machine learning techniques. It includes comprehensive data preprocessing, advanced model training, and a sleek Streamlit web application for real-time predictions.

📚 Table of Contents

Overview
Project Structure
Setup and Installation
How to Run
Features
Results
Technologies Used
Contributing

🔍 Overview

Customer churn prediction is crucial for financial institutions to enhance customer retention and profitability. This project leverages a robust machine learning pipeline featuring XGBoost, LightGBM, and Neural Networks for high-accuracy predictions. It also provides an intuitive Streamlit web app for live customer churn predictions.

🛠 Setup and Installation

Prerequisites:
Python 3.8 or higher
Jupyter Notebook
pip or conda for package management

Install Dependencies: pip install -r requirements.txt

🚀 How to Run

✅ Run the Notebook for Analysis & Model Training

jupyter notebook

Navigate to Bank_Churn_Prediction_Analysis.ipynb and run all cells for EDA, feature engineering, and model training.

🌐 Run the Streamlit Web App for Real-Time Predictions

Ensure the trained model (final_churn_model.pkl) is in the models/ directory.
Launch the app:
streamlit run app/streamlit_app.py
Enter customer details in the web form and click Predict for instant feedback:
🚨 Exit: Customer likely to churn.
✅ No Exit: Customer likely to stay.

🎯 Features

🤖 Machine Learning Pipeline

Data preprocessing: Handling missing values, outlier removal, and feature scaling.
Feature engineering with PCA for dimensionality reduction.
Multiple ML models tested:
XGBoost
LightGBM
Neural Networks (Keras)
RF selected as the best model with 87% accuracy.

🌟 Interactive Web App

Developed using Streamlit for real-time, web-based predictions.
Clean UI with dynamic feedback:
🟥 Exit (Red): High churn risk
🟩 No Exit (Green): Low churn risk
📈 Results

Best Model: RF with 87% accuracy on the test set.

F1 Score: 0.86
AUC-ROC Score: 0.91
Real-time prediction capability via Streamlit.

🖥 Technologies Used

Languages & Frameworks: Python (Pandas, NumPy, Scikit-learn, XGBoost, LightGBM, Keras)
Visualization: Matplotlib, Seaborn, Plotly
Deployment: Streamlit
Model Serialization: joblib
Other Tools: Git, Jupyter Notebook

🤝 Contributing

Contributions are welcome! Feel free to fork the repository and submit pull requests.
Let me know if you’d like Python code snippets for specific sections (e.g., XGBoost training, Streamlit app) or want to explore deployment options like Heroku or AWS EC2. 🚀💡

🚀 Developing an AI-Driven Solution for #BankCustomerChurnPrediction!

I'm excited to present my latest addition to the Machine Learning Series: Bank Customer Churn Prediction 🏦📊. This project was a remarkable experience, exploring data preprocessing, feature engineering, and advanced machine learning methods to address a crucial business challenge—predicting whether a customer will stay with the bank or churn.

🌟 Key Highlights

📌 Dataset:
The dataset comprised customer demographics, account information, and behavioral attributes.

📌 Preprocessing Steps:

Dropped non-informative columns like RowNumber, CustomerId, and Surname that had no predictive significance.
Performed one-hot encoding on categorical features (Geography and Gender), ensuring the dummy variable trap was avoided.
Scaled numerical features such as CreditScore, Balance, and EstimatedSalary for consistency across the dataset.

📌 Insights Gained:

Identified data imbalance in churn distribution, with fewer customers leaving compared to those staying (SMOTE was used to balance the dataset).
Key predictors included Age, Tenure, and Balance, as identified through a correlation matrix analysis.

📌 Model Development:

Trained and evaluated multiple classification models:
Logistic Regression
Random Forest
Gradient Boosting

🏆 Random Forest delivered the best performance with 87% accuracy, balancing precision and recall effectively.

📌 GUI for Real-Time Predictions:

Built an interactive GUI using Tkinter for user-friendly predictions.
Users can input customer details and receive instant predictions on churn risk.
Visual feedback integrated with clear labels:
🔴 Exit: Customer likely to churn.
🟢 Stay: Customer likely to remain.

📌 Business Impact:

Proactively identifies customers at risk of leaving.
Supports targeted retention strategies, potentially saving millions in acquisition costs.

📌 Key Findings:

📊 Older customers and those with high balances but low engagement are more likely to churn.
📊 Geography and gender significantly influence churn probability, emphasizing the need for localized strategies.

💡 Key Takeaways

This project sharpened my technical skills and enhanced my understanding of customer behavior in real-world scenarios. I gained practical experience in:

Managing imbalanced datasets
Applying feature scaling techniques
Developing user-friendly ML applications

🚀💡 This journey into AI-powered churn prediction highlighted how machine learning can drive strategic decision-making in the banking sector.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Bank_Customer_Churn_Pred.ipynb		Bank_Customer_Churn_Pred.ipynb
Churn_Modelling.csv		Churn_Modelling.csv
README.md		README.md
churn_predict_model		churn_predict_model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bank Customer Churn Prediction with Advanced Machine Learning & Streamlit 💡💵

📚 Table of Contents

🔍 Overview

🛠 Setup and Installation

🚀 How to Run

jupyter notebook

🌐 Run the Streamlit Web App for Real-Time Predictions

🎯 Features

🤖 Machine Learning Pipeline

🌟 Interactive Web App

Best Model: RF with 87% accuracy on the test set.

🖥 Technologies Used

🤝 Contributing

🚀 Developing an AI-Driven Solution for #BankCustomerChurnPrediction!

🌟 Key Highlights

📌 Preprocessing Steps:

📌 Insights Gained:

📌 Model Development:

🏆 Random Forest delivered the best performance with 87% accuracy, balancing precision and recall effectively.

📌 GUI for Real-Time Predictions:

📌 Business Impact:

📌 Key Findings:

💡 Key Takeaways

🚀💡 This journey into AI-powered churn prediction highlighted how machine learning can drive strategic decision-making in the banking sector.

About

Uh oh!

Releases

Packages

Languages

RoshniRanaDS27/Bank_Customer_Churn_ML

Folders and files

Latest commit

History

Repository files navigation

Bank Customer Churn Prediction with Advanced Machine Learning & Streamlit 💡💵

📚 Table of Contents

🔍 Overview

🛠 Setup and Installation

🚀 How to Run

jupyter notebook

🌐 Run the Streamlit Web App for Real-Time Predictions

🎯 Features

🤖 Machine Learning Pipeline

🌟 Interactive Web App

Best Model: RF with 87% accuracy on the test set.

🖥 Technologies Used

🤝 Contributing

🚀 Developing an AI-Driven Solution for #BankCustomerChurnPrediction!

🌟 Key Highlights

📌 Preprocessing Steps:

📌 Insights Gained:

📌 Model Development:

🏆 Random Forest delivered the best performance with 87% accuracy, balancing precision and recall effectively.

📌 GUI for Real-Time Predictions:

📌 Business Impact:

📌 Key Findings:

💡 Key Takeaways

🚀💡 This journey into AI-powered churn prediction highlighted how machine learning can drive strategic decision-making in the banking sector.

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages