Customer Inactivity Detection in Banking

Description

This was one of the first complete data science projects I developed independently—from data cleaning to model deployment. Using a Kaggle dataset on bank customer inactivity, I explored behavioral patterns and implemented a predictive model to classify customers at risk of becoming inactive. Beyond its academic value, the project inspired the development of similar analytics strategies within my current role at PBZ Bank, demonstrating how open-source exploration can translate into concrete business innovation.

Key Contributions:

  • Cleaned and processed raw banking data using Pandas

  • Performed exploratory data analysis and visualizations (churn patterns, inactivity rates)

  • Engineered features and tested multiple classification models (Logistic Regression, Random Forest, XGBoost)

  • Evaluated models using accuracy, precision, recall, and ROC-AUC metrics

  • Reflected insights back into real-world context within PBZ internal analytics

Tools

  • Python – Core development language

  • Pandas – Data cleaning & manipulation

  • Matplotlib / Seaborn – Exploratory data visualization

  • Scikit-learn – ML models & evaluation

  • XGBoost – Advanced classification performance

  • Jupyter Notebook – Interactive analysis & development