This repository contains a Python script that analyzes and models the bank.csv
dataset. The script uses pandas
, seaborn
, matplotlib
, scikit-learn
libraries to perform data preprocessing, feature engineering, and model selection. Finally it will predict giving loan or not to a customer.
- pandas
- seaborn
- matplotlib
- scikit-learn
-
Clone the repository:
git clone https://github.com/Nishef/bank-marketing.git
-
Load the dataset by running the
main.py
script:python main.py
-
The script will output the summary statistics of numerical variables, create a correlation matrix using seaborn's heatmap function, check for duplicated values, encode categorical columns, split the data into train and test sets, and create two machine learning models: a logistic regression model and a random forest classifier model.
-
The script will output the classification report and confusion matrix for each model.
This project is licensed under the MIT License - see the LICENSE file for details.