Skip to content

SairoTech/customer_purchasing_behaviours

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Customer Purchasing Behaviours - Team Project

Team Members :

Anshu Dwivedi
Henry Giorgi
Maral Barkhordari
Sonu Abraham
Keyuan Huang

Business case :

Use predictive analytics to recommend the most effective marketing strategies for different customer segments based on their purchasing frequency, loyalty score, and annual income.

Our team has chosen the Customer Purchasing Behavior Dataset for in-depth analysis, aiming to derive valuable insights into customer purchasing behaviors.

The objective of this project is to understand how various features correlate to determine the effective marketing strategies. By utilizing regression model, we aim to predict marketing strategies that adddress the diverse interests of various customer segments.This analysis will leverage details from the dataset including customer age, annual income, region,loyalty score and purchashing frequency.

About Dataset:

customer_id: Unique ID of the customer.
age: The age of the customer.
annual_income: The customer's annual income (in USD).
purchase_amount: The total amount of purchases made by the customer (in USD).
purchase_frequency: Frequency of customer purchases (number of times per year).
region: The region where the customer lives (North, South, East, West).
loyalty_score: Customer's loyalty score (a value between 0-100).

Our Approach

Step 1: Data Understanding and Preprocessing

We loaded, cleaned, and preprocessed the dataset, addressing missing values and outliers, and scaled numerical features and encoded categorical variables.

Step 2: Exploratory Data Analysis (EDA):

We conducted segmentation analysis based on customer income and loyalty scores, visualizing purchasing behaviors through histograms, scatter plots, and box plots to identify patterns.

Step 3: Model Selection and Training

We compared different models, including Linear Regression, Decision Tree, and Random Forest. Random Forest emerged as the most effective model due to its high performance and feature importance.

Step 4: Model Evaluation

We evaluated model performance using R², MAE, and RMSE, and fine-tuned the Random Forest model with GridSearchCV to optimize hyperparameters.

Step 5: Interpretation and Insights

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%