GitHub - SairoTech/customer_purchasing_behaviours

Customer Purchasing Behaviours - Team Project

Team Members :

Anshu Dwivedi
Henry Giorgi
Maral Barkhordari
Sonu Abraham
Keyuan Huang

Business case :

Use predictive analytics to recommend the most effective marketing strategies for different customer segments based on their purchasing frequency, loyalty score, and annual income.

Our team has chosen the Customer Purchasing Behavior Dataset for in-depth analysis, aiming to derive valuable insights into customer purchasing behaviors.

The objective of this project is to understand how various features correlate to determine the effective marketing strategies. By utilizing regression model, we aim to predict marketing strategies that adddress the diverse interests of various customer segments.This analysis will leverage details from the dataset including customer age, annual income, region,loyalty score and purchashing frequency.

About Dataset:

customer_id: Unique ID of the customer.
age: The age of the customer.
annual_income: The customer's annual income (in USD).
purchase_amount: The total amount of purchases made by the customer (in USD).
purchase_frequency: Frequency of customer purchases (number of times per year).
region: The region where the customer lives (North, South, East, West).
loyalty_score: Customer's loyalty score (a value between 0-100).

Our Approach

Step 1: Data Understanding and Preprocessing

We loaded, cleaned, and preprocessed the dataset, addressing missing values and outliers, and scaled numerical features and encoded categorical variables.

Step 2: Exploratory Data Analysis (EDA):

We conducted segmentation analysis based on customer income and loyalty scores, visualizing purchasing behaviors through histograms, scatter plots, and box plots to identify patterns.

Step 3: Model Selection and Training

We compared different models, including Linear Regression, Decision Tree, and Random Forest. Random Forest emerged as the most effective model due to its high performance and feature importance.

Step 4: Model Evaluation

We evaluated model performance using R², MAE, and RMSE, and fine-tuned the Random Forest model with GridSearchCV to optimize hyperparameters.

Step 5: Interpretation and Insights

The most infleuntial features that significantly affect customer purchasing behavior are annual_income, loyalty_score and purchase_to_income_ratio.
Customer Segments with :
- High Income, Low loyalty score : Offer premium rewards and exclusive services to build loyalty that can help to increase retention.
- Low Income, High Loyalty: Provide small, frequent rewards and discounts to retain engagement.
- Medium Income, Moderate Loyalty: These are average earners with moderate engagement. Use value-driven promotions like bundles and seasonal offers to boost frequency.
Key Visual Insights :
- Income Vs Loyalty(Box Plot): High-income customers often have untapped engagement potential.
- Loyalty Score Vs Purchasing Frequency : Loyalty increases align with higher purchasing frequency, highlighting opportunities for tiered programs.
- Segmented Bar Plot: Loyalty and income directly influence purchasing frequency, suggesting tailored strategies for each combination.
- Violin Plot (Loyalty Score): Loyalty score distributions vary with purchasing frequency, offering clues about engagement levels across different groups.
- 3D Scatter Plot : Visualizes interactions between income, loyalty, and purchasing frequency, revealing distinct customer clusters for strategic targeting.
Videos:

Sonu Abraham - https://drive.google.com/file/d/1F4KytKKsRCrrS-Qu_vtHZOqePTLyBkyt/view?usp=drive_link

Maral Barkhordari - https://drive.google.com/file/d/1oF-2R6wlIWy9tA8quGYx16s20FzoK4Ol/view?usp=drive_link

Keyuan Huang - https://cloudmails-my.sharepoint.com/:p:/g/personal/tp079250_mail_apu_edu_my/EWp3JD4GINpKrwNtH924B9ABsZ8iCYSJBbkkI2zgJKmFdg?e=5Z9v2z

Anshu Dwivedi - https://vimeo.com/1037096694 https://drive.google.com/file/d/1jxMFAqXdZZTdOLpWa_dd792mO-hi2NiB/view?usp=drivesdk

Henry Giorgi - https://drive.google.com/file/d/1lUT5FODGzrZc69pCjFlx7CKI9M0FJ4x-/view?usp=drive_link

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
Plots		Plots
code		code
git ignore		git ignore
processed		processed
raw dataset		raw dataset
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Purchasing Behaviours - Team Project

Team Members :

Business case :

About Dataset:

Our Approach

Videos:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Customer Purchasing Behaviours - Team Project

Team Members :

Business case :

About Dataset:

Our Approach

Videos:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages