Anshu Dwivedi
Henry Giorgi
Maral Barkhordari
Sonu Abraham
Keyuan Huang
Use predictive analytics to recommend the most effective marketing strategies for different customer segments based on their purchasing frequency, loyalty score, and annual income.
Our team has chosen the Customer Purchasing Behavior Dataset for in-depth analysis, aiming to derive valuable insights into customer purchasing behaviors.
The objective of this project is to understand how various features correlate to determine the effective marketing strategies. By utilizing regression model, we aim to predict marketing strategies that adddress the diverse interests of various customer segments.This analysis will leverage details from the dataset including customer age, annual income, region,loyalty score and purchashing frequency.
customer_id: Unique ID of the customer.
age: The age of the customer.
annual_income: The customer's annual income (in USD).
purchase_amount: The total amount of purchases made by the customer (in USD).
purchase_frequency: Frequency of customer purchases (number of times per year).
region: The region where the customer lives (North, South, East, West).
loyalty_score: Customer's loyalty score (a value between 0-100).
Step 1: Data Understanding and Preprocessing
We loaded, cleaned, and preprocessed the dataset, addressing missing values and outliers, and scaled numerical features and encoded categorical variables.
Step 2: Exploratory Data Analysis (EDA):
We conducted segmentation analysis based on customer income and loyalty scores, visualizing purchasing behaviors through histograms, scatter plots, and box plots to identify patterns.
Step 3: Model Selection and Training
We compared different models, including Linear Regression, Decision Tree, and Random Forest. Random Forest emerged as the most effective model due to its high performance and feature importance.
Step 4: Model Evaluation
We evaluated model performance using R², MAE, and RMSE, and fine-tuned the Random Forest model with GridSearchCV to optimize hyperparameters.
Step 5: Interpretation and Insights
-
The most infleuntial features that significantly affect customer purchasing behavior are annual_income, loyalty_score and purchase_to_income_ratio.
-
Customer Segments with :
-
High Income, Low loyalty score : Offer premium rewards and exclusive services to build loyalty that can help to increase retention.
-
Low Income, High Loyalty: Provide small, frequent rewards and discounts to retain engagement.
-
Medium Income, Moderate Loyalty: These are average earners with moderate engagement. Use value-driven promotions like bundles and seasonal offers to boost frequency.
-
-
Key Visual Insights :
-
Income Vs Loyalty(Box Plot): High-income customers often have untapped engagement potential.
-
Loyalty Score Vs Purchasing Frequency : Loyalty increases align with higher purchasing frequency, highlighting opportunities for tiered programs.
-
Segmented Bar Plot: Loyalty and income directly influence purchasing frequency, suggesting tailored strategies for each combination.
-
Violin Plot (Loyalty Score): Loyalty score distributions vary with purchasing frequency, offering clues about engagement levels across different groups.
-
3D Scatter Plot : Visualizes interactions between income, loyalty, and purchasing frequency, revealing distinct customer clusters for strategic targeting.
Sonu Abraham - https://drive.google.com/file/d/1F4KytKKsRCrrS-Qu_vtHZOqePTLyBkyt/view?usp=drive_link
Maral Barkhordari - https://drive.google.com/file/d/1oF-2R6wlIWy9tA8quGYx16s20FzoK4Ol/view?usp=drive_link
Keyuan Huang - https://cloudmails-my.sharepoint.com/:p:/g/personal/tp079250_mail_apu_edu_my/EWp3JD4GINpKrwNtH924B9ABsZ8iCYSJBbkkI2zgJKmFdg?e=5Z9v2z
Anshu Dwivedi - https://vimeo.com/1037096694 https://drive.google.com/file/d/1jxMFAqXdZZTdOLpWa_dd792mO-hi2NiB/view?usp=drivesdk
Henry Giorgi - https://drive.google.com/file/d/1lUT5FODGzrZc69pCjFlx7CKI9M0FJ4x-/view?usp=drive_link
-



