Skip to content

mominurr/Amazon-Best-Sellers-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Amazon Best Sellers Data Analysis

📊 Project Overview

This project analyzes product trends from Amazon's Best Sellers (amazon.com/Best-Sellers) to identify sales patterns, top-performing categories/brands, and customer preferences. The analysis reveals actionable insights for sellers, marketers, and e-commerce professionals.

Total Data Scraped: 600 rows × 14 columns
After Handling Missing Values: 597 rows × 14 columns


🖼️ Interactive Dashboard Preview


Amazon Best Sellers Dashboard

Explore Full Dashboard on Tableau Public


🔑 Key Findings

1. Impact of Reviews on Sales

  • As the number of reviews increases, the sales of a product tend to increase significantly.

2. Relationship Between Reviews and Seller Rank

  • Products with the highest number of reviews tend to rank closer to #1 in the seller rankings.

3. Top Categories

  • Parent Category: Clothing, Shoes & Jewelry
  • Subcategory: Men's Mules & Clogs
    (Products in these categories with more reviews are purchased more frequently.)

4. Top 10 Brands

  • The top brand is Ring, followed by others identified in the analysis.

5. Top 10 Products

  • The leading product is Crocs Unisex Adult Classic Clog.

6. Trends Over Time

  • Products released earlier (e.g., 2004) show lower sales in recent months, while newer products (e.g., 2024 releases) demonstrate higher sales growth.
  • Based on this trend, products released in 2025 are expected to achieve the highest sales.

7. Geographical Insights

  • A geographical analysis highlights the number of distinct brands across countries.
  • For instance, China has 126 distinct brands, the highest among all countries.

⚙️ Technical Implementation

Data Collection

  • Scraped 600 products using Python (BeautifulSoup/Selenium)
  • Raw dataset: 600 rows × 14 columns
  • Cleaned dataset: 597 rows × 14 columns (Data Preparation Notebook)

Tools Used

  • Python > pandas, selenium, BeautifulSoup
  • Tableau
  • Jupyter Notebook

🚀 Project Usage Guide

To replicate or extend this analysis, follow the steps below:

Prerequisites

Ensure Python is installed on your machine.

Steps to Run the Project

  1. Clone the Repository

    git clone https://github.com/mominurr/Amazon-Best-Sellers-Data-Analysis.git
  2. Create a Virtual Environment

    python -m venv myvenv
  3. Install Dependencies

    pip install -r requirements.txt
    
  4. Run the Scraper Script

    Execute the script to scrape data from Amazon.

    python scraper.py
  • The scraped data will be saved as data/raw_data.csv.

Process the Data

Open and run the data_preparation.ipynb notebook to handle missing and duplicate values.

  • The cleaned data will be saved as data/cleaned_data.csv.

📜 License

This project is licensed under the MIT License – see the LICENSE file for details.

🛠️ Contributions

Contributions are welcome! Feel free to fork the repo and submit a pull request.

📩 Contact

For any inquiries or collaborations:

🚀 Star this repo ⭐ if you find it useful!

Releases

No releases published

Packages