What it does? EDA Patrol_Gawkers is a crime data analysis project that explores district-wise crime patterns across India from 2001 to 2014. It performs in-depth Exploratory Data Analysis (EDA), visualizes key crime trends, identifies high-risk areas, and uses statistical and machine learning techniques to reveal insights and predict future crime trends.

How we built it We used Jupyter Notebook and Python libraries like Pandas, Matplotlib, Seaborn, and Scikit-learn. Starting from data cleaning, we performed step-by-step EDA, visualizations (including geospatial and interactive filtering), and implemented clustering and regression models to classify and forecast crime rates.

Challenges we ran into Understanding and cleaning a large, complex dataset with many crime categories. Figuring out the best way to visualize such a wide range of variables. Interpreting crime data across states and districts with varying population sizes and crime reporting accuracy. Learning and applying ML techniques as a beginner without prior data science experience.

Accomplishments that we're proud of Successfully analyzed over 10,000+ records of crime data. Identified the top crime-affected states and districts. Built meaningful plots and dashboards to visualize crime density and trends. Learned how to apply clustering and regression for real-world insights. Completed an end-to-end project independently, despite being a beginner in data science.

What we learned The power of Python for data science. How to approach a project step-by-step: from cleaning to modeling. Exploratory Data Analysis techniques and visualization best practices. Basics of clustering (K-Means) and regression. Real-world use cases of crime data analytics and reporting.

What's next for Gawkers Add interactive dashboards using Streamlit or Plotly Dash. Integrate population data to normalize crime rates per capita. Build a more accurate time-series forecasting model. Explore seasonal crime patterns and gender-based crime analysis. Possibly deploy as a web app to allow users to explore data interactively.

Built With

Share this project:

Updates