Skip to content

main5equence/global-cyber-threat-analysis

Repository files navigation

Global Cyber Threat Analysis

Project Overview

This project analyzes global cybersecurity incidents to identify patterns in cyber attacks, financial losses, targeted industries, and threat actors. The analysis combines SQL, Python, and Power BI to explore the dataset, perform data analysis, and create an interactive cyber threat dashboard. The goal of this project is to demonstrate how data analytics can be applied to cybersecurity data to uncover insights about global cyber threats.

Cybersecurity Threat Intelligence Security Security Dependabot License


Dataset

The dataset contains global cybersecurity incidents between 2015 and 2024.

Source: https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

Each record includes:

  • Country
  • Year
  • Attack Type
  • Target Industry
  • Financial Loss (in Million $)
  • Number of Affected Users
  • Attack Source
  • Security Vulnerability Type
  • Defense Mechanism Used
  • Incident Resolution Time

The dataset contains 3000 cybersecurity incidents across multiple industries and countries.


Tools & Technologies

This project uses multiple tools to simulate a real-world data analysis workflow.

Data Analysis

  • SQL (SQLite)

Data Processing & Visualization

  • Python
  • Pandas
  • Matplotlib
  • Seaborn

Dashboard

  • Power BI

Project Workflow

CSV Dataset → SQLite Database → SQL Analysis → Python Visualization → Power BI Dashboard


Project Structure

global-cyber-threat-analysis
│
├── data
│   └── Global_Cybersecurity_Threats_2015-2024.csv
│
├── sql
│   └── analysis.sql
│
├── python
│   ├── import_data.py
│   ├── visualization.py
│   └── visualization_attacksourcedist.py
│
├── dashboard
│   └── cyber-threats.pbix
│
├── requirements.txt
├── README.md
└── LICENSE

SQL Analysis

SQL was used to explore and analyze the cybersecurity dataset stored in SQLite.


Preview Data

SELECT *
FROM threats
LIMIT 10;
Zrzut ekranu 2026-03-11 203328 This query previews the first rows of the dataset.

Most Common Cyber Attack Types

SELECT "Attack Type", COUNT(*) as attacks
FROM threats
GROUP BY "Attack Type"
ORDER BY attacks DESC;
Zrzut ekranu 2026-03-11 203624 This query identifies the most frequent types of cyber attacks. image

Top Countries by Cybercrime Financial Loss

SELECT Country,
SUM("Financial Loss (in Million $)") as losses
FROM threats
GROUP BY Country
ORDER BY losses DESC;
Zrzut ekranu 2026-03-11 203835 Shows which countries experience the highest financial losses caused by cyber attacks. image

Financial Loss by Attack Type

SELECT "Attack Type",
SUM("Financial Loss (in Million $)") as total_loss
FROM threats
GROUP BY "Attack Type"
ORDER BY total_loss DESC;
Zrzut ekranu 2026-03-11 204502 Analyzes which attack types generate the largest financial damage. image

Most Common Cyber Attack Sources

SELECT "Attack Source",
COUNT(*) as incidents
FROM threats
GROUP BY "Attack Source"
ORDER BY incidents DESC;
Zrzut ekranu 2026-03-11 204747 Identifies the main threat actors responsible for cyber incidents. image

Industries Most Targeted by Cyber Attacks

SELECT "Target Industry",
COUNT(*) as attacks
FROM threats
GROUP BY "Target Industry"
ORDER BY attacks DESC;
Zrzut ekranu 2026-03-11 204933

Shows which industries are most frequently targeted.

image

Cyber Attacks Over Time

SELECT Year,
COUNT(*) as incidents
FROM threats
GROUP BY Year
ORDER BY Year;
Zrzut ekranu 2026-03-11 205108

Analyzes how the number of cyber attacks changes over time.

image

Largest Individual Cyber Incidents

SELECT Country,
Year,
"Attack Type",
"Target Industry",
"Financial Loss (in Million $)"
FROM threats
ORDER BY "Financial Loss (in Million $)" DESC
LIMIT 10;
Zrzut ekranu 2026-03-11 205206 Identifies the cyber incidents with the highest financial losses.

Dashboard Preview

cyber1powerbi cyber2powerbi

How to Run the Project

  1. Clone the repository
git clone https://github.com/yourusername/global-cyber-threat-analysis.git
  1. Import the dataset into SQLite
python import_data.py

This script loads the CSV dataset into a SQLite database (cyber_threats.db).

  1. Run SQL analysis queries

Open the database using the SQLite extension in VS Code and run queries from:

analysis.sql
  1. Run Python visualizations
python visualization.py
  1. Open the Power BI dashboard

Open the file:

cyber-threats.pbix

to explore the interactive cybersecurity dashboard.

Key Insights

  • DDoS and phishing attacks are the most frequent cyber threats.
  • DDoS attacks generate the highest financial losses among the analyzed attack types.
  • Nation-state actors and unknown attackers are responsible for the largest share of cyber incidents.
  • IT, banking, and healthcare sectors are among the most frequently targeted industries.
  • Cyber incidents remain consistently high across the years analyzed (2015–2024).
  • Large-scale cyber attacks can result in financial losses approaching 100 million USD per incident.

Future Improvements

• Add machine learning models to predict cyber attack trends
• Integrate threat intelligence APIs

About

This project analyzes global cybersecurity incidents to identify patterns in cyber attacks, financial losses, affected industries, and threat actors. The analysis combines SQL data exploration, Python visualizations, and an interactive Power BI dashboard to better understand the global cyber threat landscape.

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages