Skip to content

waleedgeo/GFSM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

GFSM Logo

Global Flood Susceptibility Map (GFSM v1)

A High-Resolution (30m) Global Dataset Derived from Multi-Source Geospatial Data

Launch GEE App Β»

Overview Β· Explorer App Β· Dataset Access Β· Citation Β· Contact

Status Resolution GEE DOI License


Overview

GFSM v1 is the first globally harmonized flood susceptibility dataset produced at a native 30-meter spatial resolution. Unlike traditional hazard models that rely on hydrodynamic simulations for specific return periods (e.g., 1-in-100 years), GFSM identifies the landscape's inherent propensity for flooding based on physical geographic and environmental controls.

Developed using a Gradient-Boosted Tree (XGBoost) machine learning framework, the model integrates topographic, hydrological, meteorological, and anthropogenic factors trained on over 30 million samples across 192 distinct climate zones and processed over 10,000 GB of high-resolution satellite data.

🎯 Why GFSM?

  • First-of-its-kind: Global harmonized susceptibility at 30m resolution
  • Data-rich foundation: Trained on extensive multi-source geospatial datasets
  • Climate-aware: Accounts for diverse climatic and geographic contexts
  • Actionable insights: Suitable for regional planning, exposure assessment, and risk screening
  • Open science: Data and methodology made available to the research community

πŸ“’ Project Status

The manuscript describing this dataset is currently under review at Nature Scientific Data. The dataset has been published on Zenodo (currently restricted) and will be made publicly available immediately following manuscript acceptance.

Dataset DOI: 10.5281/zenodo.18137662

πŸ“§ For early access to specific tiles or training samples for validation purposes, please contact the author directly.


GFSM Explorer App

Visualize and inspect the global dataset interactively using our Google Earth Engine web application. The app features smart-resolution switching and a click-to-inspect tool for detailed regional analytics.

Click the image above to launch the app.

Key App Capabilities:

  • Global Screening: Rapidly visualize susceptibility patterns from global to local scales.
  • Resolution Toggling: Switch between 1km overview layers and native 30m analysis layers.
  • Unit Inspector: Click any location to view specific model performance metrics (AUC, F1-Score) for that region (coming soon).

✨ Key Features

Coverage & Resolution

  • 🌍 Global Coverage: Spans all inhabited landmasses from ~80Β°N to ~60Β°S
  • 🎯 High Precision: Native 30m resolution for localized exposure assessment and infrastructure planning
  • πŸ“Š Comprehensive Training: Built on ~17,000 grid tiles across diverse geographic regions

Technical Excellence

  • πŸ€– Advanced ML Framework: Gradient-Boosted Trees (XGBoost) optimized for geospatial prediction
  • πŸ§ͺ Robust Validation: Global median AUC of ~0.95 across diverse terrains
  • 🌐 Climate-Aware: Model trained across 192 distinct KΓΆppen-Geiger climate zones

Input Variables (9 Flood Conditioning Factors)

Category Variables Data Source
Topographic Elevation, Slope, Aspect FABDEM (30m)
Hydrological HAND, TWI, Distance to Water Derived from MERIT-Hydro
Anthropogenic Distance to Roads, NDVI OpenStreetMap, Sentinel-2
Climate Rainfall Frequency GPM IMERG

Methodology

The generation of GFSM v1 followed a rigorous, high-performance computing workflow (involving distributed computing and advanced machine learning techniques on SHAHEEN-III and Google Earth Engine). Details will be updated upon manuscript acceptance, but feel free to reach out for preliminary insights.

Susceptibility Classes

Class Level Description
1 Very Low Areas with negligible flood propensity (e.g., high ridges, steep slopes).
2 Low Areas with minimal risk factors.
3 Moderate Transitional zones with some flood-prone characteristics.
4 High Areas with significant flood conditioning factors.
5 Very High Critical zones (floodplains, depressions) with maximum propensity.

πŸ“‚ Repository Structure

This repository serves as the official code and documentation hub for the GFSM project.

GFSM/
β”œβ”€β”€ scripts/              # Processing and analysis scripts (Coming soon)
β”‚   β”œβ”€β”€ preprocessing/    # Data preparation workflows
β”‚   β”œβ”€β”€ modeling/         # XGBoost training pipelines
β”‚   └── inference/        # GEE-based prediction scripts
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ img/              # Visual assets (Logo, screenshots, diagrams)
β”‚   └── docs/             # Additional documentation (Coming soon)
β”œβ”€β”€ other/
β”‚   └── sorting_zenodo/   # Utilities for dataset management
β”œβ”€β”€ LICENSE               # CC BY-NC-SA 4.0 License
└── README.md             # Project documentation (this file)

Note: Full processing scripts, training pipelines, and documentation will be released upon manuscript acceptance to ensure reproducibility.


πŸ“¦ Dataset Access

The complete GFSM v1 dataset has been archived on Zenodo with a permanent DOI. The dataset is currently restricted and will be made publicly available immediately upon manuscript acceptance.

Zenodo Repository

Citation: Waleed, M. (2026). Global Flood Susceptibility Map (GFSM v1): A high resolution (30m) flood susceptibility dataset derived from multi-source geospatial data (Version V1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.18137662

DOI: 10.5281/zenodo.18137662

What's Included

The Zenodo archive contains:

  • Global GeoTIFF tiles at 30m resolution
  • Metadata and documentation for all layers
  • Global Tiles Index Grid Allowing users to identify and download specific regions

Access Methods

Method Status Description
Zenodo Direct Download πŸ”’ Restricted Full dataset archive (Available post-publication)
GEE Explorer App βœ… Live Interactive visualization and inspection
GEE Asset πŸ”œ Coming Soon Programmatic access via Earth Engine API
Early Access Request πŸ“§ Available Contact author for research purposes

πŸ—ΊοΈ Roadmap & Development Status

Feature Status Timeline
GFSM Explorer (v1) βœ… Live Available Now
Zenodo Dataset Publication πŸ”’ Restricted Published, opens upon acceptance
Manuscript Publication πŸ“ Under Review Submitted to Nature Scientific Data
Public Dataset Download πŸ”œ Pending Release upon acceptance
GEE Asset (Public) πŸ”œ Pending Release upon acceptance
Processing Scripts πŸ”œ Pending Release upon acceptance
GFSM Toolbox (v2) 🚧 In Development Q2 2026 (post-publication)
API Integration πŸ’‘ Planned Future Release

Research Access: Researchers requiring early access to specific tiles or training samples for validation purposes may contact the author directly with reasonable request.


πŸ“– Citation

If you use GFSM v1 in your research, please cite both the manuscript (upon publication) and the dataset.

Primary Citation (Manuscript)

Details to be updated upon publication

@article{waleed2026gfsm,
  title={Global Flood Susceptibility Map (GFSM v1): A high resolution (30m) flood susceptibility dataset derived from multi-source geospatial data},
  author={Waleed, Mirza and Sajjad, Muhammad and Sami, Ghamdi and Meng, Gao},
  journal={Under Review at Nature Scientific Data},
  year={2026}
}

Dataset Citation

@dataset{waleed2026gfsm_data,
  author       = {Waleed, Mirza},
  title        = {{Global Flood Susceptibility Map (GFSM v1): A high 
                   resolution (30m) flood susceptibility dataset derived 
                   from multi-source geospatial data}},
  month        = jan,
  year         = 2026,
  publisher    = {Zenodo},
  version      = {V1},
  doi          = {10.5281/zenodo.18137662},
  url          = {https://doi.org/10.5281/zenodo.18137662}
}

Related Publications

The methodology builds upon these foundational works:

  • Waleed, M., & Sajjad, M. (2025). High-resolution flood susceptibility mapping and exposure assessment in Pakistan: An integrated artificial intelligence, machine learning and geospatial framework. International Journal of Disaster Risk Reduction, 121, 105442. https://doi.org/10.1016/j.ijdrr.2025.105442

  • Waleed, M., & Sajjad, M. (2025). Advancing flood susceptibility prediction: A comparative assessment and scalability analysis of machine learning algorithms via artificial intelligence in high-risk regions of Pakistan. Journal of Flood Risk Management, 18(1), e13047. https://doi.org/10.1111/jfr3.13047


Author & Contact

Mirza Waleed
GeoAI & Environmental Risk Researcher

Co-Authors: Sajjad Muhammad | Sami G. Al-Ghamdi | Meng Gao


πŸ™ Acknowledgments

This research was supported by:

  • Google Earth Engine platform for massive-scale geospatial processing
  • SHAHEEN-III supercomputer at KAUST for distributed computing resources
  • Global open-access datasets: FABDEM, GPM IMERG, Sentinel-2, and others

We thank the reviewers and the scientific community for their valuable feedback.


πŸ“„ License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Under this license, you are free to share and adapt the material, provided you give appropriate credit, do not use it for commercial purposes, and distribute any derivative works under the same license. For full license details, please visit the Creative Commons website.

About

Official Repository for the paper "Global Flood Susceptibility"

Topics

Resources

License

Stars

Watchers

Forks

Contributors