A High-Resolution (30m) Global Dataset Derived from Multi-Source Geospatial Data
Launch GEE App Β»
Overview
Β·
Explorer App
Β·
Dataset Access
Β·
Citation
Β·
Contact
GFSM v1 is the first globally harmonized flood susceptibility dataset produced at a native 30-meter spatial resolution. Unlike traditional hazard models that rely on hydrodynamic simulations for specific return periods (e.g., 1-in-100 years), GFSM identifies the landscape's inherent propensity for flooding based on physical geographic and environmental controls.
Developed using a Gradient-Boosted Tree (XGBoost) machine learning framework, the model integrates topographic, hydrological, meteorological, and anthropogenic factors trained on over 30 million samples across 192 distinct climate zones and processed over 10,000 GB of high-resolution satellite data.
- First-of-its-kind: Global harmonized susceptibility at 30m resolution
- Data-rich foundation: Trained on extensive multi-source geospatial datasets
- Climate-aware: Accounts for diverse climatic and geographic contexts
- Actionable insights: Suitable for regional planning, exposure assessment, and risk screening
- Open science: Data and methodology made available to the research community
π’ Project Status
The manuscript describing this dataset is currently under review at Nature Scientific Data. The dataset has been published on Zenodo (currently restricted) and will be made publicly available immediately following manuscript acceptance.
Dataset DOI: 10.5281/zenodo.18137662
π§ For early access to specific tiles or training samples for validation purposes, please contact the author directly.
Visualize and inspect the global dataset interactively using our Google Earth Engine web application. The app features smart-resolution switching and a click-to-inspect tool for detailed regional analytics.
Click the image above to launch the app.
- Global Screening: Rapidly visualize susceptibility patterns from global to local scales.
- Resolution Toggling: Switch between 1km overview layers and native 30m analysis layers.
- Unit Inspector: Click any location to view specific model performance metrics (AUC, F1-Score) for that region (coming soon).
- π Global Coverage: Spans all inhabited landmasses from ~80Β°N to ~60Β°S
- π― High Precision: Native 30m resolution for localized exposure assessment and infrastructure planning
- π Comprehensive Training: Built on ~17,000 grid tiles across diverse geographic regions
- π€ Advanced ML Framework: Gradient-Boosted Trees (XGBoost) optimized for geospatial prediction
- π§ͺ Robust Validation: Global median AUC of ~0.95 across diverse terrains
- π Climate-Aware: Model trained across 192 distinct KΓΆppen-Geiger climate zones
| Category | Variables | Data Source |
|---|---|---|
| Topographic | Elevation, Slope, Aspect | FABDEM (30m) |
| Hydrological | HAND, TWI, Distance to Water | Derived from MERIT-Hydro |
| Anthropogenic | Distance to Roads, NDVI | OpenStreetMap, Sentinel-2 |
| Climate | Rainfall Frequency | GPM IMERG |
The generation of GFSM v1 followed a rigorous, high-performance computing workflow (involving distributed computing and advanced machine learning techniques on SHAHEEN-III and Google Earth Engine). Details will be updated upon manuscript acceptance, but feel free to reach out for preliminary insights.
| Class | Level | Description |
|---|---|---|
| 1 | Very Low | Areas with negligible flood propensity (e.g., high ridges, steep slopes). |
| 2 | Low | Areas with minimal risk factors. |
| 3 | Moderate | Transitional zones with some flood-prone characteristics. |
| 4 | High | Areas with significant flood conditioning factors. |
| 5 | Very High | Critical zones (floodplains, depressions) with maximum propensity. |
This repository serves as the official code and documentation hub for the GFSM project.
GFSM/
βββ scripts/ # Processing and analysis scripts (Coming soon)
β βββ preprocessing/ # Data preparation workflows
β βββ modeling/ # XGBoost training pipelines
β βββ inference/ # GEE-based prediction scripts
βββ src/
β βββ img/ # Visual assets (Logo, screenshots, diagrams)
β βββ docs/ # Additional documentation (Coming soon)
βββ other/
β βββ sorting_zenodo/ # Utilities for dataset management
βββ LICENSE # CC BY-NC-SA 4.0 License
βββ README.md # Project documentation (this file)
Note: Full processing scripts, training pipelines, and documentation will be released upon manuscript acceptance to ensure reproducibility.
The complete GFSM v1 dataset has been archived on Zenodo with a permanent DOI. The dataset is currently restricted and will be made publicly available immediately upon manuscript acceptance.
Citation: Waleed, M. (2026). Global Flood Susceptibility Map (GFSM v1): A high resolution (30m) flood susceptibility dataset derived from multi-source geospatial data (Version V1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.18137662
The Zenodo archive contains:
- Global GeoTIFF tiles at 30m resolution
- Metadata and documentation for all layers
- Global Tiles Index Grid Allowing users to identify and download specific regions
| Method | Status | Description |
|---|---|---|
| Zenodo Direct Download | π Restricted | Full dataset archive (Available post-publication) |
| GEE Explorer App | β Live | Interactive visualization and inspection |
| GEE Asset | π Coming Soon | Programmatic access via Earth Engine API |
| Early Access Request | π§ Available | Contact author for research purposes |
| Feature | Status | Timeline |
|---|---|---|
| GFSM Explorer (v1) | β Live | Available Now |
| Zenodo Dataset Publication | π Restricted | Published, opens upon acceptance |
| Manuscript Publication | π Under Review | Submitted to Nature Scientific Data |
| Public Dataset Download | π Pending | Release upon acceptance |
| GEE Asset (Public) | π Pending | Release upon acceptance |
| Processing Scripts | π Pending | Release upon acceptance |
| GFSM Toolbox (v2) | π§ In Development | Q2 2026 (post-publication) |
| API Integration | π‘ Planned | Future Release |
Research Access: Researchers requiring early access to specific tiles or training samples for validation purposes may contact the author directly with reasonable request.
If you use GFSM v1 in your research, please cite both the manuscript (upon publication) and the dataset.
Details to be updated upon publication
@article{waleed2026gfsm,
title={Global Flood Susceptibility Map (GFSM v1): A high resolution (30m) flood susceptibility dataset derived from multi-source geospatial data},
author={Waleed, Mirza and Sajjad, Muhammad and Sami, Ghamdi and Meng, Gao},
journal={Under Review at Nature Scientific Data},
year={2026}
}@dataset{waleed2026gfsm_data,
author = {Waleed, Mirza},
title = {{Global Flood Susceptibility Map (GFSM v1): A high
resolution (30m) flood susceptibility dataset derived
from multi-source geospatial data}},
month = jan,
year = 2026,
publisher = {Zenodo},
version = {V1},
doi = {10.5281/zenodo.18137662},
url = {https://doi.org/10.5281/zenodo.18137662}
}The methodology builds upon these foundational works:
-
Waleed, M., & Sajjad, M. (2025). High-resolution flood susceptibility mapping and exposure assessment in Pakistan: An integrated artificial intelligence, machine learning and geospatial framework. International Journal of Disaster Risk Reduction, 121, 105442. https://doi.org/10.1016/j.ijdrr.2025.105442
-
Waleed, M., & Sajjad, M. (2025). Advancing flood susceptibility prediction: A comparative assessment and scalability analysis of machine learning algorithms via artificial intelligence in high-risk regions of Pakistan. Journal of Flood Risk Management, 18(1), e13047. https://doi.org/10.1111/jfr3.13047
Mirza Waleed
GeoAI & Environmental Risk Researcher
- Website: waleedgeo.com
- Email: [email protected]
- LinkedIn: Mirza Waleed
- GitHub: @waleedgeo
Co-Authors: Sajjad Muhammad | Sami G. Al-Ghamdi | Meng Gao
This research was supported by:
- Google Earth Engine platform for massive-scale geospatial processing
- SHAHEEN-III supercomputer at KAUST for distributed computing resources
- Global open-access datasets: FABDEM, GPM IMERG, Sentinel-2, and others
We thank the reviewers and the scientific community for their valuable feedback.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Under this license, you are free to share and adapt the material, provided you give appropriate credit, do not use it for commercial purposes, and distribute any derivative works under the same license. For full license details, please visit the Creative Commons website.