SWAT+ Calibrator

Single-phase sensitivity analysis and calibration workflow for SWAT+ models using SWATrunR.

Workflow: GSA (LH-OAT Morris screening) → Calibration (Monte Carlo sampling) → narrowed parameter ranges → optional re-iteration.

Workflow

Requirements

Software	Version	Notes
R	>= 4.3.3	Required for SWATrunR compatibility
SWAT+	>= swatplus-61.0.2.61-ifx-win_amd64-Rel	Model executable must be inside project_path
RTools	>= 4.3	Windows only — needed to compile R packages

Package	Purpose
`SWATrunR`	Run SWAT+ from R
`yaml`	Read `config.yaml`
`dplyr`	Data manipulation
`tibble`	Tidy data frames
`tidyr`	Pivot operations
`purrr`	Functional iteration
`lhs`	Latin Hypercube Sampling
`hydroGOF`	NSE, KGE, PBIAS
`lubridate`	Date handling
`ggplot2`	Plots
`ggrepel`	Non-overlapping labels in plots

Install all at once:

install.packages(c(
  "yaml", "dplyr", "tibble", "tidyr", "purrr", "lhs",
  "hydroGOF", "lubridate", "ggplot2", "ggrepel"
))

# SWATrunR (from GitHub)
remotes::install_github("chrisschuerz/SWATrunR")

Project structure

swatplus-calibrator/
├── config.yaml          # Single configuration file (edit this)
├── main.R               # Orchestrator: runs GSA then calibration
├── run_gsa.R            # Step 1: LH-OAT Morris sensitivity analysis
├── run_cal.R            # Step 2: Monte Carlo calibration
└── R/
    ├── lhoat_engine.R   # LH-OAT trajectory generation + Elementary Effects
    ├── metrics.R        # NSE, KGE, PBIAS + behavioural band (P/R-factor)
    ├── morris_classify.R# Morris classification + quantile range narrowing
    ├── plots.R          # Publication-ready TIFF plots
    ├── run_filter.R     # Invalid run detection and filtering
    └── versioning.R     # Auto-versioned output folders

Quick start

1. Prepare your SWAT+ project

Make sure your TxtInOut folder runs correctly with the SWAT+ executable. Place observed streamflow files inside the TxtInOut folder:

File	Format
`obs_flow_mon.csv`	Columns: `Date` (YYYY-MM-DD), `Flow` (m3/s)
`obs_flow_daily.csv`	Columns: `Date` (YYYY-MM-DD), `Flow` (m3/s)

You only need the file matching your chosen temporal_scale (monthly or daily).

2. Edit `config.yaml`

Update at minimum:

project_path: "C:/path/to/your/TxtInOut"
threads: 8                        # Number of parallel SWAT+ runs

simulation:
  start_date:   "1985-01-01"
  end_date:     "2010-12-31"
  warmup_years: 3

temporal_scale: "monthly"         # or "daily"

observed:
  monthly_file: "obs_flow_mon.csv"
  start_date:   "1988-01-01"      # First obs date after warm-up

Set the outlet channel unit in gsa_outputs and cal_outputs:

gsa_outputs:
  streamflow:
    file:     "channel_sdmorph_mon"   # _day for daily
    variable: "flo_out"
    unit:     33                      # Your outlet channel ID

If using temporal_scale: "daily", also update gsa_metrics suffixes:

gsa_metrics:
  - "NSE_day"
  - "KGE_day"
  - "PBIAS_day"
  - "obj_total"

3. Run

From RStudio or VSCode (set working directory to swatplus-calibrator/):

source("main.R")

From terminal:

Rscript main.R
# or
Rscript main.R path/to/config.yaml

You can also run steps individually:

CONFIG_PATH <- "config.yaml"
source("run_gsa.R")   # Step 1 only
source("run_cal.R")   # Step 2 only (requires GSA_v* to exist)

Workflow details

Step 1: Global Sensitivity Analysis (`run_gsa.R`)

Generates LH-OAT (Latin Hypercube One-At-a-Time) trajectories in normalized [0,1] space
Scales to physical parameter bounds and runs SWAT+ via SWATrunR
Computes NSE, KGE, PBIAS, and obj_total for each run
Calculates Elementary Effects (EE) per trajectory step
Derives Morris sensitivity indices: μ* (mean absolute EE) and σ (standard deviation of EE)
Classifies parameters into three groups:

Group	Condition	Quantile range
High importance + interaction	μ* > mean and σ > mean	Q5 – Q95
High importance	μ* > mean and σ ≤ mean	Q10 – Q90
Low importance	μ* ≤ mean	Q25 – Q75

Output folder: GSA_v1/ (auto-increments on re-runs)

File	Description
`ranking_global_gsa.csv`	Morris μ* and σ per parameter
`param_info_gsa.csv`	Parameter bounds used in this run
`metricas_gsa.csv`	All metrics for all runs
`ee_all_gsa.csv`	Elementary Effects values
`Morris_sensitivity_screening.tif`	Publication-ready scatter plot
`resultado_gsa.rds`	Full R object with all results

Step 2: Calibration (`run_cal.R`)

Reads param_info_gsa.csv and ranking_global_gsa.csv from the latest GSA_v*/ folder
Excludes insensitive parameters listed in calibration.exclude_parameters (fixed at their GSA median value)
Samples n_simulations random parameter sets (uniform within bounds)
Runs SWAT+ and computes NSE, KGE, PBIAS
Classifies runs as satisfactory if all thresholds are met simultaneously
Computes water balance diagnostic (ET/P and WYLD/P vs targets, if defined)
Builds the behavioural band (min–max envelope of satisfactory simulations at each timestep) and computes P-factor and R-factor (see below)
Computes new narrowed parameter ranges using the Morris group quantile rules applied to the satisfactory posterior distribution
Saves new_ranges.csv for the next iteration

Output folder: CAL_v1/ (auto-increments on re-runs)

File	Description
`results_cal.csv`	All runs with metrics + parameter values
`satisfactory_results.csv`	Only satisfactory runs
`new_ranges.csv`	Narrowed parameter ranges for next iteration
`behavioural_band.csv`	Band envelope per timestep (Date, lower, upper)
`behavioural_band_factors.csv`	P-factor and R-factor values
`balance_diagnostic.csv`	Water balance diagnostic (ET/P, WYLD/P) per run
`fdc_band.csv`	FDC envelope data (exceedance, obs_flow, lower, upper)
`scatter_performance.tif`	NSE vs KGE scatter plot with thresholds
`hydrograph.tif`	Observed vs simulated streamflow with behavioural band
`uncertainty_envelope.tif`	Behavioural uncertainty band + observed (SWAT-CUP style)
`fdc_envelope.tif`	Flow Duration Curve with behavioural uncertainty band
`boxplot_params.tif`	Normalized satisfactory parameter distributions
`resultado_cal.rds`	Full R object with all results

Iteration

After the first run, the console prints instructions for the next iteration. Update config.yaml:

iteration:
  enabled:     true
  ranges_file: "C:/path/to/your/TxtInOut/CAL_vX/new_ranges.csv"

Then run again:

source("main.R")

This creates GSA_v2/ and CAL_v2/ using the narrowed ranges from the previous calibration. The process can be repeated as many times as needed:

Iteration 1:  config.yaml (original bounds)  → GSA_v1/ → CAL_v1/new_ranges.csv
Iteration 2:  config.yaml (ranges from v1)   → GSA_v2/ → CAL_v2/new_ranges.csv
Iteration 3:  config.yaml (ranges from v2)   → GSA_v3/ → CAL_v3/new_ranges.csv
...

Parameters not found in new_ranges.csv keep their original YAML bounds. This lets you add new parameters between iterations.

Excluding insensitive parameters

Insensitive parameters identified by the Morris screening can be excluded from calibration to reduce the search space. Excluded parameters are fixed at their median value from the GSA posterior distribution — they are still passed to SWAT+ (so the model runs correctly) but are not sampled.

Automatic exclusion (recommended)

Enable automatic exclusion based on the Morris μ* index:

calibration:
  auto_exclude: true              # Enable automatic exclusion
  auto_exclude_threshold: 0       # mu_star threshold

Parameters with μ* ≤ auto_exclude_threshold are automatically excluded. The default threshold 0 removes only completely insensitive parameters (those with zero effect on model output). Increase the threshold to be more aggressive:

Threshold	Effect
`0`	Excludes only parameters with μ* = 0 (no effect at all)
`0.05`	Also excludes parameters with very low sensitivity
`0.10`	More aggressive — keeps only clearly influential parameters

Set auto_exclude: false to disable automatic exclusion entirely.

Manual exclusion

You can also manually list parameters to exclude. This works independently of (and is additive with) automatic exclusion:

calibration:
  exclude_parameters: ["canmx", "epco", "cn3_swf"]

Set exclude_parameters: [] (empty list) if automatic exclusion is sufficient.

Water balance diagnostic

If you define observed water balance targets in config.yaml, the calibration will compute ET/P and WYLD/P ratios for each run and report how satisfactory simulations compare to the targets:

calibration:
  targets:
    et_rto:   0.48147    # observed ET / P ratio
    wyld_rto: 0.51853    # observed WYLD / P ratio

This requires adding the water balance outputs to cal_outputs:

cal_outputs:
  streamflow:
    file: "channel_sdmorph_day"
    variable: "flo_out"
    unit: 33
  precip:
    file: "basin_wb_aa"
    variable: "precip"
    unit: 1
  et:
    file: "basin_wb_aa"
    variable: "et"
    unit: 1
  wateryld:
    file: "basin_wb_aa"
    variable: "wateryld"
    unit: 1

The diagnostic results are saved to balance_diagnostic.csv and included in resultado_cal.rds. This is a diagnostic check only — it does not filter runs, but helps verify that satisfactory streamflow simulations also maintain a realistic water balance.

Performance metrics

Metric	Formula	Reference
NSE	1 − SS_res / SS_tot	Nash & Sutcliffe (1970)
KGE	1 − √((r−1)² + (β−1)² + (γ−1)²)	Gupta et al. (2009)
PBIAS	100 × Σ(sim − obs) / Σ(obs)	Moriasi et al. (2007)
obj_total	(1 − min(NSE,1)) + (1 − min(KGE,1)) + \|PBIAS\|/100	Combined error

Default thresholds follow Moriasi et al. (2015) "satisfactory" criteria. Adjust in config.yaml to match your study requirements.

Behavioural band and uncertainty indices

After calibration, the tool builds a behavioural band from all satisfactory simulations and computes two uncertainty indices inspired by SUFI-2 but applied to the behavioural (satisfactory) ensemble rather than a percentile-based prediction interval.

Band construction

At each timestep t, the band boundaries are:

Lower = min of all satisfactory simulations at t
Upper = max of all satisfactory simulations at t

This is the full envelope of satisfactory runs, not a percentile subset.

P-factor

Fraction of observed data points that fall inside the behavioural band:

P-factor = n / N

where n = number of observed points within [lower, upper] and N = total observed points.

Range	Interpretation
> 0.70	Satisfactory
> 0.80	Good
→ 1.0	All observations bracketed

R-factor

Relative width of the behavioural band normalized by the variability of the observed data:

R-factor = mean(upper − lower) / σ_obs

Range	Interpretation
< 1.50	Satisfactory
< 1.00	Good (narrow band)
→ 0	Perfect (but unlikely with P-factor → 1)

P-factor and R-factor are inversely related: a wider band brackets more observations (higher P) but at the cost of a larger R. The goal is to maximize P-factor while keeping R-factor as low as possible.

Hydrograph plot

The hydrograph (hydrograph.tif) shows all simulation runs with the behavioural band:

Light blue shaded ribbon — behavioural band (min–max envelope)
Gray lines — non-satisfactory simulations
Blue lines — satisfactory simulations
Black line — observed streamflow
Annotation (top-left) — P-factor and R-factor values

Uncertainty envelope plot

The uncertainty envelope (uncertainty_envelope.tif) is a clean, SWAT-CUP–style plot that shows only the essential uncertainty information:

Green shaded ribbon — behavioural band (min–max envelope of all satisfactory simulations)
Blue line — observed streamflow
Annotation (top-left) — P-factor and R-factor values

Unlike the hydrograph, individual simulation lines are not shown. This provides a clear visualization of the prediction uncertainty range and how well it brackets observed data.

Flow Duration Curve (FDC) with uncertainty band

The FDC envelope (fdc_envelope.tif) extends the behavioural band concept to the flow duration domain:

Green shaded ribbon — FDC envelope (min–max of sorted satisfactory simulations at each exceedance level)
Blue line — observed FDC

The x-axis shows exceedance probability (%) and the y-axis uses a logarithmic scale for streamflow. This diagnostic reveals whether the model reproduces the full range of flow regimes:

FDC region	Exceedance	Flow regime
Left tail	0–10%	High flows (flood peaks)
Middle	10–70%	Medium flows
Right tail	70–100%	Low flows (baseflow)

If the observed FDC falls within the green band across all regions, the model captures both peak events and baseflow recession. Gaps indicate flow regimes where the model structure or parameterization needs improvement.

The FDC data is also saved as fdc_band.csv (columns: exceedance, obs_flow, lower, upper).

Parameter notation (SWATrunR)

Parameters use SWATrunR notation: variable::file | change = type

Change type	Meaning	Example
`pctchg`	Percentage change from default	`cn2::cn2.hru \| change = pctchg` → CN2 × (1 + value/100)
`absval`	Replace with absolute value	`esco::esco.hru \| change = absval` → ESCO = value

Tips

Start small: Use m_traj: 3 and n_simulations: 30 for a test run to verify everything works before scaling up.
Check your outlet: The unit field in gsa_outputs and cal_outputs must match the channel ID of your basin outlet in SWAT+.
Observed data alignment: Make sure observed.start_date falls after the warm-up period (simulation.start_date + warmup_years).
Daily vs monthly: Daily calibration is more demanding. Consider starting with monthly (temporal_scale: "monthly") and relaxed thresholds, then switching to daily in a later iteration.
No satisfactory runs? Increase n_simulations, relax thresholds, or check that the model structure is appropriate for your basin.
Output separation: Set output_path to keep versioned folders outside TxtInOut if you prefer a cleaner project directory.

License

This tool is provided for research and educational use. See the SWAT+ and SWATrunR licenses for their respective terms.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
R		R
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
main.R		main.R
run_cal.R		run_cal.R
run_gsa.R		run_gsa.R
workflow.svg		workflow.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SWAT+ Calibrator

Workflow

Requirements

Project structure

Quick start

1. Prepare your SWAT+ project

2. Edit `config.yaml`

3. Run

Workflow details

Step 1: Global Sensitivity Analysis (`run_gsa.R`)

Step 2: Calibration (`run_cal.R`)

Iteration

Excluding insensitive parameters

Automatic exclusion (recommended)

Manual exclusion

Water balance diagnostic

Performance metrics

Behavioural band and uncertainty indices

Band construction

P-factor

R-factor

Hydrograph plot

Uncertainty envelope plot

Flow Duration Curve (FDC) with uncertainty band

Parameter notation (SWATrunR)

Tips

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SWAT+ Calibrator

Workflow

Requirements

Project structure

Quick start

1. Prepare your SWAT+ project

2. Edit config.yaml

3. Run

Workflow details

Step 1: Global Sensitivity Analysis (run_gsa.R)

Step 2: Calibration (run_cal.R)

Iteration

Excluding insensitive parameters

Automatic exclusion (recommended)

Manual exclusion

Water balance diagnostic

Performance metrics

Behavioural band and uncertainty indices

Band construction

P-factor

R-factor

Hydrograph plot

Uncertainty envelope plot

Flow Duration Curve (FDC) with uncertainty band

Parameter notation (SWATrunR)

Tips

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2. Edit `config.yaml`

Step 1: Global Sensitivity Analysis (`run_gsa.R`)

Step 2: Calibration (`run_cal.R`)

Packages