Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts

Zhitong Gao · Bingnan Li · Mathieu Salzmann · Xuming He

NeurIPS 2024 Proceedings

Figure 1: The overview of the proposed method.

Abstract

In open-world scenarios, where both novel classes and domains may exist, an ideal segmentation model should detect anomaly classes for safety and generalize to new domains. However, existing methods often struggle to distinguish between domain-level and semantic-level distribution shifts, leading to poor OOD detection or domain generalization performance. In this work, we aim to equip the model to generalize effectively to covariate-shift regions while precisely identifying semantic-shift regions. To achieve this, we design a novel generative augmentation method to produce coherent images that incorporate both anomaly (or novel) objects and various covariate shifts at both image and object levels. Furthermore, we introduce a training strategy that recalibrates uncertainty specifically for semantic shifts and enhances the feature extractor to align features associated with domain shifts. We validate the effectiveness of our method across benchmarks featuring both semantic and domain shifts. Our method achieves state-of-the-art performance across all benchmarks for both OOD detection and domain generalization.

Environment Setup

conda env create -f environment.yml 

conda activate MultiShiftSeg
git clone https://github.com/facebookresearch/detectron2.git
pip install -e detectron2
pip install git+https://github.com/cocodataset/panopticapi.git
cd lib/network/mask2former/modeling/pixel_decoder/ops
sh make.sh

Data Preparation

1. Training Dataset

We use Cityscapes for training, which you can download from the official website. We augment the original dataset with multiple distribution shifts using ControlNet. To reproduce the data generation process, please refer to the Generation Instruction.

You can also directly download our already generated data from Google Drive or Hugging Face.

2. Evaluation Dataset

Below are the datasets used in our evaluations, along with links for downloading:

RoadAnomaly
SMIYC - RoadAnomaly21
SMIYC - RoadObstacle21
ACDC-POC (you may need to convert .npz files into .png manually.)
MUAD We use the challenge subset.

Alternatively, for ease of use and standardization, you may prefer to download a preprocessed version of the datasets from Synboost, which includes RoadAnomaly, FS Static, and FS Lost&Found:

Preprocessed Datasets

After downloading the dataset(s), you will need to configure the data directory in lib/dataset/ to align with the structure of the downloaded files. The expected data structure for these datasets in our code is as follows:

datasets
├── cityscapes
│   ├── leftImg8bit
│   └── gtFine  
├── multishift_cityscapes (our augmented dataset)
│   ├── leftImg8bit
│   └── gtFine                                         
├── road_anomaly
│   ├── original
│   └── labels
├── dataset_AnomalyTrack            
│   ├── images
│   └── labels_masks
├── dataset_ObstacleTrack           
│   ├── images
│   └── labels_masks
├── MUAD_challenge                  
│   └── test_sets
│       └── test_OOD
│           ├── leftImg8bit
│           └── leftLabel
├── acdc_poc                        
│   ├── gt_trainval
│   └── rgb_anon_trainvaltest

Training

To train the models, use the following commands:

# DeepLab v3+
python train_deeplab.py --cfg 'exp/DeepLab.yaml' --id <YOUR_EXP_ID> --weight_path <YOUR_PATH_TO>/DeepLabV3+_WideResNet38_baseline.pth

We follow previous works such as RPL initialize DeepLab v3+ with a pretrained closed-world model checkpoint from NVIDIA's semantic segmetation repository. You can the pretrained checkpoint from this Google Drive link. After downloading, specify the model weight path using the --weight_path argument.

# Mask2Former
python train_m2f.py --cfg 'exp/M2F.yaml' --id <YOUR_EXP_ID> --weight_path <YOUR_PATH_TO>/bt-f-xl.pth

For the Mask2Former model, we use the first-stage pretrained checkpoint provided by the [Mask2Anomaly](https://github.com/shyam671/ Mask2Anomaly-Unmasking-Anomalies-in-Road-Scene-Segmentation?tab=readme-ov-file) project. You can download this checkpoint from this Google Drive link and then provide its path via the --weight_path argument.

Evaluation

After training the model, you can perform inference and evaluate out-of-distribution detection performance using the following commands:

python test_deeplab.py --cfg 'exp/DeepLab.yaml' --id  <YOUR_EXP_ID> --weight_path <MODEL_WEIGHT_PATH>

or

python test_m2f.py --cfg 'exp/M2F.yaml' --id  <YOUR_EXP_ID> --weight_path <MODEL_WEIGHT_PATH>

If you wish to evaluate using the provided pretrained models, download the corresponding checkpoints below, and replace <MODEL_WEIGHT_PATH> with the path to the downloaded weights.

Checkpoint

	RoadAnomaly	RoadAnomaly	RoadAnomaly	SMIYC-RA21	SMIYC-RA21	SMIYC-RO21	SMIYC-RO21	Weights
Method	AUC	AP	FPR	AP	FPR	AP	FPR
DeepLab v3+	96.40	74.60	16.08	88.06	8.21	90.71	0.26	Google Drive or Hugging Face
Mask2Former	97.94	90.17	7.54	91.92	7.94	95.29	0.07	Google Drive or Hugging Face

Note: The SMIYC results reported are evaluated on the official online benchmarks. To verify the correctness of the pretrained models locally, you can run inference on the RoadAnomaly dataset. The evaluation scores should closely match those shown in the table.

BibTeX

@inproceedings{
gao2024generalize,
title={Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts},
author={Zhitong Gao and Bingnan Li and Mathieu Salzmann and Xuming He},
booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
year={2024},
url={https://openreview.net/forum?id=h0rbjHyWoa}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CGAug		CGAug
exps		exps
imgs		imgs
lib		lib
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
test_deeplab.py		test_deeplab.py
test_m2f.py		test_m2f.py
train_deeplab.py		train_deeplab.py
train_m2f.py		train_m2f.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts

NeurIPS 2024 Proceedings

Abstract

Environment Setup

Data Preparation

1. Training Dataset

2. Evaluation Dataset

Training

Evaluation

Checkpoint

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts

NeurIPS 2024 Proceedings

Abstract

Environment Setup

Data Preparation

1. Training Dataset

2. Evaluation Dataset

Training

Evaluation

Checkpoint

BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages