Skip to content

BianChang/HEMIT-DATASET

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Example Image

HEMIT Dataset Overview

This dataset is one of the contributions of our paper: HEMIT: H&E to Multiplex-immunohistochemistry Image Translation with Dual-Branch Pix2pix Generator.

The HEMIT dataset is tailored for image-to-image stain translation. It contains cellular-wise registered H&E and mIHC image pairs, derived from the same sectioning approach to ensure high alignment quality. The raw data is sourced from the ImmunoAIzer work [1], which includes 8 whole slide images (WSIs) from colon cancer patients.

Dataset Download

The HEMIT Dataset can be downloaded at: HEMIT Download.

Dataset Details

  • Number of Samples:
    • Train: 3717
    • Validation: 630
    • Test: 945
  • Image Patch Size: 1024x1024 pixels
  • mIHC Image Channels: 3-channel: DAPI, panCK, CD3
  • Image Format: TIF

File Structure

HEMIT
├── test
│   ├── input
│   │   ├── [7941,48408]_patch_0_0.tif
│   │   ├── [7941,48408]_patch_0_1.tif
│   │   ├── [7941,48408]_patch_0_2.tif
│   │   └── ...
│   ├── label
│   │   ├── [7941,48408]_patch_0_0.tif
│   │   ├── [7941,48408]_patch_0_1.tif
│   │   ├── [7941,48408]_patch_0_2.tif
│   │   └── ...
├── val
│   ├── input
│   │   ├── [12146,53552]_patch_0_0.tif
│   │   ├── [12146,53552]_patch_0_1.tif
│   │   ├── [12146,53552]_patch_0_2.tif
│   │   └── ...
│   ├── label
│   │   ├── [12146,53552]_patch_0_0.tif
│   │   ├── [12146,53552]_patch_0_1.tif
│   │   ├── [12146,53552]_patch_0_2.tif
│   │   └── ...
├── train
│   ├── input
│   │   ├── [6407,49798]_patch_0_0.tif
│   │   ├── [6407,49798]_patch_0_1.tif
│   │   ├── [6407,49798]_patch_0_2.tif
│   │   └── ...
│   ├── label
│   │   ├── [6407,49798]_patch_0_0.tif
│   │   ├── [6407,49798]_patch_0_1.tif
│   │   ├── [6407,49798]_patch_0_2.tif
│   │   └── ...


Corresponding images in a pair are of the same size and have the same filename, e.g., /HEMIT/train/input/[6407,49798]_patch_0_0.tif is considered to correspond to /HEMIT/train/label/[6407,49798]_patch_0_0.tif.

Evaluation Metrics

We use several evaluation metrics to assess the performance of our model:

  • SSIM (Structural Similarity Index): Measures the similarity between two images, focusing on changes in structural information, luminance, and contrast.
  • Pearson Correlation: Evaluates the linear relationship between the generated and real datasets, providing a measure of how closely the generated data matches the real data in terms of linear correlation.
  • PSNR (Peak Signal-to-Noise Ratio): Quantifies the quality of the generated images compared to the real images, with higher values indicating better quality and less noise.

The overall training and testing scheme follows the general structure of pix2pix [2]

Code Availability

The code for implementation of the dual-branch method can be accessed at: DualBranch_Pix2pix.

References

[1] Bian, Chang, et al. "ImmunoAIzer: a deep learning-based computational framework to characterize cell distribution and gene mutation in tumour microenvironment." Cancers 13.7 (2021): 1659.

[2] Isola, Phillip, et al. "Image-to-image translation with conditional adversarial networks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.

Citation

If you use this code or dataset in your research, please cite the following work:

[1] Bian C, Philips B, Cootes T, et al. HEMIT: H&E to Multiplex-immunohistochemistry Image Translation with Dual-Branch Pix2pix Generator[J]. arXiv preprint arXiv:2403.18501, 2024.

[2] Bian C, Wang Y, Lu Z, et al. Immunoaizer: A deep learning-based computational framework to characterize cell distribution and gene mutation in tumor microenvironment[J]. Cancers, 2021, 13(7): 1659.

About

This repository provides the dataset HEMIT for H&E to mIHC image-to-image translation task

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages