CryoSENSE: Compressive Sensing Enables High-throughput Microscopy with Sparse and Generative Priors on the Protein Cryo-EM Image Manifold

Cryo-electron microscopy (cryo-EM) enables the atomic-resolution visualization of biomolecules; however, modern direct detectors generate data volumes that far exceed the available storage and transfer bandwidth, thereby constraining practical throughput. We introduce cryoSENSE, the computational realization of a hardware-software co-designed framework for compressive cryo-EM sensing and acquisition. We show that cryo-EM images of proteins lie on low-dimensional manifolds that can be independently represented using sparse priors in predefined bases and generative priors captured by a denoising diffusion model. cryoSENSE leverages these low-dimensional manifolds to enable faithful image reconstruction from spatial and Fourier-domain undersampled measurements while preserving downstream structural resolution. In experiments, cryoSENSE increases acquisition throughput by up to 2.5× while retaining the original 3D resolution, offering controllable trade-offs between the number of masked measurements and the level of downsampling.

Overview
Comparative Methods
Experimental Results
CryoSENSE
- 4.1. Installation
  - 4.1.1. From PyPI (recommended)
  - 4.1.2. From Source
- 4.2. Quick Start
  - 4.2.1. Command-line Interface
  - 4.2.2. Python API
- 4.3. Demonstration
  - 4.3.1. Low Compression (Block Size 2, 4 Masks)
  - 4.3.2. High Compression (Block Size 32, 1024 Masks)
- 4.4. Example Scripts
  - 4.4.1. Python Example
  - 4.4.2. Bash Example
- 4.5. Experiment Scripts
  - 4.5.1. Running Experiments
  - 4.5.2. Batch Reconstruction
- 4.6. Pretrained Models
- 4.7. Command-line Options
- 4.8. Configuration Files
  - 4.8.1. Recommended Parameters
- 4.9. Verbose Mode Output
- 4.10. Supported Data Formats
- 4.11. Requirements
Datasets

Comparative Methods

You can find the code for running the comparative method reconstruction experiments in the comparative_methods/ directory. For detailed instructions, please refer to the comparative_methods/README.md file.

Experimental Results

The scripts and data for analyzing and visualizing the experimental results are located in the experimental_results/ directory. For more details, refer to the experimental_results/README.md file.

CryoSENSE

Installation

From PyPI (recommended)

We will enable this option in camera-ready version.

pip install cryosense

From Source

git clone X
cd CryoSENSE
pip install -e .

Quick Start

Command-line Interface

# Basic usage
cryosense --model /path/to/ddpm/model --cryoem_path /path/to/cryoem/data.pt --start_id 0 --end_id 10

# Advanced options
cryosense --model /path/to/ddpm/model \
        --cryoem_path /path/to/cryoem/data.pt \
        --block_size 16 \
        --num_masks 50 \
        --mask_type random_binary \
        --zeta_scale 0.1 \
        --zeta_min 1e-10 \
        --num_timesteps 1000 \
        --beta 0.9 \
        --beta_min 0.1 \
        --start_id 0 \
        --end_id 10 \
        --batch_size 4 \
        --noise_level 0.05 \
        --result_dir ./reconstruction_results

# With verbose output and visualizations
cryosense --model /path/to/ddpm/model \
        --cryoem_path /path/to/cryoem/data.pt \
        --start_id 0 \
        --end_id 0 \
        --result_dir ./verbose_results \
        --verbose

# Using configuration files for optimal parameters
cryosense --model /path/to/ddpm/model \
        --cryoem_path /path/to/cryoem/data.pt \
        --block_size 32 \
        --use_config \
        --start_id 0 \
        --end_id 10 \
        --result_dir ./config_results

Python API

from CryoSENSE.main import CryoSENSE

# Initialize CryoSENSE with a pretrained DDPM model
cryosense = CryoSENSE(
    model_path="/path/to/ddpm/model",
    block_size=16,
    result_dir="./results",
    verbose=True,  # Enable detailed visualizations
    use_config=True  # Use recommended parameters from configuration files
)

# Reconstruct images from a CryoEM dataset
reconstructed_images, original_images, metrics = cryosense.reconstruct_from_cryoem(
    file_path="/path/to/cryoem/data.pt",
    image_ids=[0, 1, 2],  # Process images with these IDs
    num_masks=30,
    mask_type="random_binary",
    num_timesteps=1000
    # Parameters like zeta_scale and beta will be loaded from the configuration file
)

# Access reconstruction metrics
for metric in metrics:
    print(f"Image ID: {metric['image_id']}, PSNR: {metric['PSNR']}, SSIM: {metric['SSIM']}")

Demonstration

CryoSENSE enables high-quality reconstruction across different downsampling levels. Below we demonstrate two configurations:

Low Compression (Block Size 2, 4 Masks)

With minimal compression (block size 2), CryoSENSE can reconstruct high-quality images using just 4 masks:

Reconstruction comparison with minimal compression

The reconstruction is guided by compressed measurements:

Compressed measurement visualization

The diffusion process gradually builds the image from random noise:

High Compression (Block Size 32, 1024 Masks)

Even with extreme compression (block size 32), CryoSENSE reconstructs detailed protein structures using 1024 masks:

Reconstruction comparison with extreme compression

The reconstruction is guided by highly compressed measurements:

Highly compressed measurement visualization

The diffusion process gradually builds the image from random noise:

Example Scripts

CryoSENSE includes ready-to-use example scripts for quick testing and demonstration:

Python Example

python examples/simple_example.py

Bash Example

bash examples/simple_example.sh

These example scripts demonstrate CryoSENSE's capabilities with the following features:

No manual downloads required: A sample image is included in the data/ directory
Pre-trained model: Uses the anonymously uploaded DDPM model (anon202628/empiar10076-ddpm-ema-cryoem-128x128) from HuggingFace
Configurable block size: The block size parameter can be changed to any of {2, 4, 8, 16, 32}, adjust the number of masks accordingly.
- Example scripts demonstrate both block sizes 32 and 2
- Different block sizes automatically use appropriate configuration parameters
Results visualization: Both examples include verbose output with detailed visualizations

The scripts demonstrate reconstruction with different parameter combinations:

Block size 32 with 1024 masks
Block size 2 with 4 masks

Experiment Scripts

Performs experiments across block sizes, number of masks, mask types, and noise levels and measures the performance (Fig. 2a):

Running Experiments

python scripts/run_experiments.py

The run_experiments.py script reproduces the results from the paper, specifically the LPIPS and SSIM scores for CryoSENSE reconstructions across five downsampling levels (2x-32x). Features include:

Performs experiments over multiple parameters with 16 randomly selected images:
- Block sizes: 2, 4, 8, 16, 32 (corresponding to 2x-32x downsampling)
- Number of masks: Varies based on block size
- Mask types: random_binary, random_gaussian, etc.
- Noise levels: 0.0, 0.1, etc.
Analyzes results and identifies optimal configurations
Generates comprehensive reports with metrics (PSNR, SSIM, LPIPS)
Supports running on multiple GPUs in parallel

Batch Reconstruction

python scripts/reconstruct_all_images.py

The reconstruct_all_images.py script can be used for reconstructing all images of a dataset using one specific configuration (block size, number of masks) distributed across multiple GPUs. Features include:

Processes entire datasets efficiently
Distributes work automatically across available GPUs
Uses a fixed optimal configuration for reconstruction
Generates comprehensive metrics for all processed images
Supports various input formats including PyTorch tensors and MRC files

Pretrained Models

The following DDPM models are available on Huggingface and can be used directly with CryoSENSE:

Model	Resolution	Description
anon202628/empiar10076-ddpm-ema-cryoem-128x128	128×128	EMPIAR10076
anon202628/empiar11526-ddpm-ema-cryoem-128x128	128×128	EMPIAR11526
anon202628/empiar10166-ddpm-ema-cryoem-128x128	128×128	EMPIAR10166
anon202628/empiar10786-ddpm-ema-cryoem-128x128	128×128	EMPIAR10786
anon202628/empiar10648-ddpm-cryoem-256x256	256×256	EMPIAR10648

You can specify these models directly in the command line or API calls without downloading them:

cryosense --model anon202628/empiar10076-ddpm-ema-cryoem-128x128 --cryoem_path /path/to/data

Command-line Options

Option	Description	Default
`--model`	Path to the pretrained DDPM model	(required)
`--cryoem_path`	Path to the CryoEM dataset file	(required)
`--block_size`	Block size for downsampling	4
`--num_masks`	Number of binary masks to use	30
`--mask_prob`	Probability for binary mask generation	0.5
`--mask_type`	Type of mask to use (random_binary, random_gaussian, checkerboard)	random_binary
`--zeta_scale`	Scale factor for the gradient step size	(from config)
`--zeta_min`	Initial scale factor for the gradient step size	(from config)
`--num_timesteps`	Number of diffusion timesteps	1000
`--beta`	Final momentum factor for updates	(from config)
`--beta_min`	Initial momentum factor for updates	(from config)
`--start_id`	Starting index of images to process	0
`--end_id`	Ending index of images to process	0
`--batch_size`	Number of images to process in each batch	1
`--noise_level`	Gaussian noise standard deviation for measurements	0.0
`--result_dir`	Directory to save results	results
`--device`	Device to use (cuda or cpu)	cuda
`--verbose`	Enable verbose mode with detailed visualizations	False
`--use_config`	Use recommended configuration parameters based on block size	False

Configuration Files

CryoSENSE includes configuration files with recommended parameters based on the block size. The system automatically selects the appropriate configuration based on your specified block size.

Recommended Parameters

Block Size	zeta_scale	zeta_min	beta	beta_min
2, 4, 8, 16	1.0	1e-10	0.9	0.1
32, 64	10.0	1e-10	0.9	0.1

To use these recommended configurations, either:

Pass --use_config on the command line, or
Set use_config=True when creating a CryoSENSE instance in code

You can override any specific parameter by explicitly providing it, and the system will use the configuration value for any unspecified parameters.

Verbose Mode Output

When the --verbose flag is enabled, CryoSENSE will generate and save:

All binary masks used for measurements
All measurement images for each mask
Reconstructed images in PNG format
Comparison images showing original, reconstructed, and error maps
A GIF animation showing the diffusion process from t=1000 to t=1

Regardless of whether verbose mode is enabled, CryoSENSE always saves:

Raw reconstructed image tensors (.pt files)
Reconstruction metrics in CSV format

Supported Data Formats

PyTorch tensor files (.pt) containing CryoEM images
MRC files (.mrcs) containing CryoEM images

Requirements

Python 3.13.2
PyTorch 2.7.0
diffusers 0.33.1
accelerate 1.7.0
numpy 2.2.6
scikit-image 0.25.2
lpips 0.1.4
mrcfile 1.5.4
matplotlib 3.10.3
imageio 2.37.0
CUDA-capable GPU (recommended)

Datasets

The following datasets are available on Huggingface:

Dataset	Description
anon202628/3D_Volumes_EMPIAR10076	3D volumes of EMPIAR10076
anon202628/3D_Volumes_EMPIAR10648	3D volumes of EMPIAR10648
anon202628/EMPIAR10076_128x128	EMPIAR10076 128x128 Images
anon202628/EMPIAR11526_128x128	EMPIAR11526 128x128 Images
anon202628/EMPIAR10166_128x128	EMPIAR10166 128x128 Images
anon202628/EMPIAR10786_128x128	EMPIAR10786 128x128 Images
anon202628/EMPIAR10648_256x256	EMPIAR10648 256x256 Images

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
CryoSENSE		CryoSENSE
configs		configs
data		data
examples		examples
experimental_results		experimental_results
results		results
scripts		scripts
sparse_priors		sparse_priors
training		training
.DS_Store		.DS_Store
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

CryoSENSE: Compressive Sensing Enables High-throughput Microscopy with Sparse and Generative Priors on the Protein Cryo-EM Image Manifold

Table of Contents

Comparative Methods

Experimental Results

CryoSENSE

Installation

From PyPI (recommended)

From Source

Quick Start

Command-line Interface

Python API

Demonstration

Low Compression (Block Size 2, 4 Masks)

High Compression (Block Size 32, 1024 Masks)

Example Scripts

Python Example

Bash Example

Experiment Scripts

Running Experiments

Batch Reconstruction

Pretrained Models

Command-line Options

Configuration Files

Recommended Parameters

Verbose Mode Output

Supported Data Formats

Requirements

Datasets

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages