Skip to content

balaji1312/CHSER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR

This repository contains the dataset, code, and models for our paper:

CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR, accepted at Interspeech 2025.

πŸ—‚οΈ Repository Structure

CHSER/
β”œβ”€β”€ code/                      # Source code for dataset generation, model training, and evaluation
β”‚   β”œβ”€β”€ analysis/              # Scripts for baseline wer generation 
β”‚   β”œβ”€β”€ dataset_gen/           # Scripts for creating the CHSER dataset from raw hypotheses
β”‚   └── gensec/                # Core modules for generative speech error correction (GenSEC) (T5 and Llama based)
β”œβ”€β”€ dataset/                   # CHSER dataset splits
β”‚   β”œβ”€β”€ dev/                   
β”‚   β”œβ”€β”€ test/                  
β”‚   └── train/                 
β”œβ”€β”€ models/                    # Pretrained and fine-tuned model checkpoints
β”‚   β”œβ”€β”€ 3gram/                 # n-gram baseline (for comparison or decoding)
β”‚   β”œβ”€β”€ llama2/                # Adapter weights for Llama2 model fine-tuned on CHSER
β”‚   β”œβ”€β”€ t5/                    # Adapter weights for T5 model fine-tuned on CHSER
β”‚   β”œβ”€β”€ t5_myst/               # Adapter weights for T5 model fine-tuned on MyST data
β”‚   └── transformer/           # Transformer LM baseline model (non-pretrained)

πŸ“Š Dataset

The CHSER dataset consists of child ASR hypotheses paired with human-verified reference transcripts. Hypotheses were generated using Whisper-base.en in a zero-shot beam search setting.

🧠 Models

We provide checkpoints of GenSEC models trained on adult speech (HyPoradise) and fine-tuned on CHSER. Models include:

  • Llama-based correction model
  • T5-based correction models

πŸ“œ Citation

If you found this work useful in your research, please cite:

@misc{shankar2025chser,
      title={CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR}, 
      author={Natarajan Balaji Shankar and Zilai Wang and Kaiyuan Zhang and Mohan Shi and Abeer Alwan},
      year={2025},
      eprint={2505.18463},
      archivePrefix={arXiv},
      primaryClass={eess.AS},
      url={https://arxiv.org/abs/2505.18463}, 
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors