Skip to content

Latest commit

 

History

History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Overview of example information

Directory name Corpus name Task Language URL Note
aishell AISHELL-ASR0009-OS1 Open Source Mandarin Speech Corpus ASR ZH http://www.aishelltech.com/kysjcp
ami The AMI Meeting Corpus ASR EN http://groups.inf.ed.ac.uk/ami/corpus/
an4 CMU AN4 database ASR/TTS EN http://www.speech.cs.cmu.edu/databases/an4/
aurora4 Aurora-4 database ASR EN http://aurora.hsnr.de/aurora-4.html
babel IARPA Babel corups ASR ~20 Languages https://www.iarpa.gov/index.php/research-programs/babel
chime4 The 4th CHiME Speech Separation and Recognition Challenge ASR/Multichannel ASR EN http://spandh.dcs.shef.ac.uk/chime_challenge/chime2016/
chime5 The 5th CHiME Speech Separation and Recognition Challenge ASR EN http://spandh.dcs.shef.ac.uk/chime_challenge/
cmu_wilderness CMU Wilderness Multilingual Speech Dataset Multilingual ASR ~100 Languages https://github.com/festvox/datasets-CMU_Wilderness
csj Corpus of Spontaneous Japanese ASR JP https://pj.ninjal.ac.jp/corpus_center/csj/en/
fisher_callhome_spanish Fisher and CALLHOME Spanish--English Speech Translation ASR/Machine Translation/Speech Translation ES->EN https://catalog.ldc.upenn.edu/LDC2014T23
fisher_swbd Fisher English Training Speech, Switchboard-1 Release 2 ASR EN https://catalog.ldc.upenn.edu/LDC2004S13, https://catalog.ldc.upenn.edu/LDC2005S13, https://catalog.ldc.upenn.edu/LDC97S62
hkust HKUST Mandarin Telephone Speech ASR ZH https://catalog.ldc.upenn.edu/LDC2005S15, https://catalog.ldc.upenn.edu/LDC2005T32
hub4_spanish 1997 Spanish Broadcast News Speech (HUB4-NE) ASR ES https://catalog.ldc.upenn.edu/LDC98S74, https://catalog.ldc.upenn.edu/LDC98T29
iwslt18 International Workshop on Spoken Language Translation 2018 ASR/Machine Translation/Speech Translation EN->DE https://sites.google.com/site/iwsltevaluation2018/Lectures-task
jnas ASJ Japanese Newspaper Article Sentences Read Speech Corpus (JNAS) ASR JP http://research.nii.ac.jp/src/JNAS.html
jsalt18e2e Multilingual End-to-end ASR for Incomplete Data Benchmark Multilingual ASR ~20 Languages https://www.clsp.jhu.edu/workshops/18-workshop/multilingual-end-end-asr-incomplete-data/ babel+
jsut Japanese speech corpus of Saruwatari-lab., University of Tokyo ASR/TTS JP https://sites.google.com/site/shinnosuketakamichi/publication/jsut
li10 Lanugage-Independent ASR task (10 languages) Multilingual ASR ~10 Languages https://www.merl.com/publications/docs/TR2017-182.pdf csj+hkust+voxforge(7lang)+wsj
libri_trans Translation Augmented LibriSpeech Corpus ASR/Machine Translation/Speech Translation https://persyval-platform.univ-grenoble-alpes.fr/DS91/detaildataset
librispeech LibriSpeech ASR corpus ASR EN http://www.openslr.org/12
libritts LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech TTS EN http://www.openslr.org/60/
ljspeech The LJ Speech Dataset TTS EN https://keithito.com/LJ-Speech-Dataset/
m_ailabs The M-AILABS Speech Dataset TTS ~5 languages https://www.caito.de/2019/01/the-m-ailabs-speech-dataset/
reverb REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge ASR EN https://reverb2014.dereverberation.com/
ru_open_stt Russian Open Speech To Text (STT/ASR) Dataset ASR RU https://github.com/snakers4/open_stt
swbd The Switchboard corpus ASR EN https://catalog.ldc.upenn.edu/LDC97S62
tedlium2 TED-LIUM corpus release 2 ASR EN https://www.openslr.org/19/, http://www.lrec-conf.org/proceedings/lrec2014/pdf/1104_Paper.pdf
tedlium3 TED-LIUM corpus release 3 ASR EN http://www.openslr.org/51/, https://arxiv.org/pdf/1805.04699
timit TIMIT Acoustic-Phonetic Continuous Speech Corpus ASR EN https://catalog.ldc.upenn.edu/LDC93S1
voxforge VoxForge ASR 7 languages http://www.voxforge.org/
wsj CSR-I (WSJ0) Complete, CSR-II (WSJ1) Complete ASR EN https://catalog.ldc.upenn.edu/LDC93S6A,https://catalog.ldc.upenn.edu/LDC94S13A
wsj_mix MERL WSJ0-mix multi-speaker dataset Multispeaker ASR EN http://www.merl.com/demos/deep-clustering
yesno The "yesno" corpus ASR HE http://www.openslr.org/1