Deep Speaker: speaker recognition system

Data Set: librispeech
Reference paper: "Deep Speaker: an End-to-End Neural Speaker Embedding System" https://arxiv.org/pdf/1705.02304.pdf
Reference code : https://github.com/philipperemy/deep-speaker (Thanks Philippe Rémy. I have greatly modified the code during the experiment, but the theme is still similar.)

This code was trained using librispeech-train-clean dataset, tested using librispeech-test-clean dataset. In my code librispeech dataset shows ~5% EER using CNN.

About Code

train.py
This is the main file, which can be trained after running, and saves the model and test every time a certain number of steps.
models.py
This is the module for creating the model. It consists of three models, the CNN model (consistent with the paper), the GRU model (consistent with the paper), and the third model is my own simplified simple_cnn model.
select_batch.py
Choose the optimal batch feed to the network. This is one of the core of this experiment.
triplet_loss.py
This is the module for calculating the triplet-loss for network training.
test_model.py
This is a module that tests the model and tests parameters such as eer.
eval_matrics.py
Input prediction and labels can be calculated, equal error rate, f-measure, accuracy and other indicators
pretaining.py
This is a module for pre-training of softmax classification.
pre_process.py
This is to read the voice, filter the mute, extract the fbank feature, and save the module in .npy format.

Results

This code was trained using librispeech-train-clean dataset, tested using librispeech-test-clean dataset. In my code, librispeech dataset shows ~5% EER using CNN.

If you want to know more details, please read 'deep_speaker实验报告.pdf'(Chinese). If you want to read details in English ，please contact me.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
audio		audio
checkpoints		checkpoints
demo		demo
README.md		README.md
constants.py		constants.py
deep_speaker实验报告.pdf		deep_speaker实验报告.pdf
eval_metrics.py		eval_metrics.py
kaldi_form_preprocess.py		kaldi_form_preprocess.py
models.py		models.py
network.txt		network.txt
pre_process.py		pre_process.py
pretraining.py		pretraining.py
random_batch.py		random_batch.py
select_batch.py		select_batch.py
silence_detector.py		silence_detector.py
test_model.py		test_model.py
train.py		train.py
triplet_loss.py		triplet_loss.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Speaker: speaker recognition system

About Code

Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deep Speaker: speaker recognition system

About Code

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages