the_speech

This project attempts to provide the functionality offered by various libraries used for speech/audio processing in a 'all-in-one' fashion. Here, libraries like bob.kaldi (python wrapper for the Kaldi Project), librosa, python_speech_features, cyvlfeat, kaldi.io among others are utilized.

The main purpose of this code is feature extraction. Which can be done easily in a Kaldi's "style". See "recipes" folder. Features that can be extracted are: mffcs, plp, i-vectors, fisher vectors. To be added in the future: filter banks, x-vectors.

Readme is going to be updated with more details in the future.

Note: this project is on its early stages and it's used in a pesonal manner only. There's no plans yet to give some kind of support and is currently under development and enhancement. However, any suggestion or pull request is be welcome.

Name		Name	Last commit message	Last commit date
Latest commit History 189 Commits
.idea		.idea
audio		audio
classifiers		classifiers
common		common
conf		conf
data		data
data_preproc		data_preproc
diarization		diarization
gans		gans
kaldi_python		kaldi_python
recipes		recipes
signal_processing		signal_processing
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

the_speech

About

Releases

Packages

Languages

jvel07/the_speech

Folders and files

Latest commit

History

Repository files navigation

the_speech

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages