Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement the eSpeak voice synthesizer #36

Open
rhdunn opened this issue Mar 15, 2013 · 1 comment
Open

Implement the eSpeak voice synthesizer #36

rhdunn opened this issue Mar 15, 2013 · 1 comment
Milestone

Comments

@rhdunn
Copy link
Owner

rhdunn commented Mar 15, 2013

The eSpeak text-to-speech program uses a combination of klatt (see issue #35) parameters, recorded wave audio and spectral parameters. This is all coordinated by generating a sequence of wave commands from the voice data.

A system like the wave command processing would be good as it will allow Cainteoir Engine to support multiple synthesis techniques.

The wave file processing here looks different to diphone wave concatenation, but I am not savvy on the details to say for certain.

The spectral synthesis algorithm is similar to the way that klatt works, but uses a different mathematical model to produce the wave forms.

The klatt synthesis algorithm is just an interface from the wave commands to the Klatt synthesizer (issue #35) which has been modified to better integrate with eSpeak.

There is also support for MBROLA voices in the wave commands that passes the pho file phonemes and prosody data to an mbrola process that does the synthesis. This does not make sense for the design of the Cainteoir Engine as it should be done at a higher level -- that is, it should be a separate synthesizer.

--- Want to back this issue? **[Post a bounty on it!](https://www.bountysource.com/issues/1026786-implement-the-espeak-voice-synthesizer?utm_campaign=plugin&utm_content=tracker%2F254961&utm_medium=issues&utm_source=github)** We accept bounties via [Bountysource](https://www.bountysource.com/?utm_campaign=plugin&utm_content=tracker%2F254961&utm_medium=issues&utm_source=github).
@rhdunn
Copy link
Owner Author

rhdunn commented Apr 10, 2013

Depends on #38 (Phoneme Model)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant