DNN-TTS-ContVoc

Deep Neural Network-based Text-To-Speech Synthesis using a Continuous Vocoder

Jupyter Notebook

DNN-TTS-ContVoc.ipynb_.zip

Developed by

Budapest University of Technology and Economics (BME)

License

Apache License 2.0 (Apache-2.0)

Main Characteristic

This repository contains a TTS system based on Continuous vocoder developed at the Speech Technology and Smart Interactions Laboratory (SmartLab), Budapest University of Technology and Economics. As a difference with other traditional statistical parametric vocoders, continuous model focuses on extracting continuous parameters:

Fundamental Frequency (F0)
Maximum Voiced Frequency (MVF)
Mel-Generalized Cepstrum (MGC).

Research areas

Collaborative AI

Technical Categories

Audio processing Machine learning Natural language processing

Business Categories

Telecommunications

Keywords

Last updated

07.06.2021 - 15:33

Detailed Description

Install & Run: Please ensure you have installed python dependencies (pip install -r requirements.txt) and compiles (bash tools/compile_tools.sh).

Additional information: In a given English text sentence, users can select one of two voice patterns (either male or female) from the current set to build their custom voice model. Users can also specify the neural network topology (LSTM, BLSTM, GRU) to be trained as well as the number of hidden layers. The speech synthesis model inside this system has few parameters, and it is computationally feasible; therefore, it is suitable for real-time operation.

External links:

Github repository: https://github.com/malradhi/conTTS

Documents

Readme

Trustworthy AI

n/a

GDPR Requirements

No information is stored about the user