Wave Net Vocoder - Search News

NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation

Abstract: The traditional vocoders have the advantages of high synthesis efficiency, strong interpretability, and speech editability, while the neural vocoders have the advantage of high synthesis ...

GitHub

AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec

A good audio codec for live applications such as telecommunication is characterized by three key properties: (1) compression, i.e. the bitrate that is required to transmit the signal should be as low ...

Frontiers

A review of differentiable digital signal processing for music and speech synthesis

The term “differentiable digital signal processing” describes a family of techniques in which loss function gradients are backpropagated through digital signal processors, facilitating their ...

IEEE

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion

Abstract: This paper presents a refinement framework of WaveNet vocoders for variational autoencoder (VAE) based voice conversion (VC), which reduces the quality distortion caused by the mismatch ...

Frontiers

Audio deepfakes: A survey

A deepfake is content or material that is synthetically generated or manipulated using artificial intelligence (AI) methods, to be passed off as real and can include audio, video, image, and text ...

Microsoft

FastSpeech: Fast, Robust and Controllable Text to Speech

Neural network based end-to-end text to speech (TTS) has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron 2) usually first generate mel-spectrogram from text ...

GitHub

hccho2/Tacotron2-Wavenet-Korean-TTS

Tacotron2 모델과 Wavenet Vocoder를 결합하여 한국어 TTS구현하는 project입니다. Tacotron2 모델을 Multi-Speaker모델로 확장했습니다. Tacotron2 모델로 한국어 TTS를 만드는 것이 목표입니다. Rayhane-mamah의 구현은 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results