À propos de cet article
Publié en ligne: 30 déc. 2021
Pages: 1 - 10
DOI: https://doi.org/10.2478/aucts-2021-0001
Mots clés
© 2021 Filip Cristian George et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.
The aim of this paper is to present a system capable of modifying the human voice: volume by direct amplification/attenuation, duration of the voice signal and pitch using the Phase Vocoder, and timbre with the help of cepstral analysis. The system is also able to dynamically modify the aforementioned parameters in real-time. The proposed system was evaluated using a set of “clean” speech samples from the LibriSpeech ASR corpus of English speech with the Perceptual Evaluation of Speech Quality (PESQ) standard.