Sintese e reconhecimento da fala humana / Synthesis and recognition of human speech
AUTOR(ES)
Rumiko Oishi Stolfi
DATA DE PUBLICAÇÃO
2006
RESUMO
The goal of this dissertation is to review the main concepts relating to the synthesis, processing, and recognition of human speech by computer. These technologies have many applications, which have increased substantially in recent years after the spread of portable communication equipment (mobile phones, laptops, palmtops) and the universal access to the Internet. The first part of this work is a revision of fundamental concepts of signal processing, including the Fourier transform, power spectrum and spectrogram, filters, signal digitalization, and Nyquist s theorem. The second part describes the main characteristics of human speech, the mechanisms involved in its production and perception, and the concept of phone (linguistic unit of sound). In this part we also briefly describe the main techniques used for orthographic-phonetic transcription, for speech synthesis from a phonetic description, and for the recognition of natural speech. The third part describes a practical project we developed to consolidate the knowledge acquired in our Masters studies: a program that generates Japanese popular songs from a textual description of the lyrics and music, using the concatenative synthesis method. At the end of this dissertation, we list some available software products (free and commercial) for speech synthesis and speech recognition
ASSUNTO(S)
signal processing reconhecimento automatico da voz voice systhesis speech processing systems sintese da voz processamento de sinais automatic speech recognition sistemas de processamento da fala
ACESSO AO ARTIGO
http://libdigi.unicamp.br/document/?code=vtls000401099Documentos Relacionados
- RECONHECIMENTO DE SENTENÇAS COM DIFERENTES VELOCIDADES DE FALA EM IDOSOS
- Speech recognition in quiet and noise background situation in children with cochlear implants using two different speech processors
- Reconhecimento de fala em idosos: elaboração e aplicação de um teste considerando a previsibilidade da palavra.
- Speech recognition in classroom noise of children from 4th grade of elementary school
- Telemetria de resposta neural: repercussões dos fatores etiológicos e no reconhecimento de fala após o implante coclear