Book contents
- Frontmatter
- Contents
- Foreword
- Preface
- 1 Introduction
- 2 Communication and language
- 3 The text-to-speech problem
- 4 Text segmentation and organisation
- 5 Text decoding: finding the words from the text
- 6 Prosody prediction from text
- 7 Phonetics and phonology
- 8 Pronunciation
- 9 Synthesis of prosody
- 10 Signals and filters
- 11 Acoustic models of speech production
- 12 Analysis of speech signals
- 13 Synthesis techniques based on vocal-tract models
- 14 Synthesis by concatenation and signal-processing modification
- 15 Hidden-Markov-model synthesis
- 16 Unit-selection synthesis
- 17 Further issues
- 18 Conclusion
- Appendix A Probability
- Appendix B Phone definitions
- References
- Index
11 - Acoustic models of speech production
Published online by Cambridge University Press: 25 January 2011
- Frontmatter
- Contents
- Foreword
- Preface
- 1 Introduction
- 2 Communication and language
- 3 The text-to-speech problem
- 4 Text segmentation and organisation
- 5 Text decoding: finding the words from the text
- 6 Prosody prediction from text
- 7 Phonetics and phonology
- 8 Pronunciation
- 9 Synthesis of prosody
- 10 Signals and filters
- 11 Acoustic models of speech production
- 12 Analysis of speech signals
- 13 Synthesis techniques based on vocal-tract models
- 14 Synthesis by concatenation and signal-processing modification
- 15 Hidden-Markov-model synthesis
- 16 Unit-selection synthesis
- 17 Further issues
- 18 Conclusion
- Appendix A Probability
- Appendix B Phone definitions
- References
- Index
Summary
The speech-production process was qualitatively described in Chapter 7. There we showed that speech is produced by a source, such as the glottis, which is subsequently modified by the vocal tract acting as a filter. In this chapter, we turn our attention to developing a more-formal quantitative model of speech production, using the techniques of signals and filters described in Chapter 10.
The acoustic theory of speech production
Such models often come under the heading of the acoustic theory of speech production, which refers both to the general field of research in mathematical speech-production models and to the book of that title by Fant. Although considerable previous work in this field had been done prior to its publication, this book was the first to bring together various strands of work and describe the whole process in a unified manner. Furthermore, Fant backed his study up with extensive empirical studies with X-rays and mechanical models to test and verify the speech-production models being proposed. Since then, many refinements to the model have been made, as researchers have investigated trying to improve the accuracy and practicalities of these models. Here we focus on the single most widely accepted model, but conclude the chapter with a discussion on variations on this.
As with any modelling process, we have to reach a compromise between a model that accurately describes the phenomena in question and one that is simple, effective and suited to practical needs.
- Type
- Chapter
- Information
- Text-to-Speech Synthesis , pp. 309 - 340Publisher: Cambridge University PressPrint publication year: 2009