Skip to main content Accessibility help
×
Hostname: page-component-84b7d79bbc-2l2gl Total loading time: 0 Render date: 2024-07-25T20:17:07.154Z Has data issue: false hasContentIssue false

12 - Analysis of speech signals

Published online by Cambridge University Press:  25 January 2011

Paul Taylor
Affiliation:
University of Cambridge
Get access

Summary

In this chapter we turn to the topic of speech analysis, which tackles the problem of deriving representations from recordings of real speech signals. This book is of course concerned with speech synthesis – and at first sight it may seem that the techniques for generating speech “bottom-up” as described in Chapters 10 and 11 may be sufficient for our purpose. As we shall see, however, many techniques in speech synthesis actually rely on an analysis phase, which captures key properties of real speech and then uses these to generate new speech signals. In addition, the various techniques here enable useful characterisation of real speech phenomena for purposes of visualisation or statistical analysis. Speech analysis then is the process of converting a speech signal into an alternative representation that in some way better represents the information which we are interested in. We need to perform analysis because waveforms do not usually directly give us the type of information we are interested in.

Nearly all speech analysis is concerned with three key problems. First, we wish to remove the influence of phase; second, we wish to perform source/filter separation, so that we can study the spectral envelope of sounds independently of the source that they are spoken with. Finally, we often wish to transform these spectral envelopes and source signals into other representations that are coded more efficiently, have certain robustness properties, or more clearly show the linguistic information we require.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2009

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Analysis of speech signals
  • Paul Taylor, University of Cambridge
  • Book: Text-to-Speech Synthesis
  • Online publication: 25 January 2011
  • Chapter DOI: https://doi.org/10.1017/CBO9780511816338.014
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Analysis of speech signals
  • Paul Taylor, University of Cambridge
  • Book: Text-to-Speech Synthesis
  • Online publication: 25 January 2011
  • Chapter DOI: https://doi.org/10.1017/CBO9780511816338.014
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Analysis of speech signals
  • Paul Taylor, University of Cambridge
  • Book: Text-to-Speech Synthesis
  • Online publication: 25 January 2011
  • Chapter DOI: https://doi.org/10.1017/CBO9780511816338.014
Available formats
×