Hostname: page-component-848d4c4894-wzw2p Total loading time: 0 Render date: 2024-06-03T15:16:38.985Z Has data issue: false hasContentIssue false

The importance of the time–frequency representation for sound/music analysis–resynthesis

Published online by Cambridge University Press:  04 April 2001

PAUL MASRI
Affiliation:
Digital Music Research Group, University of Bristol, 5.01 Merchant Venturers Building, Woodland Road, Bristol BS8 1UB, UK E-mail: Paul.Masri@bristol.ac.uk www.fen.bris.ac.uk/elec/dmr/
ANDREW BATEMAN
Affiliation:
Digital Music Research Group, University of Bristol, 5.01 Merchant Venturers Building, Woodland Road, Bristol BS8 1UB, UK E-mail: Paul.Masri@bristol.ac.uk www.fen.bris.ac.uk/elec/dmr/
NISHAN CANAGARAJAH
Affiliation:
Digital Music Research Group, University of Bristol, 5.01 Merchant Venturers Building, Woodland Road, Bristol BS8 1UB, UK E-mail: Paul.Masri@bristol.ac.uk www.fen.bris.ac.uk/elec/dmr/

Abstract

The time–frequency representation (TFR) is the initial stage of analysis in sound/music analysis–resynthesis (A–R) systems. Given a time-domain waveform, the TFR makes temporal and spectral detail available to the remainder of the analysis, so that the component features may be extracted. The resulting ‘feature set’ must represent the sound as completely as the original time-domain signal, if the A–R system is to be capable of effective transformation and good synthesis sound quality. Therefore the system as a whole is reliant upon the TFR to make the sound components detectable, separable and measurable. Yet the standard TFR to-date is the short-time Fourier transform (STFT), of which the shortcomings, in terms of resolution, are well recognised. The purpose of this paper is to demonstrate the importance of the TFR to system function and system design. Poor feature extraction is shown to result from the use of inappropriate TFRs, whose underlying assumptions and expectations do not match those of the system. Existing models are used as case studies, with examples of performance for different sound types. A philosophy for A–R system design that includes TFR design is presented and a methodology for implementing it is proposed.

Type
Research Article
Copyright
© 1997 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)