Speech analysis and synthesis based on dynamic modes

English

Séminaire Géométrie-Images: MGMI

14/06/2011 - 16:00 Mr Steve McLaughlin, (School of Engineering, Edinburgh) Salle 1 - Tour IRMA

In this talk the source-filter model commonly adopted in modelling speech production is adapted to represent the speech signal as the superposition and convolution of a dynamic source and resonant modes. The aim is to increase the resolution of the time-instantaneous- frequency representation of each of the individual contributions of different sections of the human phonatory system. We present a framework based on dynamic mode predictors and filters, which are adapted using gradient-based techniques, to track the modal dynamics of speech yielding a representation which is free from quasi-stationary assumptions thus allowing flexible manipulation of the speech signal. Several examples are offered including intonation modifications to illustrate the potential of the proposed approach.