Analysis of 1/f characteristics of male/female voiced speech residual


Baykut S., Akgul T.

IEEE 12th Signal Processing and Communications Applications Conference, Kusadasi, Turkey, 28 - 30 April 2004, pp.216-219 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/siu.2004.1338297
  • City: Kusadasi
  • Country: Turkey
  • Page Numbers: pp.216-219

Abstract

In this paper, we examine the I/f nature of voiced speech residual signal. Speech signals are generally classified as voiced or unvoiced. Voiced speech signals are considered to be generated by the vocal cords vibration signal exciting the vocal tract. In our model, the vocal tract is considered as a linear system. The excitation signal is the I/f noise which is called the speech residual signal. Since the vowels are the largest and the most evident voiced phoneme group, we study some of these vowels, i.e., /IY/, /IH/, /EI/, /EH/, /AE/, /ER/, /AH/, /AW/, /OA/, /OO/, /UW/ and /UH/ which are generated by several men and women. To extract the speech residual, first, we force to whiten the power spectrum of the speech signal by using a pre-emphasize filter and then perform the linear predictive analysis on the whitened speech to obtain the vocal tract parameters. The speech residual signal is obtained by the inverse filter. A wavelet decomposition technique is applied to the residual signal to obtain the wavelet