bernsee.com > The DSP Dimension > Formant Tutorial

On The Importance of Formants In Pitch Shifting

Download as PDF

by Stephan M. Bernsee, http://www.dspdimension.com, © 1995-99 all rights reserved

 

1.

Introduction

1.1

What are formants?

1.2

Audio Example - original

1.3

Why formants change with transposition

1.4

Audio Example - pitch shifted, formants change

1.5

Why singer formants do not change

1.6

Audio Example - pitch shifted, formants do not change

.
.

1. Introduction

This page is dedicated exclusively to the topic of formant movement occuring when pitch shifting sampled sounds. It will detail the effects involved and show pictures of the effects that cause unnatural sounding pitch shifts.

1.1 What are formants? The following graphics shows a short time fourier spectrum of a sampled sound of a female voice singing the vowel 'ah'. One can clearly see the fundamental frequency as a prominent peak to the left side of the display. The individual harmonics can be seen as small peaks of varying amplitude forming a regular pattern with equal distances to the fundamental frequency. To the right of the fundamental frequency one could see the harmonics forming some small peaks connected with a dotted line beneath a larger section marked with a solid line and the letter F. The small peaks and the large peak are all formants, we have marked the widest formant with F for utmost clarity and visibility.

 

Click the picture to view more details.
Formant Diagram 1
1.2 Audio Example:

original sound (WAVE, 132k)

(Read my Audio Example Notes page for more information on how to use the above example on your computer)

 

1.3 Why formants change with transposition. In the process of changing the pitch of a signal to sharp, for example, either with or without keeping the original file length, the sample rate of the audio signal is altered thus expanding the fundamental frequency along with all harmonics and the spectral envelope to the right, i.e. to higher frequencies. One can also see the distances between the harmonics determined by N times the fundamental frequency (with N being an integer > 1) becoming larger as is typical for a higher pitch. As the spectral envelope (and thus the marked position F) is also stretched, the position of the formants is altered. This has the same effect as altering the proportions of the vocal tract of the singer, shrinking or stretching him or her in size from a dwarf to a monster. Clearly, this is not happening when the singer sings at a higher pitch, therefore resulting in an artificial sound bearing little resemblance to the original.

 

Click the picture to view more details.
1.4 Audio Example:

pitch shifted, formants move (WAVE, 132k)

(Read my Audio Example Notes page for more information on how to use the above example on your computer)

 

1.5 Why singer formants do not change. When keeping the spectral envelope, i.e. the position of the formants either by pitch transposing with Timbre Correction or by having the singer sing at a higher pitch, one yields a natural sounding transpose. As shown below, the position of the formants (and the marked region F) is not altered during the process of singing at different pitches although the relative distances between the harmonics are now different. When singing at a higher pitch, the formants are not changed since the vocal tract responsible for resonances and cancellations forming the formants also remains unchanged.

 

Click the picture to view more details.
1.6 Audio Example:

pitch shifted, formants do not move (WAVE, 132k)

(Read my Audio Example Notes page for more information on how to use the above example on your computer)

 
 

The following newsgroups can be acessed for more information and help on formants.

Table 7: News Groups

comp.dsp
comp.music.research
comp.speech.research

If you're seeking general information on DSP, browse to the DSPguru homepage.


Last change: 04.10.2006, ©1999-2006 S. M. Bernsee, all rights reserved. Content subject to change without notice. Content provided 'as is', see disclaimer.