Upload
aditi-kanjolia
View
229
Download
0
Embed Size (px)
Citation preview
8/10/2019 classification of voiced and unvoiced fourier transform
1/16
Indian Instit
Instructor: Dr. R. B. P
Group No.
Group Members :
Name, Roll No.
Email ID :
te of Technology, I
EE202 : SIGNAL AND SYSTEMS
achori.
19
1. Aditi Kanjolia , 1200202
2. Keerthana Sravanthi, 120
1
DORE
313
8/10/2019 classification of voiced and unvoiced fourier transform
2/16
2
Contents
Problem Statement and Objective____________________________________ 3
Introduction____________________________________________________________4
MATLAB Code__________________________________________________________8
Implementation________________________________________________________ 9
Bibliography___________________________________________________________16
8/10/2019 classification of voiced and unvoiced fourier transform
3/16
3
PROBLEM STATEMENT AND
OBJECTIVE
CLASSIFICATION OF VOICED
And
UNVOICED SPEECH SIGNAL
Using
FOURIER TRANSFORM
8/10/2019 classification of voiced and unvoiced fourier transform
4/16
4
Introduction
Speech is an acoustic signal produced from a speech production system. From our
understanding of signals and systems, the system characteristics depend on the design ofthe system. For the case of linear time invariant system, this is completely characterized in
terms its impulse response. However, the nature of response depends on the type of input
excitation to the system. A similar phenomenon happens in the production of speech also.
Based on the input excitation phenomenon, the speech production can be broadly
categorized into three activities. The first case where the input excitation is nearly periodic
in nature, the second case where the input excitation is random noise-like in nature and
third case where there is no excitation to the system. Accordingly, the speech signal can
be broadly categorized into three regions- voiced, unvoiced and silence speech.
Our aim is to classify between voiced and unvoiced speech.
Voiced sounds consist of fundamental frequency and its harmonic components produced by
vocal cords (vocal folds). The vocal tract modifies this excitation signal causing formant
(pole) and sometimes anti-formant (zero) frequencies. With purely unvoiced sounds, there
is no fundamental frequency in excitation signal and therefore no harmonic structure. The
airflow is forced through a vocal tract constriction which can occur in several places
between glottis and mouth. Some sounds are produced with complete stoppage of airflow
followed by a sudden release, producing an impulsive turbulent excitation often followed by
a more protracted turbulent excitation. Unvoiced sounds are also usually more silent and
less steady than voiced ones.
Voiced sounds, e.g., a, b, are essentially due to vibrations of the vocal cords, and are
oscillatory. Therefore, over short periods of time, they are well modelled by sums of
sinusoids. This makes short-time Fourier transform, a useful tool for speech processing.
Unvoiced sounds such as s, sh, are more noise-like, as shown in figure below. They have
wide band spectrum.
8/10/2019 classification of voiced and unvoiced fourier transform
5/16
Figure- Disti
For many speech applications
speech. There are many ways
and it is based on the concep
Formants-
Wikipedia defines Formants speech science and phonetics
human vocal tract. It is often
the sound, though in vowels
child voice, the frequency of
and hence no peak is visible.
nction between voiced and unvoiced speec
, it is important to distinguish between voic
of doing it. We will use a basic method to d
of formants and the use of Fourier Transf
s the spectral peaks of the sound spectrum, formant is also used to mean an acoustic r
measured as an amplitude peak in the frequ
poken with a high fundamental frequency,
he resonance may lie between the widely-s
5
h.
d and unvoiced
this classification
rm.
of the voice". In esonance of the
ency spectrum of
s in a female or
read harmonics
8/10/2019 classification of voiced and unvoiced fourier transform
6/16
6
Fourier Transform-
The Fourier transform, named after Joseph Fourier, is a mathematical transformation
employed to transform signals between time domain and frequency domain, which has
many applications in physics and engineering.
The Fourier Transform decomposes any function into a sum of sinusoidal basis functions.
Each of these basis functions is a complex exponential of a different frequency. The Fourier
Transform therefore gives us a unique way of viewing any function - as the sum of simple
sinusoids.
The Fourier Series showed us how to rewrite any periodic function into a sum of sinusoids.
The Fourier Transform is the extension of this idea to non-periodic functions.
The Fourier Transform of a function g(t) is defined by:
[Equation 1]
The result is a function off, or, frequency. As a result, G(f) gives how much power g(t)
contains at the frequencyf. G(f) is often called the spectrum of g. In addition, g can beobtained from G via the inverse Fourier Transform:
[Equation 2]
Equation [2] states that we can obtain the original function g(t) from the function G(f) via
the inverse Fourier transform. As a result, g(t) and G(f) form a Fourier Pair: they are distinct
representations of the same underlying identity. We can write this equivalence via the
following symbol:
[Equation 3]
8/10/2019 classification of voiced and unvoiced fourier transform
7/16
7
Given below is a table of few examples of some alphabets with their classification. And in
parentheses are their phonetic transcriptions.
voiced unvoiced
b book
(b k)
p please
(pliz)
v vanilla
(v nIl )
f five
(faIv)
they
( eI)
thirty
( ti)
d dish
(dI )
t ten
(t n)
z zero
(z )
s sir
(s )
genre
( nr )
she
( i)
8/10/2019 classification of voiced and unvoiced fourier transform
8/16
8
MATLAB CODE
We will use a MATLAB code to do our required experimentation. We record some sounds
using wavrecord command. Then we get the Fast Fourier Transform of each of them, using
fft command and then we classify them as voiced and unvoiced speech signal.
The MATLAB code is as follows
>> Fs= 11025; % Setting frequency
>>y=wavrecord(Fs,Fs,'int16'); %Recording sound
>> figure, plot(y)% Plotting the magnitude of the signal in time
domain
>> figure, plot(abs(fft(double(y)))) % Plotting the
frequency domain spectrum
8/10/2019 classification of voiced and unvoiced fourier transform
9/16
9
IMPLEMENTATION
The above code was implemented on some vowels and consonants (A,P,B,S,Z,T and D).
Here are the results of the same:
A
8/10/2019 classification of voiced and unvoiced fourier transform
10/16
10
P
Figure P speech signal in time domain.
8/10/2019 classification of voiced and unvoiced fourier transform
11/16
11
Figure P speech signal in frequency domain.
B
Figure B speech signal in time domain.
8/10/2019 classification of voiced and unvoiced fourier transform
12/16
12
Figure B speech signal in frequency domain.
S
Figure S speech signal in time domain.
8/10/2019 classification of voiced and unvoiced fourier transform
13/16
13
Figure S speech signal in frequency domain.
Z
Figure Z speech signal in time domain.
8/10/2019 classification of voiced and unvoiced fourier transform
14/16
14
Figure Z speech signal in frequency domain.
T
Figure T speech signal in time domain.
8/10/2019 classification of voiced and unvoiced fourier transform
15/16
15
Figure T speech signal in frequency domain.
D
Figure D speech signal in time domain.
8/10/2019 classification of voiced and unvoiced fourier transform
16/16
16
Figure D speech signal in frequency domain.
BIBLIOGRAPHY-
Signals and Systems, Oppenheim and Willsky
Signals and Systems Using MATLAB, Luis F. Chaparro
Separation of Voiced and Unvoiced using Zero crossing rate and
Energy of the Speech Signal -Bachu R.G., Kopparthi S., Adapa B., Barkana B.D.
Web sources- Wikipedia, Saakshat Lab, IITG.