29
1 Speech Generation and Perception

Speech Generation and Perception

  • Upload
    dalila

  • View
    92

  • Download
    3

Embed Size (px)

DESCRIPTION

Speech Generation and Perception. Speech Generation and Perception :. The study of the anatomy of the organs of speech is required as a background for articulatory and acoustic phonetics. - PowerPoint PPT Presentation

Citation preview

Page 1: Speech Generation and Perception

1

Speech Generation and Perception

Page 2: Speech Generation and Perception

2

Speech Generation and Perception : The study of the anatomy of the organs of

speech is required as a background for articulatory and acoustic phonetics.

An understanding of hearing and perception is needed in the field of both speech synthesis and speech enhancement and is useful in the field of automatic speech recognition.

Page 3: Speech Generation and Perception

3

Schematic diagram of the human speech production :

Page 4: Speech Generation and Perception

4

Organs of Speech : Lungs and trachea :

source of air during speech.

The vocal organs work by using compressed air; this is supplied by the lungs and delivered to the system by way of the trachea.

These organs also control the loudness of the resulting speech.

The trachea and lungs together constitute the pulmonary tract.

Page 5: Speech Generation and Perception

5

Organs of Speech : The Larynx :

This is a complicated system of cartilages and muscle containing and controlling the vocal cords. Principle parts are :

Cricoid cartilage Thyroid cartilage Arytenoid cartilage Vocal cords

The place where the vocal folds come together is called the glottis.glottis.

Page 6: Speech Generation and Perception

6

Organs of Speech : The Vocal Tract :

Laryngeal pharynx beneath epiglottis

Oral pharynx behind tongue, between epiglottis and velum

Nasal pharynx Above velum, rear end of nasal cavity

Oral cavity Forward of the velum and bounded by lips, tongue and palate

Nasal cavity Above the palate and extending from the pharynx to the

nostrils

Page 7: Speech Generation and Perception

7

Vocal Tract

Page 8: Speech Generation and Perception

8

Vocal Tract Model

Page 9: Speech Generation and Perception

9

A General Discrete-Time Model For Speech Production

Page 10: Speech Generation and Perception

10

Time Waveform Of Volume Velocity Of The Glottal Source Excitation

Page 11: Speech Generation and Perception

11

Magnitude Spectrum Of One Pulse Of The Volume Velocity At The

Glottis

Page 12: Speech Generation and Perception

12

Position Of The Vocal Cords And Cartilages (a) For Phonation (b)

For Whispering

Page 13: Speech Generation and Perception

13

Page 14: Speech Generation and Perception

14

Speech Production : The operation of the system is divided into

two functions :ExcitationModulation

Excitation(glottis)

Modulation(vocal tract)

Radiate

speech

Page 15: Speech Generation and Perception

15

Speech Production : Excitation :is done in several ways

Phonation (making of a voiced sound) This is the oscillation of the vocal cords

The arytenoid cartilages close and stretch the vocal cords

When air forced through the vocal, they vibrate

The opening and closing of the cords breaks the airstream up into pulses

Page 16: Speech Generation and Perception

16

Speech Production : The repetition rate of the pulses is termed pitch.pitch.

At low levels of air pressure oscillation may become irregular, this irregularities are known as “vocal fry”.

Speech sounds accompanied by phonation are called voiced; others, unvoiced or mute.

Whispering (speak softly) The vocal cord are drown together, but with small

triangular opening between arytenoid cartilages

Page 17: Speech Generation and Perception

17

Speech Production :Frication

Frication can occur with or without phonation

Compression If the release is abrupt and clean, the sound is a

stopstop or plosive plosive

If gradual and turbulent, the sound can pass into the related fricative and is termed an affricative

Page 18: Speech Generation and Perception

18

Speech Production : Vibration

If air is forced through a closure other than the vocal cords, vibrations may be set up

Modulation This is what we do to impose information on the

glottal output Articulatory phonetics: how the organs of speech are

positioned to produce any given speech sound

Acoustic phonetics: what the measurable acoustical correlates of any given speech sound are and how acoustical features in general correspond to phonetic and articulatory ones

Page 19: Speech Generation and Perception

19

Hearing and perception : HearingHearing is a process which sound is

received and convert into nerve impulse

PerceptionPerception is the post-processing within the brain by which the sounds heard are interpreted and given meaning

Page 20: Speech Generation and Perception

20

The structure of peripheral auditory system :

Page 21: Speech Generation and Perception

21

Sectional View Of The Human Ear

Page 22: Speech Generation and Perception

22

Hearing : The ear is divided into three parts:

The outer ear: Consist of the pinnaConsist of the pinna (visible, convolved cartilage)

Its convolved shape is provide some directional cues

The external canalThe external canal (external auditory meatus) Uniform tube, 2.7 cm long by 0.7 cm across through It has a number of resonant frequencies at 3 kHz

The eardrumThe eardrum (tympanic membrane) Is a stiff, conical structure at the end of the meatus It vibrate in response to the sound

Page 23: Speech Generation and Perception

23

Hearing :The middle ear

Is an air-filled cavity

Separated from the outer ear by the tympanic tympanic membranemembrane

Connected to the inner ear by the ovaloval and round round windowwindow

Connected to the outside world by way of the eustachian tubeeustachian tube

Page 24: Speech Generation and Perception

24

Hearing : eustachian tube eustachian tube permit equalization of air pressure

between the middle air and the surrounding atmosphere

the middle ear contain three tiny bone (ossicles)(ossicles) Malleus (hammer)

Incus (anvil)

Stapes (stirrup)

The function of the ossicles Impedance transformation

Amplitude limiting

Page 25: Speech Generation and Perception

25

Hearing :The inner ear

vestibular apparatusvestibular apparatus Used for balance and sensing orientation

The round and oval windowThe round and oval window

CochleaCochlea Is a snail-shape passage communication with the middle ear via the round and

oval window It consist the transducers which convert acoustical

vibration to verve impulses

Page 26: Speech Generation and Perception

26

The Cochlea as It Would Appear If Unwound

Page 27: Speech Generation and Perception

27

Cross Section Of One Turn Of The Cochlea

Page 28: Speech Generation and Perception

28

Position Of Maximum Amplitude Along Basilar Membrance As A Function Of

Applied Frequency

Page 29: Speech Generation and Perception

29

Frequency Response Of a Point On The Basilar

Membrance