1/20 A Novel Fuzzy Approach to Speech Recognition Ramin Halavati, Saeed B. Shouraki, Pujan Ziaie...

A Novel Fuzzy Approach to Speech Recognition

Ramin Halavati, Saeed B. Shouraki, Pujan ZiaieSharif University of Technology

Tehran, Iran

Presented by: Pujan Ziaie (pujan_nacon@Yahoo.com)

Presented at Hybrid Intelligent Systems International Conference, 2004, Kitakyushu, Japan.

Summery

Introduction: Speech Recognition

Proposed Model Recognition Approach Training process Results

Speech Recognition Several Methods

HMM （ Hidden Markof Models), TDNN (Time Delay NN), …

Common Problems: Effect of Noise Recognition Speed

Fuzzy approach: To Ignore details such as noise. similarity with human recognition process.

Human Voice Recognition

Imprecise processing Deciding upon a rough measurement

of amplitude No counting on speech frames

(relative lengths) Sensitive to lower frequencies

Proposed Model Base Data:

Speech Spectrogram Phonemes Specification (developed by using GA)

Data manipulation: Stretching Using MEL Filter Banks. (Human’s ear is

more sensitive to low frequencies and less to high ones.)

Fuzzification to reduce amount of data. (Human do not use that much precise data.)

Calculating the belongness to each phoneme

Proposed Model

Spectrogram:

Proposed Model

After MEL-Stretching

Proposed Model

Data Reduction (Fuzzification)

Sorting

Reduction In the first step, the original signal frames are divided into 25 vertical ranges and then, the values inside each range are sorted so that the more powerful ones are moved to top.

In the second step, the top 10% values of each range are chosen and averaged and the result is replaced with the all the value of that range, making all values in each vertical range similar.

Proposed Model

Fuzzification (Contd.)

Proposed Model

Phoneme definition necessities: Colors Lengths (5 MFs)

1 Degree

elief 0

0 Range of Amplitudes 100

Black Blue Magenta Cyan White

Proposed Model

Sample Phoneme Definition:Range 25: Black or Blue

Range 24: Black or Blue

Range 4: Red or Yellow

Range 3: Blue or Magenta

Range 2: Black or Blue or Magenta

Length: Average

Recognition Method

The existence of appropriate phoneme definitions is assumed

Recognition Compare the given sample with all

phoneme definitions Choose the one with highest

compatibility value

Recognition Method

Single Phoneme Comparison: Comparing the color pattern of the

phoneme with all frames of the given sample.

Finding the matching sequences. Comparing the length of a matching

sequence with the required length.

Recognition Method

Sample, Step One:

Range 4: Green or Yellow

Range 3: Blue or Magenta

Input:( A column of the colors of the signal which is to be recognized)

Pattern:(The color pattern of the phoneme which is to be evaluated.)

Range 25: 100% or 10%

Range 24: 100% or 10%

Range 4: 0% or 20%

Range 3: 10% or 100%

Range 2: 10% or 90% or 0%

Range 1: 10% or 90% or 0%

Compatibility:(The compatibility measure between the signal colors and the phoneme’s pattern.)

Range 25: 100%

Range 24: 100%

Range 4: 20%

Range 3: 100%

Range 2: 90%

Range 1: 90%

After applying MAX:

Final Result after applying MIN:

Recognition Method

Sample, Step Two:

85 79 75 65 55 45 55 98 78 78 77 76 54 82 83 88 99 98 78 77

1.Output of Step 1:

2. Assuming the 75% as a threshold, the lengths are:

3. Selecting the max Length:

4. Computing Best Match Value:

( 82 + 83 + 88 + 99 + 98 + 78 + 77 ) / 7 = 86

82 83 88 99 98 78 77

5. Assuming requested Average Length for the Pattern:

Compatibility = 86 * IsAverage( 7 )

Training

To get the proper phoneme’s specification (colors and length)

Using GA for data improvement

Training Method Genetic Algorithm

Each Genome: Color Definitions Length Definitions Phoneme Descriptions

Cross Over: Combination of two genomes phoneme

Description part Mutation:

Randomly change a color or length definition. Randomly change a phoneme description part

Training Approach: flowchartStart

Sort Genomes Based on their Fitnesses.

Throw out the last 50% Genomes.

Randomly choose some genomes and add their cross-overs to the gene pool.

Add a mutated copy of all available genomes to the gene pool.

Is Best Genome’s Fitness acceptable?

Terminate.

Create 100 Random Genomes and add them to the gene pool.

Experimental Results

Comparison with HMMFuzzy Approach HMM Approach

1st correct answers: 85% 62.28

3rd correct answers (out of 62)[1]: 95% 79.60

6th correct answers (out of 62): 98% 86.98

[1] One of the top three guesses has been correct.

Future Works To encounter color transitions in the model.

To enhance horizontal segmentations.

To test noise immunities.

To alter model to represent and recognize words.

Acknowledgment

Special thanks to professor Hirota (TIT) for his useful advices and also giving me the opportunity to participate in the conference

Thank youAny questions?

1/20 A Novel Fuzzy Approach to Speech Recognition Ramin Halavati, Saeed B. Shouraki, Pujan Ziaie...

Documents

CanalAVIST Site Manual July 2008 By Prof. Kanchana Kanchanasut kk@cs.ait.ac.th Mr. Pujan Srivastava pujan@ait.ac.th Ms. Nisarat Tunsakul nisarat@ait.ac.th

Bjp new office bhoomi pujan program

An Introduction to Artificial Intelligence Lecture 3: Solving Problems by Sorting Ramin Halavati (halavati@ce.sharif.edu) In which we look at how an agent

College of Engineering - BABAK ZIAIE ZIAIE... · 2020. 9. 23. · Thesis: A single channel microstimulator for functional neuromuscular stimulation (Advisor: Prof. Khalil Najafi)

An Introduction to Artificial Intelligence Lecture 4a: Informed Search and Exploration Ramin Halavati (halavati@ce.sharif.edu) In which we see how information

Shew pujan-thakur-son-s

An Introduction to Artificial Intelligence Chapter 13 &14.1-14.2: Uncertainty & Bayesian Networks Ramin Halavati (halavati@ce.sharif.edu)

Desingning FC Rule-Based Systems Designing Forward-Chaining Rule-Based Systems Instructor: Mr. Halavati By: Shahin Jabbari Arfaee Pooya Esfandiar 7/2/20151

Shiv Rudrabhishak Pujan Vidhi Part 1

A Hermetic Glass-silicon Micropackage With High-Density on-chip Feedthroughs for Sensors and Actuators - Ziaie, Arx - … Systems, Journal

An Introduction to Artificial Intelligence CE 40417 Chapter 11 – Planning Ramin Halavati (halavati@ce.sharif.edu) In which we see how an agent can take

Dev Ellora Crest - Bhumi Pujan (Part - 2)

ORGAN TRANSPLANTATION Ben Durham, Kathryn Goodridge, Pujan Patel, Chelsea Perry, and Sagar Shah

Hirota lab. 1 Mentality Expression by the eyes of a Robot Presented by: Pujan Ziaie Supervisor: Prof. K. Hirota Dept. of Computational Intelligence and

An Introduction to Artificial Intelligence – CE 40417 Chapter 9 - Inference in first-order logic Ramin Halavati (halavati@ce.sharif.edu) In which we define

Dev Ellora Crest Bhumi Pujan (Part - 2)

Jain Center of Greater Boston · 2015-10-20 · Jain Center of Greater Boston June 2014 ी सध च पूजन Shree Siddhachakra Pujan The tradition of Siddhachakra Pujan is

BHOOMI PUJAN CELEBRATION NIRAV VORA - DINESH

Guru Mandaladi Pujan Vidhi 1071 Gha Alm 5 Shlf 4 Sharada - Tantra

An Introduction to Artificial Intelligence CE 40417 Chapter 12 – Planning and Acting in Real World Ramin Halavati (halavati@ce.sharif.edu) In which we