GUELMA UNIVERSITY International Conference on

GUELMA UNIVERSITY

PI: MIS

International Conference on Signal, Image, Vision and their Applications SIVA'11

November 21-24, 2011, Guelma

Inverse Problems Modeling, Information and Systems Laboratory (PI: MIS)

Faculty of Sciences and Technology, Guelma University, Guelma 24000, Algeria

Laboratory

UNIVERSIT 8 MAI 45 GUELMAInternational Conference on Signal, Image,

Vision and their Applications SIVA'11 SIVA'11, November 21-24, 2011, Guelma Algeria

OrganizerInverse ProblemsModeling, Information and Systems Laboratory (PI: MIS)

Organizing CommitteePresident: Abdelhani BOUKROUCHEMembers:Smain SAHOUR, Abdelaziz YOUNSI, Houcine BOUROUBA, Moussa SEDRAOUI, M. Cherif AMARA KORBA,Hakim DOGHMANE, Nadjib BOUSSETILA, Salim BENDJOUDI, Mohamed NEMISSI, Houria BOUDOUDA,Djalel DRICI, El Hadi MEHALLEL, Lotfi HOUAM, Laatra YOUSFI, El Hadi MEHALLEL, Djalel DRICI, SorayaZENATI.

Scientific CommitteePresident: R. JENNANE (France)Members:H. A. ABBASSI (Algeria), A. ALMHDIE (France), M. C. AMARA KORBA (Algeria), A. BABOURI (Algeria), M. BEDDA(Saudi Arabia), M. Hedi BEDOUI (Tunisia), A. BENIA (Algeria), S. BOUAKEZ (France), M. S. BOUDELLIOUA (Oman), H.BOUDOUDA (Algeria), A. BOUKROUCHE (Algeria), H. BOUROUBA (Algeria), N. BOUSSETILA (Algeria), A. F. CHAREF(Algeria), N.E. DEBBACHE (Algeria), M. DERICHE (Saudi Arabia), R. DJIMILI (Algeria), N. DOGHMANE (Algeria), M. ELHASSOUNI (Morocco), M. F. HARKET (Algeria), A. HAFIANE (France), S. KECHIDA (Algeria), M. KHAMADJA (Algeria),Y. LAFIFI (Algeria), R. LAKEL (Algeria), R. LEDEE (France), Dj. MESSADEK (Algeria), A. MOUSSAOUI (Algeria), M. NE-MISSI (Algeria), Ph. NEVEUX (France), J. RAGOT (France), P. RAVIER (France), S. SAHOUR (Algeria), D. SBIBIH (Morocco),M. SEDRAOUI (Algeria), H. SISSAOUI (Algeria), Y. SMARA (Algeria), H. SERIDI (Algeria), A. TALEB-AHMED (France), H.TEBBIKH (Algeria), A. YOUNSI (Algeria), O. UCAN (Turkey).



OrganizerInverse ProblemsModeling, Information and Systems Laboratory(PI: MIS)

TopicsBiomedical Signal and Image ProcessingSignal Processing for Communications SystemsImage and Video ProcessingDesign and Implementation of Signal Processing SystemsBiometricsInverse ProblemTexture AnalysisSystem IdentificationImage Segmentation and CharacterizationModeling and SimulationPattern recognitionControl and Systems

Prefacepqpqpqpq

The first International Conference on Signal, Image, Vision and their applications was held duringNovember 21-24 at Guelma, Algeria. SIVA'11 is organized by Inverse Problems Modeling, Infor-mation and Systems Laboratory (PI: MIS) of Guelma and May 8th, 1945 University.

The main aim of the Conference is to bring together people concerned by the multiple aspects ofsignal, image and vision. A wide range of topics is addressed, including Biomedical Signal and ImageProcessing, Image and Video Processing, Biometrics, Texture Analysis, Image Segmentation andCharacterization, Pattern recognition, Signal Processing for Communications Systems, Design andImplementation of Signal Processing Systems, Inverse Problem, System Identification, Modelingand Simulation, Control and Systems.

The scientific committee ensured the high quality and diversity of the scientific programme togetherwith the accessibility of the meeting to PhD students. We hope that the abstract will be useful forthose who are interested in the various conference fields. We also hope you will find your timerewarding and enjoyable in Guelma.

BProf. Abdelhani BOUKROUCHEPI: MIS Laboratory

CONFERENCES

Conferences



OrganizerInverse ProblemsModeling, Information and Systems Laboratory(PI: MIS)

CONFERENCES

C Rachid JENNANE (Universit d'Orlans, France)

C Mohamed Salah BOUDELLIOUA (Sultan Qaboos University, Sultanate of Oman)

C Mohamed DERICHE (King Fahd University of Petroleum and Minerals, Saudi Arabia)

International Conference on Signal, Image, Vision and their Applications SIVA11, November 21-24, 2011, Guelma Algeria

FRACTIONAL BROWNIAN MOTION MODELS

FOR BIOMEDICAL APPLICATIONS

Rachid Jennane

PRISME Laboratory, Universit dOrlans, 12 rue de Blois BP 6744, 45067 Orlans, France.

[email protected]

Abstract Fractal analysis for the characterization of complex structures is widely used in many fields. In the case of biomedical applications, analysis includes geometric characterization of biologic forms and investigation of dynamic processes. Among the fractal models, fractional Brownian motion (fBm) is well suited for the evaluation of images involving a varying range of gray levels. This talk will focus on the use of the fBm for image analysis. Starting from fractional Brownian motion of unique parameter H, different stochastic models are defined and discussed. The fractal anisotropy is considered and different methods emphasising this feature will be presented. An application of these models to the field of biomedical imaging will be demonstrated through trabecular bone data in order to access the bone microarchitecture quality.
mailto:[email protected]


Reduction of Linear Multidimensional

Systems

Mohamed S. Boudellioua

Dept. of Mathematics and Statistics, Sultan Qaboos University, Muscat, Oman.

Member of the Inverse Problems Modelling, Information and Systems Laboratory PI: MIS, University of Guelma, Algeria

[email protected]

Abstract A multidimensional system is a system in which information propagates in more than one independent direction. Examples of multidimensional systems are delay-differential systems, partial-differential systems and multidimensional recursive systems. Multidimensional systems have diverse applications in areas such as image and video processing, geophysical exploration, linear multipass processes, iterative learning control systems, lumped and distributed networks etc. A multidimensional system may be represented by a multivariate polynomial matrix. Using a polynomial matrix approach we give a necessary and sufficient condition under which a multivariate polynomial matrix can be reduced by unimodular equivalence to a simpler form which corresponds to the case when the reduced system consists of only one single equation with fewer unknowns. The proposed constructive method is illustrated by an example appearing in the literature using symbolic computation.
mailto:[email protected]


Feature Selection for Classification:

Challenges and Opportunities

Mohamed Deriche

Electrical Engineering Department, King Fahd University of Petroleum & Minerals,

PO Box 1427, KFUPM, Dhahran, 31261 Saudi Arabia

[email protected]

Abstract The presentation will discuss the problem of feature selection in classification problems. We will discuss the different techniques used to select a limited number of features from a pool of features. Then, we will discuss filter and wrapper techniques, the cost of feature selection algorithms and give a number of examples from signal and image processing applications.

TALKS

Talks

Contents SIVA'11, November 21-24, 2011, Guelma Algeria

CONTENTS

Mansouri S., Mansouri A., Latreche S., P. 1-7Speech Signal Compression using Discrete Wavelets Transforms.

Hamadache F., Diffellah N., Benmahammed K., P. 8-13Identification des Systmes Linaires Constantes de Temps Proches et Diffrentes par les Mthodes de Strejc et Broida.

Hamadache F., Diffellah N., et Benmahammed K., P. 14-18Arithmtique Base d'Intervalles Sur les Profiles.

Meziane Tani S., Bessaid A., P. 19-21Techniques d'indexation d'images Mdicales par contenu.

Aouat S., Larabi S., P. 22-27Mise en correspondance des contours d'objets bruits.

Diffellah N., Hamadache F., Benmahammed K., P. 28-33Interval arithmetic and its application to electrical circuits.

Diffellah N., Hamadache F., Benmahammed K., Abbada Y., P. 34-39Similitude entre deux nombres flous triangulaires.

Dali A., Bouchoucha M., Tadjine M., P. 40-45On Quadrotor Modeling: Hybrid Dynamical System Approach.

Djemili R., Bourouba H., Amara Korba M.C., P. 46-49Speaker Identification Using GMMs and Gender Dependent Multi-Class Support Vector Machines.

FIZAZI H., TEKKOUK A., P. 50-55Segmentation des images satellitaires par le k means floue et le seuillage adaptatif.

Lamiche C., Moussaoui A., P. 56-61Data Fusion Approach for the Segmentation of MR Images: A Study and an Evaluation.

Boujraf A., Sbibih D., et Leger C., P. 62-65Approximation des surfaces fermes. Kazem A., Hamza A., Salut G., P. 66-69Optimization of Particle Algorithm to Multiuser Detection in CDMA.

Sid Z., Aoufi L., Semchedine M., P. 70-72Nouvelle Approche de filtrage Non-linaire d'Images Numriques.

Menezia F., Meliani R., Mahdjoub Z. Moulay Lakhdar A., P. 73-78L'augmentation des performances d'un turbo-code dans son approche pragmatique.

Nemissi M., Seridi H., Boudouda H., Akdag H., P. 79-84Improving classification performances of the systems of multiple neural networks.

Amara Korba M.C., Bourouba H., Djemili R., Messadeg D., P. 85-88Robust Feature Extraction for Speech Recognition.

i

Contents SIVA'11, November 21-24, 2011, Guelma Algeria Behih M., Bekka R. E., Annou A., Djebari M., P. 89-93Stimulation Magntique d'un Axone d'un Nerf Priphrique l'aide d'une Nouvelle Bobine.

Assabaa M., Charef A., Ladaci S., Santouh Z., P. 94-99Tuning of PID Controllers Based on Bode's Ideal Transfer Function Using Fractional Calculus.

Tahi W., M.C. Mokrani K., P. 100-104La Sparation Aveugle de Sources.

Amrous A. I., Debyeche M., P. 105-109Robust Arabic speech recognition with auxiliary information in noisy environments.

Bourdjam O., Tighidet, Mehenni M ., Adggar D., P. 110-115Conception et Ralisation d'un Systme HOLTER d'enregistrement et d'analyse d'ECG.

Houam L., Hafiane A., Jennane R., Boukrouche A., Lespessailles E., Bands P. 116-121Trabecular Bone Texture Classification Using 1D LBP and Wavelet Coefficients in High-pass.

Gheraibia Y., Moussaoui A., P. 122-126Nouvelle Mthode pour la Prdiction des Structures 2D des Protines.

Belkhodja L., Hamdadou D., P. 127-132Vers l'aide au diagnostic mdical Ajustement des paires bilatrales de mammographies par algorithme Gntique.

Alim-Ferhat F., Kerdjidj O., Seddiki S., Messaoudi K., P. 133-138Implmentation de la transforme en ondelettes pour les images mdicales 3D sur FPGA.

Aggoune A., P. 139-144Approximate Flexible Queries Using Hausdorff Distance.

Halimouche R., Teffahi H., P. 145-148Vers une Mthode de Classification Automatique de Phrases Interrogatives et Affirmatives.

Bensafia Y., Ladaci S., P. 149-151Elimination du bruit dans une conduite d'aration en utilisant les filtres adaptatifs.

Meftah E.H., Anou A., Bensebti M., P. 152-157Etude de la technique OFDM-IDMA sur un canal surcharg en utilisateur.

Boukeffa S., Bouguelala A., P. 158-160Sidelobe Mitigation Using NLFM Waveforms For SAR Imaging.

Ouahioune M., Akrour L., Lahdir M., Ameur S., P. 161-166Compression d'images hyperspectrales par transforme en ondelettes 3D et SPIHT3D.

Bekaddour A., Bessaid A., Bendimerad F. T., P. 167-170SVM Regularization of Satellite Images K-MEANS Clustering Results.

Belgherbi A., Hadjidj I., Bessaid A., P. 171-175Segmentation automatique du Foie par la morphologie mathmatique, application l'imagerie mdicale TDM del'abdomen.

Bennamane A., Boutkedjirt T., P. 176-181Simulation et analyse d'images chographiques d'un tissue biologique mou.

ii

Contents SIVA'11, November 21-24, 2011, Guelma Algeria Lacheb Cheddadi A., Smara Y., P. 182-186Simulation d'images radar SAR.

Ajgou R., Sbaa S., Ghendir S., Taleb-Ahmed A., P. 187-192Dtection du pitch par un seuil adaptatif et en temps rel par les ondelettes discrtes.

Benyamina A., Fizazi H., P. 193-198Efficacit de l'algorithme hybride ACOClust pour le partitionnement des images satellitaires.

Elloumi W., Leconge R., Treuillet S., P. 199-204Pedestrian Localization: Literature Review and Illustration of a Monocular Vision Based Approach.

Harrar K., Hamami L., Jennane R., P. 205-208Fractal Analysis of Bone Radiographs Correlated with Histomorphometry.

Almhdie-Imjabber A., Rozenbaum O., Lespessailles E., Jennane R., P. 209-213A New local shape classification method for trabecular bone characterization.

Khader M., Seridi H., P. 214-219Face Detection Using Practical Swarm Optimization and Template Matching.

Benmazou S.,Layachi S., Merouani Hayet F., P. 220-223Automate cellulaire SMA pour la segmentation des images IRM. Mihoubi B., Souag N., Amrane Y., P. 224-229Estimation par la courbure du mouvement cardiaque sur des images Cin-IRM.

Akkoul S., Leconge R., Hafiane A., Lespessailles E., Jennane R., P. 230-233Stereo X-ray Reconstruction of the Proximal Femur.

Bachir R., Bouyadja, Khelfi M. F., P. 234-239Commande robuste par approche optimale d'un robot manipulateur 3 d.d.l.

Larbi Omar, P. 240-245Le Clustering Appliqu aux Traces des Utilisateurs dans une Plateforme E-learning.

Mehallel E. H., Boukrouche A., P. 246-250Spectral Analysis of UWB Signal based on Modulation and Multiple Access Scheme.

Mehallel E. H Drici Dj., Boukrouche A., P. 251-256Image Restoration with Total Variation Regularization.

Bourouba H., Amara Korba M. C., Djemili R., P. 257-261L'utilisation des machines vecteurs de support SVM pour l'identification du locuteur.

Meziani F., Debbal S. M., P. 262-266Application de La transforme en Ondelettes Discrte (TOD) dans L'analyse des Signaux Phonocardiogrammes.

Almhdie-Imjabber A., Deriche M., Aburrhman A., B., Lger C., P. 267-271Local comparison of multimodal medical surfaces using a compound registration-reconstruction algorithm.

Farou B., Seridi H., P. 272-276Systme de dtection et de suivi des objets en mouvement dans une vido de surveillance.

Chakour C., Ayech N., Harkat M. F., P. 277-283Diagnostic des systmes dynamiques en temps rel par MSPCA.

iii

Contents SIVA'11, November 21-24, 2011, Guelma Algeria Lakhfif A., Laski M. T., P. 284-289Un outil visuel bas sur la ralit virtuelle pour la gnration d'noncs en langue des signes.

Doghmane H., Boukrouche A., P. 290-294Rgularisation d'un problme mal pos des images omnidirectionnelles.

Atbi A., Debbal S.M., P. 295-299Segmentation des Signaux Phonocardiogrammes par l'Enveloppe de la Transforme de Hilbert.

Sedraoui M.,, Gherbi S.,, Abdelmalek S., P. 300-305Synthse d'un Contrleur Fractionnaire par l'Algorithme d'Essaim Particulaire.

Ayech N., Chakour C., Harkat M-F., P. 306-312New Adaptive Moving Window PCA for Process Monitoring.

Messaoudi F., Benyettou M., P. 313-318Identification Biomtrique par Fusion Multimodale du Visage et de la Dmarche.

Baadeche M., Smara Y., Dtection des Structures Linaires sur des Images RADAR SAR . P. 319-322 Fizazi H., Benmostefa S., P. 323-327Conception de la technique des chauves-souris pour la classification des images.

Kellalib B., Bouchoucha M., P. 328-333Vol en formation d'un groupe de Quadrotors base sur la technique de commande cooprative L et les saturationsimbriques.

Nasri N., Mokrani K., P. 334-339Classification floue par considration du contexte Spatial. Application aux images IRM du cerveau.

Gherbi S., Sedraoui M., P. 340-343Rglage optimal d'un contrleur bas sur le principe de Smith Predictor.

Bouagar S., Larabi S., P. 344-348Une Approche Globale pour la Reprsentation et la Mise en Correspondance d'Objets de Forme Libre.

Atoui I., Omeiri A., Bouraiou A., P. 349-354Application de l'Analyse en Ondelettes au Diagnostic de Dfauts d'une Machine Asynchrone.

Hameg S., Ameur Z., Lazri M., Ouallouche F., P. 355-357Estimation des prcipitations par une approche base sur la dure de vie des nuages sommets froids.

Fizazi Izabatene H., Ghardaoui S., P. 358-363Les mouches systme multi agents volutionnaire classification non supervise des images.

Boulemden A., Tlili Y., P. 364-367Pachinko Allocation Model with Local Features for Image Indexing and Retrieval.

Boudiaf A., Moussaoui A. KSeddiki. S. E. A., P. 368-371Condition Monitoring of Rotating Machines Using Artificial Neural Networks and Wavelet Transform.

Berradja K., Boughanmi N., P. 372-375Blind Separation of Cardiac Structures from Dynamic PET images using Markov Chain Monte Carlo sampling.

iv


Speech Signal Compression using Discrete

Wavelets Transforms.

MANSOURI Sihem, MANSOURI Abdelhamid, LATRECHE Sadjia

Departement of electronics, Ferhat Abbas University, Setif, Algeria.

Elmaabouda city. BP 19000. SETIF.

E-mail : [email protected]

ABSTRACSpeech compression is the technology of converting human speech into an efficiently encoded representation that can

later be decoded to produce a close approximation of the original

signal. This thesis presents a new algorithm to compress speech

signals using Discrete Wavelet Transform (DWT) Techniques.

Wavelet analysis is the breaking up of a signal into a set of scaled

and translated versions of an original (or mother) wavelet.

Taking the wavelet transform of a signal decomposes the original

signal into wavelets coefficients at different scales and positions.

These coefficients represent the signal in the wavelet domain and

all data operations can be performed using just the

corresponding wavelet coefficients.

In this paper a Wavelet based speech coder is implemented in

software using Matlab 7.4 Wavelet Toolbox. The major issues

concerning the design of this Wavelet based speech coder are

choosing optimal wavelets for speech signals, decomposition level

in the

DWT, thresholding criteria for coefficient truncation and

efficient encoding of truncated coefficients.

KEY WORDS Compression, Filter-Bank , Speech, Wavelet.

I. INTRODUCTION

PEECH is a very basic way for humans to convey

information to one another. With a bandwidth of only 4

kHz, speech can convey information with the emotion of a

human voice. People want to be able to hear someones voice

from anywhere in the world-as if the person would be in the

same room. Speech can be defined such as the response of the

vocal tract to one or more excitation signals. Compression of signals is based on removing the redundancy

between neighboring samples and/or between the adjacent

cycles. In data compression, it is desired to represent data by

as small as possible number of coefficients within an

acceptable loss of visual quality. Compression techniques can

be classified into one of two main categories: lossless and

lossy.

Compression methods can be classified into three functional

categories:

Direct Methods: The samples of the signal are directly handled to provide compression.

Transformation Methods: such as Fourier Transform (FT), Wavelet Transform (WT), and Discrete Cosine

Transform (DCT) .

Parameter Extraction Methods: A preprocessor is employed to extract some features that are later used to

reconstruct the signal.

Fast Fourier transforms (FFT) have been among major tools

for compression and efficient coding of signals.

Particularly, the discrete cosine transform (DCT) based

compression algorithms have become industry standard still

and video image compression systems (JPEG, MPEG). The

ability of wavelets to outperform traditional orthogonal bases

in the mentioned systems is being extensively explored nowadays.

Wavelet compression is a form of predictive compression

where the amount of noise in the data set can be estimated

relative to the predictive function. Most modern compression

techniques use a two step process: First, a predictive

compression function (such as wavelet transform) is applied.

If the choice of the predictive compression function is good,

the result will be a new set of data with smaller values and

more repletion. Second, a coding compression step that will

represent the data set in its minimal form (Huffman coding,

run-length). The compression of speech signals has many practical

applications. One example is in digital cellular technology

where many users share the same frequency bandwidth.

Compression allows more users to the system than otherwise

possible.

This paper presents a speech coder model that is based on

wavelet transform. The paper uses the discrete wavelet

transform to model speech signals. This model is implemented

using MATLAB.

II. THE DISCRETE WAVELET TRANSFORM

The main idea is the same as is in the continuous wavelet

transform (CWT). A time-scale representation of a digital

signal is obtained using digital filtering techniques.

The CWT was computed by changing the scale of the analysis

window, shifting the window in time, multiplying by the

signal, and integrating over all times. The Discrete Wavelet

Transform (DWT) involves choosing scales and positions

based on powers of two. so called dyadic scales and positions.

The mother wavelet is rescaled ordilated by powers of two and translated by integers.

S

1

Specifically, a function f(t) L2(R) (defines space of square integral functions) can be represented as

= , 2 + , (2

=

=

=1

)

The function (t) is known as the mother wavelet, while (t)

is known as the scaling function. The set of functions { 2

( 2L t k ), 2( 2 j t k )| j L, j,k ,LZ}, where Z is the set of integers, is an orthonormal basis for L2(R).

The numbers a(L, k) are known as the approximation coefficients at scale L, while d(j,k) are known as the detail

coefficients at scale j. )The approximation and detail

coefficients can be expressed

The approximation and detail coefficients can be expressed as:

, =1

2 2

, =1

2 2

To provide some understanding of the above coefficients consider a projection fl(t) of the function f(t) that provides the

best approximation (in the sense of minimum error energy) to

f(t) at a scale l. This projection can be constructed from the

coefficients a(L, k), using the equation:

= , (2 )

=

As the scale l decreases, the approximation becomes finer,

converging to f(t) as l 0. The difference between the approximation at scale l + 1 and

that at l, fl+1(t) - fl(t), is completely described by the

coefficients d(j, k) using the equation:

+1 = , (2 )

=

Using these relations, given a(L, k) and {d(j, k) | j L}, it is

clear that we can build the approximation at any scale. Hence,

the wavelet transform breaks the signal up into a coarse

approximation fL(t) (given a(L, k)) and a

number of layers of detail { f j+1(t)-fj(t)| j < L} (given by {d(j,

k) | j L}). As each layer of detail is added, the approximation

at the next finer scale is achieved.

2.1 Vanishing Moments

The number of vanishing moments of a wavelet indicates

the smoothness of the wavelet function as well as the flatness

of the frequency response of the wavelet filters (filters used to

compute the DWT) [7].

Typically a wavelet with p vanishing moments satisfies the

following equation [6]:

t dt

for m=0,., p-1,

Or equivalently,

1 = 0 = 0, , 1.

For the representation of smooth signals, a higher number of

vanishing moments leads to a faster decay rate of wavelet

coefficients. Thus, wavelets with a high number of vanishing

moments lead to a more compact signal representation and are

hence useful in coding applications.

However, in general, the length of the filters increases with the

number of vanishing moments and the complexity of

computing the DWT coefficients increases with the size of the

wavelet filters.

2.2 The Fast Wavelet Transform Algorithm

The Discrete Wavelet Transform (DWT) coefficients can be

computed by using Mallat.s Fast Wavelet Transform

algorithm. This algorithm is sometimes referred to as the two-

channel sub-band coder and involves filtering the input signal

based on the wavelet function used.

2.2.1 Implementation Using Filters

To explain the implementation of the Fast Wavelet Transform

algorithm consider the following equations:

() = (2 )

= 1

1 (2 )

2 = 20,

The first equation is known as the twin-scale relation (or the

dilation equation) and defines the scaling function . The next

equation expresses the wavelet in terms of the scaling

function . The third equation is the condition required for the

wavelet to be orthogonal to the scaling function and its

translates.

The coefficients c(k) or {c0, .., c2N-1} in the above equations

represent the impulse response coefficients for a low pass

filter of length 2N, with a sum of 1 and a norm of 1 2 . The high pass filter is obtained from the low pass filter using

the relationship gk =(1)k c(1k) , where k varies over the

range (1 - (2N-1)) to 1.

Equation 2.7 shows that the scaling function is essentially a

low pass filter and is used to define the approximations. The

wavelet function defined by equation 2.8 is a high pass filter

and defines the details.

Starting with a discrete input signal vector s, the first stage of

the FWT algorithm decomposes the signal into two sets of

coefficients. These are the approximation coefficients cA1

2

(low frequency information) and the detail coefficients cD1

(high frequency information), as shown in the figure below.

Fig.1: Filtering operation of the DWT [15].

The coefficient vectors are obtained by convolving s with the

low-pass filter Lo_D for approximation and with the high-pass

filter Hi_D for details. This filtering operation is then followed

by dyadic decimation or down sampling by a factor of 2.

Mathematically the two-channel filtering of the discrete signal s is represented by the expressions:

1 = 2 , 1 = 2

These equations implement a convolution plus down sampling

by a factor 2 and give the forward fast wavelet transform.

If the length of each filter is equal to 2N and the length of the

original signal s is equal to n, then the corresponding lengths

of the coefficients cA1 and cD1 are given by the formula:

1

2 +

This shows that the total length of the wavelet coefficients is

always slightly greater than the length of the original signal

due to the filtering process used.

2.2.2 Multilevel Decomposition

The decomposition process can be iterated, with successive

approximations being decomposed in turn, so that one signal

is broken down into many lower resolution components. This

is called the wavelet decomposition tree.

Fig.2: Decomposition of DWT coefficients

The wavelet decomposition of the signal s analysed at level j

has the following structure [cAj, cDj, ..., cD1].

Looking at a signals wavelet decomposition tree can reveal

valuable information. The diagram below shows the wavelet

decomposition to level 3 of a sample signal S.

Fig.3: Level 3 Decomposition of Sample Signal S [5]

Since the analysis process is iterative, in theory it can be

continued indefinitely. In reality, the decomposition can only

proceed until the vector consists of a single sample.

Normally, however there is little or no advantage gained in

decomposing a signal beyond a certain level. The selection of

the optimal decomposition level in the hierarchy depends on

the nature of the signal being analysed or some other suitable

criterion, such as the low-pass filter cut-off.

III. WAVELET SPEECH COMPRESSION TECHNIQUES

The idea behind signal compression using wavelets is

primarily linked to the relative scarceness of the wavelet

domain representation for the signal. Wavelets concentrate

speech information (energy and perception) into a few

neighboring coefficients [7]. Therefore as a result of taking

the wavelet transform of a signal, many coefficients will

either be zeros or have negligible magnitudes. Data

compression is then achieved by treating small valued

coefficients as insignificant data and thus discarding them. The process of compressing a speech signal using wavelets

involves a number of different stages, each of which are

discussed below.

A. Choice of Wavelet

The choice of the mother-wavelet function used in designing

high quality speech coders is of prime importance. Choosing a

wavelet that has compact support in both time and frequency in addition to a significant number

of vanishing moments is essential for an optimum wavelet

speech compressor.

Several different criteria can be used in selecting an optimal

wavelet function. The objective is to minimize reconstructed

error variance and maximize signal to noise ratio (SNR). In

general optimum wavelets can be selected based on the energy

conservation properties in the approximation part of the

wavelet coefficients.

In [1] it was shown that the Battle-Lemarie wavelet

concentrates more than 97.5% of the signal energy in the

approximation part of the coefficients. This is followed very closely by the Daubechies D20, D12, D10 or D8 wavelets, all

concentrating more than 96% of the signal energy in the Level

1 approximation coefficients.

Wavelets with more vanishing moments provide better

reconstruction quality, as they introduce less distortion into the

processed speech and concentrate more signal energy in a few

3

neighboring coefficients. However the computational

complexity of the DWT increases with the number of

vanishing moments and hence, for real time applications it is

not practical to use wavelets with an arbitrarily high number

of vanishing moments [8].

B. Wavelet decomposition

Wavelets work by decomposing a signal into different

resolutions or frequency bands, and choosing the wavelet

function and computing the Discrete Wavelet Transform

(DWT) carries out this task [2]. Signal compression is based

on the concept that selecting a small number of approximation

coefficients (at a suitably chosen level) and some of the detail

coefficients can accurately represent regular signal

components. Choosing a decomposition level for the DWT usually depends

on the type of signal being analyzed or some other suitable

criterion such as entropy. For the processing of speech signals

decomposition up to scale 5 is adequate [1], with no further

advantage gained in processing beyond scale 5.

C. Truncation of coefficients

After calculating the wavelet transform of the speech signal, compression involves truncating wavelet coefficients below a

threshold. An experiment conducted on a male spoken

sentence [4], shows that most of the coefficients have small

magnitudes. More than 90% of the wavelet coefficients have

less than 5% of the maximum value.

This means that most of the speech energy is in the high-

valued coefficients, which are few [4]. Thus the small valued

coefficients can be truncated or zeroed and then be used to

reconstruct the signal. This compression scheme provided a

segmental signal-to-noise ratio (SEGSNR) of 20 dB, with only

10% of the coefficients.

Two different approaches are available for calculating thresholds. The first, known as Global Thresholding involves

taking the wavelet expansion of the signal and keeping the

largest absolute value coefficients. In this case you can

manually set a global threshold, a compression performance or

a relative square norm recovery performance. Thus only a

single parameter needs to be selected.

The second approach known as By Level Thresholding

consists of applying visually determined level dependent

thresholds to each decomposition level in the wavelet

transform.

D. Encoding coefficients

Signal compression is achieved by first truncating small-

valued coefficients and then efficiently encoding them. One

way of representing the high-magnitude coefficients is to store

the coefficients along with their respective positions in the

wavelet transform vector [2].

Another approach to compression is to encode consecutive

zero valued coefficient [4], with two bytes. One byte to

indicate a sequence of zeros in the wavelet transforms vector

and the second byte representing the number of consecutive

zeros.

For further data compactness a suitable bit-encoding format,

can be used to quantize and transmit the data at low bit rates.

A low bit rate representation can be achieved by using an

entropy coder like Huffman coding or arithmetic coding.

IV. IMPLEMENTATION AND RESULTS

The design of the wavelet transform speech coder is based on

the concepts covered in the Wavelet Speech Compression

Techniques. Figure (4) below illustrates the different

processes involved in coding speech signals using wavelets.

The MATLABs Wavelet Toolbox incorporates many

different wavelet families, from the Wavelet Speech

Compression Techniques. It was decided to use the Haar and

Daubechies wavelets for coding speech signals.

We introduce the wavelet-based speech coder, as some functions (i.e. Compress, Decompress, Encode, Decode,

Pefcal, Playingsound, Results and Plotresult ) which are

mainly called from the function Main.

A. Calculating Thresholds

For the truncation of small-valued transform coefficients, two

different thresholding techniques are used, Global

Thresholding and By-Level Thresholding.

The aim of Global Thresholding is to retain the largest

absolute value coefficients, regardless of the scale in the

wavelet decomposition tree. Global thresholds are calculated

by setting the % of coefficients to be truncated. Level dependent thresholds are calculated using the Birge-Massart

strategy [3]. This thresholding scheme is based on an

approximation result from Birge and Massart and is well

suited for signal compression.

Fig.4: Design Flow of Wavelet Based Speech Coder.

B. Encoding Zero-Valued coefficients

After zeroing wavelet coefficients with negligible values

based on either calculating threshold values or simply

selecting a truncation percentage, the transform vector needs

to be compressed. In this implementation, consecutive zero valued coefficients are encoded with two bytes. One byte is

4

used to specify a starting string of zeros and the second byte

keeps track of the number of successive zeros.

Due to the scarcity of the wavelet representation of the speech

signal, this encoding method leads to a higher compression

ratio than storing the non-zero coefficients along with their

respective positions in the wavelet transform vector, as suggested in the Wavelet Speech Compression Techniques .

This encoding scheme is the primary means of achieving

signal compression.

C. Performance Measurement Tools

A number of quantitative parameters can be used to evaluate

the performance of the coder, in terms of reconstructed signal

quality after compression scores. The following parameters

are compared:

Signal to Noise Ratio (SNR), Peak Signal to Noise Ratio (PSNR), Normalized Root Mean Square Error (NRMSE), Retained Signal Energy(RSE), Compression Ratios (CR).

The results obtained for the above quantities are calculated

using the following formulas:

SIGNAL TO NOISE RATIO (SNR)

= 10 10

2

2

2

is the mean square of the speech signal and 2

is the

mean square difference between the original and reconstructed

signals.

PEAK SIGNAL TO NOISE RATIO (PSNR)

PNSR = 10 log10 NX 2

xr 2

N is the length of the reconstructed signal, X is the maximum

absolute square value of the signal x and || x-r||2 is the energy

of the difference between the original and reconstructed

signals.

NORMALIZED ROOT MEAN SQUARE ERROR (NRMSE)

NRMSE = ( )2

( )2

x(n) is the speech signal, r(n) is the reconstructed signal, and

x(n) is the mean of the speech signal.

RETAINED SIGNAL ENERGY (RSE)

% = () 2

() 2 100

||x(n)|| is the norm of the original signal and ||r(n)|| is the norm

of the reconstructed signal.

COMPRESSION RATIO (CR)

=( )

( )

x(n) is the original signal and r(n) is the reconstructed signal.

V. RESULTS AND DISCUSSION

A. Performance of Recorded Speech Coding

A male and female spoken speech signals were

decomposed at scale 3 and level dependent thresholds were

applied using the Birge-Massart strategy.

Since the speech files were of short duration, the entire signal

was decomposed at once without framing. A summary of the

performance is given below for the different wavelets used.

a) A Male Speech Signal Decomposed at Different type of wavelets

Wavelet Zeros(%) RSE(%) SNR PSNR NRMSE

Haar 79.2184 93.3054 11.7428 92.7803 0.2587

Db4 79.0749 97.8056 16.5870 97.6245 0.1481

Db6 79.0609 97.9450 16.8722 97.9098 0.1433

Db8 79.0381 98.0706 17.1463 98.1838 0.1389

Db10 79.0118 98.2008 17.4502 98.4877 0.1341

b) Female Speech Signal Decomposed at Different type of wavelets


Haar 79.1756 90.2214 10.0972 91.9450 0.3127

Db4 79.0771 95.4705 13.5870 95.2864 0.2128

Db6 79.0560 95.9561 13.8722 95.7948 0.2007

Db8 79.0386 96.2727 14.2463 96.1318 0.1931

Db10 79.0111 96.3477 14.3566 96.2324 0.1909

c) Compression scores

Wavelet Male Female

Haar 3.5438 3.4658

Db4 3.7742 3.5657

Db6 3.8220 3.5988

Db8 3.8250 3.6352

Db10 3.8753 3.6446

Table 1: a), b), c): Performance of Recorded Speech

Compression

B. Performance of Real Time Speech Coding

In this section the same speech files were used, except the

signals were divided into frames of length 20ms to simulate

real time coding. Each frame was then decomposed, truncated

and compressed. A level 5, wavelet decomposition was

applied along with global thresholds of close to 78-79% to

enable comparisons with the previous results. A summary of

the performance is given below for the different wavelets

used. a) Male Speech Signal Decomposed at Different type of wavelets


Haar 78.8175 83.6181 9.1507 90.1882 0.3487

Db4 78.9216 89.4932 10.7339 91.8503 0.2852

Db6 78.7505 88.6178 11.0929 91.9448 0.2852

Db8 78.3683 88.4611 9.7060 90.8218 0.3242

Db10 77.9373 88.4156 9.6944 90.9158 0.3207

5

b) Female Speech Signal Decomposed at Different type of wavelets


Haar 78.8531 85.7896 8.2749 90.1227 0.3857

Db4 79.0593 92.2694 10.5569 92.3711 0.2977

Db6 79.0560 92.2768 10.7596 92.5368 0.2921

Db8 78.9169 89.8339 8.5702 90.3742 0.3747

Db10 78.4119 88.8729 8.1457 90.1153 0.3860

c) Compressions scores

Wavelet Male Female

Haar 2.3152 2.2919

Db4 2.1076 2.1765

Db6 1.9757 1.9544

Db8 1.7834 1.8182

Db10 1.6393 1.6588

Table 2: a), b), c): Performance of Real Time Speech

Compression

Thus using a wavelet with a large no of vanishing moments

will start to deteriorate the performance of a real time speech

coder after a certain filter length. From table 2, the

Db6 wavelet gives better performance than all the other

wavelets tested. Thus for real time speech coding using

wavelets it is important to choose a wavelet function with less

vanishing moments and also a reasonable decomposition

level.

C. Varying Truncation % vs. Performance

To see the effects of varying the truncation percentage on the

reconstructed signal quality, a male spoken speech signal was

decomposed at level 5, using the Db10 wavelet. Figure 5

shows the percentage of samples with a certain threshold value

in he wavelet decomposition. From this figure it can be seen

that most of the coefficients have small magnitudes. Figures 6-

8 show trends in the quality of a signal as a function of the truncation percentage. Hence a signal of a certain quality can

be reconstructed, by choosing the appropriate threshold.

Fig.5 : Truncation % vs. Threshold

Fig.6: SNR vs. % Truncation

Fig.7: PSNR vs. % Truncation

Fig. 8: NRMSE vs. % Truncation

VI. CONCLUSIONS

Speech coding is currently an active topic for research in

the areas of Very Large Scale Integrated (VLSI) circuit

technologies and Digital Signal Processing (DSP). The

Discrete Wavelet Transform performs very well in the

compression of recorded speech signals. For real time speech

processing however, its performance is not as good.

Therefore for real time speech coding it is recommended to use a wavelet with a small number of vanishing moments at

level 5 decomposition or less. The wavelet based compression

software designed reaches a signal to noise ratio of 17.45 db

at a compression ratio of 3.88 using the Daubechies 10

wavelet.

The performance of the wavelet scheme in terms of

compression scores and signal quality is comparable with

other good techniques such as code excited linear predictive

6

coding (CELP) for speech, with much less computational

burden. In addition, using wavelets the compression ratio can

be easily varied, while most other compression techniques

have fixed compression ratios.

VII. REFERENCES

[1] Agbinya, J.I. Discrete Wavelet Transform Techniques in Speech

Processing, IEEE Tencon Digital Signal Processing Applications

Proceedings, IEEE, New York, NY, 1996, pp 514-519.

[2] Fgee, E.B., Phillips, W.J. and Robertson, W. Comparing Audio Compression using Wavelets with other Audio Compression

Schemes, IEEE Canadian Conference on Electrical and Computer

Engineering, IEEE, Edmonton, Canada, 1999, pp. 698-701.

[3] Misiti, M., Misiti, Y., Oppenheim G. and Poggi, J. Wavelet Toolbox Users Guide, Mathworks, 1997.

[4] Kinsner, W. and Langi, A. Speech and Image Signal Compression with Wavelets, IEEE Wescanex Conference Proceedings, IEEE, New York, NY,

1993, pp. 368-375.

[5] M. Misiti, Y. Misiti, G. Oppenheim and J. Poggi, Matlab Wavelet Tool Box, The Math Works Inc., 2000.

[6] J. Ooi and V. Viswanathan, Applications of Wavelets to Speech

Processing,. Modern Methods of Speech Processing, R.P. Ramachandran and

R. Mammone, ed., Kluwer Academic Publishers, Boston, 1995, pp. 449-464.

[7] V. Viswanathan, W. Anderson, J. Rowlands, M. Ali and A. Tewfik, .Real-Time Implementation of a Wavelet-Based Audio Coder on the T1

TMS320C31 DSP Chip,. 5th International Conference on Signal Processing

Applications & Technology (ICSPAT), Dallas, TX, Oct. 1994.

[8] A. Gersho, .Speech Coding,. Digital Speech Processing, A.N. Ince, ed.,

Kluwer Academic Publishers, Boston, 1992, pp. 73-100.

7


AbstractLobjectif de ce travail est danalyser lidentification

paramtrique de systmes dynamiques linaires par les deux

mthodes didentification connus mthode de Strejc et mthode

de Broida pour les systmes avec des constantes de temps assez

proches et des systmes avec des constantes de temps trs

diffrentes pour une ventuelle comparaison.

Index Terms Broida, Identification , Strejc, , Systmes

dynamiques.

I. INTRODUCTION

N systme linaire a une fonction de transfert qui peut se

calculer en tablissant les quations diffrentielles qui

relient entre et sortie. Ces quations thoriques sont parfois

difficiles crire car on na pas forcement toute la

connaissance du systme ncessaire.

Souvent, un modle dont le comportement ressemble a celui

du systme tudier est suffisant pour laborer une loi de

commande adapte. On prsente deux mthodes celle de Strejc

et Broida pour obtenir un modle sous forme de fonction de

transfert quivalente en terme de rponse un systme dont on

ne sait pas modliser le comportement [1] , [2], [3].

II. IDENTIFICATION

Identifier un systme, cest rechercher partir

dexpriences sur celui-ci, un modle mathmatique dont le

comportement dynamique soit le plus voisin possible de celui

du systme.

Lidentification dun processus ncessite lenregistrement

pralable de sa rponse. Celle-ci peut tre une rponse

indicielle (modlisation temporelle) ou la rponse signal

contenu spectral plus dense (bruit analogique ou binaire)

permettant aprs transformation de Fourier de modliser la

rponse frquentielle du processus.

A. Identification dans le domaine temporel

Cette technique consiste attaquer le systme par un

chelon damplitude donne et sintresser par la suite

lvolution dans le temps de sa sortie. . On compare ensuite la

rponse obtenue avec une fonction de transfert normalise. Il

faut donc utiliser une mthode didentification ne demandant

aucune connaissance pralable si ce nest lenregistrement de

la rponse indicielle. Le modle que lon obtiendra alors sera

un modle de reprsentation qui peut navoir aucune

correspondance physique avec le systme considr. Mais, si

lidentification est bonne, les rponses indicielles et/ou

frquentielle seront aussi proches que possible de celles du

systme tudi.

a. Mthode de Strejc

Pour les systmes apriodiques dordre deux et plus, il est

difficile destimer lordre du systme avant de lavoir

identifi. On peut choisir de le modliser par une fonction de

transfert de la forme :

nTp

prTKpE

pSpT

1

)exp(

)(

)()( (1)

Les paramtres identifier sont donc :

le gain statique K,

le retard , la constante de temps et lordre n.

La rponse indicielle dun tel modle est reprsente

comme suit :

Partant de la rponse indicielle thorique dun systme

dordre sans retard pur, Strejc a calcul les temps caractristiques Tu et Ta. Ces rsultats ont t reports dans le

tableau suivant :

Identification des Systmes Linaires

Constantes de Temps Proches et Diffrentes par

les Mthodes de Strejc et Broida

F. HAMADACHE, N. DIFFELLAH, et K. BENMAHAMMED

U

8


Table.1 Tableau des paramtres de Strejc

()

=

=

1 0 0 5.00

2 0.26 0.104 0.368

3 0.32 0.218 0.271

4 0.35 0.319 0.224

5 0.37 0.410 0.195

6 0.38 0.493 0.175

7 0.39 0.570 0.161

8 0.40 0.642 0.149

9 0.41 0.709 0.140

10 0.41 0.773 0.132

La premire colonne de ce tableau donne la position du point

dinflexion Q. . La deuxime colonne permet de calculer le

temps et den tirer le temps de retard = 1 . De la troisime colonne, on tire la constante de temps du systme.

Pour dterminer les paramtres dun modle de Strejc, il

faut :

Mesurer le gain statique du systme qui est le rapport dela valeur finale du signal de sortie et

dentre;

Mesurer les temps 1 ; Estimer le retard maximum ; Tirer du tableau de Strejc la valeur de =

. pour diffrentes valeurs de ; Calculer le retard = 1 qui doit tre

positif et infrieur ; Calculer la constante de temps = . ; Dterminer les ordres n possibles en ne conservant

que les temps de retard admissibles.

b. Mthode de Broida

La mthode de Broda consiste a assimiler la fonction de

transfert d'un systme d'ordre celle du premier ordre affecte d'un retard pur .

Le modle propos pour approcher le comportement du

systme est un premier ordre avec un retard pur. Sa fonction

de transfert est :

=

1 +

(2)

Les paramtres identifier sont donc :

le gain statique , le retard , la constante de temps

Afin de dterminer des valeurs de ces paramtres, Broda fait

correspondre la rponse indicielle identifier et la fonction de

transfert du premier ordre affecte d'un retard en deux points

1 2 d'ordonnes correspondant 28% 40% de la valeur finale de la sortie du systme.

Les paramtres du modle de Broida sont :

= 5.5 2 1 = 2.81 1.82 (3)

III. RELATION ENTRE LA METHODE DE STREJC ET BROIDA

Afin de mieux comprendre les relations existant entre les

modles de Strejc et Broida, appliquons cette mthode des

systmes dont les transmittances sont connues exactement.

A. Systmes avec des constants de temps proches

Considrons un systme dcrit par 3 constantes de temps

valant 5, 6 7 secondes.

=1

1 + 5 1 + 6 1 + 7

(4)

a. Mthode de Strejc

La reprsentation de la rponse indicielle du systme et la

tangente au point dinflexion est reprsente par la figure

suivante :

9


La mesure des temps caractristiques sur la rponse indicielle

(Fig.3) a conduit aux valeurs suivantes :

1 = 5.1924[],2 = 27.4453 [], = 2 1 =22.2529[] qui nous permettent de construire le tableau didentification suivant :

Table.2Tableau des paramtres de Strejc

= . = 1 = . 2 2.3143 2.8780 8.1891 3 4.8511 0.3412 6.0305 4 7.0987 1.9063 4.9847 5 9.1237 3.9314 4.3393 6 10.9707 5.7783 3.8943 7 12.6842 7.4918 3.5827 8 14.2864 9.0940 3.3157 9 15.7773 10.5850 3.1154

10 17.2015

12.0092

2.9374

Les retards ngatifs nayant aucune signification physique, on

liminera sans autre les ordres suprieurs n = 3. Il reste donc

le choix entre les deux modles suivants :

2 =2.8780 p

1 + 8.1891 2

(5)

3 =0.3412 p

1 + 6.0305 3

(6)

Le trac de la rponse indicielle relle et les rponses

indicielles de Strejc dordre 2 et 3 est comme suit :

Le calcul derreur entre les rponses indicielles du systme

rel et les modles de Strejc dordre 2 et 3 est reprsent par

la figure suivante :

Interprtation : Le modle de Strejc dordre 3 est en trs

bonne concordance avec le systme rel et les diffrences

entre les rponses indicielles sont difficilement perceptibles

b. Mthode de Broida

La reprsentation de la rponse indicielle du systme et points

correspondant 28% et 40% de la valeur finale est reprsente

par la figure suivante :

La mesure des temps caractristiques sur la rponse

indicielle (Fig.3) a conduit aux valeurs suivantes : 1 = 10.9700[], 2 = 13.6300 [], qui nous permettent de calculer les paramtres de Broida.

= 14.6300 = 6.1820 (7)

Le modle de Broida est alors dfini comme suit :

=6.1820 p

1 + 14.6300

(8)

Le trac des rponses indicielles relles et du modle de

Broida est comme suit :

10


Le trac de lerreur entre le systme rel et le modle de

Broida est reprsent par la figure suivante :

Interprtation : Le modle de Broida est loin dtre en

concordance avec le systme rel en le comparant avec le

modle de Strejc ; mme le modle de Strejc dordre 2

quon a rejet lors de ltape de modlisation est meilleur

que ce dernier.

B. Systmes avec des constants de temps diffrentes

Considrons un systme dcrit par 3 constantes de temps

valant 0.5, 5 10 secondes.

=1

1 + 0.5 1 + 5 1 + 10

(9)

a. Mthode de Strejc

La reprsentation de la rponse indicielle du systme et la

tangente au point dinflexion est reprsente par la figure

suivante :

La mesure des temps caractristiques sur la rponse indicielle

a conduit aux valeurs suivantes : 1 = 3.1790 [], 2 =

23.3126 [], = 2 1 = 20.1336 [] qui nous permettent de construire le tableau didentification suivant :

Tab.2 Tableau didentification de Strejc

= . = 1 = . 2 2.0939 1.0851 7.4092 3 4.3891 1.2101 5.4562 4 6.4226 3.2436 4.5099 5 8.2548 5.0758 3.9261 6 9.9259 6.7469 3.5234 7 11.4762 8.2972 3.2415 8 12.9258 9.7468 2.9999 9 14.2747 11.0957 2.8187

10 15.5633

-12.3843

2.6576

Les retards ngatifs nayant aucune signification physique, on

liminera sans autre les ordres suprieurs n = 2. Il reste donc

un seul modle qui est le suivant :

2 =1.0851

1 + 7.4092 2

(10)

Le trac de la rponse indicielle relle et la rponse indicielle

de Strejc dordre 2 est comme suit :

11


Le calcul derreur entre les rponses indicielles du systme

rel et le modle de Strejc dordre 2 est reprsent par la

figure suivante :

Commentaire : Le modle de Strejc dordre 2 est en trs

bonne concordance avec le systme rel et les diffrences

entre les rponses indicielles sont difficilement perceptibles.

b. Mthode de Broida

La reprsentation de la rponse indicielle du systme et

points correspondant 28% et 40% de la valeur finale est

reprsente par la figure suivante :


indicielle (Fig.3) a conduit aux valeurs suivantes : 1 = 8.0400[], 2 = 10.5200 [], qui nous permettent de calculer les paramtres de Broida.


indicielle (Fig.3) a conduit aux valeurs suivantes : 1 =

8.0400[], 2 = 10.5200 [], qui nous permettent de calculer les paramtres de Broida.

= 13.6400 = 3.5760 (11)

Le modle de Broida est alors dfinie comme suit :

=3.5760 p

1 + 13.6400

(12)

Le trac des rponses indicielles relle et du modle de

Broida est comme suit :

Le trac de lerreur entre le systme rel et le modle de

Broida est reprsent par la figure suivante :

Commentaire : Le modle de Broida est loin dtre en

concordance avec le systme rel en le comparant avec le

modle de Strejc.

IV. CONCLUSION

On voit que la modlisation de Strejc pour les systmes sans

retard pur que ce soit constantes de temps proches ou

12


diffrentes conduit une erreur minimale donc une

meilleure concordance avec le systme rel, alors que la

modlisation de Broida lerreur est assez importante.

Comme perspective, on dsire voir la meilleure

modlisation pour les systmes avec retard pur.

REFERENCES

[1] L. Ljung, System identification theory for the use, Prentice Hall information and system science series, 1999.

[2] Y. Zhu, Multivariable System Identification for Process Control. Elsevier Science & Technology Books, , October 2001.

[3] L. LjungT ,T.Glad, Modeling of dynamic systems, Prentice Hall, 1994.

13


Rsum Cet article est une introduction larithmtique

base dintervalles sur les profiles, avec une telle procdure, il est

possible destimer les bornes infrieure et suprieure dun

intervalle.

Mots cls Arithmtique par intervalles, profiles, informations

imprcise, estimation, oprations.

I. INTRODUCTION

A reprsentation par intervalles peut tenir compte des

incertitudes, elle peut tre parfois insuffisante,

particulirement quand un degr de confiance est prfr. En

effet, dans une approche par intervalles, il est seulement

possible destimer les bornes infrieure et suprieure des

paramtres du systme. En effet, cette reprsentation nous

autorise supposer que la valeur dun paramtre se situe dune

manire galement probable en tout point de lintervalle. Cette

faon de voir les choses peut tre enrichie en exploitant une

reprsentation par intervalles flous..

Lutilisation des intervalles flous pour reprsenter les

incertitudes dans les systmes de commande nest pas une

thorie nouvelle mais intressante [1],[2],[3].

II. ARITHMETIQUE PAR INTERVALLE

Larithmtique par intervalle introduite par Moore ([1],

[4], [5], [6]) fait intervenir des intervalles. Lide est dune

part de garantir les rsultats en calculant un intervalle dans

lequel se trouve le rsultat effectif, et dautre part on cherche

fournir un encadrement de largeur raisonnable de la solution et

avoir un rsultat suffisamment prcis.

Manuscrit reu Septembre 27, 2011.

Arithmtique base dintervalle sur les profiles.

F HAMADACHE, Centre universitaire de Bordj Bou Arrridj, Algerie, LSI

Laboratoire des Systmes Intelligents, [email protected] , N DIFFELLAH, Centre universitaire de Bordj Bou Arrridj, Algerie, LSI

Laboratoire des Systmes Intelligents, [email protected]

K BENMAHAMMED, Universit el Baha, collge des ingnieurs, Arabie

Saoudite, LSI Laboratoire des Systmes Intelligents, [email protected].

A. Dfinition dun intervalle

Un intervalle xxX , est lensemble des rels x tel que

x et

x sont les extrmits de X.

xxxxX / (1) B. Opration sur les intervalles

Soit aaa , et bbb , deux intervalles. Les oprations arithmtique standard sont dfinis comme suit :

a. Addition

Laddition de deux intervalles est comme suit :

bababbaa ,,, (2)

b. Soustraction

La soustraction de deux intervalles est comme suit :

bababbaa ,,, (3)

c. Multiplication

La multiplication de deux intervalles est comme suit :

)max(),min(,, zzbbaa (4)

Avec :

babababaz .,.,.,. (5)

d. Division

La division de deux intervalles est comme suit :

bbaabbaa

1,

1,,/,

(6)

Avec :

bb ,0

Arithmtique Base dIntervalles Sur les

Profiles

F. HAMADACHE, N. DIFFELLAH, et K. BENMAHAMMED

L

14


III. REPRSENTATION DE LINFORMATION IMPRCISE

Un intervalle standard a peut tre considr comme

ensemble dlments auxquels une fonction dappartenance

)(xa

rectangulaire est associe. En effet, un intervalle peut

tre vu comme un intervalle flou (nombre flou) spcial dont la

fonction dappartenance prend la valeur 1 sur tout

lintervalle et le 0 ailleurs. Dans ce cas, toute lanalyse et

larithmtique par intervalles peut tre considre comme

sous-ensemble de la thorie des ensembles flous.

Un intervalle flou A [7] contrairement une reprsentation

par intervalles conventionnels, toutes les valeurs dans un

intervalle flou ne sont pas galement possibles.

La distribution de possibilit est reprsente par la fonction

dappartenance de lintervalle flou o le support fait rfrence

lintervalle de toutes les valeurs possibles et le noyau

reprsente lensemble des meilleures valeurs dun paramtre

Dans ce cas, on peut dire que les intervalles flous

encapsulent dans un seul formalisme le cas optimiste (la

connaissance prcise) et le cas pessimiste (maximum de

l'incertitude).

Les diffrents types d'intervalles flous sont reprsents

comme suit :

A. Intervalle standard

Un intervalle standard est une reprsentation classique dun

intervalle.

B. Intervalle flou quelquonque

C. Intervalle flou trapzoidale

Dune manire conventionnelle, cet intervalle flou ([2], [3])

est souvent reprsent par la forme canonique de sa fonction

dappartenance donne par :

ailleurs

AS

ANxpourx

Ag

AN

ANxpour

AN

ASxpourx

Af

xA

0

,)(

,1

,)(

)(

(7)

Par exemple pour un intervalle flou trapzodale A, lquation

prcdente peut scrire sous la forme suivante :

ailleurs

AS

ANxpour

AN

AS

xA

S

AN

ANxpour

AN

ASxpour

AS

AN

ASx

xA

0

,

,1

,

)(

(8)

15


D. Intervalle flou triangulaire

Dans le cas dun intervalle flou triangulaire (cas particulier

dun intervalle flou trapzodal) on obtient :

ailleurs

AS

ANxpour

NAA

S

xA

S

AN

ASxpour

AS

AN

ASx

xA

0

,

,

)(

(9)

IV. PRINCIPE DE REPRSENTATION PAR PROFILES

Pour des raisons de simplicit mais sans perte de gnralit,

dans toute la suite de ce document, les intervalles flous

exploits sont considrs sous forme triangulaire.

AN

AS

AS

AS

ANAf

AS

AS

ANxxAf

AS

AS

ANyx

ASx

AS

ANy

AS

AN

ASx

y

AS

AN

ASx

xAf

)1()()(1

)()(1

)(

)()(

(10)

AN

AS

AN

AS

ASAg

AN

ASy

ASxAgA

NA

SyA

Sx

xA

SA

NA

Sy

AN

AS

xA

S

y

AN

AS

xA

S

xAg

)1()()(1

)()(1

)(

)()(

(11)

Un intervalle Flou A triangulaire [3] est donn par ses

profiles gauche A- et droite A

+ :

AKASASxxAxA

AKASASxxAxA

.).1(;)(/inf)(

.).1(;)(/inf)(

(12)

Avec:

)0()0(.4

)1()1(.3

1,0)(.2

1,0)(.1

AAKA

AKAA

avectedcroissanestA

aveccroissanteestA

V. OPERATIONS ARITHMETIQUES A BASE DINTERVALLE SUR

LES PROFILES

Prenons deux ensembles flous )(),( AAA et

)(),( BBB avec :

droitesprofilesBA

gauchesprofilesBA

:)(),(

:)(),(

Les quatre oprations standards sont dfinies par:

A. Addition sur les profiles

)()(),()())(( BABABA

(13)

Exemple:

Considrons deux nombres flous A1 et A2 tels que :

16


Premier nombre flou

A1 = [A1-, A1

+] : 24,1

1A

avec

A1- : 1

1A : profile gauche

A1+ : 24

1

A : profile droite

Deuxime nombre flou

A2 = [A2-, A2

+] : 22,26

2A :

avec

A2- : 26

2

A : profile gauche

A2+ : 222

A : profile droite

2224,261))(21

( AA

42,35))(21

( AA

Lutilisation de INTLAB ( Interval laboratory)pour la

simulation nous a donn les enveloppes dvolution

suivantes :

B. Soustraction sur les profiles

)()(),()())(( BABABA

(14)

21

,2121

AAAAAA

2624,221))(21

( AA

410,33))(21

( AA

Lutilisation de INTLAB pour la simulation nous a donn les

enveloppes dvolution suivantes :

C. Multiplication sur les profiles

BABABABAZ

avecZZBA

,.,.,.

:)max(),min())((

(15)

,2224,2624

,221,261

:)max(),min())(21(

Z

avecZZAA

221,2624))(21( AA Lutilisation de INTLAB pour la simulation nous a donn

les enveloppes dvolution suivantes :

D. Division sur les profiles

BSavecBB

AABA

0

)(

1,

)(

1)(),())((

(16)

26

1,

22

124,,1))(

2/

1( AA

261,22/24))(2/1( AA

17


Lutilisation de INTLAB pour la simulation nous a donn les

enveloppes dvolution suivantes :

VI. CONCLUSION

Larithmtique par intervalles sur les profiles constitue une

bonne approche, elle repose sur le fait de chercher fournir un

encadrement de largeur raisonnable de la solution et avoir un

rsultat suffisamment prcis

REFERENCES

[1] R.E.Boche, " An operational interval arithmetic ", Lockheed Missiles

and Space Company, Palo Alto, California, Paper N.CP 63-1431,

IEEE, August 1963.

[2] R.E.Moore , Error in Digital Computation, Volume 1, John Willy and

sons, Inc. New York .London .Sidney; 1965.

[3] G.J. klir, Uncertainty and information,Foundation of Generalized

Information Theory, Binghamton University SUNY, 2006 by John

Wiley & Sons,

[4] R.E.Moore, Automatic error analysis in digital computation, Missiles

and space division.28 January

[5] R.E.Moore, Interval analysis, Lockheed Missiles and space division

Sunnyvale California. September 1959

18


Rsum De nos jours, les systmes mdicaux produisent

une grande quantit de donnes images qui sont stockes dans

des bases de donnes, laccs rapide ces bases normes

ncessite des algorithmes dindexation efficaces. Lindexation

des images mdicales est devenu, pour les applications

cliniques, un outil essentiel parce quelle apporte une aide

efficace aussi bien en diagnostic quau suivi thrapeutique. Les

systmes de recherche dimages par contenu (en anglais

Content Based Image Retrievel) sont l'une des solutions

possibles pour grer efficacement ces bases. Dans ce papier, les

principales techniques dindexation dimage par contenu sont

analyses, ensuite nous discutons les problmes de ses

techniques, et nous proposons des dfis.

Mots cls Indexation dimage par contenu, recherche

dimage, CBIR, descripteur couleur, texture, forme.

I. INTRODUCTION

Lvolution de la technologie touch plusieurs secteurs dont le secteur mdical, do lapparition dappareils

dacquisition dimages qui produisent un nombre important

dimages chaque anne. Cependant laccs rapide ces bases

dimages normes ncessite des algorithmes dindexation

efficaces.

Plusieurs systme dindexation dimages ont t prsents

dans la littrature [6, 7,8].les Premiers systmes sont bass sur

le texte (en anglais :(Text-Based Image Retrivel), ces derniers

adoptent une approche qui consiste dcrire le contenu visuel

sous forme textuelle (utilisation de mots-cls). Ces mots-cls

servent comme index pour accder aux donnes visuelles

associes. Lavantage de cette approche est quelle permet de

consulter les bases de donnes en utilisant les langages

dinterrogation standard, par exemple SQL. Cependant, ceci

ncessite une grande quantit de traitement manuel. De plus la

fiabilit des donnes descriptives nest pas assure: elles sont

Manuscrit reu 16 Octobre2011. Techniques dindexation dimages

Mdicales par contenu. 1Souad. Meziane Tani, Laboratoire gnie biomdicales , Universit

Tlemcen 13000, Algrie 2Abdelhafid. Bessaid , Laboratoire gnie biomdicales , Universit

Tlemcen 13000, Algrie.

subjectives et elles pourraient ne pas dcrire correctement le

contenu de limage. Afin de rsoudre ces problmes, les

systmes dindexation dimage par contenu CBIR sont

introduits. Lobjectif est de reprsenter les images en utilisant

seulement le contenu visuel numrique de limage. Les principaux systmes de recherche dimages par contenu

existants sont : dune part ,les systmes commerciaux

comme : QBIC (IBM, 1995) Image Finder (Attrasoft)

,Excalibur (Excalibur Technologies, 1996) .Et dautre part ,les

systmes exprimentaux comme Blobworld (Universit de

Californie - Berkeley), Photobook (Massachussetts Institute of

Technology) ,Viper (Universit de Genve Computer Vision

Group) ,SIMPLIcity (Stanford University) ,Ikona (INRIA

Rocquencourt IMEDIA).

Ce papier est organis en trois principales sections. Dans la

(section-II) nous prsentons un aperu sur larchitecture

globale dun systme dindexation, puis nous discutons dans la

(section-III) des principales techniques dindexation dimage

par contenu. la fin (section-IV) les problmes et les dfis

sont proposs.

II. SYSTEME DINDEXATION PAR CONTENU

Un systme dindexation se compose de deux tapes

principales : la premire, en mode autonome (hors ligne ) pour

les prtraitements, lorganisation et la gestion de la base

dimages de rfrence (sous-systme dindexation de la base),

et la deuxime ,cest la phase de consultation en ligne pour les

traitements propres a chaque nouvelle requte de lutilisateur

(sous-systme de recherche). Les deux sous-systmes ont en

commun les deux traitements suivants :

Lextraction de descripteurs caractristiques des images, que ce soit pour une image de la base

durant la phase en mode autonome ou une image

requte propose par lutilisateur lors de la phase

en ligne.

La construction des index partir des descripteurs.

La figure II- 1 illustre larchitecture dun systme dindexation

par contenu.

Techniques dindexation dimages

Mdicales par contenu

S. Meziane Tani, A. Bessaid

19


Figure II-1: Architecture dun systme dindexation.

La phase dindexation contient les oprations ncessaires

pour organiser les descripteurs de manire accder

rapidement aux donnes. Indexer une image consistera

calculer une signature depuis ses descripteurs par le biais

dune fonction dindexation. Idalement, plus deux signatures

sont proches au sens de la mtrique utilise, plus les images

associes sont proches au sens de lutilisateur. La complexit

de la comparaison des images est alors rduite la

comparaison de leur signature dindexation.

Pendant la consultation de la base, lutilisateur slectionne

une image par le biais dune interface graphique. Les index

ou signatures de la requte sont confrontes aux index des

images de rfrences. Finalement, le systme slectionne et

prsente lutilisateur les images les plus similaires la

requte.

III TECHNIQUE DINDEXATION DIMAGE PAR

CONTENU

Plusieurs approches ont t proposes dans la littrature

pour indexer le contenu dune image, nous prsentons dans la

section suivante un aperu sur les principales techniques

utilises.

A. Indexation dimage par caractristiques bas niveau

Une caractristique bas niveau est un ensemble de valeurs et

qui sont extraites directement de limage permettant de la

caractriser. Lextraction des caractristiques de bas niveau

reprsente une premire abstraction par rapport limage

brute, le but est que cette caractristique soit discriminante par

rapport aux entits visuelles que lon cherche caractriser.ces

descripteurs bas niveau peuvent tre utilis pour limage

entire comme ils peuvent tre utilis pour une partie de

limage [2].

1) Couleur

La couleur est lune des plus importantes caractristiques

et la plus attirante ds la premire vue de limage, cest pour

cela quelle est la plus utilise en recherche par le contenu. Il

existe plusieurs espaces colorimtriques pour reprsenter la

couleur tels que RGB, HSL, LAB [3].quelque soit lespace de

couleur utilise, il existe plusieurs faons de caractriser la

couleur tels que : lhistogramme couleurs, les moments

statiques, les angles de couleur etc.

Lhistogramme couleur a t propos pour la premire

fois par SWAIN [4], cette mthode consiste calculer

lhistogramme couleur de chaque image c..d. le nombre de

pixels pour chaque intensit lumineuse (couleur), ensuite la

similarit entre les images est calcul partir dintersection

entre histogrammes, le grand inconvnient de cette mthode

est que les histogrammes occupent une place importante en

espace mmoire ce qui alourdit lalgorithme de recherche.

pour rsoudre ce problme, au lieu de calculer la

distribution complte, dans les systmes de dindexation

dimages, on calcule seulement les caractristiques de

couleur des dominantes tels que lesprance, la variance et

dautres moments.La figure (Figure III-) prsente

lhistogramme couleur dune image.

Figure III- 2:Histogramme couleur d'une image(RGB).

2) Texture

Plusieurs chercheurs ont essay de donner une dfinition de la texture mais aucune nest gnrale et formelle ; la texture

se dfinit comme un ensemble de primitives (de pixels peu

prs semblables) arranges selon des rgles particulires de

placement.

20


Figure III-A-3:Exemple d'une texture

Les mthodes proposes pour caractriser la

texture peuvent tre classifies en deux catgories[5] :La

premire inclut les mthodes statistiques qui caractrisent les

relations statistiques entres les diffrents niveaux de gris des

pixels utilisant une fonction de corrlation, la mthode la plus

connue cest la matrice de co-occurrence .La deuxime

catgorie comporte les mthodes spectrales bases sur le

traitement de signal ,tels que le filtre de Gabor , les moments

de hrmites .On peut galement trouver dautre mthodes de

caractrisation de la texture tels que : modlisation

markovienne qui peut modliser les interaction entre pixels .

1) Forme

La forme est une autre primitive qui sert caractriser le contenu dune image; deux mthodes de

description de la forme peuvent tre distingues [2], dune part

les mthodes bases sur le contour tels que: descripteur EOH

(Edge Orientation Histogram) qui consiste extraire les

contours dune image et de calculer lhistogramme de langle

des gradients sur les contours. Et dautre part la mthode base

sur les rgions: tels que les moments gomtriques.

B. Indexation dimage par caractristique haut

niveau

Une caractristique haut niveau, gnralement appele

caractristique smantique, reprsente le contenu

smantique qui se dfini par les attributs abstraits de

limage tels que : les scnes smantique (exemple : bord de

mer, rue, champ doiseaux, etc.), comportement

smantique (par exemple la performance, agression,) et

l'motion smantique (calme, l'harmonie, l'inspiration,

etc.). Cependant la smantique de limage est la

caractristique la plus complexe, floue et abstraites est

difficile a extraire de limage ;

Gnralement, les caractristiques hauts niveaux

dpendent des caractristiques bas niveaux.

III. CONCLUSION

Selon la prsente analyse et l'tude des techniques dindexation dimages bases sur le contenu, quelques

problmes ont t dcouverts savoir :

1) lindexation des images par la couleur prsente le problme du manque de considration de la scne

visuelle. De plus les images mdicales ne sont pas

toujours en couleur.

2) Il est difficile de caractriser tous types de textures avec une mthode particulire parce que la texture choisie

pour chaque mthode dpend de la texture de limage.

3) Lindexation des images par la forme prsente plusieurs problmes parmi lesquels ont peut citez le cas des

formes caches (Comment reconnatre les objets

partiellement cachs ?).

En plus des problmes cits en haut, ces descripteurs

(couleur, forme, texture) sont bass sur la prise en compte

directe du contenu visuel mais aucune considration pour la

smantique de l'image, ce qui cre un foss smantique qui a

t dfini par [5] comme le manque de concordance entre les

informations que l'on peut extraire des donnes visuelles et

linterprtation que les mmes donnes ont pour un utilisateur

dans une situation donne.

Le dfi actuel consiste amlior les descripteurs visuels

afin de rduire ce foss smantique et de passer lutilisation

des caractristiques hauts niveaux.

REFERENCES

[1] S. Jai-andaloussi ,M lamard guy gazuguel ,h tairi ,m meknassi , b cochenrer , Content based image retrievel :used of generalized

gaussien density , springer journal ,april 2009.

[2] Y .Xiaohong ,X jinhua , The related techniques of content based image retrieval,international sympusiumom of computer science and

computatiana technology ,IEEE computer society,2008.

[3] V.chtkara, Color based image retrievel using compact binary signatures , technical rapport, department of computer science , university of

Alberta Admonton, May 2001.

[4] M swain ,D ballard , Color indexing ,international journal of vision ,1991.

[5] Image Retrieval :ideas, influences ,and trend of the new age, the Pennsylvania state university, ACM transactions on computing surveys , April 2008.

[6] P. Aigrain et al. Content-based representation and retrieval of visual media : a-stateof- the-art review. Multimedia Tools and Applications,

3(3) :179202, septembre 1996

[7] New J.J. Fan and K.Y. Su. An efficient algorithm for matching multiple patterns. IEEE Trans. Knowl. Data Eng., 5(2) :339351, avril 1993.

[8]

21


Mise en correspondance des contours dobjets

bruits

S.Aouat, S. Larabi

RSUM Nous prsentons dans ce papier une nouvelle approche qui permet de comparer les silhouettes dobjets. Les silhouettes sont extraites partir de deux images prises sous des points de vue lgrement diffrents, (dans le cas de deux images stroscopiques) ou partir dune squence dobjets en mouvement. La comparaison des silhouettes se fait en se basant sur leur description XLWDOS en prenant en considration la prsence du bruit dans les deux images. Le bruit modifie malheureusement dune manire assez considrable une telle description do la ncessit de rsoudre ce problme afin effectuer une meilleure comparaison des images. La mthode est valide par une exprimentation effectue sur des images relles. MOTS CLS Comparaison, Descripteur, Silhouette, Bruit, LWDOS, XLWDOS.

I. INTRODUCTION

lusieurs reprsentations des objets 3D partir dimages en

se basant sur diffrents lments ont t proposes dans la

littrature, nous citons : Des reprsentations bases sur les

parties composant un objet (primitives volumiques), [3]. Des

reprsentations bases sur les frontires 3D reconstruites

partir de la strovision [13]. Des reprsentations bases sur

lapparence [11,4]. Des reprsentations bases sur les

courbures des frontires de la silhouette qui est reprsente

moyennant le CSS (Curvature Scale Space) propose par

Mokhtarian [10,9,14], ainsi que beaucoup dautres mthodes

de reprsentation et reconnaissance utilisant des

caractristiques extraites des images que se soit gomtriques

ou photmomtriques [15-32].

Manuscript reu le 16 Octobre, 2011.

Saliha AOUAT Laboratoire LRIA, Dpartement

dInformatique USTHB, Alger, Algrie

([email protected], [email protected])

Slimane LARABI Laboratoire LRIA USTHB

Parmi les diffrentes mthodes de description des formes

utilises dans la reconnaissance d'objets, lquipe de vision par

ordinateur du Laboratoire de Recherche en Intelligence

Artificielle (LRIA) de lUSTHB a labor une technique de

description dobjets, la description propose est textuelle,

invariante au changement dchelle et la rotation mais

malheureusement elle est sensible au bruit (interfrences et

parasites). Cette description est base sur le langage LWDOS

(Language for Writing Descriptors of Outline Shapes)[8]. Et

XLWDOS (XML Language for Writing Descriptors of

Silhouettes) [7].

Dans ce papier, nous dveloppons une mthode qui permet

de comparer deux silhouettes dobjets extraites partir

dimages prises sous des points de vues lgrement diffrents

et cela en tenant compte de la prsence du bruit dans les deux

images. Pour effectuer le travail dcrit dans ce papier, nous

nous sommes inspirs de lide prsente dans [6,1],

cependant la particularit de notre approche rside dans

lexprimentation totalement automatique depuis la

description textuelle jusqu' la comparaison des silhouettes

reprsentant des images relles, ainsi que son application un

ensemble dobjets en mouvement. Cette approche est

diffrente du lissage textuel [2] appliqu sur les descripteurs et

aboutissant lindexation et la reconnaissance de silhouettes

dobjets.

Ce papier est organis comme suit : le paragraphe 2 est

consacr la description de silhouettes selon les langages

XLWDOS [7]. Nous aborderons ensuite dans le paragraphe 3

la notion du bruit dans une image ainsi que la mthode utilise

pour le lissage de celle ci. Dans le chapitre 4, nous expliquons

le processus de comparaison que nous avons conu pour

confirmer ou infirmer la ressemblance de deux silhouettes. Le

paragraphe 5 explicitera notre tude par une partie

exprimentation qui montrera le droulement du processus de

comparaison implment et sa validation sur des images

relles.

P

22


II. DESCRIPTION XLWDOS DUNE SILHOUETTE

Le langage XLWDOS [7] est un langage XML dvelopp

pour la description des silhouettes, il est en fait issu du

langage LWDOS [8] Lintrt majeur du descripteur

XLWDOS (comme son prdcesseur) est son criture

textuelle, permettant une facilit dindexation, de traitement et

de stockage. Lexemple de la figure 1 illustre brivement la

description dune silhouette moyennant le langage XLWDOS

exemple de silhouette

P1 D1 P2 P3 (DS signifie description

of shape.)

Cette description signifie que la silhouette est forme dune

partition P1 suivie dune ligne de disjonction (D1) suivi son

tour de deux autres partitions P2 et P3. Il est noter que

lorsquon passe de plusieurs partitions une seule la ligne sera

appele ligne de jonction (J). Comme la mme description

globale peut dcrire plusieurs silhouettes dobjets, il serait

ncessaire de passer la description dtaille des lments

constituant la description prcdente. Le descripteur dtaill

est donn par :

Figure 1. Exemple de description dune silhouette

exemple de silhouette

cv 3 130 25 r 90 35 cc 25 60 40 r

90 20

h P1 5 s P1 P2 15 h P1 9 s P1 P3 17 h P1 6

r 90 20 r 180 15 r 90 20

r 90 20 r 180 15 r 90 20

.

A titre dexemple Le cot gauche de la partition P1 est une

succession dune courbe convexe (cv) dont le degr de

convexit est de 3%, langle dinclinaison est de 130 et de

longueur 25 pixels. Cette courbe est suivie dun segment de

droite (r) inclin de 90 et de longueur gale 35 pixels.

Le cot droit de P1 est une courbe concave (cc) suivi aussi

dun segment vertical. La ligne de disjonction (D) est

constitue dune succession de cinq petits segments :

h P1 5 : de longueur 5 et appartenant la partition P1 den haut do le nom free high symbolis par (h)

s P1 P2 15 : de longueur 15 appartenant deux partitions P1 et P2 do le nom shared segment symbolis par (s)

h P1 9 : de longueur 9 appartenant la partition P1 s P1 P3 17 : de longueur 17 appartenant deux partitions

P1 et P3.

h P1 6 : de longueur 6 et appartenant la partition P1.

Il est important dindiquer que toutes ces valeurs et

mesures sont calculs relativement lunique rectangle daire

minimum englobant la silhouette [7,8]. La dsignation CP

signifie partition compose (composed part).

Documents

GUELMA UNIVERSITY International Conference on