Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
EUROPEAN SPEECH COMMUNICATION ASSOCIATION (ESCA)
5th EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY
UNDER THE AUSPICES OF
THE MINISTRY OF CULTURETHE MINISTRY OF THE AEGEAN
THE GENERAL SECRETARIAT OF SCIENCE AND RESEARCH
PROCEEDINGSVOLUME 3
ORGANIZER:UNIVERSITY OF PATRASWIRE COMMUNICATIONS LABORATORY261 10 Rion - Patras - Greece
SESSION: TABRobustness in Recognition and Signal Processing ItChair: Alex Wialbet, Carnegie Mellon Univ., USA
TAB.l Adaptation of Time Differentiated Cepstrum forNoisy Speech Recognition 1075Tai-Hwei Hwang, *Lee-Min Lee, Hsiao-Chuan WangNational Tsing-Hua Univ., ROChina*Mingchi Institute of Technology, ROChina
TAB.2 On The Importance of Various ModulationFrequencies for Speech Recognition 1079f Noboru Kanedera, Takayuki Arai, Hynek Hermansky, MishaPavelOregon Graduate Institute of Science and Technology, USAinternational Computer Science Institute, USAflshikawa National College of Technology,Japan
TAB3 A Robust RNN-Based Pre-CIassification for NoisyMandarin Speech Recognition 1083Wei-Tyng Hong, Sin-Horng ChenNational Chiao Tung Univ., ROChina
TAB.4 A Parallel Environment Model (PEM) for SpeechRecognition and Adaptation 1087Mazin RahimAT&T Labs, USA
TAB.5 Adaptive Model Combination for Robust SpeechRecognition in Car Environments 1091VolkerSchless, Fritz ClassDaimlerBenzAG, Germany
TAB.6 A Comparative Study of Speech Detection MethodsStefaan Gerven Van, Fei Xie 1095KULEUVEN-ESAT, Belgium
TAB.7 Voice Activity Detection Using Source SeparationTechniques 1099Nikos Doukas, Patrick Naylor, Tania StathakiImperial College, UK
TAB.8 Applying Blind Signal Separation to the Recognitionof Overlapped Speech 1103Tomohiko Taniguchi, Shoji Kajita, Kazuya Takeda, FumitadaItakuraNagoya Univ., Japan
TAB.9 Multiresolution Channel Normalization for ASR inReverberant Environments 1107Carlos Avendano, Sangita Tibrewala, Hynek HermanskyOregon Graduate Institute of Science and Technology, USA
TAB.10 A Speech Pre-Processing Technique for End-PointDetection in Highly Non-Stationary EnvironmentsRafael Martinez, Agustin Alvarez, Vilda Pedro Gomez,Mercedes Perez, Victor Nieto, Victoria RodellarUniversidad Politecnica de Madrid, Spain 1111
TAB.ll Application of Several Channel and NoiseCompensation Techiques for Robust Speaker RecognitionLaura Docio-Fernandez, Carmen Garcia-MateoUniv. of Vigo, Spain 1115
TAB.12 Knowing the Wheat from the Weeds in NoisySpeech l U 9Hany Agaiby, Thomas J. MoirUniv. of Paisley, UK
TAB.13 Model-Based Approach for Robust SpeechRecognition in Noisy Environements with Multiple Noise
Sources 1123Do Yeong Kim, *Nam Soo Kim, Chong Kwan UnKAIST, Korea*SAIT, Korea
TAB.14 Normalization of Speaker Variability byWarping for Robust Speech Recognition 1127Y.C. Chu, Charlie Jie, Vincent Tung, Ben Iin, Richard LeePhilips Taiwan, ROChina
TAB.1S LPC Poles Tracker for Music/Speech/NoiseSegmentation and Music Cancellation 1131Stephane H. MaesIBM, USA
TAB.16 Comparative Evaluations of Several Front-Endsfor Robust Speech Recognition 1135Doh-Suk Kim, Jae-Hoon Jeong, Soo-Young Lee, Rhee tA- KilKorean Advanced Instiute of Science and Technology, Korea
TAB.17 Speaker Normalization Through Formant-BasedWarping of the Frequency Scale 1139Evandro B. Gouvea, Richard M. SternCarnegie Mellon Univ., USA
TAB.18 The Use of Cepstral Means in ConversationalSpeech Recognition 1143Martin WestphalUniv. Karlsruhe, Germany
TAB.19 Compensation for Environmental and SpeakerVariability by Normalization of Pole Locations 1147Juan M. Huerta, Richard M. StemCarnegie Mellon Univ., USA
TAB.20 Cellular Phone Speech Recognition: NoiseCompensation vs. Robust Architectures 1151Jean-Baptiste Puel, Regine Andre-ObrechtIRIT - Universitaire Paul Sabatier, France
TAB.21 Speech Recognition in Noise Using On-Adaptation U55TungHui ChiangAdvanced Technology Center (ATC), Computer andCommunication Labs (CCL), Industrial Technology ResearchInstiitute (ITRI), ROChina
SESSION: TACAcoustic ModellingChair Vassilios Digalakis, Technical Vniv, of Crete,Greece
TAC.l Incorporating Linguistic Knowledge and AutomaticBaseform Generation in Acoustic Subword Unit BasedSpeech Recognition 1159Trym Holter, TorBjom SvendsenThe Norwegian Univ. of Science and Technology (NTNU),Norway
TAC.2 Modeling and Decoding of Crossword ContextDependent Phones in the Philips Large VocabularyContinuous Speech Recognition System 1163Peter Beyerlein, Meinhard Ullrich, Patricia WilcoxPhilips GmbH, Germany
TAC3 Modelling Inter-Frame Dependence withPreceeding and Succeeding Frames 1167Philip Hanna, Ji Ming, Peter CTBoyle, F.Jack SmithQueen's Univ. of Belfast, N. Ireland
TAC.4 Continuous Speech Recognition Using SyllablesRhys James Jones, *Simon Downey, John S. MasonUniv. of Wales Swansea, UK 1171*BT Laboratories, UK
TAC.5 A New Approach to Generalized Mixture Tying forContinuous HMM-Based Speech Recognition 1175Daniel Willett, Gerhard RigollGerhard-Mercator-Univ. Duisburg, Germany
TAC.6 State Tying for Context Dependent PhonemeModels 1179Klaus Beulen, Elmar Bransch, Hermann NeyRWTH Aachen Univ. of Technology, Germany
TAC7 A Novel Node Splitting Criterion in Decision TreeConstruction for Semi-Continuous HMMS 1183Jacques Duchateau, Kris Demuynck, Dirk Van CompernolleKatholieke Univ. Leuven-ES.A.T, Belgium
TAC.8 Creating Unseen Triphones by PhoneConcatenation in the Spectral, Cepstral and FormantDomains 1187Mats BlombergKTH, Sweden
TAC.9 Creating Large Subword Units for SpeechRecognition 1191Thilo Pfau, Manfred Beham, *W. Reichl, Giinther RuskeMuenchen Univ. of Technology, Germany*Bell Laboratories, USA
TAC.10 Segmental Modeling Using a Continuous Mixtureof Non-Parametric Models 1195Jacob Goldberger, David Burshtein, *Horacio FrancoTel Aviv Univ., Israel*SRI International, USA
TAC.11 Segmentation and Modeling in Segment-BasedRecognition 1199Jane W. Chang, James R. GlassMIT, USA
TAC.12 Using Syllables in a Hybrid HMM-ANNRecognition System 1203Alfred HauensteinSiemens AG, Germany
TAC.13 Noise Robust Segment-Based Word RecognitionUsing Vector Quantisation 1207Ramalingam Hariharan, Juha Hakkinen, Kari Laurila, JanneSuontaustaNokia Research Center, Finland
TAC.14 Viterbi Based Splitting of Phoneme IIMM's... 1211Luis Javier Rodriguez, *Ines M. TorresUniversidad del Pais Vasco., Spain*UPV/EHU, Spain
TAC.1S The Demiphone: An efficient Subword Unit forContinuous Speech Recognition 1215Jose B. Marino, A. Nogueiras, Antonio BonafonteUniversitiit Poliitecnica de Catalunya, Spain
TAC.16 Organizing Phone Models Based on PiecewiseLinear Segment Lattices of Speech Samples 1219Hiroaki Kojima, Kazuyo TanakaElectrotechnical Lab, Japan
TAC.17 Automatic Architecture Design by Likelihood-Based Context Clustering with Crossvalidation 1223Ivica RoginaUniv. Karlsruhe, Germany
TAC.18 Towards Articulatory Speech Recognition:Learning Smooth Maps to Recover Articulator Information
1227Sam Roweis, *Abeer AlwanCalifornia Institute ofTechnologv, USA*UCLA, USA
TAC.19 Selection of the Most Effective Set of SubwordUnits for an HMM-Based Speech Recognition SystemAnastasiosTsopanoglou, *Nikos FakotakisKNOWLEDGESA, Greece 1231*Univ. ofPatras, Greece
TAC.20 Multi-Ban Continuous Speech RecognitionChristophe Cerisara, Jean-Paul Haton, Jean Francois Man,Dominique FohrCRIN-CNRS & INR1A Lorraine, France 1235
TAC.21 The Design of Acoustic Parameters for Speaker-Independent Speech Recognition 1239Nabil N. Bitar, Carol Y. Espy-WilsonBoston Univ., USA
SESSION: TADSpeech Coding IIOwir.-John hlaurjopoulos, Univ. ofPaJras. Greece
TAD.l High Quality Split-Band LPC Vocoder and its FixedPoint Real Time Implementation 1243Stephane Villette, Milos Stefanovic, Ian Atkinson, AhmetKondozUniv. of Surrey, UK
TAD.2 Missing Packet Recovery Techniques for DM CodedSpeech 1247Wen-Whei Chang, *Hwai-Tsu Chang, *Wan-Yu MengNational Chiao-Tung Univ., ROChinaindustrial Technology Research Institute, ROChina
TAD.3 Spectral Sensitivity of LSP Parameters and TheirTransformed Coefficients 1251Vu Hai Le, Laszlo LoisTechnical Univ. of Budapest, Hungary
TAD.4 Reducing the Complexity of the LPC VectorQuantizer Using the K-D Tree Search Algorithm 1255V. Ramasubramanian, K.K. PaliwalATR Interpreting Telecommunications Res. Labs., Japan
TAD.5 LPC Quantization Using Wavelet Based TemporalDecomposition of the LSF 1259Aweke N. Lemma, *W.Bastiaan Kleijn, Ed F. DeprettereDelft Univ. of Technology, The Netherlands*KTH, Sweden
TAD.6 A Novel 1.7/2.4 KB/S DCT Based PrototypeInterpolation Speech Coding System 1263Costas S. Xydeas, Gokhan H. IlkUniv. of Manchester, UK
TAD.7 Improved Regular Pulse VSELP Coding of Speechat Low Bit-Rates 1267Yong-Soo Choi, *Hong-Goo Kang, Sang-Wook Park, tJae-HaYoo, Dae-Hee YounYonsei University, Korea*AT&T Labs, USAfLG Electronic Inc., Korea
TAD.8 Joint Estimation of Pitch JB and Magnitudes, andV\UV Decisions for MBE Vocoder 1271Yong Duk Cho, Hong Kook Kim, Moo Young Kim, SangRyong KimSamsung Advanced Institute of Technology, South Korea
TAD.9 A New Distance Measure in LPC Coding:Application for Real Time Situations 1275Balazs Kovesi, Samir Saoudi, Jean Marc Boucher, *GaborHorvathENST-Br, Francetechnological Univ. of Budapest, Hungary
TAD.10 Consideration of Processing Strategies for Very-Low-Rate Compression of Wideband Speech Signals withknown Text Transcription 1279Peter Vepyek, Alan B. BradleyRMTT, Australia
TAD.ll Zero-Redundancy Error Protection for CELPSpeech Codecs 1283Norbert GortzUniv. of Kiel, Germany
TAD.12 Low Bit Rate Speech Coding Using an ImprovedHSX Model 1287Ridha Matmti, Milan Jelinek, Jean-Pierre AdoulUniv. of Sherbrooke, Canada
TAD.13 Phonetic Vocoding with Speaker AdaptationCarlos M. Ribeiro, Isabel TrancosoINESC, Portugal 1291
TAD.14 Quantization of Spectral Sequences Using VariableLength Spectral Segments for Speech Coding at Very LowBit Rate 1295Genevieve Baudoin, *Jan Cernocky, tGerard CholletESIEE, France*FEIVUT, FrancefENST, France
TAD.15 On Modeling Event Functions in TemporalDecomposition Based Speech Coding 1299Shahrokh Ghaemmaghami, Mohamed Deriche, BoualemBoashashQueensland Univ. of Technology, Australia
TAD.16 Phase Quantization by Pitch-Cycle WaveformCoding in Low Bit Rate Sinusoidal Coders 1303Soledad Torres, * Javier F Casajus-QuirosUniversidad de Valladolid, Spain*Universidad Politecnica de Madrid, Spain
TAD.17 A Perceptual Study of the Greek Vowel SpaceUsing Synthetic Stimuli 1307Antonis Botinis, *Marios Fourakis, t-fohn W. HawksAthens Univ., Greece*The Ohio State Univ., USAtKent State Univ., USA
TAD.18 Mixed Multi-Band Excitation Coder UsingFrequency Domain Mixture Function (FDMF) for a Low-Bit Rate Speech Coding 1311Woo-Jin Han, Sung-Joo Kim, Yung-Hwan OhKAIST, Korea
TAD.19 Robust GSM Speech Decoding Using the ChannelDecoder's Soft Output 1315Tim Fingscheidt, Olaf ScheufenAachen Univ. of Technology, Germany
TAD.20 A Low-Bit-Rate Speech Coder Using AdaptiveLine Spectral Frequency Prediction 1319Carl W. Seymour, Tony A. RobinsonCambridge Univ., UK
SESSION; WtADialogue Systenxs;AppJjcationsChair: Norman Fraser, Univ. of Surrey, UK
W1A.1 Experiments in Spoken Queries for DocumentRetrieval 1323James Barnett, Steve Anderson, *John Broglio, Mona Singh,tR. Hudson, fS.W. KuoDragon Systems, USA*Univ. of Massachusetts, USAflntermetrics Inc., USA
W1A.2 Towards an Automated Directory InformationSystem. 1327Frank Seide, *Andreas KellnerPhilips Research Laboratories Taipei, Taiwan*Philips GmbH Aachen, Germany
W1A.3 A Strategy for Mixed-Initiative Dialogue ControlLars Bo LarsenAalborg Univ., Denmark 1331
W1A.4 On the Design of Effective Speech-Based Interfacesfor Desktop Applications 1335Jim Hugunin, Victor ZueMIT, USA
W1A.5 Dialogue Strategies Guiding Users to theirCommunicative Goals 1339Matthias Denecke, Alex WaibelCarnegie Mellon Univ., USA
W1A.6 A Speech Interface for Forms on WWW 1343Sunil IssarCarnegie Mellon Univ., USA
SESSION: W1BSpeech Production ModellingChair: Michael D. Rttey,AT&T {jibs, USA
W1B.1 Voice Conversion by Codebook Mapping of LineSpectral Frequencies and Excitation Spectrum 1347Levent M Arslan, David TalkinEntropic Research Laboratory, USA
W1B.2 Optimal State Dependent Spectral Represetation forHMM Modeling: A New Theoretical FrameworkChafic Mokbel, *Guillaume Gravier, *Gerard CholletFrance Telecom, France*ENST, France 1351
W1B3 Speech Analysis and Systems Using an AM-FMMolulation Model 1355Alexandras Potamianos, *Petros MaragosAT&T Labs-Research, USA*ILSP & Georgia Tech, Greece & USA
W1B.4 Synthesis of Fricative Consonants by Audiovisual-to-Articulatory Inversion 1359Khaled Mawass, Pierre Badin, Gerard BaillyICP, INPG, France
W1B.5 New Transformations of Cepstral Parameters forAutomatic Vocal Tract Length Normalization in SpeechRecognition 1363Tom Claes, *Ioannis Dologlou, Louis ten Bosch, Dirk VanCompernolleLernout & Hauspie Speech Products, Belgium*K.ULeuven-E.S.A.T, Belgium
W1B.6 A Multiresolutionally Oriented Approach forDetermination of Cepstral Features in Speech RecognitionSimon Dobrisek, France Mihelic, Nikola PavesicUniv. of Ljubljana, Slovenia 1367
SESSION: W1CSpeaker Recognition IIChair: Aaron Rosenberg, AT & TLabs, USA
W1C.1 Speaker Identification with User-Selected PasswordPhrases 1371Aaron E. Rosenberg, S. ParthasarathyAT&T Labs, USA
W1C.2 Speaker Verification Based on Phonetic DecisionMaking 1375Jesper OlsenAalborg Univ., Denmark
W1C.3 Analysis and Comparison of Score NormalisationMethods for Text-Dependent Speaker Verification 1379A.M. Ariyaeeinia, P. SivakumaranUniv. of Hertfordshire, UK
W1C.4 Automatic Speaker Recognition on a VocoderLink 1383Frederic Jauquet, Patrick Vcrlindc, Claude VloeberghsRoyal Military Academy, Belgium
W1C.5 Likelihood Ratio Adjustment for the Compensa-tionof Model Mismatch in Speaker Verification 1387Frederic Bimbot, *Dominique GenoudENST/CNRS, France*IDIAP, Switzerland
W1C.6 A Lognormal Tied Mixture Model of Pitch forProsody-Based Speaker Recognition 1391Kemal M. Sonmez, Larry Heck, Mitchcl Wcinlxaub, F.lizabcthShribergSRI International, USA
SF,SSION: W1DSpeech Enhancement IChair: Hynek Hermansky, Oregon Graduate Inst. ofScience and Tech., USA
W1D.1 Residual Noise Suppression Using PsychoacousticCriteria 1395Tim Haulick, Klaus Linhard, Peter SchrogmeierDaimlerBenzAG, Germany
W1D.2 Processing Linear Prediction Residual for SpeechEnhancement 1399*B. Yegnanarayana, Carlos Avendano, Hynek Hermansky,*P.Satyanarayana MurthyOregon Graduate Institute of Science and Technology, USA*ITT MADRAS, India
W1DJ Combined Acoustic Echo Control and NoiseReduction for Mobile Communications 1403Stefan Gustafsson, Rainer MartinAachen Univ. of Technology, Germany
W1D.4 A Nonstationary Autoregressive IIMM and itsApplication to Speech Enhancement 1407Ki Yong Lee, Yeol RhcemChangwon National Univ., Korea
W1D.5 Spectral Subtraction and Mean Normalization inthe Context of Weighted Matching Algorithms 1411Nestor Becerra Yoma, Fergus R. Mclnnes, Mervyn A. JackUniv. of Edinburgh, UK
W1D.6 Improving the Intelligibility of Noisy Speech Usingan Audible Noise Suppression Technique 1415Dionysios Tsoukalas, John Mourjopoulos, George KokkinakisUniv. ofPutras, Greece
SESSION: W2ASpoken Language UnderstandingChair: toannis Dologlou, ESAT-M12,KV-Uuven,Belgium
W2A.1 Automatic Acquisition of Salient GrammarFragments for Call-Type Classification 1419Jerry H. Wright, Allen L. Gorin, Giuseppe RiccardiAT&T Labs-Research, USA
W2A.2 Stochastically-Based Natural LanguageUnderstanding Across Tasks and Languages 1423Minker WolfgangUMSI, France
W2A3 Transducer Composition for Context-DependentNetwork Expansion 1427Michael Riley, Fernando Pereira, Mehryar MohriAT&T Labs, USA
W2A.4 Giving Prosody a Meaning 1431Christian Lieske, *Johan Bos, tMartin Emele, $Bj6rnGamback, *C.J. RuppSwiss Federal Institute of Technology Lausanne, Switzerland*Univ. of Saarland, GermanyfUniv. of Stuttgart, GermanyiRoyal Institute of Technology, Sweden
W2A.5 Feature-Based Language Understanding 1435Kishore A. Papineni, Salim Roukos, Todd R. WardIBM, USA
W2A.6 Speech Translation Based on AutomaticallyTrainable Finite-State Models 1439Juan Carlos Amengual, *Jose Miguel Benedi, fKlaus Beulen,*Francisco Casacuberta, Asuncion Castano, AntonioCastellanos, *VictorM. Jimenez, *David Llorens, AndresMarzal, fHermann Ney, Federico Prat, *Enrique Vidal, JuanMiguel VilarUniversitdt Jaume I, Spain*Universidad Politecnica de Valencia, SpaintRWTH, Germany
SESSION: W2BLanguage Model AdaptationChair: Herman Ney. RWTH, Germany
W2B.1 Document Space Models Using Latent SemanticAnalysis 1443Yoshihiko Gotoh, Steve RenalsUniv. of Sheffield, UK
W2B.2 Adaptive Topic-Dependent Language ModellingUsing Word-Based Varigrams 1447Sven C. Martin, Jorg Iiermann, Hermann NeyRWTH Aachen, Germany
W2B.3 A Latent Semantic Analysis Framework for Large-Span Language Modeling 1451Jerome R. BellegardaApple Computer Inc. USA
W2B.4 A Maximum Likelihood Model for TopicClassification of Broadcast News 1455Richard Schwartz, *Toru Imai, Francis Kubala, Long Nguyen,John MakhoulBBN Systems and Technologies, USA*NHK, Japan
W2B.S Language Modelling for Task-Oriented DomainsCosmin Popovici, *PaoIo BaggiaICI-Instituto de Cercetari in Informatica, Romania*Centro Studi e Luboratori Telecomunicazioni(CSELT), Italy 1459
W2B.6 Chinese Language Model Adaptation Based onDocument Classification and Multiple Domain-SpecificLanguage Models 1463Sung-Chien Lin, Chi-Lung Tsai, *Lee-Feng Chien, *Ker-JiannChen, *Iin-Shan LeeNational Taiwan Univerisity, ROChina*Academia Sinica, ROChina
SESSION: W2CProsody and Speech Recognition/ UnderstandingChair: Jan van Santen, Bell Labs-Lucent Technologies,USA
W2C.1 Estimating Prosodic Weights in a Syntactic-Rhythmical Prediction System 1467Langlais PhilippeCERI, France
W2C.2 Syntactic Information Contained in ProsodicFeatures of Japanese Utterances 1471Kazuhiko Ozeki, Kazuyuki Kousaka, Yujie ZhangThe Univ. of Electro-Communications, Japan
W2C.3 Hierarchical Duration Modelling for SpeechRecognition Using the ANGIE Framework 1475Grace Chung, Stephanie SeneffMIT Laboratory for Computer Science, USA
W2C.4 On the Use of Prosody in a Speech-to-SpeechTranslator 1479Volker Strom, Anja Eisner, Wolfgang Hess, *Walter Kasper,tAlexandra Klein, *Hans Ulrich Krieger, $J6rg Spilker, fHansWeber, JGiinther GorzUniv. of Bonn, Germany*German Research Center for AI,GermanyfUniv. of Wein,AustriafUniv. of Erlangen-Numberg, Germany
W2C.5 Automatic Recognition of Sentence Type fromProsody in Dutch 1483Vincent J. van Heuven, *Judith Haan, Jos J.A. PacillyLeiden University, The Netherlands*Nijmegen University, The Netherlands
W2C.6 Automatic Word Demarcation Based on ProsodyPaul Munteanu, Bertrand Caillaud, Jean-Francois Serignat,Genevieve Caelen-HaumontCUPS/IMAG.France 1487
SESSION: W2DWideband Speech CodingChair: Jean Pierre Martens, Univ. of Gent, Belgium
W2D.1 A 16-KBIT/S Wideband Speech Codec ScalablewithG.729 1491Akitoshi Kataoka, Sachiko Kurihara, Shigeaki Sasaki, ShinjiHayashiNTT, Japan
W2D.2 Comparison of Auditory Masking Models forSpeech Coding 1495Michelle E. Lynch, Eliathamby Ambikairajah, *Andy DavisRTC, Ireland*BTLabs, UK
W2D.3 Wideband Speech Coding Based on the MBEStructure 1499Anne Amodio, Gang FengUniv. Stendhal/INPG, France
W2D.4 Perceptual Filter Comparisons for Wideband andFM Bandwidth Audio Coders 1503Marcos Perreau-Guimaraes, *Nicolas Moreau, MadeleineBonnetUniv. Rene Descartes - Paris 5, France*ENST, France
W2D.5 Wideband Coding of Speech Using Neural NetworkGain Adaptation 1507Cheung-Fat Chan, Man-Tak ChuCity Univ. of Hong Kong, Hong Kong
W2D.6 Wideband-Speech APVQ Coding From 16 To 32KBPS 1511Josep M. SalavedraUniversitat Politecnica de Catalunya, Spain
SESSION: WMASpeech Recognition, to Adverse lavirotaaeBts, CSRartel Error Analysis'Chair, Lerilamel, UMSt-CNRS, France
WMA.1 A Comparative Analysis of Blind ChannelEqualization Methods for Telephone Speech RecognitionWei-Wen Hung, Hsiao-Chuan WangNational Tsing Hua Univ., ROChina 1515
WMA2. HMM Retraining Based on State DurationAlignment for Noisy Speech Recognition 1519Wei-Wen Hung, Hsiao-Chuan WangNational Tsing Hua Univ., ROChina
WMA3 Fast Parallel Model Combination NoiseAdaptation Processing 1523Yasuhiro Komori, Tetsuo Kosaka, Hiroki Yamamoto,Masayuki YamadaCanon Inc., Japan
WMA.4 Speech Recognition Module for CSCW Using aMicrophone Array 1527Takashi Endo, Shigeki Nagaya, Masayuki Nakazawa, KiyoshiFumkawa, Ryuuichi OkaReal World Computing Partnership, Japan
WMA.S Relative Mel-Frequency Cepstral CoefficientsCompensation for Robust Telephone Speech Recognition"Jiqing Han, Munsung Han, Gyu-Bong Park, Jeongue Park,•Wen GaoSystems Engineering Research Institute, £77?/, Korea*Harbin Institute of Technology, ROChina 1531
WMA.6 Robust Speech Detection Method for SpeechRecognition System for Telecommunications Networks andITS Field Trial 1535Seiichi Yamamoto, Naito Masaki, Shingo KuroiwaKDD R&D Labs, Japan
WMA.7 The Tuning of Speech Detection in the Context of aGlobal Evaluation of a Voice Response System 1539Laurent Mauuary, Lamia KarrayFrance Telecom, France
WMA.8 New Methods in Continuous Mandarin SpeechRecognition 1543C. Julian Chen, Ramesh A. Gopinath, Michael D. Monkowski,Michael Picheny, Katherine ShenIBM, USA
WMA.9 Automanic Transcription of General Audio Data:Effect of Environment Segmentation on PhoneticRecognition 1547Michelle S. Spina, Victor ZueMIT, USA
WMA.10 Automatic Recognition of Continuous CantoneseSpeech with Very Large Vocabulary 1551Alfred Ying Pang NG, LW. Chan, P.C. ChingChinese Univ. of Hong Kong, Hong Kong
WMA.ll Source Normalization Training for IIMMApplied to Noisy Telephone Speech Recognition 1555Yifan GongTexas Instruments, USA
WMA.12 The Development of a Speaker IndependentContinuous Speech Recognizer for Portuguese 1559Joao P. Neto, *Ciro A. Martins, *Luis B. Almeida1ST, Portugal*INESC, Portugal
WMA.13 Blame Assignment for Errors Made by LargeVocabulary Speech Recognizers 1563Lin ChaseCarnegie Mellon Univ.,USA
WMA.14 Predicting Speech Recognition PerformanceAtsushi NakamuraATRITL, Japan 1567
WMA.15 A Voice Activity Detector for the ITU-T 8kbit/sSpeech Coding Standard G.729 1571Scott D. Watson, Barry M.G. Cheetham, *P.A. Barret, *W.T.K.Wong, *A.V. LewisThe Univ. of Liverpool, UK*BT Laboratories, UK
WMA.16 Vocabulary-Independent Recognition ofAmerican Spanish Phrases and Digit Strings 1575Yeshwant K. Muthusamy, John J. GodfreyTexas Instruments, USA
WMA.17 Recognition of Spoken and Spelled ProperNames 1579Michael Meyer, Hermann HildUniv. Karlsruhe, Germany
WMA.18 HMM Compensation for Noisy SpeechRecognition Based on Cepstral Parameter GenerationTakao Kobayashi, Takashi Masuko, *Keiichi TokudaTokyo Institute of Technology, Japan*Nagoya Institute of Technology, Japan 1583
WMA.19 On the Robustness of the Critical-Band AdaptiveFiltering Method for Multi-Source Noisy SpeechRecognition 1587George Nokas, Evangelos Dermatas, George KokkinakisUniv. ofPatras, Greece
WMA.20 A Space Transformation Approach for RobustSpeech Recognition in Noisy Environments 1591Cun-tai Guan, Shu-hung Leung, Wing-hong LauCity Univ. of Hong Kong, Hong Kong
WMA.21 Robust Isolated Word Recognition Using theWSP-PMC Combination 1595Tzur Vaich, Arnon CohenBen Gurion Univ., Israel
SESSION: WM BMultimodal Speech Processing, Emerging Techniquesand ApplicationsCliair: Giorgio Micca, Centra Sludi e iMbaraiotiTetecomunicazioni (CSBLT), Italy
WMB.l Fuzzy Logic for Rule-Based Formant SpeechSynthesis 1599Spyros Raptis, George CarayannisILSP, Greece
WMB.2 Integrating Acoustic and Labial Information forSpeaker Identification and Verification 1603Pierre Jourlin, *Juergen Luettin, *Dominique Genoud, *HubertWassnerUA/ID1AP, France*IDIAP, Switzerland
WMB J Subword Unit Representations for SpokenDocument Retrieval 1607Kenney Ng, Victor ZueMIT, USA
WMB.4 Non-Linear Representations, Sensor ReliabilityEstimation and Context-Dependent Fusion in theAudiovisual Recognition of Speech in Noise 1611Pascal Teissier, Jean-Luc Schwartz, *Anne Guerin-DugueICP, INPG, France*Laboratoire de Traitement D'Images et de Reconnaissancedes Formes, France
WMB.5 Securized Flexible Vocabulary Voice MessagingSystem on Unix Workstation with ISDN ConnectionPhilippe Renevey, Andrzej Drygajlo 1615Swiss Federal Institute of Technology Lausanne, Switzerland
WMB.6 Automatic Deriving of Multiple Variants ofPhonetic Trancriptions from Acoustic Signals 1619Houda Mokbel, Denis JouvetFrance Telecom, France
WMB.7 Improved Bimodal Speech Recognition Using Tied-Mixture HMMs and 5000 Word Audio - VisualSynchronous Database 1623Satoshi Nakamura, Ron Nagai, Kiyohiro ShikanoNara Institute of Science and Technology, Japan
WMB.8 On the Use of Phone Duration and SegmentalProcessing to Label Speech Signal 1627Philippe Depambour, Regine Andre-Obrecht, *Bernard DelyonIRIT - Equipe IHMPT, France*IRISA, France
WMB.9 Automatic Detection of Disturbing Robot Voice-and Ping Pong-Effects in GSM Transmitted SpeechMartin Paping, Thomas FahnleAscom Systec AG, Switzerland 1631
WMB.10 Speech Synthesis Using Phase VocoderTechniques 1635Joseph Di MartinoUniv. Henri Poincare Nancy I, France
WMB.ll Integration of Eye Fixation Information withSpeech Recognition Systems 1639Ramesh R. Sarukkai, *Craig HunterUniv. of Rochester, USA*Univ. of Rochester,
WMB.12 Generation of Broadband Speech fromNarrowband Speech Using Piecewise Linear MappingYoshihisa Nakatoh, M. Tsushima, T. NorimatsuMatsushita Electric Industrial Co, Ltd, Japan 1643
WMB.13 An Assessment of the Benefits Active NoiseReduction Systems Provide to Speech Intelligibility inAircraft Noise Environments 1647Ian E.C. RogersDefence Evaluation and Research Agency, UK
WMB.14 OLGA - A Dialogue System with an AnimatedTalking Agent 1651Jonas Beskow, Kjell Elenius, *Scott McGlashanKTH, Sweden*Swedish Institute for Computer Science, Sweden
WMB. 15 Towards Usable Multi modal CommandLanguages: Definition and Ergonomic Assessment ofConstraints on Users' Spontaneous Speech and GesturesSandrine Robbe, Noelle Carbonell, *Claude ValotCRIN, France*IMASSA-CERMA, France 1655
WMB.16 Exploiting Repair Context in Interactive ErrorRecovery 1659Bernhard Suhm, Alex WaibelCarnegie Mellon Univ., USA
WMB.17 An Hybrid Image Processing Approach toLipTracking Independent of Head Orientation 1663Lionel Reveret, *Frederique Garcia, tChristian Benoit, *EricVatikiotis-BatesonINPG/ENSERG, France*ATR, JapantINPG, France
WMB.18 Automatic Modeling of Coartriculation in Text-To Visual Speech Synthesis 1667BertrandLeGoffUniv. of Stendhal, France
WMB.19 A Multimedia Platform for Audio-Visual SpeechProcessing 1671Ali Adjoudani, Thierry Guiard-Marigny, Bertrand Le Goff,Lionel Reveret, Christian BenoitUniv. of Stendhal, France
WMB.20 An Intelligent System for Information RetrievalOver the Internet Through Spoken Dialogue 1675Hiroya Fujisaki, *Hiroyuki Kameda, Sumio Ohno, Takuya Ito,Ken Tajima, Kenji AbeScience Univ. of Tokyo, Japan*Tokyo Engineering Univ., Japan
WMB.21 Data Hiding in Speech Using Phase CodingYasemin Yardimci, *Enis A Cetin, *Rashid AnsariBilkent University, Turkey*Univ. of Illinois, USA.... 1679
WMB.22 CAVE: An On-Line Procedure for Creating andRunning Auditory-Visual Speech Perception Experiments-Hardware, Software, and Advantages 1683Denis Burnham, John Fowler, Michelle NicolUniv. of NSW, Australia