4
A MULTIMEDIA APPROACH TO LEARNING PHONETIC TRANSCRIPTION Jocelyn Clayards, Martin Holmes, John Esling University of Victoria, Canada ABSTRACT The Linguistic Interactive Phonetics Software (LIPS) program uses digital video clips to provide audio-visual examples of the speech sound inventory of Canadian English and a phonetically balanced set of words. This Windows-based CD-ROM program teaches students to identify phonetic symbols and labels and to progress to word-level phonemic and phonetic transcription. Transcription is performed via keyboard strokes with a widely used font. Not only do students learn the phonetic symbol, they also gain the use of a tool for keyboard entry of transcription outside the realm of the program. Each level of the program has a test module providing feedback on student progress. A final test examines students on recognition and identification of symbols, labels and sounds and finally to perform transcription from the audio-visual database of words. A database-driven algorithm automatically generates test questions ensuring that no two tests are identical. 1. OVERVIEW The program described in this paper is designed as a tool for students to learn phonetic transcription. We found a need in our phonetics teaching to provide an environment where students could practice phonetic theory and transcription at their own pace and receive feedback as to their progress. The LIPS (Linguistic Interactive Phonetic Software) program accomplishes this by using a series of audio-visual clips representing the speech sounds of Canadian English as well as a phonetically balanced set of over 200 words also presented via audio-visual clips. As well as providing an opportunity to develop analytic listening skills and ear training, the use of the audio-visual clips affords the opportunity for close observation of articulator interaction. The software allows manipulation of the video image so that the student can step through the video frame by frame. The view shown in the videos is a close-up of the lips affording the student a view similar to what a lip-reader would see. All program modules are accessible to students at anytime. So although the modules are laid out in a linear fashion in the program work area, students can move freely between and amongst the various program modules. Navigation is easily accomplished through mouse-clicking on the specific module topic. We have found that students often head straight for the test modules. Because the test modules report percentage of correct responses and track the incorrect responses, students can easily identify the areas where they need further work. 1.1. Multi-Media and Phonetics Teaching The teaching of phonetic transcription often starts with the introduction of the IPA symbols and labels. Ear training has traditionally been developed with the instructor producing the sounds of the IPA symbols and perhaps playing tape recordings of the IPA sounds [1] and of different languages. In more recent years, technology in education has used computers to deliver digitized samples of speech via CD-ROM as a vehicle for ear training [2, 3]. An assessment of materials that have been used to teach phonetics and transcription has revealed a group of multimedia programs with differing capabilities. Some of the programs which perform these tasks are the IPA Labels [4], the IPA Tutorials [5], Higgins [6], and Video Phonetics [7]. All of these programs offer digitized audio samples of the phonetic symbols, and some offer video clip close-ups of speakers pronouncing the IPA Symbols (Higgins and Video Phonetics). The IPA Tutorials provide transcription practice by allowing students to listen to a sentence, view its digitized waveform and assign IPA characters to the appropriate locations in the waveform by clicking on the appropriate IPA symbol. The selected symbol then appears at the designated spot in the waveform. However, with the exception of the LIPS program, none of the programs mentioned above combine video clips and reading exercises, or allow natural keyboard entry for transcription of the IPA symbols. This last point is a valuable asset of LIPS. By implementing the readily available SIL IPA font as the vehicle for transcription, students learn to use the keystrokes that activate this font. Transcription may also be performed by mouse-clicking on a select symbol to have the symbol appear in the transcription area. Student can take advantage of the Windows operating system feature which displays a tool tip when the mouse is placed over a particular icon. In this case when the student places the mouse over a particular symbol, the keystroke combination for that symbol appears in the tool tip window. See Figure 3 for illustration. The advantage of teaching the keystroke combinations is that students may then use the SIL IPA font for other transcription applications or to include phonetic symbols in their academic work outside the realm of the program. The SIL IPA font is shareware available at no charge through the SIL website. 2.0 Program Content The LIPS program uses the speech sounds of Canadian English to introduce a subset of IPA symbols and labels from within the programs Phoneme module. Students have the opportunity within the Phoneme module to learn the IPA label and symbol for each of the phones. They may select a phone group by either its place or manner of articulation, thus choosing to work with some of the inventory or the whole inventory. Once the phone workset is chosen, the student then advances to a view screen which displays the audio-video clips. In this view screen, students can examine all the sounds identifying each one in turn or comparing between the sounds. Transcription performed within this module is broad phonemic transcription. Allophones are introduced in the Phones module, thus initiating students to a more detailed phonetic transcription. Further ear training and transcription practice is available by accessing the digitized sound files in the programs Other Languages module. Students then have the opportunity to practice their transcription skills of another language by page 1133 ICPhS99 San Francisco

a multimedia approach to learning phonetic transcription

Embed Size (px)

Citation preview

Page 1: a multimedia approach to learning phonetic transcription

A MULTIMEDIA APPROACH TO LEARNING PHONETICTRANSCRIPTION

Jocelyn Clayards, Martin Holmes, John EslingUniversity of Victoria, Canada

ABSTRACTThe Linguistic Interactive Phonetics Software (LIPS) programuses digital video clips to provide audio-visual examples of thespeech sound inventory of Canadian English and a phoneticallybalanced set of words. This Windows-based CD-ROM programteaches students to identify phonetic symbols and labels and toprogress to word-level phonemic and phonetic transcription.Transcription is performed via keyboard strokes with a widelyused font. Not only do students learn the phonetic symbol, theyalso gain the use of a tool for keyboard entry of transcriptionoutside the realm of the program. Each level of the program hasa test module providing feedback on student progress. A finaltest examines students on recognition and identification ofsymbols, labels and sounds and finally to perform transcriptionfrom the audio-visual database of words. A database-drivenalgorithm automatically generates test questions ensuring that notwo tests are identical.

1. OVERVIEWThe program described in this paper is designed as a tool forstudents to learn phonetic transcription. We found a need in ourphonetics teaching to provide an environment where studentscould practice phonetic theory and transcription at their own paceand receive feedback as to their progress. The LIPS (LinguisticInteractive Phonetic Software) program accomplishes this byusing a series of audio-visual clips representing the speechsounds of Canadian English as well as a phonetically balanced setof over 200 words also presented via audio-visual clips. As wellas providing an opportunity to develop analytic listening skillsand ear training, the use of the audio-visual clips affords theopportunity for close observation of articulator interaction. Thesoftware allows manipulation of the video image so that thestudent can step through the video frame by frame. The viewshown in the videos is a close-up of the lips affording the studenta view similar to what a lip-reader would see.

All program modules are accessible to students at anytime.So although the modules are laid out in a linear fashion in theprogram work area, students can move freely between andamongst the various program modules. Navigation is easilyaccomplished through mouse-clicking on the specific moduletopic. We have found that students often head straight for the testmodules. Because the test modules report percentage of correctresponses and track the incorrect responses, students can easilyidentify the areas where they need further work.

1.1. Multi-Media and Phonetics TeachingThe teaching of phonetic transcription often starts with theintroduction of the IPA symbols and labels. Ear training hastraditionally been developed with the instructor producing thesounds of the IPA symbols and perhaps playing tape recordingsof the IPA sounds [1] and of different languages. In more recentyears, technology in education has used computers to deliverdigitized samples of speech via CD-ROM as a vehicle for ear

training [2, 3]. An assessment of materials that have been used toteach phonetics and transcription has revealed a group ofmultimedia programs with differing capabilities. Some of theprograms which perform these tasks are the IPA Labels [4], theIPA Tutorials [5], Higgins [6], and Video Phonetics [7].

All of these programs offer digitized audio samples of thephonetic symbols, and some offer video clip close-ups ofspeakers pronouncing the IPA Symbols (Higgins and VideoPhonetics). The IPA Tutorials provide transcription practice byallowing students to listen to a sentence, view its digitizedwaveform and assign IPA characters to the appropriate locationsin the waveform by clicking on the appropriate IPA symbol. Theselected symbol then appears at the designated spot in thewaveform. However, with the exception of the LIPS program,none of the programs mentioned above combine video clips andreading exercises, or allow natural keyboard entry fortranscription of the IPA symbols.

This last point is a valuable asset of LIPS. By implementingthe readily available SIL IPA font as the vehicle for transcription,students learn to use the keystrokes that activate this font.Transcription may also be performed by mouse-clicking on aselect symbol to have the symbol appear in the transcription area.Student can take advantage of the Windows operating systemfeature which displays a Òtool tipÓ when the mouse is placed overa particular icon. In this case when the student places the mouseover a particular symbol, the keystroke combination for thatsymbol appears in the tool tip window. See Figure 3 forillustration. The advantage of teaching the keystrokecombinations is that students may then use the SIL IPA font forother transcription applications or to include phonetic symbols intheir academic work outside the realm of the program. The SILIPA font is shareware available at no charge through the SILwebsite.

2.0 Program ContentThe LIPS program uses the speech sounds of Canadian English tointroduce a subset of IPA symbols and labels from within theprogramÕs Phoneme module. Students have the opportunitywithin the Phoneme module to learn the IPA label and symbol foreach of the phones. They may select a phone group by either itsplace or manner of articulation, thus choosing to work with someof the inventory or the whole inventory. Once the phone worksetis chosen, the student then advances to a view screen whichdisplays the audio-video clips. In this view screen, students canexamine all the sounds identifying each one in turn or comparingbetween the sounds. Transcription performed within this moduleis broad phonemic transcription.

Allophones are introduced in the Phones module, thusinitiating students to a more detailed phonetic transcription.Further ear training and transcription practice is available byaccessing the digitized sound files in the programÕs OtherLanguages module. Students then have the opportunity topractice their transcription skills of another language by

page 1133 ICPhS99 San Francisco

Page 2: a multimedia approach to learning phonetic transcription

Figure 1. This figure shows the matching IPA label to IPA symbol task for the Canadian English consonant inventory. The LIPSprogram displays the IPA label along the banner at the bottom of the work area. Students must then click on the symbol whichcorresponds to the label. An algorithm within the program ensures that the test questions are presented in a random order each time thetest module is initiated. Correct responses are reported at the bottom left corner of the work area

Figure 2: This figure shows the module which test ability to match the transcription to the audio-visual clip. Note the tool tip invitingusers to adjust the slide to Òstep-throughÓ the video frame by frame, enabling close observation of articulator interaction.

page 1134 ICPhS99 San Francisco

Page 3: a multimedia approach to learning phonetic transcription

Figure 3. This figure shows transcription from within the phonemic module. Students may click on the IPA symbol at the left or use thekeyboard to enter their transcription. Note the tool tip in the IPA symbols column which shows the keystroke for the IPA symbol. Also

note the ÒHint:Ó banner which cues students to the next correct symbol.

Figure 4:.This figure illustrates one of the Tutorial Modules. Note the tutorial promotes analytic listening by playing three differentsamples which focus the student to listen to the phonetic feature under discussion.

page 1135 ICPhS99 San Francisco

Page 4: a multimedia approach to learning phonetic transcription

transcribing the digitized speech samples of Miriam. TheAustronesian language Miriam was chosen as it uses elementsfrom the same phone set as Canadian English, but exhibiting amuch simpler vowel inventory. This affords students the practiceto listen and transcribe at a purely phonemic level rather thandrawing upon the phonology of their own language.

Each level of the program has a test module providingfeedback on student progress. Students receive immediatefeedback as to whether their answers are correct. Correctresponses are reported as an overall percentage. The programtracks incorrect responses. The questions are repeated within thetest module giving students a second opportunity to answer themcorrectly. This feature alerts students to the areas where theyneed more practice. Test questions are automatically generatedby the program. This is accomplished through a database-drivenalgorithm which creates a question with the correct answer butalso simultaneously generates plausible distractor answers.Question and answer types are generated in a random order sothat no two tests are alike. This serves to keep the test modulestruly interactive in that a student cannot memorize the order ofthe questions or their answers.

The Tutorials module in the program covers such topics asAspiration, Voicing, Vowel Length, the Glottal Stop and theFlap. The concept under discussion is unfolded as students worktheir way through a series of screens. Audio visual clips serve tofocus the student on the concept being discussed. This techniqueof focusing the student on a particular phonetic feature develops astudentÕs analytic listening skills as proposed by Ashby et al. [8]at University College London. See Figure 4 for illustration of atutorial view screen. Tutorial information is supplemented by anon-line Glossary that is accessed through its own module.

3.0 Pedagogy: Phonetic and OtherwiseLIPS embraces the philosophy that students learn by doing. Inkeeping with learning theory, the instructional design of theprogram provides the venue to allow students to learn at theirown pace, presenting a variety of learning scenarios in discretemodules that build upon one another and develop differentphonetic skills. Students must distinguish between and amongstthe phones both auditorially and visually. They are asked toextend this knowledge to the word level, first learning totranscribe at the phonemic level and then at the phonetic level.They are tested on their reading skills, their listening skills andtheir transcription skills. The program provides the environmentfor repeated practice of these skills and feedback is providedthrough the test modules at each level.

Finally, LIPS has been tested by students of IntroductoryLinguistics and Introductory Phonetics at the University ofVictoriaÕs Computer Assisted Language Learning (CALL)Facility. The program was accessed via network throughWindows 95 based IBM Pentium systems. Many of the studentswere also learners of English as a Foreign Language and foundthe program very useful particularly to practice and analyse theCanadian vowel system. Of particular interest was the ability tofocus on vowel length and tense/lax distinctions. One studentwas even observed using a mirror to practice lip positioningmodeling the example provided by the program.

ACKNOWLEDGMENTSWe wish to thank Ewa Czaykowska-Higgins for her comments andsuggestions concerning program content. We thank Tom Hukari and theLinguistics 100 students as well as Suzanne Urbanczyk and theLinguistics 250 students for their valuable feedback in testing and

identifying program bugs. We thank Lynn Marie Heap for herarticulatory contribution to the project. We are most grateful to JudsonTunnell and the staff of the University of Victoria CALL Facility formaking the program available to students.

REFERENCES[1] Wells, J. C. and House, J. (1995). The Sounds of the IPA on cassettetape. London, University College London.[2] Esling, J. H. (1994). University of Victoria Phonetic Database,Version 3.0, on CD-ROM. Victoria, Speech Technology Research Ltd.[3] Sounds of the WorldÕs Languages (1991). Hypercard stack, LosAngeles, UCLA.[4] Esling, J. H. (1992). IPA Labels: A Hypercard Approach to Learningthe International Phonetic Alphabet. Victoria, University of Victoria.[5] Dickson, B.C. (1994). An Instruction Manual for the IPA Tutorials.Victoria: Speech Technology Research Ltd.[6] Smith, Ian (1997). Higgins: A Multimedia Program for theInstruction of the International Phonetic Alphabet. Toronto, YorkUniversity.[7] Kay Elemetrics (1999). Video Phonetics on CD-ROM. Pine Brook,New Jersey.[8] Ashby, M., Maidment, J., and Abberton, E., (1996). AnalyticListening: A New Approach to Ear-Training . University College London,retrieved 02/11/99; http://www.phon.ucl.ac.uk/home/sh19/ashby/ma.htmBody of the Acknowledgment: 8 point, normal, left & right justified.

page 1136 ICPhS99 San Francisco