34
Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009 1:30 – 4:30 pm

Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Embed Size (px)

Citation preview

Page 1: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Introduction to Speech Recognition Software

Scott A. Dougherty, M.Ed.Cricket Rizzo, MS, OTR/L, ATP

2009 AT-OT-PT Summer InstituteTuesday, June 16, 2009

1:30 – 4:30 pm

Page 2: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

About the Presentation

• Links are programmed into the globes at the lower right corner of the slide

• Links are noted in the slide’s notes• The presentation is available

electronically for download• Information on pricing and features was

current as of May 2009 (i.e., use at your own peril)

Page 3: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

About the Presenters

• Cricket Rizzo, MS, OTR/L, ATP• Scott A. Dougherty, M.Ed.

• Questions are welcome at any time• We like to pick on the people in the

back row• We plan to enjoy these three hours –

we hope you do, too

Page 4: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Session Description

• Explore the use of speech recognition software for writing

• Explore the use of speech recognition software for computer access

• Discuss common software and hardware

• Demonstrate microphone initialization, dictation, and computer control

• Share typical scenarios and obstacles

Page 5: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

DefinitionsAT SERVICES“Any services that directly assist in the selection, acquisition, or use of an assistive technology device.”

AT DEVICE“Any item, piece of equipment, or product system, whether acquired commercially off the shelf, modified or customized, that is used to increase, maintain, or improve the functional capabilities of individuals with disabilities.”

(PL 100-407, Section 3, 1988)

Page 6: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Assistive Technology in Legislation• IDEIA 2004 (34 CFR Parts 300 and 301 )• Early Intervention Act (PL-99-336)• Technology-Related Assistance for Individuals with

Disabilities Act of 1988 or “The Tech Act” (PL-100-407)

• Americans with Disabilities Act (PL-101-336)• Entitlement Legislation:

– Rehabilitation Act of 1973 (PL-93-112, as amended)– Rehabilitation Act Amendments of 1998

• Section 508 compliance

Page 7: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Why explore speech recognition in schools?• My student has difficulty accessing the

computer.

• My student has difficulty using a pencil and/or keyboard.

• My student has anxiety about writing with a pencil and/or keyboard.

• I use it in my law/medical practice. It works beautifully and I believe my Kindergartener needs it, too.

Page 8: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

How does it work?

• Speech converted from analog to digital

• Signal broken into phonemes

• Phoneme pattern compared to dictionary

• Matching term is displayed (text) or used (command)

HowStuffWorks, Inc.

Page 9: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Microsoft Speech Recognition

• Included as part of Microsoft Windows XP and Vista

• Start > Control Panel > Speech

• Utilizes the Dragon NS engine

Page 10: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Dragon NaturallySpeaking Professional, Medical, & Legal

• Accurate recognition• Create and edit documents• Email, IM, and web by voice• Voice shortcuts• Audio recorder support• Network compatible for multiple users• Robust commands• Third-party correction (L&M)• Supportable on Electronic Medical

Records systems (M)

Page 11: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Dragon NaturallySpeaking Preferred• Accurate recognition• Create and edit

documents• Email, IM, and web by

voice• Voice shortcuts• Audio recorder support

Page 12: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Dragon NaturallySpeaking Standard• Accurate recognition• Create and edit

documents• Email, IM, and web by

voice

Page 13: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

IBM ViaVoice

• Editions– Mac OS X– Pro USB– Advanced– Standard– Personal

• May be a good option for “legacy” computers

Page 14: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

SpeakQ

• Continuous recognition• Discrete recognition• Requires installation of WordQ

version 2• Simultaneous use of keyboard

is encouraged

Page 15: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

MacSpeech Dictate

• Replaced iListen software• Support for audio recorders is

pending• Dictation and command

capability• Read and edit documents that

were not initially created with MacSpeech Dictate

Page 16: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware Considerations

• Analog microphone• Digital microphone• Desktop array• Analog voice recorder• Digital voice recorder• Jouse• Head pointer

– Tracker Pro– HeadMouse

Page 17: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Analog Microphone

• 3.5 mm (1/8”) connection

• One connection for headphone, second connection for microphone

• Connects to sound card directly or via noise-cancelling adapter

• Boom maintains distance from mouth

Page 18: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Digital Microphone

• Male USB connection

• One connection for headphone and microphone

• Connects via USB port

• Boom maintains distance from mouth

Page 19: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Desktop Array

• May be placed on monitor, desk, or shelf

• Ideal for use at a dedicated station

• Eliminates issues with head microphones

• Adjustable spectrum of reception

Page 20: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Analog Voice Recorder

• Tape-based• Incapable of allowing revision

of recordings• Generally not compatible with

speech recognition software – verify before you buy

Page 21: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Digital Voice Recorder

• Chip-based• Allows for inserts

and deletions• Compatibility with

speech recognition varies

Page 22: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware - Jouse

• Alternative means of input

• Useful for mouse control and keyboarding

• Another hands-free interface for the computer

Page 23: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Head Pointer

• Hands-free option• Keyboard access is

necessary or desired for use with some speech recognition software

Page 24: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Head Pointer

• Tracker Pro and HeadMouse– uses reflective dot on forehead to reflect

input (head movement) back to camera mounted on computer; camera translates head movement into mouse movement

– need another means to perform clicks, double clicks, right clicks, and drag

• Software i.e. Magic Cursor, Dragger 32– Can dwell or click with a switch

Page 25: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Hardware – Head Pointer

• HeadMouse Extreme

• Tracker Pro

Page 26: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Dictation…What’s the process?

• Initialization– Audio detection– Speech recognition

• Voice profile creation• Dictation

– Continuous– Discrete

• EditingBrian Basset and Microsoft

Corporation

Page 27: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Initialization

• Provides guidance on microphone placement

• Tests the microphone connection

• Analyzes ambient noise levels

Page 28: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Voice Profile Creation

• Provides sample text for dictation

• Builds a profile that matches speech sounds to expected letter patterns

• May be skipped in some software

• Available for subsequent accuracy trainings

Page 29: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Dictation

• Continuous dictation – Speak and phrases,

sentences, or words

• Discreet dictation– Input text with

microphone

– Select dictation with keyboard

• Punctuation may be inserted manually or automatically

Page 30: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Editing• Scratch That• Select/Correct/Delete

That• Train that• Bold/Italicize/Underline

that

• Click [menu item]• Move [direction]

[number] [character/word]

• Move mouse [direction]

Page 31: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Training for Computer Access• Complete tutorials provided with software• Dictate from book or magazine for practice including

punctuations i.e. , . ? “ and formatting• Dictate from “off the top of your head”

– Usually takes more time when dictating from thoughts rather than hard copy

– Watch for pauses, umms, uhhs, mumbling

• Dictation• Formatting• Use in Applications i.e. Word, E-mail, Web Access• Timeframe varies with cognitive abilities and

computer expertise; typical user – 20 hours to cover basics

Page 32: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Talking Points

• Initializing struggling readers

• Use as a note taking tool

• Editing errors – keyboard or the mouse?

• Multiple profiles for a single user

• Computer access strategies

• Dragon Naturally Speaking on a Mac?

• Use at school and home

Page 33: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

More Talking Points

• “I want all of the teachers to dictate their lectures”

• Dictating “under the weather”

• Matching microphones to users and environments

• Test-taking

Page 34: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009

Contact informationScott A. DoughertyIDEA Training and Consultation Coordinator, AT

Allegheny Intermediate Unit #3475 East Waterfront DriveHomestead, PA 15120-1144

[email protected]

Cricket RizzoMS, OTR/L, ATPOccupational Therapist

Westmoreland Intermediate Unit #7102 Equity Drive Greensburg, PA 15601

[email protected](724) 836-2460 , ext. 2193

These presentation materials can be downloaded at http://www.aiu3.net/Level3.aspx?id=3822