16

Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM
Page 2: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |

Oracle Digital Assistant The Complete Training

2

Voice

Image courtesy of pixabay.com

Page 3: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Safe Harbor StatementThe following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.

3

Page 4: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Topic agenda

4

Introduction to voice

Designing for voice channels

1

2

Page 5: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Topic agenda

5

Introduction to voice

Designing for voice channels

1

2

Page 6: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Introduction to voice• Voice – we’ve been doing it for about 100,000 years• Ability to be more natural, expressive, ease of use, wearables, hands free• Rise of voice assistants in the home• Specific set of challenges for Enterprise– Data security, privacy, compliance– Domain specific vocabulary– Voice should be “out of the box”

6

Page 7: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Introduction to voice• Oracle provides a voice SDK and server with Oracle Digital Assistant• Voice SDK provides speech widget– Push to talk (PTT)– Streams voice to speech server– Converts speech to text– A skill or digital assistant processes text as it if had been typed– DO WE PROVICE THE VOICE FOR TTS?–OPTIONS FOR SETTING UP ALTERNATE TTS

7

Page 8: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Voice in Oracle Digital AssistantMobile App

Web App

ODA SDK

ODA SDK

Oracle Digital Assistant

Speech Service

Audio request

Speech results as text

Skill

Speech results as text

Skill/digital assistant response

Page 9: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Mobile/Web App

Voice in Oracle Digital AssistantOracle Digital Assistant

Speech Service

Skill

Acoustic Model Pronunciation dictionary

Pronunciation guesser

LanguageGrammar

ODA SDK

Encode/Resample

Record

Stream

Denoise

Endpointing/VAD

State management

Error handling

Page 10: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Topic agenda

10

Introduction to voice

Designing for voice channels

1

2

Page 11: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Designing for voice channels – speech as input• Speech is just a separate way of providing input; not a whole new

processing path• ODA will convert speech to text then process the text as if it was typed• But there are some differences– Users may say more than they would typically type– Users less likely to normalize entity values– Speech can mask differences between words• Through vs threw (might correctly resolve based on context)• Andi vs Andy (almost impossible to know which)

11

Page 12: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Designing for voice channels – design considerations• Try to elicit longer, more conversational, responses from users– Automatic speech recognition (ASR) does better with more context

• Users remember bad past experiences with poor voice systems– Give guidance on the kind of language and to use full sentences

• Users will go off-script so be ready for anything at any time– Already a feature of using a digital assistant

• Text-to-speech (TTS) is slow so keep spoken responses short

12

Page 13: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved.

Designing for voice channels – design considerations• Domain-specific vocabulary can be hard for to recognize–Work around with aliases, synonyms or list selection

• Names are hard to recognize– ASR is good with common names, but can’t recognize names it has never seen before– Bots that use contact lists can work around with list selection/disambiguation instead

• Acronyms and abbreviations are known-hard for ASR– Discourage them in your UI and encourage users to speak fully

13

Page 14: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 14

Page 15: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM
Page 16: Oracle Digital Assistant - GitHub Pages · Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16. Title: 08_04_Voice 1019 Created Date: 10/18/2019 11:35:42 AM

Copyright © 2018, Oracle and/or its affiliates. All rights reserved. 16

Oracle Digital Assistant Hands-On