SlideShare uma empresa Scribd logo
1 de 36
Baixar para ler offline
State of the Art/Best Practices in
Speech Technology
Dan Burnett, Director of Speech Technologies
Why speech?


                    Ma Ma




                    Vok Say Oh




2   © Voxeo Corporation
Speech is the natural human interface

      15% of world population has a personal
       computer

      Greater than 60% of world population has a
       mobile phone




© Voxeo Corporation
What is communication?


                                       Your
                                     Customer




                    You
         (Your speech-enabled IVR)




© Voxeo Corporation
Communication is natural?




                      249694




© Voxeo Corporation
But for IVRs . . .


                                            Your
                                          Customer




                     You
      (Your untuned speech-enabled IVR)




© Voxeo Corporation
So why do we tune?

      For better communication, which leads to



                       More satisfied customers
                       Shorter call durations




© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
What we say – prompts

      Goal: naturally reduce variability in caller's
       responses

      Because: predictability simplifies grammars
       and increases recognition accuracy




© Voxeo Corporation
Prompt tuning

        Vocabulary
          •  Use the words your customers use
          •  For sales, say “sales”; For billing, say “billing”; ...
          •  Are you calling to learn more about our products, to
                fix a problem with your bill, or …

        Keep in mind
          •  Speech allows your customer to describe things
             THEIR way rather than to use your internal company
             description
          •  Make it easier for them to do that!


© Voxeo Corporation
Prompt tuning

        Prompt specificity
          •  General: “What would you like?”
          •  More specific: “Which department would you like?”
          •  Precise: “Would you like A, B, C, or something else?”

        Keep in mind
          •  The caller will often use the exact words YOU use




© Voxeo Corporation
Ever heard this before?

        For Sales, press 1
        For Billing, press 2
        For option I can't remember, press 3
        For another option I can't remember, press 4
        For yet another option I can't remember, press 5
        For more of the same, press 6
        Blah blah, press 7
        For help with this menu, press 8
        To hear these options again, press 9
© Voxeo Corporation
Prompt tuning

        Prompt length
          •  Keep it short: less than a few sentences total, only
             one of which asks for input
          •  Or: provide pauses (at least one second long) for
             interruption

        Keep in mind
          •  Speech communication is only natural if it's not
             drawn out
          •  Primacy and recency effects




© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
What we listen for – grammars

      Goal: Cover everything they are likely to say,
       and nothing more

      Because: Accuracy in grammar coverage
       directly affects recognition accuracy




© Voxeo Corporation
Grammar tuning

        Cover everything they say
          •  Pre- and post- phrases such as please, I would like,
             and thank you
          •  Synonyms such as (for yes/no) yeah, sure, absolutely
             not

        Keep in mind
          •  Recognizers can only hear it if it's in the grammar




© Voxeo Corporation
Grammar tuning

        Include only what they say
          •  Write grammars that don't overgenerate
          •  If matching numbers/digits, only include valid strings
                if at all possible

        Keep in mind
          •  Every unnecessary grammar phrase is a potential
                misrecognition




© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
How we listen – parameter optimization

      Goal: Optimize recognizer parameter settings
      Because: Better accuracy, of course!




© Voxeo Corporation
Parameter optimization – which parameters?

      Rejection threshold
      Endpointer settings (sensitivity)

      Large grammar parameters




© Voxeo Corporation
Rejection threshold – what is it?

                                                     False
                                                   Rejections

         Misrecognitions




0
                          Rejection Threshold
         100
© Voxeo Corporation
Rejection threshold – what is it?

Cutoff value for the recognizer confidence below      False
 which the speaker's utterance will be rejected
   Rejections

         Misrecognitions




0
                          Rejection Threshold
         100
© Voxeo Corporation
Rejection threshold – total error

                                                     False
                                                   Rejections

         Misrecognitions




0
                          Rejection Threshold
         100
© Voxeo Corporation
Rejection threshold – comparison
                                        ASR
                                      Engine A



                                                    ASR
                                                  Engine B




0
                    Rejection Threshold
            100
© Voxeo Corporation
Rejection threshold – comparison
                                            ASR
                                          Engine A



                                                        ASR
                                                      Engine B




    Optimal thresholds


0
                        Rejection Threshold
            100
© Voxeo Corporation
Rejection threshold – another
comparison

                                                      ASR
                                                    Engine A



                      Optimal thresholds
                         ASR
                                                                Engine B




0
                           Rejection Threshold
                      100
© Voxeo Corporation
Parameter optimization

        Rejection threshold
          •  Generally largest impact on accuracy
          •  Optimum varies across recognition engines
          •  Optimum varies by set of active grammars

        Keep in mind
          •  Optimizing the rejection threshold is the SINGLE
                MOST IMPORTANT parameter tuning you can do




© Voxeo Corporation
Endpointer sensitivity


                                       Your
                                     Customer




                     You
             (Your hard-of-hearing
              speech-enabled IVR)




© Voxeo Corporation
Parameter optimization

        Endpointer sensitivity
          •  Second-largest impact on accuracy
          •  Unnecessarily high and low sensitivity are both bad
          •  Optimum should be set once, checked annually

        Keep in mind
          •  If the recognizer can't hear you, it can't understand
                what you say




© Voxeo Corporation
Parameter optimization

        Large grammar parameters
          •  Typically need to be adjusted if grammar has more
             than 5000 entries
          •  Typically consumes more memory and/or CPU
          •  Vary by ASR engine, so ask

        Keep in mind
          •  If your grammar has many options, your recognizer
                needs to “think” more than the default settings
                usually allow



© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
Summary – Keep in mind

        Speech allows your customer to describe things THEIR way rather than to
         use your internal company description – make it easy for them!

        The caller will often use the exact words YOU use

        Speech communication is only natural if it's not drawn out

        Recognizers can only hear it if it's in the grammar

        Every unnecessary grammar phrase is a potential misrecognition

        Optimizing the rejection threshold is the SINGLE MOST IMPORTANT
         parameter tuning you can do

        If the recognizer can't hear you, it can't understand what you say

        If your grammar has many options, your recognizer needs to “think” more
         than the default settings usually allow
© Voxeo Corporation
For help




34   © Voxeo Corporation
State of the Art/Best Practices in
Speech Technology
Dan Burnett, Director of Speech Technologies

dburnett@voxeo.com
Voxeo Summit 2010: Best Practices in Speech Technology

Mais conteúdo relacionado

Destaque

Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14Voxeo Corp
 
Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012Ryan Ozawa
 
Voxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloudVoxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloudVoxeo Corp
 
Voxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKsVoxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKsVoxeo Corp
 
Voxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobileVoxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobileVoxeo Corp
 
Voxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactionsVoxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactionsVoxeo Corp
 

Destaque (7)

Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14
 
Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012
 
Voxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloudVoxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloud
 
Voxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKsVoxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKs
 
Voxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobileVoxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobile
 
Roman stadium en
Roman stadium enRoman stadium en
Roman stadium en
 
Voxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactionsVoxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactions
 

Semelhante a Voxeo Summit 2010: Best Practices in Speech Technology

Voxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects PersonalizationVoxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects PersonalizationVoxeo Corp
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatterpurplestains88
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatterpurplestains88
 
5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics Options5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics OptionsVoxeo Corp
 
Voxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self ServiceVoxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self ServiceVoxeo Corp
 
Shop By Voice Product Overview
Shop By Voice Product OverviewShop By Voice Product Overview
Shop By Voice Product OverviewAlora Chistiakoff
 
Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413Yechiel Kurtz
 
Jacada Visual IVR
Jacada Visual IVRJacada Visual IVR
Jacada Visual IVRJacada
 
Voxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects SpotlightVoxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects SpotlightVoxeo Corp
 
Call center solution
Call center solutionCall center solution
Call center solutionKien Nguyen
 
Video Receptionist Flyer
Video Receptionist FlyerVideo Receptionist Flyer
Video Receptionist FlyerWinTech LLC
 

Semelhante a Voxeo Summit 2010: Best Practices in Speech Technology (12)

Voxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects PersonalizationVoxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects Personalization
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatter
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatter
 
5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics Options5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics Options
 
Voxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self ServiceVoxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self Service
 
Shop By Voice Product Overview
Shop By Voice Product OverviewShop By Voice Product Overview
Shop By Voice Product Overview
 
Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413
 
Jacada Visual IVR
Jacada Visual IVRJacada Visual IVR
Jacada Visual IVR
 
Voxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects SpotlightVoxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects Spotlight
 
Call center solution
Call center solutionCall center solution
Call center solution
 
Demystifying Speech Application Tuning
Demystifying Speech Application TuningDemystifying Speech Application Tuning
Demystifying Speech Application Tuning
 
Video Receptionist Flyer
Video Receptionist FlyerVideo Receptionist Flyer
Video Receptionist Flyer
 

Mais de Voxeo Corp

Voxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on SteroidsVoxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on SteroidsVoxeo Corp
 
Voxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analyticsVoxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analyticsVoxeo Corp
 
Voxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTCVoxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTCVoxeo Corp
 
Voxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business usersVoxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business usersVoxeo Corp
 
Voxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fansVoxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fansVoxeo Corp
 
Voxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topicsVoxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topicsVoxeo Corp
 
Voxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsessionVoxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsessionVoxeo Corp
 
Voxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The PossibleVoxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The PossibleVoxeo Corp
 
Voxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log searchVoxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log searchVoxeo Corp
 
Voxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analyticsVoxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analyticsVoxeo Corp
 
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)Voxeo Corp
 
Voxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deploymentsVoxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deploymentsVoxeo Corp
 
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?Voxeo Corp
 
How Do You Hear Me Now?
How Do You Hear Me Now?How Do You Hear Me Now?
How Do You Hear Me Now?Voxeo Corp
 
CCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsCCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsVoxeo Corp
 
IPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication ApplicationsIPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication ApplicationsVoxeo Corp
 
7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVR7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVRVoxeo Corp
 
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and MoreServing the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and MoreVoxeo Corp
 
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010Voxeo Corp
 
Securing Unified Communications Systems
Securing Unified Communications SystemsSecuring Unified Communications Systems
Securing Unified Communications SystemsVoxeo Corp
 

Mais de Voxeo Corp (20)

Voxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on SteroidsVoxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
 
Voxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analyticsVoxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analytics
 
Voxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTCVoxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTC
 
Voxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business usersVoxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business users
 
Voxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fansVoxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fans
 
Voxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topicsVoxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topics
 
Voxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsessionVoxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsession
 
Voxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The PossibleVoxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The Possible
 
Voxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log searchVoxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log search
 
Voxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analyticsVoxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analytics
 
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
 
Voxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deploymentsVoxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deployments
 
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
 
How Do You Hear Me Now?
How Do You Hear Me Now?How Do You Hear Me Now?
How Do You Hear Me Now?
 
CCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsCCXML For Advanced Communications Applications
CCXML For Advanced Communications Applications
 
IPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication ApplicationsIPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication Applications
 
7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVR7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVR
 
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and MoreServing the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and More
 
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
 
Securing Unified Communications Systems
Securing Unified Communications SystemsSecuring Unified Communications Systems
Securing Unified Communications Systems
 

Último

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 

Último (20)

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 

Voxeo Summit 2010: Best Practices in Speech Technology

  • 1. State of the Art/Best Practices in Speech Technology Dan Burnett, Director of Speech Technologies
  • 2. Why speech? Ma Ma Vok Say Oh 2 © Voxeo Corporation
  • 3. Speech is the natural human interface   15% of world population has a personal computer   Greater than 60% of world population has a mobile phone © Voxeo Corporation
  • 4. What is communication? Your Customer You (Your speech-enabled IVR) © Voxeo Corporation
  • 5. Communication is natural? 249694 © Voxeo Corporation
  • 6. But for IVRs . . . Your Customer You (Your untuned speech-enabled IVR) © Voxeo Corporation
  • 7. So why do we tune?   For better communication, which leads to  More satisfied customers  Shorter call durations © Voxeo Corporation
  • 8. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 9. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 10. What we say – prompts   Goal: naturally reduce variability in caller's responses   Because: predictability simplifies grammars and increases recognition accuracy © Voxeo Corporation
  • 11. Prompt tuning   Vocabulary •  Use the words your customers use •  For sales, say “sales”; For billing, say “billing”; ... •  Are you calling to learn more about our products, to fix a problem with your bill, or …   Keep in mind •  Speech allows your customer to describe things THEIR way rather than to use your internal company description •  Make it easier for them to do that! © Voxeo Corporation
  • 12. Prompt tuning   Prompt specificity •  General: “What would you like?” •  More specific: “Which department would you like?” •  Precise: “Would you like A, B, C, or something else?”   Keep in mind •  The caller will often use the exact words YOU use © Voxeo Corporation
  • 13. Ever heard this before?   For Sales, press 1   For Billing, press 2   For option I can't remember, press 3   For another option I can't remember, press 4   For yet another option I can't remember, press 5   For more of the same, press 6   Blah blah, press 7   For help with this menu, press 8   To hear these options again, press 9 © Voxeo Corporation
  • 14. Prompt tuning   Prompt length •  Keep it short: less than a few sentences total, only one of which asks for input •  Or: provide pauses (at least one second long) for interruption   Keep in mind •  Speech communication is only natural if it's not drawn out •  Primacy and recency effects © Voxeo Corporation
  • 15. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 16. What we listen for – grammars   Goal: Cover everything they are likely to say, and nothing more   Because: Accuracy in grammar coverage directly affects recognition accuracy © Voxeo Corporation
  • 17. Grammar tuning   Cover everything they say •  Pre- and post- phrases such as please, I would like, and thank you •  Synonyms such as (for yes/no) yeah, sure, absolutely not   Keep in mind •  Recognizers can only hear it if it's in the grammar © Voxeo Corporation
  • 18. Grammar tuning   Include only what they say •  Write grammars that don't overgenerate •  If matching numbers/digits, only include valid strings if at all possible   Keep in mind •  Every unnecessary grammar phrase is a potential misrecognition © Voxeo Corporation
  • 19. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 20. How we listen – parameter optimization   Goal: Optimize recognizer parameter settings   Because: Better accuracy, of course! © Voxeo Corporation
  • 21. Parameter optimization – which parameters?   Rejection threshold   Endpointer settings (sensitivity)   Large grammar parameters © Voxeo Corporation
  • 22. Rejection threshold – what is it? False Rejections Misrecognitions 0 Rejection Threshold 100 © Voxeo Corporation
  • 23. Rejection threshold – what is it? Cutoff value for the recognizer confidence below False which the speaker's utterance will be rejected Rejections Misrecognitions 0 Rejection Threshold 100 © Voxeo Corporation
  • 24. Rejection threshold – total error False Rejections Misrecognitions 0 Rejection Threshold 100 © Voxeo Corporation
  • 25. Rejection threshold – comparison ASR Engine A ASR Engine B 0 Rejection Threshold 100 © Voxeo Corporation
  • 26. Rejection threshold – comparison ASR Engine A ASR Engine B Optimal thresholds 0 Rejection Threshold 100 © Voxeo Corporation
  • 27. Rejection threshold – another comparison ASR Engine A Optimal thresholds ASR Engine B 0 Rejection Threshold 100 © Voxeo Corporation
  • 28. Parameter optimization   Rejection threshold •  Generally largest impact on accuracy •  Optimum varies across recognition engines •  Optimum varies by set of active grammars   Keep in mind •  Optimizing the rejection threshold is the SINGLE MOST IMPORTANT parameter tuning you can do © Voxeo Corporation
  • 29. Endpointer sensitivity Your Customer You (Your hard-of-hearing speech-enabled IVR) © Voxeo Corporation
  • 30. Parameter optimization   Endpointer sensitivity •  Second-largest impact on accuracy •  Unnecessarily high and low sensitivity are both bad •  Optimum should be set once, checked annually   Keep in mind •  If the recognizer can't hear you, it can't understand what you say © Voxeo Corporation
  • 31. Parameter optimization   Large grammar parameters •  Typically need to be adjusted if grammar has more than 5000 entries •  Typically consumes more memory and/or CPU •  Vary by ASR engine, so ask   Keep in mind •  If your grammar has many options, your recognizer needs to “think” more than the default settings usually allow © Voxeo Corporation
  • 32. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 33. Summary – Keep in mind   Speech allows your customer to describe things THEIR way rather than to use your internal company description – make it easy for them!   The caller will often use the exact words YOU use   Speech communication is only natural if it's not drawn out   Recognizers can only hear it if it's in the grammar   Every unnecessary grammar phrase is a potential misrecognition   Optimizing the rejection threshold is the SINGLE MOST IMPORTANT parameter tuning you can do   If the recognizer can't hear you, it can't understand what you say   If your grammar has many options, your recognizer needs to “think” more than the default settings usually allow © Voxeo Corporation
  • 34. For help 34 © Voxeo Corporation
  • 35. State of the Art/Best Practices in Speech Technology Dan Burnett, Director of Speech Technologies
 dburnett@voxeo.com