SlideShare a Scribd company logo
1 of 84
Download to read offline
SpeechEnabling
Web Apps
CAN YOU SPEAK MAGIC?

!2
CAN YOU SPEAK MAGIC?

Ben Klang
!2
CAN YOU SPEAK MAGIC?

Ben Klang
!2
CAN YOU SPEAK MAGIC?

Ben Klang
!2
CAN YOU SPEAK MAGIC?

ADD SPEECH TO THE WEB

!3
CAN YOU SPEAK MAGIC?

ADD SPEECH TO THE WEB

!3
CAN YOU SPEAK MAGIC?

ADD SPEECH TO THE WEB
•Speech Input API

!3
CAN YOU SPEAK MAGIC?

ADD SPEECH TO THE WEB
•Speech Input API
•Text-To-Speech (<Audio/>)

!3
CAN YOU SPEAK MAGIC?

ADD SPEECH TO THE WEB
•Speech Input API
•Text-To-Speech (<Audio/>)
•WebRTC

!3
CAN YOU SPEAK MAGIC?

ADD SPEECH TO THE WEB
•Speech Input API
•Text-To-Speech (<Audio/>)
•WebRTC

http://bit.ly/HTML5_Speech_Input_API
http://www.w3.org/TR/webrtc/

!3
CAN YOU SPEAK MAGIC?

ADD SPEECH TO THE WEB
•Speech Input API
•Text-To-Speech (<Audio/>)
•WebRTC

http://bit.ly/HTML5_Speech_Input_API
http://www.w3.org/TR/webrtc/

!3
CAN YOU SPEAK MAGIC?

SPEECH INPUT API

!4
CAN YOU SPEAK MAGIC?

SPEECH INPUT API

!5
CAN YOU SPEAK MAGIC?

SPEECH INPUT API

!5
CAN YOU SPEAK MAGIC?

SPEECH INPUT API

<input type="text" x-webkit-speech />

!5
CAN YOU SPEAK MAGIC?

ANNYANG!

!6
CAN YOU SPEAK MAGIC?

!7
CAN YOU SPEAK MAGIC?

DEMO

!8
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS

!9
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS
•Chrome Only :(

!9
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS
•Chrome Only :(
•Uses Google ASR(duh)

!9
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS
•Chrome Only :(
•Uses Google ASR(duh)
•Partial Firefox implementation from GSoC

!9
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS
•Chrome Only :(
•Uses Google ASR(duh)
•Partial Firefox implementation from GSoC

•Requires ASR Server

!9
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS
•Chrome Only :(
•Uses Google ASR(duh)
•Partial Firefox implementation from GSoC

•Requires ASR Server
•Only Google runs one today

!9
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS
•Chrome Only :(
•Uses Google ASR(duh)
•Partial Firefox implementation from GSoC

•Requires ASR Server
•Only Google runs one today
•serviceURI attribute not yet implemented

!9
CAN YOU SPEAK MAGIC?

SPEECH INPUT API CAVEATS
•Chrome Only :(
•Uses Google ASR(duh)
•Partial Firefox implementation from GSoC

•Requires ASR Server
•Only Google runs one today
•serviceURI attribute not yet implemented

•Specification maturity seems slow

!9
CAN YOU SPEAK MAGIC?

TEXT-TO-SPEECH

!10
CAN YOU SPEAK MAGIC?

TTS API + <AUDIO/>

!11
CAN YOU SPEAK MAGIC?

TTS API OPTIONS

!12
CAN YOU SPEAK MAGIC?

TTS API OPTIONS
•AT&T: http://developer.att.com

!12
CAN YOU SPEAK MAGIC?

TTS API OPTIONS
•AT&T: http://developer.att.com
•Nuance NDEV

http://nuancemobiledeveloper.com/

!12
CAN YOU SPEAK MAGIC?

TTS API OPTIONS
•AT&T: http://developer.att.com
•Nuance NDEV

http://nuancemobiledeveloper.com/
•Google:

http://translate.google.com/translate_tts?
tl=en&q=TEXT
!12
CAN YOU SPEAK MAGIC?

<AUDIO/> CAVEATS

!13
CAN YOU SPEAK MAGIC?

<AUDIO/> CAVEATS
•You can’t pay for Google TTS

!13
CAN YOU SPEAK MAGIC?

<AUDIO/> CAVEATS
•You can’t pay for Google TTS
•No specified Mandatory To
Implement (MTI) codecs

!13
CAN YOU SPEAK MAGIC?

<AUDIO/> CAVEATS
•You can’t pay for Google TTS
•No specified Mandatory To
Implement (MTI) codecs
•Broad consensus

!13
CAN YOU SPEAK MAGIC?

<AUDIO/> CAVEATS
•You can’t pay for Google TTS
•No specified Mandatory To
Implement (MTI) codecs
•Broad consensus
•Everyone: MP3 (+containers H.264, MP4)

!13
CAN YOU SPEAK MAGIC?

<AUDIO/> CAVEATS
•You can’t pay for Google TTS
•No specified Mandatory To
Implement (MTI) codecs
•Broad consensus
•Everyone: MP3 (+containers H.264, MP4)
•Except IE: Ogg/Vorbis, Opus, WebM

!13
CAN YOU SPEAK MAGIC?

<AUDIO/> CAVEATS
•You can’t pay for Google TTS
•No specified Mandatory To
Implement (MTI) codecs
•Broad consensus
•Everyone: MP3 (+containers H.264, MP4)
•Except IE: Ogg/Vorbis, Opus, WebM

•http://bit.ly/Browser_Audio_Codecs

!13
CAN YOU SPEAK MAGIC?

!14
CAN YOU SPEAK MAGIC?

WHAT IS WEBRTC TO ME?

!15
CAN YOU SPEAK MAGIC?

WHAT IS WEBRTC TO ME?

Telephones in
Web Browsers!

!15
CAN YOU SPEAK MAGIC?

WHAT IS WEBRTC TO ME?

Telephones in
Web Browsers!

!15
CAN YOU SPEAK MAGIC?

How does WebRTC Work?

!16
CAN YOU SPEAK MAGIC?

tp://
ht

!17
CAN YOU SPEAK MAGIC?

tp://
ht

Alice
!17
CAN YOU SPEAK MAGIC?

tp://
ht

Alice

Bob
!17
b

Bo

se
!

pl
ea



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

SD t m
v= P
e
0
:

Ge

CAN YOU SPEAK MAGIC?

tp://
ht

Alice
Bob
!17
b

Bo

se
!

pl
ea



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

SD t m
v= P
e
0
:

Ge

CAN YOU SPEAK MAGIC?

tp://
ht

Alice
Bob
!17
CAN YOU SPEAK MAGIC?

Ge

SD t m
v= P
e
0
:

Bo

b



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

pl
ea

se
!

tp://
ht

Alice

SD

v
P:
o= =0


s= bo
t= - b 1
99
m 00
15
=a
ud
0
IN
io
61
IP
00
4
0.
1
RT
0.
0.
P/
0
SA
V
PF
10
9

Bob
!17
CAN YOU SPEAK MAGIC?

Ge

SD t m
v= P
e
0
:

Bo

b



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

pl
ea

se
!

tp://
ht

Alice

SD

v
P:
o= =0


s= bo
t= - b 1
99
m 00
15
=a
ud
0
IN
io
61
IP
00
4
0.
1
RT
0.
0.
P/
0
SA
V
PF
10
9

Bob
!17
CAN YOU SPEAK MAGIC?

Ge

SD t m
v= P
e
0
:

Bo

b



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

pl
ea

se
!

tp://
ht

Alice

SD

v
P:
o= =0


s= bo
t= - b 1
99
m 00
15
=a
ud
0
IN
io
61
IP
00
4
0.
1
RT
0.
0.
P/
0
SA
V
PF
10
9

Bob
!17
CAN YOU SPEAK MAGIC?

Ge

SD t m
v= P
e
0
:

Bo

b



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

pl
ea

se
!

tp://
ht

Alice

SD

v
P:
o= =0


s= bo
t= - b 1
99
m 00
15
=a
ud
0
IN
io
61
IP
00
4
0.
1
RT
0.
0.
P/
0
SA
V
PF
10
9

Bob
!17
CAN YOU SPEAK MAGIC?

tp://
ht
se
!

SD

Ge

SD t m
v= P
e
0
:

Bo

b



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

pl
ea

v
P:
o= =0


s= bo
t= - b 1
99
m 00
15
=a
ud
0
IN
io
61
IP
00
4
0.
1
RT
0.
0.
P/
0
SA
V
PF
10
9

SRTP
SRTP

Alice

Bob
!17
CAN YOU SPEAK MAGIC?

tp://
ht

Ge

SD t m
v= P
e
0
:

Bo

b



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

pl
ea

se
!

SD

X

v
P:
o= =0


s= bo
t= - b 1
99
m 00
15
=a
ud
0
IN
io
61
IP
00
4
0.
1
RT
0.
0.
P/
0
SA
V
PF
10
9

SRTP
SRTP

Alice

Bob
!17
CAN YOU SPEAK MAGIC?

Alice

Bob
!18
b

Bo

se
!

pl
ea



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

SD t m
v= P
e
0
:

Ge

CAN YOU SPEAK MAGIC?

Alice
Bob
!18
b

Bo

se
!

pl
ea



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

SD t m
v= P
e
0
:

Ge

CAN YOU SPEAK MAGIC?

Alice
Bob
!18
!
ng

Bo

b

se
!
pl
ea



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

lli
Ca

SD t m
v= P
e
0
:

e
lic

A

0
0.
0.
0.
4
IP
9
IN
0
10
5
F
91
VP
19


SA
ch
P:
P/
it
w
RT
SD
1
0 es
v= fre
00
61
o= o
s= 0 0 di
t= =au

Bob
Alice

m

Ge

CAN YOU SPEAK MAGIC?

!18
!
ng

Bo

b

se
!
pl
ea



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

lli
Ca

SD t m
v= P
e
0
:

e
lic

A

0
0.
0.
0.
4
IP
9
IN
0
10
5
F
91
VP
19


SA
ch
P:
P/
it
w
RT
SD
1
0 es
v= fre
00
61
o= o
s= 0 0 di
t= =au

Bob
Alice

m

Ge

CAN YOU SPEAK MAGIC?

!18
se
!
pl
ea



o=
al
s=
ic
e2
t= 0
05
m 0
18
=a
ud
0
IN
io
54
IP
60
4
0.
9
0.
RT
0.
P/
0
SA
VP
F
10
9

SR
TP

b

!
ng

Bo

lli
Ca

SD t m
v= P
e
0
:

e
lic

A

0
0.
0.
0.
4
IP
9
IN
0
10
5
F
91
VP
19


SA
ch
P:
P/
it
w
RT
SD
1
0 es
v= fre
00
61
o= o
s= 0 0 di
t= =au
m

Bob
Alice

TP
SR

Ge

CAN YOU SPEAK MAGIC?

!18
CAN YOU SPEAK MAGIC?

Example RTC Apps

!19
CAN YOU SPEAK MAGIC?

Example RTC Apps
2 Examples

!19
CAN YOU SPEAK MAGIC?

“Communicating isn’t going
to be what you’re doing it’s what you’ll be doing
while you’re doing
something else”

- Geoff Hollingworth
Ericsson Head of AT&T Foundry
!20
CAN YOU SPEAK MAGIC?

1. Incident Response

!21
CAN YOU SPEAK MAGIC?

!22
CAN YOU SPEAK MAGIC?

INCIDENT RESPONSE

!23
CAN YOU SPEAK MAGIC?

INCIDENT RESPONSE
•Timely, Contextual Information
•Adapt for mobile vs. desktop users
•Group-based communication
•Inherit from existing organizational groups
•Allow ad-hoc participants (“guest” parties)
•Federate with external services

•Incident recording/logging
•“Lessons learned” and process improvement
•Links from/to issue tracking systems
!23
CAN YOU SPEAK MAGIC?

2. Medical Records
Management

!24
CAN YOU SPEAK MAGIC?

!25
CAN YOU SPEAK MAGIC?

MEDICAL RECORDS MGMT

!26
CAN YOU SPEAK MAGIC?

MEDICAL RECORDS MGMT
•Automate Medical Claims
•Secure Caller Authentication
•Reuse primary auth via website
•Verify with voice biometrics
•Cross-check against caller location

•Call recording/transcription
•Medical advice given to patient
automatically added to patient file
•Auditing/Service Quality Assurance
!26
CAN YOU SPEAK MAGIC?

HTTPS://TALKY.IO/ATLRUG

!27
CAN YOU SPEAK MAGIC?

WEBRTC CAVEATS

!28
CAN YOU SPEAK MAGIC?

WEBRTC CAVEATS
•Bleeding edge, developing standard

!28
CAN YOU SPEAK MAGIC?

WEBRTC CAVEATS
•Bleeding edge, developing standard
•Only available on Chrome, Firefox

!28
CAN YOU SPEAK MAGIC?

WEBRTC CAVEATS
•Bleeding edge, developing standard
•Only available on Chrome, Firefox
•Only available on Desktop

!28
CAN YOU SPEAK MAGIC?

WEBRTC CAVEATS
•Bleeding edge, developing standard
•Only available on Chrome, Firefox
•Only available on Desktop
•Well funded/backed development

!28
CAN YOU SPEAK MAGIC?

WEBRTC CAVEATS
•Bleeding edge, developing standard
•Only available on Chrome, Firefox
•Only available on Desktop
•Well funded/backed development
•Expect to see it mainstream
(Desktop + Mobile) as soon as 2014

!28
CAN YOU SPEAK MAGIC?

WEBRTC CAVEATS
•Bleeding edge, developing standard
•Only available on Chrome, Firefox
•Only available on Desktop
•Well funded/backed development
•Expect to see it mainstream
(Desktop + Mobile) as soon as 2014
•http://iswebrtcreadyyet.com/

!28
CAN YOU SPEAK MAGIC?

!29
CAN YOU SPEAK MAGIC?

adhearsionconf.com
Early Bird Discount: atlrug
!30
CAN YOU SPEAK MAGIC?

@bklang
bklang@mojolingo.com
http://bit.ly/HTML5_Speech_Input_API
http://www.w3.org/TR/webrtc/
http://iswebrtcreadyyet.com/

http://mojolingo.com
@MojoLingo

Early Bird Discount: atlrug
!31

More Related Content

Similar to Speech-Enabling Web Apps

HTML5 is the Future of Mobile, PhoneGap Takes You There Today
HTML5 is the Future of Mobile, PhoneGap Takes You There TodayHTML5 is the Future of Mobile, PhoneGap Takes You There Today
HTML5 is the Future of Mobile, PhoneGap Takes You There Today
davyjones
 
CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...
CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...
CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...
CODE BLUE
 
Philly ete-2011
Philly ete-2011Philly ete-2011
Philly ete-2011
davyjones
 

Similar to Speech-Enabling Web Apps (20)

ConnectJS 2015: Video Killed the Telephone Star
ConnectJS 2015: Video Killed the Telephone StarConnectJS 2015: Video Killed the Telephone Star
ConnectJS 2015: Video Killed the Telephone Star
 
Slack Apps: Best practices & Use Cases!
Slack Apps: Best practices & Use Cases!Slack Apps: Best practices & Use Cases!
Slack Apps: Best practices & Use Cases!
 
HTML5 is the Future of Mobile, PhoneGap Takes You There Today
HTML5 is the Future of Mobile, PhoneGap Takes You There TodayHTML5 is the Future of Mobile, PhoneGap Takes You There Today
HTML5 is the Future of Mobile, PhoneGap Takes You There Today
 
Prototyping: Helping to take away the suck
Prototyping: Helping to take away the suckPrototyping: Helping to take away the suck
Prototyping: Helping to take away the suck
 
Building A Great API - Evan Cooke, Cloudstock, December 2010
Building A Great API - Evan Cooke, Cloudstock, December 2010Building A Great API - Evan Cooke, Cloudstock, December 2010
Building A Great API - Evan Cooke, Cloudstock, December 2010
 
CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...
CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...
CODE BLUE 2014 : マイクロソフトの脆弱性調査 : ベンダーでありながら発見者となるために by デイヴィッド・シードマン David Se...
 
Evaluation of Web Processing Service Frameworks
Evaluation of Web Processing Service FrameworksEvaluation of Web Processing Service Frameworks
Evaluation of Web Processing Service Frameworks
 
Keeping It Realtime!
Keeping It Realtime!Keeping It Realtime!
Keeping It Realtime!
 
Open source-secret-sauce-rit-2010
Open source-secret-sauce-rit-2010Open source-secret-sauce-rit-2010
Open source-secret-sauce-rit-2010
 
Voice Applications with Adhearsion
Voice Applications with AdhearsionVoice Applications with Adhearsion
Voice Applications with Adhearsion
 
Philly ete-2011
Philly ete-2011Philly ete-2011
Philly ete-2011
 
TDC2016SP - Otimização Prematura: a Raíz de Todo o Mal
TDC2016SP - Otimização Prematura: a Raíz de Todo o MalTDC2016SP - Otimização Prematura: a Raíz de Todo o Mal
TDC2016SP - Otimização Prematura: a Raíz de Todo o Mal
 
Stapling and patching the web of now - ForwardJS3, San Francisco
Stapling and patching the web of now - ForwardJS3, San FranciscoStapling and patching the web of now - ForwardJS3, San Francisco
Stapling and patching the web of now - ForwardJS3, San Francisco
 
Single Page Web Apps
Single Page Web AppsSingle Page Web Apps
Single Page Web Apps
 
Building websites with building blocks
Building websites with building blocksBuilding websites with building blocks
Building websites with building blocks
 
A Fullstack Future - Progressive Web Apps
A Fullstack Future - Progressive Web AppsA Fullstack Future - Progressive Web Apps
A Fullstack Future - Progressive Web Apps
 
Progressive Web Apps 101 - NationJS
Progressive Web Apps 101 - NationJSProgressive Web Apps 101 - NationJS
Progressive Web Apps 101 - NationJS
 
Of innovation and impatience - Future Decoded 2015
Of innovation and impatience - Future Decoded 2015Of innovation and impatience - Future Decoded 2015
Of innovation and impatience - Future Decoded 2015
 
RIPE 69 & IETF 91 Webinar - DNS-Privacy, IPv6, DANE and DHCP(v6)
RIPE 69 & IETF 91 Webinar - DNS-Privacy, IPv6, DANE and DHCP(v6)RIPE 69 & IETF 91 Webinar - DNS-Privacy, IPv6, DANE and DHCP(v6)
RIPE 69 & IETF 91 Webinar - DNS-Privacy, IPv6, DANE and DHCP(v6)
 
Mobile First - WCJ 2012
Mobile First - WCJ 2012Mobile First - WCJ 2012
Mobile First - WCJ 2012
 

More from Mojo Lingo

Infiltrando Telecoms Usando Ruby
Infiltrando Telecoms Usando RubyInfiltrando Telecoms Usando Ruby
Infiltrando Telecoms Usando Ruby
Mojo Lingo
 

More from Mojo Lingo (20)

Using Asterisk to Create "Her"
Using Asterisk to Create "Her"Using Asterisk to Create "Her"
Using Asterisk to Create "Her"
 
Tipping the Scales: Measuring and Scaling Asterisk
Tipping the Scales: Measuring and Scaling AsteriskTipping the Scales: Measuring and Scaling Asterisk
Tipping the Scales: Measuring and Scaling Asterisk
 
WebRTC Overview by Dan Burnett
WebRTC Overview by Dan BurnettWebRTC Overview by Dan Burnett
WebRTC Overview by Dan Burnett
 
WebRTC: What? How? Why? - ClueCon 2013
WebRTC: What? How? Why? - ClueCon 2013WebRTC: What? How? Why? - ClueCon 2013
WebRTC: What? How? Why? - ClueCon 2013
 
Infiltrando Telecoms Usando Ruby
Infiltrando Telecoms Usando RubyInfiltrando Telecoms Usando Ruby
Infiltrando Telecoms Usando Ruby
 
Enhancing FreePBX with Adhearsion
Enhancing FreePBX with AdhearsionEnhancing FreePBX with Adhearsion
Enhancing FreePBX with Adhearsion
 
Connecting Adhearsion
Connecting AdhearsionConnecting Adhearsion
Connecting Adhearsion
 
Testing Adhearsion Applications
Testing Adhearsion ApplicationsTesting Adhearsion Applications
Testing Adhearsion Applications
 
Testing Telephony: It's Not All Terrible
Testing Telephony: It's Not All TerribleTesting Telephony: It's Not All Terrible
Testing Telephony: It's Not All Terrible
 
Rayo for XMPP Folks
Rayo for XMPP FolksRayo for XMPP Folks
Rayo for XMPP Folks
 
Talking To Rails
Talking To RailsTalking To Rails
Talking To Rails
 
Building Real Life Applications with Adhearsion
Building Real Life Applications with AdhearsionBuilding Real Life Applications with Adhearsion
Building Real Life Applications with Adhearsion
 
Integrating Voice Through Adhearsion
Integrating Voice Through AdhearsionIntegrating Voice Through Adhearsion
Integrating Voice Through Adhearsion
 
Infiltrating Telecoms Using Ruby
Infiltrating Telecoms Using RubyInfiltrating Telecoms Using Ruby
Infiltrating Telecoms Using Ruby
 
Telephony Through Ruby Colored Lenses
Telephony Through Ruby Colored LensesTelephony Through Ruby Colored Lenses
Telephony Through Ruby Colored Lenses
 
Voice Applications for the Modern Open Source Hacker
Voice Applications for the Modern Open Source HackerVoice Applications for the Modern Open Source Hacker
Voice Applications for the Modern Open Source Hacker
 
Multidextrous Voice Application Framework
Multidextrous Voice Application FrameworkMultidextrous Voice Application Framework
Multidextrous Voice Application Framework
 
A Hackaton Focused on Call Control
A Hackaton Focused on Call ControlA Hackaton Focused on Call Control
A Hackaton Focused on Call Control
 
Supergluing Asterisk to the Web with Adhearsion
Supergluing Asterisk to the Web with AdhearsionSupergluing Asterisk to the Web with Adhearsion
Supergluing Asterisk to the Web with Adhearsion
 
Introduction to Adhearsion
Introduction to AdhearsionIntroduction to Adhearsion
Introduction to Adhearsion
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 

Speech-Enabling Web Apps

  • 2. CAN YOU SPEAK MAGIC? !2
  • 3. CAN YOU SPEAK MAGIC? Ben Klang !2
  • 4. CAN YOU SPEAK MAGIC? Ben Klang !2
  • 5. CAN YOU SPEAK MAGIC? Ben Klang !2
  • 6. CAN YOU SPEAK MAGIC? ADD SPEECH TO THE WEB !3
  • 7. CAN YOU SPEAK MAGIC? ADD SPEECH TO THE WEB !3
  • 8. CAN YOU SPEAK MAGIC? ADD SPEECH TO THE WEB •Speech Input API !3
  • 9. CAN YOU SPEAK MAGIC? ADD SPEECH TO THE WEB •Speech Input API •Text-To-Speech (<Audio/>) !3
  • 10. CAN YOU SPEAK MAGIC? ADD SPEECH TO THE WEB •Speech Input API •Text-To-Speech (<Audio/>) •WebRTC !3
  • 11. CAN YOU SPEAK MAGIC? ADD SPEECH TO THE WEB •Speech Input API •Text-To-Speech (<Audio/>) •WebRTC http://bit.ly/HTML5_Speech_Input_API http://www.w3.org/TR/webrtc/ !3
  • 12. CAN YOU SPEAK MAGIC? ADD SPEECH TO THE WEB •Speech Input API •Text-To-Speech (<Audio/>) •WebRTC http://bit.ly/HTML5_Speech_Input_API http://www.w3.org/TR/webrtc/ !3
  • 13. CAN YOU SPEAK MAGIC? SPEECH INPUT API !4
  • 14. CAN YOU SPEAK MAGIC? SPEECH INPUT API !5
  • 15. CAN YOU SPEAK MAGIC? SPEECH INPUT API !5
  • 16. CAN YOU SPEAK MAGIC? SPEECH INPUT API <input type="text" x-webkit-speech /> !5
  • 17. CAN YOU SPEAK MAGIC? ANNYANG! !6
  • 18. CAN YOU SPEAK MAGIC? !7
  • 19. CAN YOU SPEAK MAGIC? DEMO !8
  • 20. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS !9
  • 21. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS •Chrome Only :( !9
  • 22. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS •Chrome Only :( •Uses Google ASR(duh) !9
  • 23. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS •Chrome Only :( •Uses Google ASR(duh) •Partial Firefox implementation from GSoC !9
  • 24. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS •Chrome Only :( •Uses Google ASR(duh) •Partial Firefox implementation from GSoC •Requires ASR Server !9
  • 25. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS •Chrome Only :( •Uses Google ASR(duh) •Partial Firefox implementation from GSoC •Requires ASR Server •Only Google runs one today !9
  • 26. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS •Chrome Only :( •Uses Google ASR(duh) •Partial Firefox implementation from GSoC •Requires ASR Server •Only Google runs one today •serviceURI attribute not yet implemented !9
  • 27. CAN YOU SPEAK MAGIC? SPEECH INPUT API CAVEATS •Chrome Only :( •Uses Google ASR(duh) •Partial Firefox implementation from GSoC •Requires ASR Server •Only Google runs one today •serviceURI attribute not yet implemented •Specification maturity seems slow !9
  • 28. CAN YOU SPEAK MAGIC? TEXT-TO-SPEECH !10
  • 29. CAN YOU SPEAK MAGIC? TTS API + <AUDIO/> !11
  • 30. CAN YOU SPEAK MAGIC? TTS API OPTIONS !12
  • 31. CAN YOU SPEAK MAGIC? TTS API OPTIONS •AT&T: http://developer.att.com !12
  • 32. CAN YOU SPEAK MAGIC? TTS API OPTIONS •AT&T: http://developer.att.com •Nuance NDEV
 http://nuancemobiledeveloper.com/ !12
  • 33. CAN YOU SPEAK MAGIC? TTS API OPTIONS •AT&T: http://developer.att.com •Nuance NDEV
 http://nuancemobiledeveloper.com/ •Google:
 http://translate.google.com/translate_tts? tl=en&q=TEXT !12
  • 34. CAN YOU SPEAK MAGIC? <AUDIO/> CAVEATS !13
  • 35. CAN YOU SPEAK MAGIC? <AUDIO/> CAVEATS •You can’t pay for Google TTS !13
  • 36. CAN YOU SPEAK MAGIC? <AUDIO/> CAVEATS •You can’t pay for Google TTS •No specified Mandatory To Implement (MTI) codecs !13
  • 37. CAN YOU SPEAK MAGIC? <AUDIO/> CAVEATS •You can’t pay for Google TTS •No specified Mandatory To Implement (MTI) codecs •Broad consensus !13
  • 38. CAN YOU SPEAK MAGIC? <AUDIO/> CAVEATS •You can’t pay for Google TTS •No specified Mandatory To Implement (MTI) codecs •Broad consensus •Everyone: MP3 (+containers H.264, MP4) !13
  • 39. CAN YOU SPEAK MAGIC? <AUDIO/> CAVEATS •You can’t pay for Google TTS •No specified Mandatory To Implement (MTI) codecs •Broad consensus •Everyone: MP3 (+containers H.264, MP4) •Except IE: Ogg/Vorbis, Opus, WebM !13
  • 40. CAN YOU SPEAK MAGIC? <AUDIO/> CAVEATS •You can’t pay for Google TTS •No specified Mandatory To Implement (MTI) codecs •Broad consensus •Everyone: MP3 (+containers H.264, MP4) •Except IE: Ogg/Vorbis, Opus, WebM •http://bit.ly/Browser_Audio_Codecs !13
  • 41. CAN YOU SPEAK MAGIC? !14
  • 42. CAN YOU SPEAK MAGIC? WHAT IS WEBRTC TO ME? !15
  • 43. CAN YOU SPEAK MAGIC? WHAT IS WEBRTC TO ME? Telephones in Web Browsers! !15
  • 44. CAN YOU SPEAK MAGIC? WHAT IS WEBRTC TO ME? Telephones in Web Browsers! !15
  • 45. CAN YOU SPEAK MAGIC? How does WebRTC Work? !16
  • 46. CAN YOU SPEAK MAGIC? tp:// ht !17
  • 47. CAN YOU SPEAK MAGIC? tp:// ht Alice !17
  • 48. CAN YOU SPEAK MAGIC? tp:// ht Alice Bob !17
  • 51. CAN YOU SPEAK MAGIC? Ge SD t m v= P e 0 : Bo b 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 pl ea se ! tp:// ht Alice SD v P: o= =0 
 s= bo t= - b 1 99 m 00 15 =a ud 0 IN io 61 IP 00 4 0. 1 RT 0. 0. P/ 0 SA V PF 10 9 Bob !17
  • 52. CAN YOU SPEAK MAGIC? Ge SD t m v= P e 0 : Bo b 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 pl ea se ! tp:// ht Alice SD v P: o= =0 
 s= bo t= - b 1 99 m 00 15 =a ud 0 IN io 61 IP 00 4 0. 1 RT 0. 0. P/ 0 SA V PF 10 9 Bob !17
  • 53. CAN YOU SPEAK MAGIC? Ge SD t m v= P e 0 : Bo b 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 pl ea se ! tp:// ht Alice SD v P: o= =0 
 s= bo t= - b 1 99 m 00 15 =a ud 0 IN io 61 IP 00 4 0. 1 RT 0. 0. P/ 0 SA V PF 10 9 Bob !17
  • 54. CAN YOU SPEAK MAGIC? Ge SD t m v= P e 0 : Bo b 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 pl ea se ! tp:// ht Alice SD v P: o= =0 
 s= bo t= - b 1 99 m 00 15 =a ud 0 IN io 61 IP 00 4 0. 1 RT 0. 0. P/ 0 SA V PF 10 9 Bob !17
  • 55. CAN YOU SPEAK MAGIC? tp:// ht se ! SD Ge SD t m v= P e 0 : Bo b 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 pl ea v P: o= =0 
 s= bo t= - b 1 99 m 00 15 =a ud 0 IN io 61 IP 00 4 0. 1 RT 0. 0. P/ 0 SA V PF 10 9 SRTP SRTP Alice Bob !17
  • 56. CAN YOU SPEAK MAGIC? tp:// ht Ge SD t m v= P e 0 : Bo b 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 pl ea se ! SD X v P: o= =0 
 s= bo t= - b 1 99 m 00 15 =a ud 0 IN io 61 IP 00 4 0. 1 RT 0. 0. P/ 0 SA V PF 10 9 SRTP SRTP Alice Bob !17
  • 57. CAN YOU SPEAK MAGIC? Alice Bob !18
  • 60. ! ng Bo b se ! pl ea 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 lli Ca SD t m v= P e 0 : e lic A 0 0. 0. 0. 4 IP 9 IN 0 10 5 F 91 VP 19 
 SA ch P: P/ it w RT SD 1 0 es v= fre 00 61 o= o s= 0 0 di t= =au Bob Alice m Ge CAN YOU SPEAK MAGIC? !18
  • 61. ! ng Bo b se ! pl ea 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 lli Ca SD t m v= P e 0 : e lic A 0 0. 0. 0. 4 IP 9 IN 0 10 5 F 91 VP 19 
 SA ch P: P/ it w RT SD 1 0 es v= fre 00 61 o= o s= 0 0 di t= =au Bob Alice m Ge CAN YOU SPEAK MAGIC? !18
  • 62. se ! pl ea 
 o= al s= ic e2 t= 0 05 m 0 18 =a ud 0 IN io 54 IP 60 4 0. 9 0. RT 0. P/ 0 SA VP F 10 9 SR TP b ! ng Bo lli Ca SD t m v= P e 0 : e lic A 0 0. 0. 0. 4 IP 9 IN 0 10 5 F 91 VP 19 
 SA ch P: P/ it w RT SD 1 0 es v= fre 00 61 o= o s= 0 0 di t= =au m Bob Alice TP SR Ge CAN YOU SPEAK MAGIC? !18
  • 63. CAN YOU SPEAK MAGIC? Example RTC Apps !19
  • 64. CAN YOU SPEAK MAGIC? Example RTC Apps 2 Examples !19
  • 65. CAN YOU SPEAK MAGIC? “Communicating isn’t going to be what you’re doing it’s what you’ll be doing while you’re doing something else”
 - Geoff Hollingworth Ericsson Head of AT&T Foundry !20
  • 66. CAN YOU SPEAK MAGIC? 1. Incident Response !21
  • 67. CAN YOU SPEAK MAGIC? !22
  • 68. CAN YOU SPEAK MAGIC? INCIDENT RESPONSE !23
  • 69. CAN YOU SPEAK MAGIC? INCIDENT RESPONSE •Timely, Contextual Information •Adapt for mobile vs. desktop users •Group-based communication •Inherit from existing organizational groups •Allow ad-hoc participants (“guest” parties) •Federate with external services •Incident recording/logging •“Lessons learned” and process improvement •Links from/to issue tracking systems !23
  • 70. CAN YOU SPEAK MAGIC? 2. Medical Records Management !24
  • 71. CAN YOU SPEAK MAGIC? !25
  • 72. CAN YOU SPEAK MAGIC? MEDICAL RECORDS MGMT !26
  • 73. CAN YOU SPEAK MAGIC? MEDICAL RECORDS MGMT •Automate Medical Claims •Secure Caller Authentication •Reuse primary auth via website •Verify with voice biometrics •Cross-check against caller location •Call recording/transcription •Medical advice given to patient automatically added to patient file •Auditing/Service Quality Assurance !26
  • 74. CAN YOU SPEAK MAGIC? HTTPS://TALKY.IO/ATLRUG !27
  • 75. CAN YOU SPEAK MAGIC? WEBRTC CAVEATS !28
  • 76. CAN YOU SPEAK MAGIC? WEBRTC CAVEATS •Bleeding edge, developing standard !28
  • 77. CAN YOU SPEAK MAGIC? WEBRTC CAVEATS •Bleeding edge, developing standard •Only available on Chrome, Firefox !28
  • 78. CAN YOU SPEAK MAGIC? WEBRTC CAVEATS •Bleeding edge, developing standard •Only available on Chrome, Firefox •Only available on Desktop !28
  • 79. CAN YOU SPEAK MAGIC? WEBRTC CAVEATS •Bleeding edge, developing standard •Only available on Chrome, Firefox •Only available on Desktop •Well funded/backed development !28
  • 80. CAN YOU SPEAK MAGIC? WEBRTC CAVEATS •Bleeding edge, developing standard •Only available on Chrome, Firefox •Only available on Desktop •Well funded/backed development •Expect to see it mainstream (Desktop + Mobile) as soon as 2014 !28
  • 81. CAN YOU SPEAK MAGIC? WEBRTC CAVEATS •Bleeding edge, developing standard •Only available on Chrome, Firefox •Only available on Desktop •Well funded/backed development •Expect to see it mainstream (Desktop + Mobile) as soon as 2014 •http://iswebrtcreadyyet.com/ !28
  • 82. CAN YOU SPEAK MAGIC? !29
  • 83. CAN YOU SPEAK MAGIC? adhearsionconf.com Early Bird Discount: atlrug !30
  • 84. CAN YOU SPEAK MAGIC? @bklang bklang@mojolingo.com http://bit.ly/HTML5_Speech_Input_API http://www.w3.org/TR/webrtc/ http://iswebrtcreadyyet.com/ http://mojolingo.com @MojoLingo Early Bird Discount: atlrug !31