SlideShare a Scribd company logo
1 of 30
Download to read offline
© 2011 IBM Corporation
Watson and the Social Web
Chris Welty
IBM Watson Group
ibmwatson.com
Do Not Record. Do Not Distribute.
© 2011 IBM Corporation
What is Cognitive Computing?
§  Increasingly, machines are being asked to add their computational
power to problems which are not inherently solvable
§  Traditionally, these problems came from AI
– The hardest AI problems are the easiest for human intelligence:
vision, speech, natural language – these are not actually associated
with “being intelligent”
– Human intelligence provides solutions, but does not scale
§  Cognitive Computing is founded on four principles
Learn & improve. Cognitive computing systems
focus on inexact solutions to unsolvable problems
that utilize machine learning and improve over time.
Often they combine multiple approaches and must
integrate them effectively. They must learn from
humans, in more and more seamless ways.
Speed&Scale. Cognitive computing harnesses the
clear advantage machines have over humans in
their ability to perform mundane tasks of arbitrary
complexity repeatedly, whether it is the scale of the
data or the complexity of the task.
Interact in a natural way. Cognitive computing
provides technologies that support a higher level of
human cognition by adapting to human approaches
and interfaces...over the next several decades it will
incorporate essentially all the ways humans sense
and interact.
Assist & augment human cognition. Cognitive
computing addresses problems that lie squarely in
the province of human intelligence, but where we
can't handle the volume of information, penetrate the
complexity or otherwise extend our reach
(physically).
The goal is to be useful, not universally correct.
or
Computers can be incorrect and still prove useful!
© 2011 IBM Corporation
Examples of Cognitive Computing
§ Web Search
§ Image Search
§ Event Search
§ Recommendations
§ Natural Language Processing
© 2011 IBM Corporation
What is Watson?
§  Open Domain Question-Answering Machine
§  Given
–  Rich Natural Language Questions
–  Over a Broad Domain of Knowledge
§  Delivers
–  Precise Answers: Determine what is being asked & give precise response
–  Accurate Confidences: Determine likelihood answer is correct
–  Consumable Justifications: Explain why the answer is right
–  Fast Response Time: Precision & Confidence in <3 seconds
–  At the level of human experts
– Proved its mettle in a televised match
–  Won a 2-game Jeopardy match against
the all-time winners
–  viewed by over 50,000,000
4
© 2011 IBM Corporation
What is Jeopardy?
§ Jeopardy! is an American quiz
show
– 1964 – Today
– Household name in U.S.
§ answer-and-question format
– contestants are presented with
clues in the form of answers
– must phrase their responses in
question form.
– Open domain trivia questions,
speed is a big factor
§  Example
–  Category: General Science
–  Clue: When hit by electrons, a
phosphor gives off electromagnetic
energy in this form
–  Answer: What is light?
© 2011 IBM Corporation
Social Computing: What’s the connection?
§ Social Web as Data Source:
– The vast majority of sources Watson
used to answer questions came from
community-created data
–  Adapting Watson to a new problem
requires the same kind of information
about that problem
§  Social Machines:
–  Watson combined with people is a
powerful proposition
§  Social Web as Application:
–  Watson’s major advance is in
understanding natural language, the
technology can be useful to augment
social interaction
© 2011 IBM Corporation
$200
If you are looking at
the wainscoating,
you are looking in
this direction.
$1000
The first person
mentioned by name in
‘The Man in the Iron
Mask’ is this hero of a
previous book by the
same author.
7
The Jeopardy! Challenge
Hard for humans, hard for machines
Broad/Open
Domain
Complex
Language
High
Precision
Accurate
Confidence
High
Speed
$600
In cell division, mitosis
splits the nucleus &
cytokinesis splits this
liquid cushioning the
nucleus
$800
The conspirators against
this man were wounded by
each other while they
stabbed at him
But hard for different reasons.
For people, the challenge is knowing the answer
For machines, the challenge is understanding the
question
What is down?
Who is
D’Artagnan?
What is
cytoplasm?
Who is Julius
Caesar?
© 2011 IBM Corporation
The Winner’s Cloud
What It Takes to compete against Top Human Jeopardy! Players
Winning Human
Performance
2007 QA Computer System
Grand Champion
Human Performance
Top human
players are
remarkably
good.
Each dot – actual historical human Jeopardy! games
More Confident Less Confident
Develop
against a
metric!
© 2011 IBM Corporation
2007 QA Computer System
In 2007, we committed to
making a Huge Leap!
More Confident Less Confident
Each dot – actual historical human Jeopardy! games
Computers?
Not So Good.
Winning Human
Performance
Grand Champion
Human Performance
The Winner’s Cloud
What It Takes to compete against Top Human Jeopardy! Players
© 2011 IBM Corporation
DeepQA: The Technology Behind Watson
An example of a new software paradigm
. . .
Answer
Scoring
Models
Answer &
Confidence
Question
Evidence
Sources
Models
Models
Models
Models
ModelsPrimary
Search
Candidate
Answer
Generation
Hypothesis
Generation
Hypothesis and
Evidence Scoring
Final Confidence
Merging &
Ranking
Synthesis
Answer
Sources
Question &
Topic
Analysis
Question
Decomposition
Evidence
Retrieval
Deep
Evidence
Scoring
Hypothesis
Generation
Hypothesis and Evidence
Scoring
Learned Models
help combine and
weigh the Evidence
DeepQA generates and scores many hypotheses using an extensible collection of
Natural Language Processing, Machine Learning and Reasoning Algorithms.
These gather and weigh evidence over both unstructured and structured content to
determine the answer with the best confidence.
Content from
Community
Resources!
© 2011 IBM Corporation
Example Question
In 1894 C.W. Post
created his warm
cereal drink Postum in
this Michigan city
Related Content
(Structured & Unstructured)
Primary
Search
1985
Post Foods
aramour
General Foods
Grand Rapids
…
Battle Creek
…
…
Candidate Answer Generation
1)  Battle Creek (0.85)
2)  Post Foods ( 0.20)
3)  1985 (0.05)
Merging &
Ranking
Evidence
Retrieval
Question
Analysis
Keywords: 1894, C.W. Post,
created …
Lexical AnswerType:
(Michingan city)
Date(1894)
Relations:
Create(Post, cereal drink)
…
[0.58 0 -1.3 … 0.97]
[0.71 1 13.4 … 0.72]
[0.12 0 2.0 … 0.40]
[0.84 1 10.6 … 0.21]
[0.33 0 6.3 … 0.83]
[0.21 1 11.1 … 0.92]
[0.91 0 -8.2 … 0.61]
[0.91 0 -1.7 … 0.60]
Evidence
Scoring
Need thousands of
Q/A pairs for training!
© 2011 IBM Corporation
Planet Fitness
Role	
  of	
  Answer	
  Typing	
  in	
  QA	
  
Type Information - a crucial hint to get the correct answer
ASTRONOMY:	
  In	
  1610	
  Galileo	
  named	
  the	
  moons	
  of	
  this	
  planet	
  
for	
  the	
  Medici	
  brothers	
  
Telescope
Giovanni Medici
Sidereus
Nuncius
Jupiter
Ganymede
Telescope
(Instrument)
Giovanni Medici
(Person)
Sidereus
Nuncius
(Book)
Jupiter
(Planet)
Ganymede
(Moon)
Terms	
  Associated	
  with	
  Clue	
  Context	
  
	
  (e.g.	
  via	
  Keyword	
  Search)	
  
Planet Fitness
(Planet)
© 2011 IBM Corporation
§  This	
  fish	
  was	
  thought	
  to	
  be	
  exLnct	
  millions	
  of	
  years	
  ago	
  	
  
unLl	
  one	
  was	
  found	
  off	
  South	
  Africa	
  in	
  1938	
  	
  	
  
§  Category:	
  ENDS	
  IN	
  "TH"	
  	
  
§  Answer:	
  
§  When	
  hit	
  by	
  electrons,	
  a	
  phosphor	
  gives	
  off	
  electromagneLc	
  
energy	
  in	
  this	
  form	
  
§  Category:	
  General	
  Science	
  
§  Answer:	
  
	
  
§  Secy.	
  Chase	
  just	
  submiXed	
  this	
  to	
  me	
  for	
  the	
  third	
  Lme-­‐-­‐guess	
  
what,	
  pal.	
  This	
  Lme	
  I'm	
  accepLng	
  it	
  	
  
§  Category:	
  Lincoln	
  Blogs	
  
§  Answer:	
  
	
  
The type of thing
being asked for is
often indicated but
can go from specific
to very vague
coelacanth	
  
light	
  (or	
  photons)	
  
his	
  resigna4on	
  
13	
  
Answer Typing for Jeopardy!?
© 2011 IBM Corporation
Broad Domain
Our Focus is on reusable NLP technology for analyzing vast volumes of as-is text.
Structured sources (DBs and KBs) provide background knowledge for interpreting the text.
We do NOT attempt to anticipate all
questions and build databases.
We do NOT try to build a formal
model of the world
© 2011 IBM Corporation
Sources for typing evidence
§ DbPedia & Freebase
– Wide coverage of well-known entities
– Taxonomy (MountainsOfNepal → Mountain)
– Good type coverage, but not many synonyms
•  E.g. what about “summit”
§  Wikpedia Categories
–  Wide coverage of entities and type name synonyms
–  Noisy (many errors)
§  Wikipedia Intro
–  First sentence always indicates the most common type of the entity
–  Highly reliable, low coverage of types
Communities can
scale data collection!
© 2011 IBM Corporation
Typing	
  Impact	
  on	
  Jeopardy!	
  clues	
  
61.5%
62.0%
62.5%
63.0%
63.5%
64.0%
64.5%
65.0%
65.5%
66.0%
66.5%
An ensemble of TyCor components
+ ~10%
© 2011 IBM Corporation
Many sources of evidence
In 1894 C.W. Post
created his warm
cereal drink Postum in
this Michigan city
Related Content
(Structured & Unstructured)
Primary
Search
1985
Post Foods
aramour
General Foods
Grand Rapids
…
Battle Creek
…
…
Candidate Answer Generation
1)  Battle Creek (0.85)
2)  Post Foods ( 0.20)
3)  1985 (0.05)
Merging &
Ranking
Evidence
Retrieval
Question
Analysis
Keywords: 1894, C.W. Post,
created …
Lexical AnswerType:
(Michingan city)
Date(1894)
Relations:
Create(Post, cereal drink)
…
[0.58 0 -1.3 … 0.97]
[0.71 1 13.4 … 0.72]
[0.12 0 2.0 … 0.40]
[0.84 1 10.6 … 0.21]
[0.33 0 6.3 … 0.83]
[0.21 1 11.1 … 0.92]
[0.91 0 -8.2 … 0.61]
[0.91 0 -1.7 … 0.60]
Evidence
Scoring
© 2011 IBM Corporation
Watson as part of a social machine
§  Watson makes mistakes:
– This woman was the first to witness her husband resign from the U.S. Presidency.
– This U.S. City’s largest airport is named for a world-war II hero; its second largest for a
world-war II battle.
§  These mistakes are typically obvious to people
– Even when they don’t know the answer
– Watson isn’t stupid, it solves problems differently
– Often these multiple perspectives can combine productively
•  E.g. add a “dismiss” button to the answer interface
Richard Nixon
Dolly Madison
Pat Nixon
Watson can
adapt and learn
from its users!
© 2011 IBM Corporation
Cut to the chase…..
Watson emerges victorious
© 2011 IBM Corporation
Technology marches forward…
© 2011 IBM Corporation
Adapt Watson
Models
Answer &
Confidence
Question
Evidence
Sources
Models
Models
Models
Models
Models
Answer
Sources
. . .
Answer
Scoring
Primary
Search
Candidate
Answer
Generation
Hypothesis
Generation
Hypothesis and
Evidence Scoring
Final Confidence
Merging &
Ranking
Synthesis
Question &
Topic
Analysis
Question
Decomposition
Evidence
Retrieval
Deep
Evidence
Scoring
Hypothesis
Generation
Hypothesis and Evidence
Scoring
Learned Models
help combine and
weigh the Evidence
What does it take to use Watson in a new domain?
(medical diagnosis, call centers, etc...)
Gathering significant numbers of
question-answer pairs is proving
to be one of the most significant
challenges for adapting Watson.
Can the social web help?
Community
created!
© 2011 IBM Corporation
Integrating Watson in Social Interaction?
Did you hear about Bob?
No
He’s taking a year off to climb
the tallest mountain!
The tallest mountain is
Mount Everest.
Wow.
me
me
Jeff
Watson
Jeff
© 2011 IBM Corporation
Privacy – a blessing and a curse
Need to protect our data, but…
Crime on the web, the social web, is very real
Identity theft
Credit card, bank, insurance fraud
Terrorist networks
Medical diagnosis
Monitoring your profile for health-related information
ICT for depression
Calendar, appointments, traffic, spreading disease
© 2011 IBM Corporation
The arrival of Cognitive Computing
Learn & improve. The core of Watson is a group of
over 100 independent algorithms that approximate a
solution to the “is this the right answer to the question”
problem. Achieving winning (human expert)
performance, required two hallmarks of cognitive
computing systems: a metric to measure improvements
to the system (the winners cloud), and a significant
ground truth (over 200K Q-A pairs).
Speed&Scale. Watson used big data, as well as a
3000 node cluster for massive computation to get
answering speeds down into the 2s range.
Interact in a natural way. Watson was a significant
step forward in natural language understanding, the
most basic interface for humans. Say goodbye to
your mouse…
Assist & augment human cognition. Watson
depended on primarily a set of background
documents (the corpus). The value of having access
to this kind of fact-finding power over a large (and
possibly changing) corpus provides a clear
augmentation to human abilities.
© 2011 IBM Corporation
The arrival of Cognitive Computing
Learn & improve. The core of Watson is a group of
over 100 independent algorithms that approximate a
solution to the “is this the right answer to the question”
problem. Achieving winning (human expert)
performance, required two hallmarks of cognitive
computing systems: a metric to measure improvements
to the system (the winners cloud), and a significant
ground truth (over 200K Q-A pairs).
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
% Answered
© 2011 IBM Corporation
The arrival of Cognitive Computing
Assist & augment human cognition. Watson
depended on primarily a set of background
documents (the corpus). The value of having access
to this kind of fact-finding power over a large (and
possibly changing) corpus provides a clear
augmentation to human abilities.
UTI
Diabetes
Influenza
hypokalemia
Renal failure
esophogitis
Diagnosis	
  Models	
   Confidence	
  
Most	
  Confident	
  Diagnosis:	
  UTI	
  
	
  
Symptoms	
  
Tests/Findings	
  
Medica4ons	
  
Family	
  History	
  
Notes/Hypotheses	
  
Huge	
  Volumes	
  of	
  Texts,	
  
Journals,	
  References,	
  DBs	
  
etc.	
  
Pa4ent	
  History	
  
© 2011 IBM Corporation
The arrival of Cognitive Computing
Speed&Scale. Watson used big data, as well as a
3000 node cluster for massive computation to get
answering speeds down into the 2s range.
© 2011 IBM Corporation
The arrival of Cognitive Computing
Interact in a natural way. Watson was a significant
step forward in natural language understanding, the
most basic interface for humans. Say goodbye to
your mouse…
© 2011 IBM Corporation
The arrival of Cognitive Computing
Learn & improve. The core of Watson is a group of
over 100 independent algorithms that approximate a
solution to the “is this the right answer to the question”
problem. Achieving winning (human expert)
performance, required two hallmarks of cognitive
computing systems: a metric to measure improvements
to the system (the winners cloud), and a significant
ground truth (over 200K Q-A pairs).
Speed&Scale. Watson used big data, as well as a
3000 node cluster for massive computation to get
answering speeds down into the 2s range.
Interact in a natural way. Watson was a significant
step forward in natural language understanding, the
most basic interface for humans. Say goodbye to
your mouse…
Assist & augment human cognition. Watson
depended on primarily a set of background
documents (the corpus). The value of having access
to this kind of fact-finding power over a large (and
possibly changing) corpus provides a clear
augmentation to human abilities.
© 2011 IBM Corporation
…and for Social Web
§  First and foremost, social web analytics (e.g. recommendations) and Social
Computing in general lie clearly in the realm of Cognitive Computing
– Uncertainty, natural language, human intelligence
– Inexact solutions that can improve with time, training
– Problems & solutions need metrics to be solvable
§  All cognitive computing systems require ground truth data
– This data is expensive to collect
– Crowdsourcing is a key new technology/approach
§  The user interface moving closer to people
– Natural language, speech, gestures
– In addition, integrating the collection of training data seamlessly into the interface
is a key development
§  Cognitive computing systems require integration of multiple, disparate, data
sources
– Structured, unstructured, semi-structured
– curated, crowdsourced

More Related Content

Similar to Lecture 6: Watson and the Social Web (2014), Chris Welty

Ibm watson - how it works, and what it means for society beyond winning jeo...
Ibm   watson - how it works, and what it means for society beyond winning jeo...Ibm   watson - how it works, and what it means for society beyond winning jeo...
Ibm watson - how it works, and what it means for society beyond winning jeo...Rick Bouter
 
Sis sat 1000 josh dreller
Sis sat 1000 josh drellerSis sat 1000 josh dreller
Sis sat 1000 josh drellerMediaPost
 
Fix What Matters
Fix What MattersFix What Matters
Fix What MattersEd Bellis
 
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!Tony Pearson
 
HarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the peopleHarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the peopleMichael Bernstein
 
20211103 jim spohrer oecd ai_science_productivity_panel v5
20211103 jim spohrer oecd ai_science_productivity_panel v520211103 jim spohrer oecd ai_science_productivity_panel v5
20211103 jim spohrer oecd ai_science_productivity_panel v5ISSIP
 
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Sri Ambati
 
Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...
Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...
Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...John Mathon
 
Step-by-step approach to question answering
Step-by-step approach to question answeringStep-by-step approach to question answering
Step-by-step approach to question answeringNAVER Engineering
 
Sp14 cs188 lecture 1 - introduction
Sp14 cs188 lecture 1  - introductionSp14 cs188 lecture 1  - introduction
Sp14 cs188 lecture 1 - introductionAmer Noureddin
 
Big data new physics giga om structure conference ny - march 2011
Big data new physics   giga om structure conference ny - march 2011Big data new physics   giga om structure conference ny - march 2011
Big data new physics giga om structure conference ny - march 2011Jeff Jonas
 
The Near Future: AI in 2024
The Near Future: AI in 2024The Near Future: AI in 2024
The Near Future: AI in 2024JosiahSeaman1
 
AI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptxAI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptxISSIP
 
Spohrer SIRs 20230511 v16.pptx
Spohrer SIRs 20230511 v16.pptxSpohrer SIRs 20230511 v16.pptx
Spohrer SIRs 20230511 v16.pptxISSIP
 
AI Fables, Facts and Futures: Threat, Promise or Saviour
AI Fables, Facts and Futures: Threat, Promise or SaviourAI Fables, Facts and Futures: Threat, Promise or Saviour
AI Fables, Facts and Futures: Threat, Promise or SaviourUniversity of Hertfordshire
 

Similar to Lecture 6: Watson and the Social Web (2014), Chris Welty (20)

IBM Watson-How it works
IBM Watson-How it worksIBM Watson-How it works
IBM Watson-How it works
 
Ibm watson - how it works, and what it means for society beyond winning jeo...
Ibm   watson - how it works, and what it means for society beyond winning jeo...Ibm   watson - how it works, and what it means for society beyond winning jeo...
Ibm watson - how it works, and what it means for society beyond winning jeo...
 
Watson how it works?
Watson how it works?Watson how it works?
Watson how it works?
 
Sis sat 1000 josh dreller
Sis sat 1000 josh drellerSis sat 1000 josh dreller
Sis sat 1000 josh dreller
 
Watson System
Watson SystemWatson System
Watson System
 
Upmc tpdev7
Upmc tpdev7Upmc tpdev7
Upmc tpdev7
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Fix What Matters
Fix What MattersFix What Matters
Fix What Matters
 
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
 
HarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the peopleHarambeeNet: Data by the people, for the people
HarambeeNet: Data by the people, for the people
 
20211103 jim spohrer oecd ai_science_productivity_panel v5
20211103 jim spohrer oecd ai_science_productivity_panel v520211103 jim spohrer oecd ai_science_productivity_panel v5
20211103 jim spohrer oecd ai_science_productivity_panel v5
 
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
 
Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...
Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...
Artificial Intelligence is back, Deep Learning Networks and Quantum possibili...
 
Step-by-step approach to question answering
Step-by-step approach to question answeringStep-by-step approach to question answering
Step-by-step approach to question answering
 
Sp14 cs188 lecture 1 - introduction
Sp14 cs188 lecture 1  - introductionSp14 cs188 lecture 1  - introduction
Sp14 cs188 lecture 1 - introduction
 
Big data new physics giga om structure conference ny - march 2011
Big data new physics   giga om structure conference ny - march 2011Big data new physics   giga om structure conference ny - march 2011
Big data new physics giga om structure conference ny - march 2011
 
The Near Future: AI in 2024
The Near Future: AI in 2024The Near Future: AI in 2024
The Near Future: AI in 2024
 
AI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptxAI and Education 20240327 v16 for Northeastern.pptx
AI and Education 20240327 v16 for Northeastern.pptx
 
Spohrer SIRs 20230511 v16.pptx
Spohrer SIRs 20230511 v16.pptxSpohrer SIRs 20230511 v16.pptx
Spohrer SIRs 20230511 v16.pptx
 
AI Fables, Facts and Futures: Threat, Promise or Saviour
AI Fables, Facts and Futures: Threat, Promise or SaviourAI Fables, Facts and Futures: Threat, Promise or Saviour
AI Fables, Facts and Futures: Threat, Promise or Saviour
 

More from Lora Aroyo

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfNeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfLora Aroyo
 
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningCATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningLora Aroyo
 
Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Lora Aroyo
 
Data excellence: Better data for better AI
Data excellence: Better data for better AIData excellence: Better data for better AI
Data excellence: Better data for better AILora Aroyo
 
CHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumCHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumLora Aroyo
 
Semantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorSemantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorLora Aroyo
 
The Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataThe Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataLora Aroyo
 
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumKeynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumLora Aroyo
 
FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18Lora Aroyo
 
Understanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsUnderstanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsLora Aroyo
 
StorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesStorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesLora Aroyo
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the LoopLora Aroyo
 
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoDigital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoLora Aroyo
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...Lora Aroyo
 
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Lora Aroyo
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneLora Aroyo
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityLora Aroyo
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchLora Aroyo
 
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital AgeEuropeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital AgeLora Aroyo
 
"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to SnapchatLora Aroyo
 

More from Lora Aroyo (20)

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdfNeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf
 
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine LearningCATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning
 
Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)Harnessing Human Semantics at Scale (updated)
Harnessing Human Semantics at Scale (updated)
 
Data excellence: Better data for better AI
Data excellence: Better data for better AIData excellence: Better data for better AI
Data excellence: Better data for better AI
 
CHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH SymposiumCHIP Demonstrator presentation @ CATCH Symposium
CHIP Demonstrator presentation @ CATCH Symposium
 
Semantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP DemonstratorSemantic Web Challenge: CHIP Demonstrator
Semantic Web Challenge: CHIP Demonstrator
 
The Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked DataThe Rijksmuseum Collection as Linked Data
The Rijksmuseum Collection as Linked Data
 
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @RijksmuseumKeynote at International Conference of Art Libraries 2018 @Rijksmuseum
Keynote at International Conference of Art Libraries 2018 @Rijksmuseum
 
FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18FAIRview: Responsible Video Summarization @NYCML'18
FAIRview: Responsible Video Summarization @NYCML'18
 
Understanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithmsUnderstanding bias in video news & news filtering algorithms
Understanding bias in video news & news filtering algorithms
 
StorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & MachinesStorySourcing: Telling Stories with Humans & Machines
StorySourcing: Telling Stories with Humans & Machines
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the Loop
 
Digital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora AroyoDigital Humanities Benelux 2017: Keynote Lora Aroyo
Digital Humanities Benelux 2017: Keynote Lora Aroyo
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
 
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017
 
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneMy ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden University
 
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchSXSW2017 @NewDutchMedia Talk: Exploration is the New Search
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
 
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital AgeEuropeana GA 2016: Harnessing Crowds, Niches & Professionals  in the Digital Age
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
 
"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat"Video Killed the Radio Star": From MTV to Snapchat
"Video Killed the Radio Star": From MTV to Snapchat
 

Recently uploaded

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Recently uploaded (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Lecture 6: Watson and the Social Web (2014), Chris Welty

  • 1. © 2011 IBM Corporation Watson and the Social Web Chris Welty IBM Watson Group ibmwatson.com Do Not Record. Do Not Distribute.
  • 2. © 2011 IBM Corporation What is Cognitive Computing? §  Increasingly, machines are being asked to add their computational power to problems which are not inherently solvable §  Traditionally, these problems came from AI – The hardest AI problems are the easiest for human intelligence: vision, speech, natural language – these are not actually associated with “being intelligent” – Human intelligence provides solutions, but does not scale §  Cognitive Computing is founded on four principles Learn & improve. Cognitive computing systems focus on inexact solutions to unsolvable problems that utilize machine learning and improve over time. Often they combine multiple approaches and must integrate them effectively. They must learn from humans, in more and more seamless ways. Speed&Scale. Cognitive computing harnesses the clear advantage machines have over humans in their ability to perform mundane tasks of arbitrary complexity repeatedly, whether it is the scale of the data or the complexity of the task. Interact in a natural way. Cognitive computing provides technologies that support a higher level of human cognition by adapting to human approaches and interfaces...over the next several decades it will incorporate essentially all the ways humans sense and interact. Assist & augment human cognition. Cognitive computing addresses problems that lie squarely in the province of human intelligence, but where we can't handle the volume of information, penetrate the complexity or otherwise extend our reach (physically). The goal is to be useful, not universally correct. or Computers can be incorrect and still prove useful!
  • 3. © 2011 IBM Corporation Examples of Cognitive Computing § Web Search § Image Search § Event Search § Recommendations § Natural Language Processing
  • 4. © 2011 IBM Corporation What is Watson? §  Open Domain Question-Answering Machine §  Given –  Rich Natural Language Questions –  Over a Broad Domain of Knowledge §  Delivers –  Precise Answers: Determine what is being asked & give precise response –  Accurate Confidences: Determine likelihood answer is correct –  Consumable Justifications: Explain why the answer is right –  Fast Response Time: Precision & Confidence in <3 seconds –  At the level of human experts – Proved its mettle in a televised match –  Won a 2-game Jeopardy match against the all-time winners –  viewed by over 50,000,000 4
  • 5. © 2011 IBM Corporation What is Jeopardy? § Jeopardy! is an American quiz show – 1964 – Today – Household name in U.S. § answer-and-question format – contestants are presented with clues in the form of answers – must phrase their responses in question form. – Open domain trivia questions, speed is a big factor §  Example –  Category: General Science –  Clue: When hit by electrons, a phosphor gives off electromagnetic energy in this form –  Answer: What is light?
  • 6. © 2011 IBM Corporation Social Computing: What’s the connection? § Social Web as Data Source: – The vast majority of sources Watson used to answer questions came from community-created data –  Adapting Watson to a new problem requires the same kind of information about that problem §  Social Machines: –  Watson combined with people is a powerful proposition §  Social Web as Application: –  Watson’s major advance is in understanding natural language, the technology can be useful to augment social interaction
  • 7. © 2011 IBM Corporation $200 If you are looking at the wainscoating, you are looking in this direction. $1000 The first person mentioned by name in ‘The Man in the Iron Mask’ is this hero of a previous book by the same author. 7 The Jeopardy! Challenge Hard for humans, hard for machines Broad/Open Domain Complex Language High Precision Accurate Confidence High Speed $600 In cell division, mitosis splits the nucleus & cytokinesis splits this liquid cushioning the nucleus $800 The conspirators against this man were wounded by each other while they stabbed at him But hard for different reasons. For people, the challenge is knowing the answer For machines, the challenge is understanding the question What is down? Who is D’Artagnan? What is cytoplasm? Who is Julius Caesar?
  • 8. © 2011 IBM Corporation The Winner’s Cloud What It Takes to compete against Top Human Jeopardy! Players Winning Human Performance 2007 QA Computer System Grand Champion Human Performance Top human players are remarkably good. Each dot – actual historical human Jeopardy! games More Confident Less Confident Develop against a metric!
  • 9. © 2011 IBM Corporation 2007 QA Computer System In 2007, we committed to making a Huge Leap! More Confident Less Confident Each dot – actual historical human Jeopardy! games Computers? Not So Good. Winning Human Performance Grand Champion Human Performance The Winner’s Cloud What It Takes to compete against Top Human Jeopardy! Players
  • 10. © 2011 IBM Corporation DeepQA: The Technology Behind Watson An example of a new software paradigm . . . Answer Scoring Models Answer & Confidence Question Evidence Sources Models Models Models Models ModelsPrimary Search Candidate Answer Generation Hypothesis Generation Hypothesis and Evidence Scoring Final Confidence Merging & Ranking Synthesis Answer Sources Question & Topic Analysis Question Decomposition Evidence Retrieval Deep Evidence Scoring Hypothesis Generation Hypothesis and Evidence Scoring Learned Models help combine and weigh the Evidence DeepQA generates and scores many hypotheses using an extensible collection of Natural Language Processing, Machine Learning and Reasoning Algorithms. These gather and weigh evidence over both unstructured and structured content to determine the answer with the best confidence. Content from Community Resources!
  • 11. © 2011 IBM Corporation Example Question In 1894 C.W. Post created his warm cereal drink Postum in this Michigan city Related Content (Structured & Unstructured) Primary Search 1985 Post Foods aramour General Foods Grand Rapids … Battle Creek … … Candidate Answer Generation 1)  Battle Creek (0.85) 2)  Post Foods ( 0.20) 3)  1985 (0.05) Merging & Ranking Evidence Retrieval Question Analysis Keywords: 1894, C.W. Post, created … Lexical AnswerType: (Michingan city) Date(1894) Relations: Create(Post, cereal drink) … [0.58 0 -1.3 … 0.97] [0.71 1 13.4 … 0.72] [0.12 0 2.0 … 0.40] [0.84 1 10.6 … 0.21] [0.33 0 6.3 … 0.83] [0.21 1 11.1 … 0.92] [0.91 0 -8.2 … 0.61] [0.91 0 -1.7 … 0.60] Evidence Scoring Need thousands of Q/A pairs for training!
  • 12. © 2011 IBM Corporation Planet Fitness Role  of  Answer  Typing  in  QA   Type Information - a crucial hint to get the correct answer ASTRONOMY:  In  1610  Galileo  named  the  moons  of  this  planet   for  the  Medici  brothers   Telescope Giovanni Medici Sidereus Nuncius Jupiter Ganymede Telescope (Instrument) Giovanni Medici (Person) Sidereus Nuncius (Book) Jupiter (Planet) Ganymede (Moon) Terms  Associated  with  Clue  Context    (e.g.  via  Keyword  Search)   Planet Fitness (Planet)
  • 13. © 2011 IBM Corporation §  This  fish  was  thought  to  be  exLnct  millions  of  years  ago     unLl  one  was  found  off  South  Africa  in  1938       §  Category:  ENDS  IN  "TH"     §  Answer:   §  When  hit  by  electrons,  a  phosphor  gives  off  electromagneLc   energy  in  this  form   §  Category:  General  Science   §  Answer:     §  Secy.  Chase  just  submiXed  this  to  me  for  the  third  Lme-­‐-­‐guess   what,  pal.  This  Lme  I'm  accepLng  it     §  Category:  Lincoln  Blogs   §  Answer:     The type of thing being asked for is often indicated but can go from specific to very vague coelacanth   light  (or  photons)   his  resigna4on   13   Answer Typing for Jeopardy!?
  • 14. © 2011 IBM Corporation Broad Domain Our Focus is on reusable NLP technology for analyzing vast volumes of as-is text. Structured sources (DBs and KBs) provide background knowledge for interpreting the text. We do NOT attempt to anticipate all questions and build databases. We do NOT try to build a formal model of the world
  • 15. © 2011 IBM Corporation Sources for typing evidence § DbPedia & Freebase – Wide coverage of well-known entities – Taxonomy (MountainsOfNepal → Mountain) – Good type coverage, but not many synonyms •  E.g. what about “summit” §  Wikpedia Categories –  Wide coverage of entities and type name synonyms –  Noisy (many errors) §  Wikipedia Intro –  First sentence always indicates the most common type of the entity –  Highly reliable, low coverage of types Communities can scale data collection!
  • 16. © 2011 IBM Corporation Typing  Impact  on  Jeopardy!  clues   61.5% 62.0% 62.5% 63.0% 63.5% 64.0% 64.5% 65.0% 65.5% 66.0% 66.5% An ensemble of TyCor components + ~10%
  • 17. © 2011 IBM Corporation Many sources of evidence In 1894 C.W. Post created his warm cereal drink Postum in this Michigan city Related Content (Structured & Unstructured) Primary Search 1985 Post Foods aramour General Foods Grand Rapids … Battle Creek … … Candidate Answer Generation 1)  Battle Creek (0.85) 2)  Post Foods ( 0.20) 3)  1985 (0.05) Merging & Ranking Evidence Retrieval Question Analysis Keywords: 1894, C.W. Post, created … Lexical AnswerType: (Michingan city) Date(1894) Relations: Create(Post, cereal drink) … [0.58 0 -1.3 … 0.97] [0.71 1 13.4 … 0.72] [0.12 0 2.0 … 0.40] [0.84 1 10.6 … 0.21] [0.33 0 6.3 … 0.83] [0.21 1 11.1 … 0.92] [0.91 0 -8.2 … 0.61] [0.91 0 -1.7 … 0.60] Evidence Scoring
  • 18. © 2011 IBM Corporation Watson as part of a social machine §  Watson makes mistakes: – This woman was the first to witness her husband resign from the U.S. Presidency. – This U.S. City’s largest airport is named for a world-war II hero; its second largest for a world-war II battle. §  These mistakes are typically obvious to people – Even when they don’t know the answer – Watson isn’t stupid, it solves problems differently – Often these multiple perspectives can combine productively •  E.g. add a “dismiss” button to the answer interface Richard Nixon Dolly Madison Pat Nixon Watson can adapt and learn from its users!
  • 19. © 2011 IBM Corporation Cut to the chase….. Watson emerges victorious
  • 20. © 2011 IBM Corporation Technology marches forward…
  • 21. © 2011 IBM Corporation Adapt Watson Models Answer & Confidence Question Evidence Sources Models Models Models Models Models Answer Sources . . . Answer Scoring Primary Search Candidate Answer Generation Hypothesis Generation Hypothesis and Evidence Scoring Final Confidence Merging & Ranking Synthesis Question & Topic Analysis Question Decomposition Evidence Retrieval Deep Evidence Scoring Hypothesis Generation Hypothesis and Evidence Scoring Learned Models help combine and weigh the Evidence What does it take to use Watson in a new domain? (medical diagnosis, call centers, etc...) Gathering significant numbers of question-answer pairs is proving to be one of the most significant challenges for adapting Watson. Can the social web help? Community created!
  • 22. © 2011 IBM Corporation Integrating Watson in Social Interaction? Did you hear about Bob? No He’s taking a year off to climb the tallest mountain! The tallest mountain is Mount Everest. Wow. me me Jeff Watson Jeff
  • 23. © 2011 IBM Corporation Privacy – a blessing and a curse Need to protect our data, but… Crime on the web, the social web, is very real Identity theft Credit card, bank, insurance fraud Terrorist networks Medical diagnosis Monitoring your profile for health-related information ICT for depression Calendar, appointments, traffic, spreading disease
  • 24. © 2011 IBM Corporation The arrival of Cognitive Computing Learn & improve. The core of Watson is a group of over 100 independent algorithms that approximate a solution to the “is this the right answer to the question” problem. Achieving winning (human expert) performance, required two hallmarks of cognitive computing systems: a metric to measure improvements to the system (the winners cloud), and a significant ground truth (over 200K Q-A pairs). Speed&Scale. Watson used big data, as well as a 3000 node cluster for massive computation to get answering speeds down into the 2s range. Interact in a natural way. Watson was a significant step forward in natural language understanding, the most basic interface for humans. Say goodbye to your mouse… Assist & augment human cognition. Watson depended on primarily a set of background documents (the corpus). The value of having access to this kind of fact-finding power over a large (and possibly changing) corpus provides a clear augmentation to human abilities.
  • 25. © 2011 IBM Corporation The arrival of Cognitive Computing Learn & improve. The core of Watson is a group of over 100 independent algorithms that approximate a solution to the “is this the right answer to the question” problem. Achieving winning (human expert) performance, required two hallmarks of cognitive computing systems: a metric to measure improvements to the system (the winners cloud), and a significant ground truth (over 200K Q-A pairs). 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% % Answered
  • 26. © 2011 IBM Corporation The arrival of Cognitive Computing Assist & augment human cognition. Watson depended on primarily a set of background documents (the corpus). The value of having access to this kind of fact-finding power over a large (and possibly changing) corpus provides a clear augmentation to human abilities. UTI Diabetes Influenza hypokalemia Renal failure esophogitis Diagnosis  Models   Confidence   Most  Confident  Diagnosis:  UTI     Symptoms   Tests/Findings   Medica4ons   Family  History   Notes/Hypotheses   Huge  Volumes  of  Texts,   Journals,  References,  DBs   etc.   Pa4ent  History  
  • 27. © 2011 IBM Corporation The arrival of Cognitive Computing Speed&Scale. Watson used big data, as well as a 3000 node cluster for massive computation to get answering speeds down into the 2s range.
  • 28. © 2011 IBM Corporation The arrival of Cognitive Computing Interact in a natural way. Watson was a significant step forward in natural language understanding, the most basic interface for humans. Say goodbye to your mouse…
  • 29. © 2011 IBM Corporation The arrival of Cognitive Computing Learn & improve. The core of Watson is a group of over 100 independent algorithms that approximate a solution to the “is this the right answer to the question” problem. Achieving winning (human expert) performance, required two hallmarks of cognitive computing systems: a metric to measure improvements to the system (the winners cloud), and a significant ground truth (over 200K Q-A pairs). Speed&Scale. Watson used big data, as well as a 3000 node cluster for massive computation to get answering speeds down into the 2s range. Interact in a natural way. Watson was a significant step forward in natural language understanding, the most basic interface for humans. Say goodbye to your mouse… Assist & augment human cognition. Watson depended on primarily a set of background documents (the corpus). The value of having access to this kind of fact-finding power over a large (and possibly changing) corpus provides a clear augmentation to human abilities.
  • 30. © 2011 IBM Corporation …and for Social Web §  First and foremost, social web analytics (e.g. recommendations) and Social Computing in general lie clearly in the realm of Cognitive Computing – Uncertainty, natural language, human intelligence – Inexact solutions that can improve with time, training – Problems & solutions need metrics to be solvable §  All cognitive computing systems require ground truth data – This data is expensive to collect – Crowdsourcing is a key new technology/approach §  The user interface moving closer to people – Natural language, speech, gestures – In addition, integrating the collection of training data seamlessly into the interface is a key development §  Cognitive computing systems require integration of multiple, disparate, data sources – Structured, unstructured, semi-structured – curated, crowdsourced