SlideShare uma empresa Scribd logo
1 de 33
Baixar para ler offline
A	
  Hybrid	
  Diagnosis	
  Approach	
  Combining	
  
Black-­‐Box	
  and	
  White-­‐Box	
  Reasoning	
  	
  
Mingmin	
  Chen1	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Shizhuo	
  Yu1	
  	
  	
  	
  	
  
Nico	
  Franz2	
  	
  	
  	
  	
  	
  	
  Shawn	
  Bowers3	
  	
  	
  	
  	
  	
  	
  Bertram	
  Ludäscher1	
  4	
  
	
  
	
  
1	
  Department	
  of	
  Computer	
  Science	
  ,	
  University	
  of	
  California,	
  Davis	
  
2	
  School	
  of	
  Life	
  Sciences,	
  Arizona	
  State	
  University	
  
3	
  Department	
  of	
  Computer	
  Science,	
  Gonzaga	
  University	
  
4	
  GSLIS	
  &	
  NCSA,	
  University	
  of	
  Illinois	
  at	
  Urbana-­‐Champaign	
  
Outline	
  
•  The	
  Taxonomy	
  Alignment	
  Problem	
  
T1	
  +	
  T2	
  +	
  A	
  =>	
  T3	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  (ambiguous	
  ..	
  unique	
  ..	
  inconsistent)	
  
•  Model-­‐based	
  Diagnosis	
  [Reiter’87]	
  
– Black-­‐box	
  
•  Hybrid	
  Approach	
  
– Black-­‐box	
  &	
  White-­‐box	
  combined	
  
•  Benchmark	
  Results	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   2	
  
Meet	
  Prof.	
  Nico	
  Franz:	
  	
  
	
  Curator	
  of	
  Insects	
  @	
  ASU	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   3	
  
What	
  Nico	
  et	
  al.	
  do	
  for	
  a	
  living	
  …	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   4	
  
 	
  What	
  Nico	
  does	
  for	
  a	
  living	
  (cont’d):	
  
The	
  Indoors	
  Part	
  
•  First:	
  go	
  fun	
  places,	
  find	
  new	
  bugs,	
  study	
  them	
  …	
  
–  “Bugs-­‐R-­‐Us”	
  (see	
  taxonbytes.org)	
  
•  Now:	
  Compare,	
  align	
  and	
  revise	
  taxonomies,	
  based	
  on	
  
careful	
  observafon,	
  “character”	
  data,	
  experfse	
  …	
  	
  	
  
•  Formally:	
  
–  Input:	
  T1	
  +	
  T2	
  (taxonomies)	
  +	
  A	
  (expert	
  ar.cula.ons)	
  
	
  
–  Output:	
  revised,	
  “merged”	
  taxonomy	
  (-­‐ies)	
  T3	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   5	
  
•  Given:
–  Taxonomies T1 , T2
•  incl. constraints (coverage, disjointness)
–  Set of articulations (alignment) A
•  Find:
–  Combined (“merged”) taxonomy T3 (= T1 + T2 + A)
•  Is it a taxonomy? Or a DAG?
–  Optional:
•  Final alignment (should be minimal)
6	
  
Taxonomy Alignment Problem (TAP)
T1	
  
T2	
  
A	
  
T3	
  
 	
  TAP:	
  Possible	
  Outcomes	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   7	
  
 	
  TAP:	
  Possible	
  Outcomes	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
{A1, A2, A3, A4}
{A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4}
{A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4}
{A1} {A2} {A3} {A4}
{ }
Inconsistent!	
  	
  	
  
è	
  Diagnosis	
  (Reiter)	
  	
  
=	
  Black-­‐Box	
  
	
  Provenance	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   8	
  
 TAP:	
  Possible	
  Outcomes	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
{A1, A2, A3, A4}
{A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4}
{A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4}
{A1} {A2} {A3} {A4}
{ }
Inconsistent!	
  	
  	
  
è	
  Diagnosis	
  (Reiter)	
  	
  
=	
  Black-­‐Box	
  
	
  Provenance	
  
1.b2.e
1.c
1.a
2.d
2.f
Ambiguous!	
  	
  
è	
  Mul5ple	
  
Possible	
  	
  
Worlds	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   9	
  
 TAP:	
  Possible	
  Outcomes	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
{A1, A2, A3, A4}
{A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4}
{A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4}
{A1} {A2} {A3} {A4}
{ }
Inconsistent!	
  	
  	
  
è	
  Diagnosis	
  (Reiter)	
  	
  
=	
  Black-­‐Box	
  
	
  Provenance	
  
1.b2.e
1.c
1.a
2.d
2.f
Ambiguous!	
  	
  
è	
  Mul5ple	
  
Possible	
  	
  
Worlds	
  
1.c2.f
1.b
1.a
2.d
2.e
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   10	
  
 TAP:	
  Possible	
  Outcomes	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
{A1, A2, A3, A4}
{A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4}
{A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4}
{A1} {A2} {A3} {A4}
{ }
Inconsistent!	
  	
  	
  
è	
  Diagnosis	
  (Reiter)	
  	
  
=	
  Black-­‐Box	
  
	
  Provenance	
  
1.b2.e
1.c
1.a
2.d
2.f
Ambiguous!	
  	
  
è	
  Mul5ple	
  
Possible	
  	
  
Worlds	
  
1.c2.f
1.b
1.a
2.d
2.e
1.b
1.a
2.e
2.d
1.c
2.f
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   11	
  
•  FO	
  reasoning	
  about	
  
taxonomies	
  (MFOL)	
  
•  Earlier:	
  CleanTax	
  	
  
–  Prover9/Mace4	
  
•  Now:	
  Euler	
  
–  ASP	
  Reasoners	
  (DLV,	
  
Clingo)	
  
–  Specialized	
  reasoners	
  
(PyRCC)	
  
–  …	
  	
  
–  X	
  =	
  ASP,	
  RCC,	
  …	
  	
  
Euler/X	
  Toolkit	
  and	
  Workflow	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   12	
  
Real-­‐world	
  examples:	
  Turn	
  this	
  …	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   13	
  
… into this!
(Perellescus Alignment Result)
•  T3 := T1 and T2 are “merged”
–  Blue dashed: overlaps è resolve via “zoom-in view”
14	
  
1.16
1.14
2.40
2.44
2.47
1.11
2.38
2.35
1.20
1.23
2.52
2.53
2.54
1.17
2.41
1.22
2.46
1.25
2.48
1.12
2.36
1.26
2.49
1.13
2.37
1.18
2.42
1.19
2.43
1.15
2.39
1.21
2.45
1.12L
2.36L
1.27
2.50
1.24
2.51
Nodes
Taxonomy 1 5
Taxonomy 2 8
MERGED Taxa 13
Edges
Overlaps 10
Input 24
INFERRED 5
 	
  Possible	
  Outcome:	
  Inconsistency!	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
{A1, A2, A3, A4}
{A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4}
{A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4}
{A1} {A2} {A3} {A4}
{ }
Inconsistent!	
  	
  	
  
è	
  Diagnosis	
  (Reiter)	
  	
  
=	
  Black-­‐Box	
  
	
  Provenance	
  
•  Need	
  to	
  debug	
  the	
  
input	
  arfculafons	
  è	
  
(black-­‐box)	
  diagnosis!	
  
•  Focus:	
  
–  How	
  do	
  we	
  efficiently	
  
compute	
  the	
  diagnosfc	
  
lance?	
  
•  Also:	
  
–  How	
  to	
  visualize..	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   15	
  
Example	
  Instance	
  	
  
(from	
  synthefc	
  benchmark	
  suite)	
  
•  Here:	
  N	
  =	
  10	
  taxa	
  in	
  T1,	
  T2	
  
•  Euler/X	
  finds:	
  
inconsistent!	
  
•  è	
  diagnosc	
  la]ce	
  of	
  210	
  
=	
  1024	
  nodes	
  
è Find	
  minimal	
  inconsistent	
  
subset	
  (MIS)	
  	
  
è maximal	
  consistent	
  subset	
  
(MCS)	
  ..	
  	
  
è show	
  to	
  user!	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   16	
  
Visualizing	
  Diagnoses:	
  Scalability	
  
Issues	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   17	
  
N	
  =	
  10	
  arfculafons	
  è	
  210	
  =	
  1024	
  node	
  diagnosfc	
  lance	
  (…	
  ouch!)	
  
	
   	
   	
   	
   	
  …	
  but	
  only	
  one	
  MIS	
  …	
  	
  	
  	
  	
  
Beaer	
  Idea:	
  Just	
  show	
  MIS,	
  MCS	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   18	
  
N	
  =	
  4	
  arfculafons	
  è	
  24	
  =	
  16	
  node	
  diagnosfc	
  lance,	
  
but	
  3	
  MCS	
  and	
  2	
  MIS	
  are	
  enough!	
  
..	
  but	
  4	
  MCS	
  and	
  
1	
  MIC	
  tell	
  it	
  all!	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   19	
  
1024	
  node	
  lance	
  
Visualizing	
  Diagnoses:	
  Focusing	
  on	
  
MIS	
  (and	
  MCS)	
  
Visualizing	
  Diagnoses	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   20	
  
Example	
  from	
  
paper:	
  N=12	
  è	
  
4096	
  nodes	
  
	
  
..	
  but	
  7	
  MCS	
  and	
  	
  
5	
  MIC	
  tell	
  it	
  all!	
  
Black-Box Inconsistency Analysis
(Diagnostic Lattice)
•  Then:
–  Repair: find & revise minimal inconsistent subsets (Min-Incons)
–  Expand: find maximal consistent subsets (Max-Cons) & revise outs
What happens if you
can’t have all (here: 4)
articulations together?
21	
  
•  Black-­‐box	
  Analysis	
  (Hinng	
  Set	
  algo.)	
  yields	
  a	
  Diagnosis	
  (lance)	
  
–  for	
  n=4	
  arfculafons,	
  there	
  are	
  168	
  possible	
  diagnoses	
  
–  depending	
  on	
  expected	
  “red/green	
  areas”	
  è	
  explore	
  space	
  differently	
  
•  |arfculafons|	
  =	
  n	
  è	
  	
  
	
   	
  |possible	
  diagnoses|	
  =	
  |monotonic	
  Boolean	
  funcfons|	
  	
  
	
   	
  =	
  Dedekind	
  Number	
  (n):	
  	
  2,	
  3,	
  6,	
  20,	
  168,	
  7581,	
  7828354,	
  ...	
  	
  
	
  
Inconsistency Analysis (Diagnostic Lattice)
•  The Min-Incons (MIS) and
Max-Cons (MCS) sets
determine all others
è  Repair MIS and/or Expand
MCS
22
Improving	
  Diagnosis	
  
•  Reiter’s	
  “black-­‐box”	
  (model-­‐based)	
  diagnosis	
  
helps	
  debug	
  the	
  arfculafons	
  
•  Limited	
  scalability	
  (inherent	
  complexity)	
  
•  But	
  every	
  bit	
  helps:	
  
– Hinng	
  Set	
  Algorithm	
  (“logarithmic	
  extracfon”)	
  
•  Our	
  idea:	
  
– Exploit	
  “white-­‐box”	
  reasoning	
  informafon	
  
è	
  RULES	
  to	
  the	
  rescue	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   23	
  
Key	
  Idea:	
  exploit	
  white-­‐box	
  info	
  
•  We	
  use	
  Answer	
  Set	
  Programming	
  (ASP)	
  to	
  solve	
  
Taxonomy	
  Alignment	
  Problem	
  (TAP)	
  
•  Inconsistency	
  =	
  “False”	
  is	
  derived	
  in	
  the	
  head:	
  
False	
  :-­‐	
  	
  	
  <denial	
  of	
  integrity	
  constraint>	
  	
  
•  Apply	
  provenance	
  trick	
  from	
  databases	
  J	
  	
  
–  What	
  arfculafons	
  contribute	
  to	
  a	
  derivafon	
  of	
  
“False”	
  ?	
  	
  
–  Eliminate	
  those	
  that	
  don’t!	
  
è	
  an	
  example	
  of	
  reusing	
  inferences	
  across	
  separate	
  
black-­‐box	
  tests!	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   24	
  
The	
  Provenance	
  “Trick”	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   25	
  
Hybrid	
  Provenance	
  
A3:	
  	
  c	
  <	
  f	
  
Black-­‐box	
  Provenance	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   26	
  
Hybrid	
  Provenance	
  
A3:	
  	
  c	
  <	
  f	
  
Black-­‐box	
  Provenance	
  
r7: d = e ∪ f
a = e ∪ f
A1: a = d
r3: a = b ∪ c
f < c
r4: b ∩ c = ∅ r8: e ∩ f = ∅ A2: b < e
A1+A2	
  +	
  …	
  =>	
  f	
  <	
  c	
  
White-­‐box	
  	
  
Provenance	
  
1.a
1.b
isa
1.c
isa
2.d
=
2.e<
<
2.f<
isa
isa
Input	
  Alignment	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   27	
  
The	
  Hybrid	
  Approach	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   28	
  
Hybrid	
  Approach	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   29	
  
What	
  ar5cula5ons	
  
contribute	
  to	
  some	
  
inconsistency?	
  	
  
Good	
  old	
  black-­‐box	
  
(HST)	
  
Benchmark	
  
Results	
  	
  
•  White-­‐box	
  <	
  Hybrid	
  <	
  Black-­‐box	
  	
  	
  	
  	
  	
  	
  	
  	
   	
  (runfmes)	
  
•  Note:	
  white-­‐box	
  does	
  not	
  give	
  you	
  a	
  diagnosis	
  
•  Potassco	
  <	
  DLV	
  	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   30	
  
Benchmark	
  
DLV	
  
•  White-­‐box	
  <	
  Hybrid	
  <	
  Black-­‐box	
  	
  	
  	
  	
  	
  	
  	
  	
   	
  (runfmes)	
  
•  Potassco	
  <	
  DLV	
  	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   31	
  
Benchmark	
  
Clingo	
  
•  White-­‐box	
  <	
  Hybrid	
  <	
  Black-­‐box	
  	
  	
  	
  	
  	
  	
  	
  	
   	
  (runfmes)	
  
•  Potassco	
  <	
  DLV	
  	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   32	
  
Conclusions	
  
•  ASP	
  rules	
  can	
  be	
  used	
  to	
  efficiently	
  solve	
  real-­‐
world	
  taxonomy	
  reasoning	
  problems	
  
•  Reiter’s	
  diagnosis	
  useful	
  to	
  debug	
  inconsistent	
  
alignments	
  
•  Adding	
  a	
  “white-­‐box”	
  provenance	
  approach	
  
speeds	
  up	
  state-­‐of-­‐the-­‐art	
  HST	
  algorithm	
  by	
  
eliminang	
  independent	
  arculaons	
  
•  Future	
  work:	
  
–  Further	
  improvements,	
  including	
  parallelism:	
  
•  Trade-­‐off	
  with	
  sharing	
  inferences	
  across	
  parallel	
  instances	
  	
  
Hybrid	
  Diagnosis	
  in	
  Euler	
  	
  @	
  	
  RuleML2014,	
  Prague	
   33	
  

Mais conteúdo relacionado

Semelhante a Bertram's talk on Hybrid (Black-box + White-box) Diagnosis at RuleML'14 in Prague

Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Marina Santini
 
Memorization of Various Calculator shortcuts
Memorization of Various Calculator shortcutsMemorization of Various Calculator shortcuts
Memorization of Various Calculator shortcutsPrincessNorberte
 
Str t-test1
Str   t-test1Str   t-test1
Str t-test1iamkim
 
Robust outlier detection
Robust outlier detection Robust outlier detection
Robust outlier detection vinnief
 
20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrison20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrisonComputer Science Club
 
Graph Methods for Generating Test Cases with Universal and Existential Constr...
Graph Methods for Generating Test Cases with Universal and Existential Constr...Graph Methods for Generating Test Cases with Universal and Existential Constr...
Graph Methods for Generating Test Cases with Universal and Existential Constr...Sylvain Hallé
 
optimizedBell.pptx
optimizedBell.pptxoptimizedBell.pptx
optimizedBell.pptxRichard Gill
 
Chi square tests
Chi square testsChi square tests
Chi square testswaqole
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...Rothamsted Research, UK
 
ensembles_emptytemplate_v2
ensembles_emptytemplate_v2ensembles_emptytemplate_v2
ensembles_emptytemplate_v2Shrayes Ramesh
 

Semelhante a Bertram's talk on Hybrid (Black-box + White-box) Diagnosis at RuleML'14 in Prague (20)

qm2CHAP12.ppt
qm2CHAP12.pptqm2CHAP12.ppt
qm2CHAP12.ppt
 
Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1)
 
Memorization of Various Calculator shortcuts
Memorization of Various Calculator shortcutsMemorization of Various Calculator shortcuts
Memorization of Various Calculator shortcuts
 
Chi square test
Chi square test Chi square test
Chi square test
 
Str t-test1
Str   t-test1Str   t-test1
Str t-test1
 
Algorithm Exam Help
Algorithm Exam HelpAlgorithm Exam Help
Algorithm Exam Help
 
Chapter 4 ds
Chapter 4 dsChapter 4 ds
Chapter 4 ds
 
Robust outlier detection
Robust outlier detection Robust outlier detection
Robust outlier detection
 
20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrison20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrison
 
Graph Methods for Generating Test Cases with Universal and Existential Constr...
Graph Methods for Generating Test Cases with Universal and Existential Constr...Graph Methods for Generating Test Cases with Universal and Existential Constr...
Graph Methods for Generating Test Cases with Universal and Existential Constr...
 
optimizedBell.pptx
optimizedBell.pptxoptimizedBell.pptx
optimizedBell.pptx
 
Algorithm Assignment Help
Algorithm Assignment HelpAlgorithm Assignment Help
Algorithm Assignment Help
 
Chi square tests
Chi square testsChi square tests
Chi square tests
 
Inferential statistics nominal data
Inferential statistics   nominal dataInferential statistics   nominal data
Inferential statistics nominal data
 
CSC446: Pattern Recognition (LN3)
CSC446: Pattern Recognition (LN3)CSC446: Pattern Recognition (LN3)
CSC446: Pattern Recognition (LN3)
 
HPC_NIST_SHA3
HPC_NIST_SHA3HPC_NIST_SHA3
HPC_NIST_SHA3
 
The chi square_test
The chi square_testThe chi square_test
The chi square_test
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...
 
Chapter12
Chapter12Chapter12
Chapter12
 
ensembles_emptytemplate_v2
ensembles_emptytemplate_v2ensembles_emptytemplate_v2
ensembles_emptytemplate_v2
 

Mais de Bertram Ludäscher

Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionGames, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionBertram Ludäscher
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Bertram Ludäscher
 
[Flashback] Integration of Active and Deductive Database Rules
[Flashback] Integration of Active and Deductive Database Rules[Flashback] Integration of Active and Deductive Database Rules
[Flashback] Integration of Active and Deductive Database RulesBertram Ludäscher
 
[Flashback] Statelog: Integration of Active & Deductive Database Rules
[Flashback] Statelog: Integration of Active & Deductive Database Rules[Flashback] Statelog: Integration of Active & Deductive Database Rules
[Flashback] Statelog: Integration of Active & Deductive Database RulesBertram Ludäscher
 
Answering More Questions with Provenance and Query Patterns
Answering More Questions with Provenance and Query PatternsAnswering More Questions with Provenance and Query Patterns
Answering More Questions with Provenance and Query PatternsBertram Ludäscher
 
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?Bertram Ludäscher
 
Which Model Does Not Belong: A Dialogue
Which Model Does Not Belong: A DialogueWhich Model Does Not Belong: A Dialogue
Which Model Does Not Belong: A DialogueBertram Ludäscher
 
From Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science TalesFrom Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science TalesBertram Ludäscher
 
From Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science TalesFrom Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science TalesBertram Ludäscher
 
Possible Worlds Explorer: Datalog & Answer Set Programming for the Rest of Us
Possible Worlds Explorer: Datalog & Answer Set Programming for the Rest of UsPossible Worlds Explorer: Datalog & Answer Set Programming for the Rest of Us
Possible Worlds Explorer: Datalog & Answer Set Programming for the Rest of UsBertram Ludäscher
 
Deduktive Datenbanken & Logische Programme: Eine kleine Zeitreise
Deduktive Datenbanken & Logische Programme: Eine kleine ZeitreiseDeduktive Datenbanken & Logische Programme: Eine kleine Zeitreise
Deduktive Datenbanken & Logische Programme: Eine kleine ZeitreiseBertram Ludäscher
 
[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...
[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...
[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...Bertram Ludäscher
 
Dissecting Reproducibility: A case study with ecological niche models in th...
Dissecting Reproducibility:  A case study with ecological niche models  in th...Dissecting Reproducibility:  A case study with ecological niche models  in th...
Dissecting Reproducibility: A case study with ecological niche models in th...Bertram Ludäscher
 
Incremental Recomputation: Those who cannot remember the past are condemned ...
Incremental Recomputation:  Those who cannot remember the past are condemned ...Incremental Recomputation:  Those who cannot remember the past are condemned ...
Incremental Recomputation: Those who cannot remember the past are condemned ...Bertram Ludäscher
 
Validation and Inference of Schema-Level Workflow Data-Dependency Annotations
Validation and Inference of Schema-Level Workflow Data-Dependency AnnotationsValidation and Inference of Schema-Level Workflow Data-Dependency Annotations
Validation and Inference of Schema-Level Workflow Data-Dependency AnnotationsBertram Ludäscher
 
An ontology-driven framework for data transformation in scientific workflows
An ontology-driven framework for data transformation in scientific workflowsAn ontology-driven framework for data transformation in scientific workflows
An ontology-driven framework for data transformation in scientific workflowsBertram Ludäscher
 
Knowledge Representation & Reasoning and the Hierarchy-of-Hypotheses Approach
Knowledge Representation & Reasoning and the Hierarchy-of-Hypotheses ApproachKnowledge Representation & Reasoning and the Hierarchy-of-Hypotheses Approach
Knowledge Representation & Reasoning and the Hierarchy-of-Hypotheses ApproachBertram Ludäscher
 
Whole-Tale: The Experience of Research
Whole-Tale: The Experience of ResearchWhole-Tale: The Experience of Research
Whole-Tale: The Experience of ResearchBertram Ludäscher
 
ETC & Authors in the Driver's Seat
ETC & Authors in the Driver's SeatETC & Authors in the Driver's Seat
ETC & Authors in the Driver's SeatBertram Ludäscher
 
From Provenance Standards and Tools to Queries and Actionable Provenance
From Provenance Standards and Tools to Queries and Actionable ProvenanceFrom Provenance Standards and Tools to Queries and Actionable Provenance
From Provenance Standards and Tools to Queries and Actionable ProvenanceBertram Ludäscher
 

Mais de Bertram Ludäscher (20)

Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionGames, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
 
[Flashback] Integration of Active and Deductive Database Rules
[Flashback] Integration of Active and Deductive Database Rules[Flashback] Integration of Active and Deductive Database Rules
[Flashback] Integration of Active and Deductive Database Rules
 
[Flashback] Statelog: Integration of Active & Deductive Database Rules
[Flashback] Statelog: Integration of Active & Deductive Database Rules[Flashback] Statelog: Integration of Active & Deductive Database Rules
[Flashback] Statelog: Integration of Active & Deductive Database Rules
 
Answering More Questions with Provenance and Query Patterns
Answering More Questions with Provenance and Query PatternsAnswering More Questions with Provenance and Query Patterns
Answering More Questions with Provenance and Query Patterns
 
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
 
Which Model Does Not Belong: A Dialogue
Which Model Does Not Belong: A DialogueWhich Model Does Not Belong: A Dialogue
Which Model Does Not Belong: A Dialogue
 
From Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science TalesFrom Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science Tales
 
From Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science TalesFrom Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science Tales
 
Possible Worlds Explorer: Datalog & Answer Set Programming for the Rest of Us
Possible Worlds Explorer: Datalog & Answer Set Programming for the Rest of UsPossible Worlds Explorer: Datalog & Answer Set Programming for the Rest of Us
Possible Worlds Explorer: Datalog & Answer Set Programming for the Rest of Us
 
Deduktive Datenbanken & Logische Programme: Eine kleine Zeitreise
Deduktive Datenbanken & Logische Programme: Eine kleine ZeitreiseDeduktive Datenbanken & Logische Programme: Eine kleine Zeitreise
Deduktive Datenbanken & Logische Programme: Eine kleine Zeitreise
 
[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...
[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...
[Flashback 2005] Managing Scientific Data: From Data Integration to Scientifi...
 
Dissecting Reproducibility: A case study with ecological niche models in th...
Dissecting Reproducibility:  A case study with ecological niche models  in th...Dissecting Reproducibility:  A case study with ecological niche models  in th...
Dissecting Reproducibility: A case study with ecological niche models in th...
 
Incremental Recomputation: Those who cannot remember the past are condemned ...
Incremental Recomputation:  Those who cannot remember the past are condemned ...Incremental Recomputation:  Those who cannot remember the past are condemned ...
Incremental Recomputation: Those who cannot remember the past are condemned ...
 
Validation and Inference of Schema-Level Workflow Data-Dependency Annotations
Validation and Inference of Schema-Level Workflow Data-Dependency AnnotationsValidation and Inference of Schema-Level Workflow Data-Dependency Annotations
Validation and Inference of Schema-Level Workflow Data-Dependency Annotations
 
An ontology-driven framework for data transformation in scientific workflows
An ontology-driven framework for data transformation in scientific workflowsAn ontology-driven framework for data transformation in scientific workflows
An ontology-driven framework for data transformation in scientific workflows
 
Knowledge Representation & Reasoning and the Hierarchy-of-Hypotheses Approach
Knowledge Representation & Reasoning and the Hierarchy-of-Hypotheses ApproachKnowledge Representation & Reasoning and the Hierarchy-of-Hypotheses Approach
Knowledge Representation & Reasoning and the Hierarchy-of-Hypotheses Approach
 
Whole-Tale: The Experience of Research
Whole-Tale: The Experience of ResearchWhole-Tale: The Experience of Research
Whole-Tale: The Experience of Research
 
ETC & Authors in the Driver's Seat
ETC & Authors in the Driver's SeatETC & Authors in the Driver's Seat
ETC & Authors in the Driver's Seat
 
From Provenance Standards and Tools to Queries and Actionable Provenance
From Provenance Standards and Tools to Queries and Actionable ProvenanceFrom Provenance Standards and Tools to Queries and Actionable Provenance
From Provenance Standards and Tools to Queries and Actionable Provenance
 

Último

World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligencePriyadharshiniG41
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...boychatmate1
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 

Último (20)

World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligence
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 

Bertram's talk on Hybrid (Black-box + White-box) Diagnosis at RuleML'14 in Prague

  • 1. A  Hybrid  Diagnosis  Approach  Combining   Black-­‐Box  and  White-­‐Box  Reasoning     Mingmin  Chen1                                  Shizhuo  Yu1           Nico  Franz2              Shawn  Bowers3              Bertram  Ludäscher1  4       1  Department  of  Computer  Science  ,  University  of  California,  Davis   2  School  of  Life  Sciences,  Arizona  State  University   3  Department  of  Computer  Science,  Gonzaga  University   4  GSLIS  &  NCSA,  University  of  Illinois  at  Urbana-­‐Champaign  
  • 2. Outline   •  The  Taxonomy  Alignment  Problem   T1  +  T2  +  A  =>  T3                      (ambiguous  ..  unique  ..  inconsistent)   •  Model-­‐based  Diagnosis  [Reiter’87]   – Black-­‐box   •  Hybrid  Approach   – Black-­‐box  &  White-­‐box  combined   •  Benchmark  Results   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   2  
  • 3. Meet  Prof.  Nico  Franz:      Curator  of  Insects  @  ASU   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   3  
  • 4. What  Nico  et  al.  do  for  a  living  …     Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   4  
  • 5.    What  Nico  does  for  a  living  (cont’d):   The  Indoors  Part   •  First:  go  fun  places,  find  new  bugs,  study  them  …   –  “Bugs-­‐R-­‐Us”  (see  taxonbytes.org)   •  Now:  Compare,  align  and  revise  taxonomies,  based  on   careful  observafon,  “character”  data,  experfse  …       •  Formally:   –  Input:  T1  +  T2  (taxonomies)  +  A  (expert  ar.cula.ons)     –  Output:  revised,  “merged”  taxonomy  (-­‐ies)  T3   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   5  
  • 6. •  Given: –  Taxonomies T1 , T2 •  incl. constraints (coverage, disjointness) –  Set of articulations (alignment) A •  Find: –  Combined (“merged”) taxonomy T3 (= T1 + T2 + A) •  Is it a taxonomy? Or a DAG? –  Optional: •  Final alignment (should be minimal) 6   Taxonomy Alignment Problem (TAP) T1   T2   A   T3  
  • 7.    TAP:  Possible  Outcomes   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   7  
  • 8.    TAP:  Possible  Outcomes   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   {A1, A2, A3, A4} {A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4} {A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4} {A1} {A2} {A3} {A4} { } Inconsistent!       è  Diagnosis  (Reiter)     =  Black-­‐Box    Provenance   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   8  
  • 9.  TAP:  Possible  Outcomes   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   {A1, A2, A3, A4} {A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4} {A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4} {A1} {A2} {A3} {A4} { } Inconsistent!       è  Diagnosis  (Reiter)     =  Black-­‐Box    Provenance   1.b2.e 1.c 1.a 2.d 2.f Ambiguous!     è  Mul5ple   Possible     Worlds   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   9  
  • 10.  TAP:  Possible  Outcomes   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   {A1, A2, A3, A4} {A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4} {A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4} {A1} {A2} {A3} {A4} { } Inconsistent!       è  Diagnosis  (Reiter)     =  Black-­‐Box    Provenance   1.b2.e 1.c 1.a 2.d 2.f Ambiguous!     è  Mul5ple   Possible     Worlds   1.c2.f 1.b 1.a 2.d 2.e Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   10  
  • 11.  TAP:  Possible  Outcomes   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   {A1, A2, A3, A4} {A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4} {A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4} {A1} {A2} {A3} {A4} { } Inconsistent!       è  Diagnosis  (Reiter)     =  Black-­‐Box    Provenance   1.b2.e 1.c 1.a 2.d 2.f Ambiguous!     è  Mul5ple   Possible     Worlds   1.c2.f 1.b 1.a 2.d 2.e 1.b 1.a 2.e 2.d 1.c 2.f Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   11  
  • 12. •  FO  reasoning  about   taxonomies  (MFOL)   •  Earlier:  CleanTax     –  Prover9/Mace4   •  Now:  Euler   –  ASP  Reasoners  (DLV,   Clingo)   –  Specialized  reasoners   (PyRCC)   –  …     –  X  =  ASP,  RCC,  …     Euler/X  Toolkit  and  Workflow     Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   12  
  • 13. Real-­‐world  examples:  Turn  this  …     Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   13  
  • 14. … into this! (Perellescus Alignment Result) •  T3 := T1 and T2 are “merged” –  Blue dashed: overlaps è resolve via “zoom-in view” 14   1.16 1.14 2.40 2.44 2.47 1.11 2.38 2.35 1.20 1.23 2.52 2.53 2.54 1.17 2.41 1.22 2.46 1.25 2.48 1.12 2.36 1.26 2.49 1.13 2.37 1.18 2.42 1.19 2.43 1.15 2.39 1.21 2.45 1.12L 2.36L 1.27 2.50 1.24 2.51 Nodes Taxonomy 1 5 Taxonomy 2 8 MERGED Taxa 13 Edges Overlaps 10 Input 24 INFERRED 5
  • 15.    Possible  Outcome:  Inconsistency!   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   {A1, A2, A3, A4} {A1, A2, A3} {A1, A2, A4} {A1, A3, A4} {A2, A3, A4} {A1, A2} {A1, A3} {A2, A3} {A1, A4} {A2, A4} {A3, A4} {A1} {A2} {A3} {A4} { } Inconsistent!       è  Diagnosis  (Reiter)     =  Black-­‐Box    Provenance   •  Need  to  debug  the   input  arfculafons  è   (black-­‐box)  diagnosis!   •  Focus:   –  How  do  we  efficiently   compute  the  diagnosfc   lance?   •  Also:   –  How  to  visualize..     Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   15  
  • 16. Example  Instance     (from  synthefc  benchmark  suite)   •  Here:  N  =  10  taxa  in  T1,  T2   •  Euler/X  finds:   inconsistent!   •  è  diagnosc  la]ce  of  210   =  1024  nodes   è Find  minimal  inconsistent   subset  (MIS)     è maximal  consistent  subset   (MCS)  ..     è show  to  user!   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   16  
  • 17. Visualizing  Diagnoses:  Scalability   Issues   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   17   N  =  10  arfculafons  è  210  =  1024  node  diagnosfc  lance  (…  ouch!)            …  but  only  one  MIS  …          
  • 18. Beaer  Idea:  Just  show  MIS,  MCS   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   18   N  =  4  arfculafons  è  24  =  16  node  diagnosfc  lance,   but  3  MCS  and  2  MIS  are  enough!  
  • 19. ..  but  4  MCS  and   1  MIC  tell  it  all!   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   19   1024  node  lance   Visualizing  Diagnoses:  Focusing  on   MIS  (and  MCS)  
  • 20. Visualizing  Diagnoses   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   20   Example  from   paper:  N=12  è   4096  nodes     ..  but  7  MCS  and     5  MIC  tell  it  all!  
  • 21. Black-Box Inconsistency Analysis (Diagnostic Lattice) •  Then: –  Repair: find & revise minimal inconsistent subsets (Min-Incons) –  Expand: find maximal consistent subsets (Max-Cons) & revise outs What happens if you can’t have all (here: 4) articulations together? 21  
  • 22. •  Black-­‐box  Analysis  (Hinng  Set  algo.)  yields  a  Diagnosis  (lance)   –  for  n=4  arfculafons,  there  are  168  possible  diagnoses   –  depending  on  expected  “red/green  areas”  è  explore  space  differently   •  |arfculafons|  =  n  è        |possible  diagnoses|  =  |monotonic  Boolean  funcfons|        =  Dedekind  Number  (n):    2,  3,  6,  20,  168,  7581,  7828354,  ...       Inconsistency Analysis (Diagnostic Lattice) •  The Min-Incons (MIS) and Max-Cons (MCS) sets determine all others è  Repair MIS and/or Expand MCS 22
  • 23. Improving  Diagnosis   •  Reiter’s  “black-­‐box”  (model-­‐based)  diagnosis   helps  debug  the  arfculafons   •  Limited  scalability  (inherent  complexity)   •  But  every  bit  helps:   – Hinng  Set  Algorithm  (“logarithmic  extracfon”)   •  Our  idea:   – Exploit  “white-­‐box”  reasoning  informafon   è  RULES  to  the  rescue   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   23  
  • 24. Key  Idea:  exploit  white-­‐box  info   •  We  use  Answer  Set  Programming  (ASP)  to  solve   Taxonomy  Alignment  Problem  (TAP)   •  Inconsistency  =  “False”  is  derived  in  the  head:   False  :-­‐      <denial  of  integrity  constraint>     •  Apply  provenance  trick  from  databases  J     –  What  arfculafons  contribute  to  a  derivafon  of   “False”  ?     –  Eliminate  those  that  don’t!   è  an  example  of  reusing  inferences  across  separate   black-­‐box  tests!   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   24  
  • 25. The  Provenance  “Trick”   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   25  
  • 26. Hybrid  Provenance   A3:    c  <  f   Black-­‐box  Provenance   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   26  
  • 27. Hybrid  Provenance   A3:    c  <  f   Black-­‐box  Provenance   r7: d = e ∪ f a = e ∪ f A1: a = d r3: a = b ∪ c f < c r4: b ∩ c = ∅ r8: e ∩ f = ∅ A2: b < e A1+A2  +  …  =>  f  <  c   White-­‐box     Provenance   1.a 1.b isa 1.c isa 2.d = 2.e< < 2.f< isa isa Input  Alignment   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   27  
  • 28. The  Hybrid  Approach   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   28  
  • 29. Hybrid  Approach   Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   29   What  ar5cula5ons   contribute  to  some   inconsistency?     Good  old  black-­‐box   (HST)  
  • 30. Benchmark   Results     •  White-­‐box  <  Hybrid  <  Black-­‐box                    (runfmes)   •  Note:  white-­‐box  does  not  give  you  a  diagnosis   •  Potassco  <  DLV       Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   30  
  • 31. Benchmark   DLV   •  White-­‐box  <  Hybrid  <  Black-­‐box                    (runfmes)   •  Potassco  <  DLV       Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   31  
  • 32. Benchmark   Clingo   •  White-­‐box  <  Hybrid  <  Black-­‐box                    (runfmes)   •  Potassco  <  DLV       Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   32  
  • 33. Conclusions   •  ASP  rules  can  be  used  to  efficiently  solve  real-­‐ world  taxonomy  reasoning  problems   •  Reiter’s  diagnosis  useful  to  debug  inconsistent   alignments   •  Adding  a  “white-­‐box”  provenance  approach   speeds  up  state-­‐of-­‐the-­‐art  HST  algorithm  by   eliminang  independent  arculaons   •  Future  work:   –  Further  improvements,  including  parallelism:   •  Trade-­‐off  with  sharing  inferences  across  parallel  instances     Hybrid  Diagnosis  in  Euler    @    RuleML2014,  Prague   33