33
A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning Mingmin Chen 1 Shizhuo Yu 1 Nico Franz 2 Shawn Bowers 3 Bertram Ludäscher 1 4 1 Department of Computer Science , University of California, Davis 2 School of Life Sciences, Arizona State University 3 Department of Computer Science, Gonzaga University 4 GSLIS & NCSA, University of Illinois at Urbana-Champaign

Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Embed Size (px)

DESCRIPTION

Presentation given at RuleML 2014 conference (http://ruleml2014.vse.cz/) with updates on the Euler/X toolkit; see also http://taxonbytes.org/prior-work-on-concept-taxonomy-2013/

Citation preview

Page 1: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Mingmin Chen1 Shizhuo Yu1

Nico Franz2 Shawn Bowers3 Bertram Ludäscher1 4

1 Department of Computer Science , University of California, Davis2 School of Life Sciences, Arizona State University

3 Department of Computer Science, Gonzaga University4 GSLIS & NCSA, University of Illinois at Urbana-Champaign

Page 2: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Outline

• The Taxonomy Alignment Problem

T1 + T2 + A => T3 (ambiguous .. unique .. inconsistent)

• Model-based Diagnosis [Reiter’87]

– Black-box

• Hybrid Approach

– Black-box & White-box combined

• Benchmark Results

Hybrid Diagnosis in Euler @ RuleML2014, Prague 2

Page 3: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Meet Prof. Nico Franz: Curator of Insects @ ASU

Hybrid Diagnosis in Euler @ RuleML2014, Prague 3

Page 4: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

What Nico et al. do for a living …

Hybrid Diagnosis in Euler @ RuleML2014, Prague 4

Page 5: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

What Nico does for a living (cont’d): The Indoors Part

• First: go fun places, find new bugs, study them …– “Bugs-R-Us” (see taxonbytes.org)

• Now: Compare, align and revise taxonomies, based on careful observation, “character” data, expertise …

• Formally:– Input: T1 + T2 (taxonomies) + A (expert articulations)

– Output: revised, “merged” taxonomy (-ies) T3Hybrid Diagnosis in Euler @ RuleML2014, Prague 5

Page 6: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

• Given:

– Taxonomies T1 , T2

• incl. constraints (coverage, disjointness)

– Set of articulations (alignment) A• Find:

– Combined (“merged”) taxonomy T3 (= T1 + T2 + A)• Is it a taxonomy? Or a DAG?

– Optional:• Final alignment (should be minimal)

6

Taxonomy Alignment Problem (TAP)

T1

T2

T3

A

Page 7: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

TAP: Possible OutcomesInput Alignment

Hybrid Diagnosis in Euler @ RuleML2014, Prague 7

Page 8: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

TAP: Possible OutcomesInput Alignment

Inconsistent! Diagnosis (Reiter)= Black-BoxProvenance

Hybrid Diagnosis in Euler @ RuleML2014, Prague 8

Page 9: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

TAP: Possible OutcomesInput Alignment

Inconsistent! Diagnosis (Reiter)= Black-BoxProvenance

Ambiguous! Multiple Possible Worlds

Hybrid Diagnosis in Euler @ RuleML2014, Prague 9

Page 10: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

TAP: Possible OutcomesInput Alignment

Inconsistent! Diagnosis (Reiter)= Black-BoxProvenance

Ambiguous! Multiple Possible Worlds

Hybrid Diagnosis in Euler @ RuleML2014, Prague 10

Page 11: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

TAP: Possible OutcomesInput Alignment

Inconsistent! Diagnosis (Reiter)= Black-BoxProvenance

Ambiguous! Multiple Possible Worlds

Hybrid Diagnosis in Euler @ RuleML2014, Prague 11

Page 12: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

• FO reasoning about taxonomies (MFOL)

• Earlier: CleanTax– Prover9/Mace4

• Now: Euler– ASP Reasoners (DLV,

Clingo)

– Specialized reasoners(PyRCC)

– …

– X = ASP, RCC, …

Euler/X Toolkit and Workflow

Hybrid Diagnosis in Euler @ RuleML2014, Prague 12

Page 13: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Real-world examples: Turn this …

Hybrid Diagnosis in Euler @ RuleML2014, Prague 13

Page 14: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

… into this! (Perellescus Alignment Result)

• T3 := T1 and T2 are “merged”

– Blue dashed: overlaps resolve via “zoom-in view”

14

Page 15: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Common Outcome: Inconsistency!Input Alignment

Inconsistent! Diagnosis (Reiter)= Black-BoxProvenance

• Need to debug the input articulations (black-box) diagnosis!

• Focus:– How do we efficiently

compute the diagnostic lattice?

• Also:– How to visualize..

Hybrid Diagnosis in Euler @ RuleML2014, Prague 15

Page 16: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Example Instance (from synthetic benchmark suite)

• Here: N = 10 taxa in T1, T2

• Euler/X finds: inconsistent!

• diagnostic lattice of 210

= 1024 nodes Find minimal inconsistent

subset (MIS)

maximal consistent subset (MCS) ..

show to user!

Hybrid Diagnosis in Euler @ RuleML2014, Prague 16

Page 17: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Visualizing Diagnoses

Hybrid Diagnosis in Euler @ RuleML2014, Prague 17

N = 10 articulations 210 = 1024 node diagnostic lattice

Page 18: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Better Idea: Just show MIS, MCS

Hybrid Diagnosis in Euler @ RuleML2014, Prague 18

N = 4 articulations 24 = 16 node diagnostic lattice,but 3 MCS and 2 MIS are enough!

Page 19: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Visualizing Diagnoses

.. but 4 MCS and 1 MIC tell it all!

Hybrid Diagnosis in Euler @ RuleML2014, Prague 19

1024 node lattice

Page 20: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Visualizing Diagnoses

Hybrid Diagnosis in Euler @ RuleML2014, Prague 20

Example from paper: N=12 4096 nodes

.. but 7 MCS and

5 MIC tell it all!

Page 21: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Black-Box Inconsistency Analysis

(Diagnostic Lattice)

• Then:

– Repair: find & revise minimal inconsistent subsets (Min-Incons)

– Expand: find maximal consistent subsets (Max-Cons) & revise outs

What happens if you

can’t have all (here: 4)

articulations together?

21

Page 22: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

• Black-box Analysis (Hitting Set algo.) yields a Diagnosis (lattice)

– for n=4 articulations, there are 168 possible diagnoses

– depending on expected “red/green areas” explore space differently

• |articulations| = n

|possible diagnoses| = |monotonic Boolean functions|

= Dedekind Number (n): 2, 3, 6, 20, 168, 7581, 7828354, ...

Inconsistency Analysis (Diagnostic Lattice)

• The Min-Incons (MIS) and

Max-Cons (MCS) sets

determine all others

Repair MIS and/or Expand

MCS

22

Page 23: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Improving Diagnosis

• Reiter’s “black-box” (model-based) diagnosis helps debug the articulations

• Limited scalability (inherent complexity)

• But every bit helps:

– Hitting Set Algorithm (“logarithmic extraction”)

• Our idea:

– Exploit “white-box” reasoning information

RULES to the rescue

Hybrid Diagnosis in Euler @ RuleML2014, Prague 23

Page 24: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Key Idea: exploit white-box info

• We use Answer Set Programming (ASP) to solve Taxonomy Alignment Problem (TAP)

• Inconsistency = “False” is derived in the head:False :- <denial of integrity constraint>

• Apply provenance trick from databases – What articulations contribute to a derivation of

“False” ? – Eliminate those that don’t! an example of reusing inferences across separate black-box tests!

Hybrid Diagnosis in Euler @ RuleML2014, Prague 24

Page 25: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

The Provenance “Trick”

Hybrid Diagnosis in Euler @ RuleML2014, Prague 25

Page 26: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Hybrid ProvenanceA3: c < f Black-box Provenance

Input Alignment

Hybrid Diagnosis in Euler @ RuleML2014, Prague 26

Page 27: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Hybrid ProvenanceA3: c < f Black-box Provenance

A1+A2 + … => f < c

White-box Provenance

Input Alignment

Hybrid Diagnosis in Euler @ RuleML2014, Prague 27

Page 28: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

The Hybrid Approach

Hybrid Diagnosis in Euler @ RuleML2014, Prague 28

Page 29: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Hybrid Approach

Hybrid Diagnosis in Euler @ RuleML2014, Prague 29

What articulations contribute to some inconsistency?

Good old black-box(HST)

Page 30: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Benchmark Results

• White-box < Hybrid < Black-box (runtimes)

• Note: white-box does not give you a diagnosis

• Potassco < DLV

Hybrid Diagnosis in Euler @ RuleML2014, Prague 30

Page 31: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Benchmark DLV

• White-box < Hybrid < Black-box (runtimes)

• Potassco < DLV

Hybrid Diagnosis in Euler @ RuleML2014, Prague 31

Page 32: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Benchmark Clingo

• White-box < Hybrid < Black-box (runtimes)

• Potassco < DLV

Hybrid Diagnosis in Euler @ RuleML2014, Prague 32

Page 33: Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

Conclusions

• ASP rules can be used to efficiently solve real-world taxonomy reasoning problems

• Reiter’s diagnosis useful to debug inconsistent alignments

• Adding a “white-box” provenance approach speeds up state-of-the-art HST algorithm by eliminating independent articulations

• Future work:– Further improvements, including parallelism:

• Trade-off with sharing inferences across parallel instances

Hybrid Diagnosis in Euler @ RuleML2014, Prague 33