SlideShare uma empresa Scribd logo
1 de 61
Baixar para ler offline
INTERACTIVE EXPLORATION
OF EVENT SEQUENCES
IN TEMPORAL CATEGORICAL DATA

KRIST WONGSUPHASAWAT
DEPARTMENT OF COMPUTER SCIENCE &
HUMAN-COMPUTER INTERACTION LAB
UNIVERSITY OF MARYLAND                                   al!
                                                      pos
                                          at  ion Pro
                                   Dissert 2010!
                                            ,
                                   April 30
INTERACTIVE EXPLORATION
OF EVENT SEQUENCES
IN TEMPORAL CATEGORICAL DATA

KRIST WONGSUPHASAWAT
DEPARTMENT OF COMPUTER SCIENCE &
HUMAN-COMPUTER INTERACTION LAB
UNIVERSITY OF MARYLAND                                   al!
                                                      pos
                                          at  ion Pro
                                   Dissert 2010!
                                            ,
                                   April 30
TEMPORAL CATEGORICAL DATA
•  A type of time series                Category!
 Numerical                    Categorical
                                                          Event!
       Stock: Microsoft           Patient ID: 45851737
 04/26/2010&10:00   &31.03&   12/02/2008&14:26   &Admit&
 04/26/2010&10:15   &31.01&   12/02/2008&14:26   &Emergency&
 04/26/2010&10:30   &31.02&   12/02/2008&22:44   &ICU&
 04/26/2010&10:45   &31.08&   12/05/2008&05:07   &Floor&
 04/26/2010&11:00   &31.16&   12/08/2008&10:02   &Floor&
 04/26/2010&11:15   &31.15&   12/14/2008&06:19   &Exit&
                              &           Time
                                   Admit
                                   Emergency
                                        ICU
                                                  Floor
                                                           Exit
TEMPORAL CATEGORICAL DATA




•    Electronic Health Records: symptoms, treatment, lab test
•    Traffic incident logs: arrival/departure time of each unit
•    Student records: course, paper, proposal, defense, etc.
•    Usability study logs
•    Etc.
working with physicians at
WASHINGTON HOSPITAL CENTER
INTERACTIVE EXPLORATION
OF EVENT SEQUENCES
IN TEMPORAL CATEGORICAL DATA

KRIST WONGSUPHASAWAT
DEPARTMENT OF COMPUTER SCIENCE &
HUMAN-COMPUTER INTERACTION LAB
UNIVERSITY OF MARYLAND                                   al!
                                                      pos
                                          at  ion Pro
                                   Dissert 2010!
                                            ,
                                   April 30
EVENT SEQUENCES
•  Examples: “Bounce backs”

              ICU      Floor      ICU



                         within 2 days
RESEARCH STATEMENT
           “my research aims to design
             effective visualization
           and interaction techniques
          to support users in exploring
 event sequences in temporal categorical data.”

       •  a flexible temporal search approach
•  a novel visualization that provides an overview
                  of multiple records
RESEARCH
              QUESTION#1
             PRELIM. + PROPOSED
MOTIVATION         WORK             CONCLUSION




        RESEARCH           RESEARCH
        QUESTIONS         QUESTION#2
                         PRELIM. + PROPOSED
                               WORK
TRADITIONAL SEARCH MODEL

Know exactly what they want



                         Query                          Data     *
   How to specify the query?                            How to display the data?
   SQL, TSQL, GUI, ...

                         How to process the query?
                         How to make the search efficient?
                         Indexing methods, algorithms
EXPLORATORY SEARCH MODEL
Uncertain about what they
are looking for
Know exactly what they want
                                  Insight, Hypotheses   *
                         search
               Query                 Data
                         refine,
                         formulate
UNCERTAIN
                     Floor        ICU      Exit-Alive



                     QUERY            within 2 days

Find something
useful and display


                     RESULTS
                                                        Frustrated!
                     found 0 record
EXPLORATORY SEARCH MODEL
Uncertain about what they
are looking for
Know exactly what they want
                                  Insight, Hypotheses             *
                         search
               Query                  Data
                         refine,
                         formulate



                                       How to display the data?
                                  Difficult to understand the
                                  event sequences in the data
                                  quickly
NEEDS A STARTING POINT
 Show overview
 or summary




Where should I start?
RESEARCH QUESTIONS
1.  How to support the users when they are
    uncertain about what they are looking for?
2.  How to provide an overview of event
    sequences for temporal categorical data?
RESEARCH QUESTIONS
1.  How to support the users when they are
    uncertain about what they are looking for?
2.  How to provide an overview of event
    sequences for temporal categorical data?
APPROACHES
Exact Search!       Similarity-based Search!
MUST have A, B, C      SHOULD have A, B, C


      Query                  Query


    Record#1                Record#2

                                             more
    Record#2                Record#1
                                             similar

    Record#3                Record#3
SIMILARITY-BASED SEARCH


                     How to specify query?
What is “similar”?
                     How to display results?




                         User Interface!
Similarity Measure
                         & Visualization
SIMILARITY MEASURE
Gives a score that shows how much a record is similar to the query.
                       Min = 0 / Max = 1



             Query                     Record#1      0.80

                                       Record#2      0.63

                                       Record#3      0.96

                                       Record#4      0.77
SIMILARITY MEASURE (2)



      Query             Record#3   0.96

                        Record#1   0.80
                more
              similar
                        Record#4   0.77

                        Record#2   0.63
CHALLENGE
•  What is “similar”?       depends&on&users/tasks&
    Record#1
    (Query)       A                           B   C

    Record#2                    missing
                  A                           B   C

    Record#3                                              extra
                  A                           B   C   D

    Record#4
                            Time difference
                  A     B                         C
               time
MATCH & MISMATCH MEASURE
                time


  Record#1
  (Query)              A             C    B              C



  Record#2
                       A         B        C                   B


            Matched Events           Missing Event      Extra Event




                                          }
      Time Difference
      Number of Swapping                      Total Score
      Number of Missing Events                       0 to 1
      Number of Extra Events
RELATED WORK
•  Similarity measure
  –  For numerical time series
     •  [Ding et al. 2008], [Liu et al. 2006], [Berndt and Clifford 1994], ...

  –  Edit distance
     •  [Levenshtein 1966], [Winkler 1999], [Chen 2004], ...

  –  Biological sequence searching
     •  [Pearson and Lipman 1988], [Altschul et al. 1990], ...

  –  Approximate graph matching
     •  [Tian et al. 2007], [Tian et al. 2008] , ...

  –  Temporal Categorical Data
     •  [Sherkat 2006], ...
PROTOTYPE: SIMILAN
DEMO - DATA
•  Patient transfers in the emergency department
•  Real data from hospital (Jan – Mar 2010)

      ADMIT           Admission time
      EMERGENCY       Emergency room
      ICU             Intensive Care Unit
      INTERMEDIATE    Intermediate Medical Care
      FLOOR           Normal room
      EXIT-ALIVE      Leave the hospital alive
      EXIT-DEAD       Leave the hospital dead
DEMO - SIMILAN
This is when bugs are more likely to occur.!
CONTROLLED EXPERIMENT
•  18 participants
•  2 interfaces by 5 tasks
  –  Similarity-based Search (Similan)
  –  Exact Search (LifeLines2)




•  Time, error, preference
X = Exact Search , S = Similarity-based Search
RESULTS
         Sequence         Counting              Time const.           Uncertainty

        Sequence          CountSpeed (seconds)
                                            Time                      Uncertain
60                  15                  150                   150

40                  10                  100                   100

20                  5                     50                  50

    0               0                       0                     0
          X     S          X       S
                                   R                X    S                 X    S
                         Preference (7-points likert scale)
8                   8                   8                     8
6                   6                   6                     6
4                   4                   4                     4
2                   2                   2                     2
0                   0                   0                     0
         X      S         X        S            X       S
                                                        R              X       S
FLEXIBLE TEMPORAL SEARCH
(FTS)
PROPOSED WORK: FTS
•  Draw an example to specify query
•  Specify flexible and inflexible constraints

          MUST HAVE              RANGE      ?


                 MUST NOT HAVE           optional



•  Algorithms
PROPOSED WORK: FTS (2)
      •  Combine exact and similarity-based search



                                              Primary Bin
similar




                                           Pass all constraints




                                             Secondary Bin
similar




                                                The rest
RESEARCH QUESTIONS
1.  How to support the users when they are
    uncertain about what they are looking for?
2.  How to provide an overview of event
    sequences for temporal categorical data?
WHEN YOU READ A PAPER…
•  What is it about?




         INFORMATION VISUALIZATION MANTRA
         OVERVIEW FIRST,
         ZOOM AND FILTER,
         THEN DETAILS ON DEMAND
RELATED WORK
•  Single-record visualizations

      Patient ID: 45851737
   12/02/2008&14:26   &Admit&
   12/02/2008&14:26   &Emergency&   Single-record"
   12/02/2008&22:44   &ICU&         Visualization
   12/05/2008&05:07   &Floor&
   12/08/2008&10:02   &Floor&
   12/14/2008&06:19   &Exit&
   &

•  E.g. LifeLines, MIDGAARD, etc.
RELATED WORK (2)
•  Multiple instances of single-record visualizations

    Patient ID: 45851737
     Patient ID: 45851737
      Patient ID: 45851737
 12/02/2008&14:26 &Admit&
  12/02/2008&14:26 45851737
       Patient ID: &Admit&
 12/02/2008&14:26 &Emergency&
    12/02/2008&14:26 &Admit&
 12/02/2008&22:44 &Emergency&
   12/02/2008&14:26 &ICU&
      12/02/2008&14:26 &Admit&
    12/02/2008&14:26 &Emergency&
   12/02/2008&22:44 &ICU&
      12/02/2008&14:26 &Emergency&                Visualization
 12/05/2008&05:07 &Floor&
    12/02/2008&22:44 &ICU&                         Visualization
                                                     Single-record"
                                                     Visualization
   12/05/2008&05:07 &Floor&
      12/02/2008&22:44 &ICU&
 12/08/2008&10:02 &Floor&
    12/05/2008&05:07 &Floor&                         Visualization
   12/08/2008&10:02 &Floor&
      12/05/2008&05:07 &Floor&
 12/14/2008&06:19 &Exit&
    12/08/2008&10:02 &Floor&
 & 12/14/2008&06:19 &Exit&
      12/08/2008&10:02 &Floor&
    12/14/2008&06:19 &Exit&
   & 12/14/2008&06:19 &Exit&
    &
      &
•  E.g. LifeLines2, Continuum, ActiviTree, etc.
More space please....
RELATED WORK (3)
•  Multiple-record visualizations

    Patient ID: 45851737
     Patient ID: 45851737
      Patient ID: 45851737
 12/02/2008&14:26 &Admit&
  12/02/2008&14:26 45851737
       Patient ID: &Admit&
 12/02/2008&14:26 &Emergency&
    12/02/2008&14:26 &Admit&
 12/02/2008&22:44 &Emergency&
   12/02/2008&14:26 &ICU&
      12/02/2008&14:26 &Admit&
    12/02/2008&14:26 &Emergency&     Multiple-record"
   12/02/2008&22:44 &ICU&
      12/02/2008&14:26 &Emergency&
 12/05/2008&05:07 &Floor&
    12/02/2008&22:44 &ICU&            Visualization
   12/05/2008&05:07 &Floor&
      12/02/2008&22:44 &ICU&
 12/08/2008&10:02 &Floor&
    12/05/2008&05:07 &Floor&
   12/08/2008&10:02 &Floor&
      12/05/2008&05:07 &Floor&
 12/14/2008&06:19 &Exit&
    12/08/2008&10:02 &Floor&
 & 12/14/2008&06:19 &Exit&
      12/08/2008&10:02 &Floor&
    12/14/2008&06:19 &Exit&
   & 12/14/2008&06:19 &Exit&
    &
      &
•  E.g. LifeLines2, Continuum
LifeLines2’s Temporal Summary [Wang et al. 2009]




           Continuum’s Histogram [Andre 2007]



 Not so useful with many event types
Does not help exploring event sequences
OVERVIEW OF EVENT SEQUENCES
•  Scalability vs. Loss of information

    Patient ID: 45851737
     Patient ID: 45851737
      Patient ID: 45851737
 12/02/2008&14:26 &Admit&
  12/02/2008&14:26 &Admit&
 12/02/2008&14:26 &Emergency&


                                         ?
    12/02/2008&14:26 &Admit&
 12/02/2008&22:44 &Emergency&
  12/02/2008&14:26 &ICU&
    12/02/2008&14:26 &Emergency&
  12/02/2008&22:44 &ICU&
 12/05/2008&05:07 &Floor&
    12/02/2008&22:44 &ICU&
 12/08/2008&10:02 &Floor&
  12/05/2008&05:07 &Floor&
    12/05/2008&05:07 &Floor&
  12/08/2008&10:02 &Floor&
 12/14/2008&06:19 &Exit&
    12/08/2008&10:02 &Floor&
  12/14/2008&06:19 &Exit&
 & 12/14/2008&06:19 &Exit&
  &
    &

•  Goal: Show common trends, outliers
LIFEFLOW
         AGGREGATE

Merge multiple records into tree




          VISUALIZE
        Display the tree
AGGREGATE
•  Aggregate by prefix

     #1
     #2
     #3
     #4




               Example with 4 records
AGGREGATE
•  Aggregate by prefix

     #1
     #2
     #3
     #4
VISUALIZE
•  Inspired by the Icicle tree




        Number of files!
VISUALIZE (2)
•  Modify: Use horizontal axis to represent time
•  Video
DEMO - LIFEFLOW
When the lines are combined into flow!
LIFEFLOW
•  Overview
  –  See trends
  –  Spot outliers
•  Query
•  Compare data
COMPARISON                     ICU
    ICU         Intermediate               Intermediate




          Jan-Mar 2008               April-June 2008
Floor
PROPOSED WORK: LIFEFLOW
•  Designing interaction
PROPOSED WORK: LIFEFLOW
•  Primary
  –  Including non-temporal attributes
     •  Age, Gender, etc.
  –  Link with Flexible Temporal Search
•  Optional
  –  Rank-by-feature
  –  Complex aggregation
  –  Multiple alignments
PROPOSED WORK: LIFEFLOW
•  Including non-temporal attributes
  –  [Pinto et al. 2001]

           Male

           Male

           Female


                                       Male

                                  Male

                                   Female
PROPOSED WORK: LIFEFLOW
•  Complex aggregation


    #7
    #8


                    Not just merge by prefix

    #9

    #10


           Do not merge if the time gap is too different
PROPOSED WORK: LIFEFLOW
•  Multiple alignments
            ICU     Floor   ICU
EVALUATION PLAN
•  Flexible Temporal Search
   –  Controlled Experiment
      •  18-20 participants / 60-90 mins
•  LifeFlow
   –  Usability Study
      •  15-20 participants / 60-90 mins
   –  MILCs
      •  3-5 domain experts / 5-12 weeks
CONCLUSION
Summary and expected contributions!
EXPLORATORY SEARCH MODEL
Uncertain about what they
are looking for
Know exactly what they want
                                  Insight, Hypotheses             *
                         search
               Query                  Data
                         refine,
                         formulate



                                       How to display the data?
                                  Difficult to understand the
                                  event sequences in the data
                                  quickly
RESEARCH QUESTIONS

1.  How to support the users when they are
    uncertain about what they are looking for?
   Flexible Temporal Search

2.  How to provide an overview of event
    sequences for temporal categorical data?
   LifeFlow: an overview visualization
EXPECTED CONTRIBUTIONS
1.  Design of visual representations, user
    interfaces and interaction techniques
2.  Algorithms for flexible temporal search
3.  Evaluation results
4.  Open new directions for exploring
    temporal categorical data
ACKNOWLEDGEMENT
  DR. PHUONG HO, DR. MARK SMITH, DAVID ROSEMAN
           WASHINGTON HOSPITAL CENTER
             http://www.whcenter.org

       NATIONAL INSTITUTES OF HEALTH (NIH)
                http://www.nih.gov

        MICHAEL PACK, MICHAEL VANDANIKER
CENTER FOR ADVANCED TRANPORTATION TECHNOLOGY LAB
                   (CATT LAB)
            http://www.cattlab.umd.edu
Q&A
Questions?!
THANK YOU
Thank you!

Mais conteúdo relacionado

Semelhante a Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration of Event Sequences in Temporal Categorical Data

Lecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfLecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfJojo314349
 
KDD, Data Mining, Data Science_I.pptx
KDD, Data Mining, Data Science_I.pptxKDD, Data Mining, Data Science_I.pptx
KDD, Data Mining, Data Science_I.pptxYogeshGairola2
 
The Semantic Web - This time... its Personal
The Semantic Web - This time... its PersonalThe Semantic Web - This time... its Personal
The Semantic Web - This time... its PersonalMark Wilkinson
 
Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Cagatay Turkay
 
Data Science Workshop - day 1
Data Science Workshop - day 1Data Science Workshop - day 1
Data Science Workshop - day 1Aseel Addawood
 
Learning from Time-to-Event Data from Online Learning Contexts
Learning from Time-to-Event Data from Online Learning Contexts Learning from Time-to-Event Data from Online Learning Contexts
Learning from Time-to-Event Data from Online Learning Contexts Shalin Hai-Jew
 
Csci 6530 2016 fall presentation
Csci 6530 2016 fall presentationCsci 6530 2016 fall presentation
Csci 6530 2016 fall presentationciakov
 
Humanizing bioinformatics
Humanizing bioinformaticsHumanizing bioinformatics
Humanizing bioinformaticsJan Aerts
 
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirShare and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirSpark Summit
 
CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730jeffreylancaster
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleAndy Petrella
 
Brainwave Feature Extraction, Classification & Prediction
Brainwave Feature Extraction, Classification & PredictionBrainwave Feature Extraction, Classification & Prediction
Brainwave Feature Extraction, Classification & PredictionOlivia Moran
 
Secondary Data presentation.pptx
Secondary Data presentation.pptxSecondary Data presentation.pptx
Secondary Data presentation.pptxmukundsha
 

Semelhante a Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration of Event Sequences in Temporal Categorical Data (20)

The Business Research Method
The Business Research MethodThe Business Research Method
The Business Research Method
 
Lecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdfLecture-1-Introduction-to-Data-Mining.pdf
Lecture-1-Introduction-to-Data-Mining.pdf
 
KDD, Data Mining, Data Science_I.pptx
KDD, Data Mining, Data Science_I.pptxKDD, Data Mining, Data Science_I.pptx
KDD, Data Mining, Data Science_I.pptx
 
Research and collection of data
Research and collection of dataResearch and collection of data
Research and collection of data
 
The Semantic Web - This time... its Personal
The Semantic Web - This time... its PersonalThe Semantic Web - This time... its Personal
The Semantic Web - This time... its Personal
 
Data Mining
Data MiningData Mining
Data Mining
 
Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?
 
Data Science Workshop - day 1
Data Science Workshop - day 1Data Science Workshop - day 1
Data Science Workshop - day 1
 
Research
ResearchResearch
Research
 
Learning from Time-to-Event Data from Online Learning Contexts
Learning from Time-to-Event Data from Online Learning Contexts Learning from Time-to-Event Data from Online Learning Contexts
Learning from Time-to-Event Data from Online Learning Contexts
 
Csci 6530 2016 fall presentation
Csci 6530 2016 fall presentationCsci 6530 2016 fall presentation
Csci 6530 2016 fall presentation
 
Humanizing bioinformatics
Humanizing bioinformaticsHumanizing bioinformatics
Humanizing bioinformatics
 
Reproducibility for IR evaluation
Reproducibility for IR evaluationReproducibility for IR evaluation
Reproducibility for IR evaluation
 
Reproducibility for IR evaluation
Reproducibility for IR evaluationReproducibility for IR evaluation
Reproducibility for IR evaluation
 
Baselining Logs
Baselining LogsBaselining Logs
Baselining Logs
 
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirShare and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
 
CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730CLIR Fellows - Science Data - 14_0730
CLIR Fellows - Science Data - 14_0730
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scale
 
Brainwave Feature Extraction, Classification & Prediction
Brainwave Feature Extraction, Classification & PredictionBrainwave Feature Extraction, Classification & Prediction
Brainwave Feature Extraction, Classification & Prediction
 
Secondary Data presentation.pptx
Secondary Data presentation.pptxSecondary Data presentation.pptx
Secondary Data presentation.pptx
 

Mais de Krist Wongsuphasawat

What I tell myself before visualizing
What I tell myself before visualizingWhat I tell myself before visualizing
What I tell myself before visualizingKrist Wongsuphasawat
 
Navigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization LibrariesNavigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization LibrariesKrist Wongsuphasawat
 
Encodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization ComponentsEncodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization ComponentsKrist Wongsuphasawat
 
6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)Krist Wongsuphasawat
 
Increasing the Impact of Visualization Research
Increasing the Impact of Visualization ResearchIncreasing the Impact of Visualization Research
Increasing the Impact of Visualization ResearchKrist Wongsuphasawat
 
6 things to expect when you are visualizing
6 things to expect when you are visualizing6 things to expect when you are visualizing
6 things to expect when you are visualizingKrist Wongsuphasawat
 
What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)Krist Wongsuphasawat
 
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Dataร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with DataKrist Wongsuphasawat
 
Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...Krist Wongsuphasawat
 
What to expect when you are visualizing
What to expect when you are visualizingWhat to expect when you are visualizing
What to expect when you are visualizingKrist Wongsuphasawat
 
Adventure in Data: A tour of visualization projects at Twitter
Adventure in Data: A tour of visualization projects at TwitterAdventure in Data: A tour of visualization projects at Twitter
Adventure in Data: A tour of visualization projects at TwitterKrist Wongsuphasawat
 
Data Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsData Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsKrist Wongsuphasawat
 
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...Krist Wongsuphasawat
 
Making Sense of Millions of Thoughts: Finding Patterns in the Tweets
Making Sense of Millions of Thoughts: Finding Patterns in the TweetsMaking Sense of Millions of Thoughts: Finding Patterns in the Tweets
Making Sense of Millions of Thoughts: Finding Patterns in the TweetsKrist Wongsuphasawat
 
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?Krist Wongsuphasawat
 
A Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapA Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapKrist Wongsuphasawat
 
Visualization for Event Sequences Exploration
Visualization for Event Sequences ExplorationVisualization for Event Sequences Exploration
Visualization for Event Sequences ExplorationKrist Wongsuphasawat
 

Mais de Krist Wongsuphasawat (20)

What I tell myself before visualizing
What I tell myself before visualizingWhat I tell myself before visualizing
What I tell myself before visualizing
 
Navigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization LibrariesNavigating the Wide World of Data Visualization Libraries
Navigating the Wide World of Data Visualization Libraries
 
Encodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization ComponentsEncodable: Configurable Grammar for Visualization Components
Encodable: Configurable Grammar for Visualization Components
 
6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)6 things to expect when you are visualizing (2020 Edition)
6 things to expect when you are visualizing (2020 Edition)
 
Increasing the Impact of Visualization Research
Increasing the Impact of Visualization ResearchIncreasing the Impact of Visualization Research
Increasing the Impact of Visualization Research
 
6 things to expect when you are visualizing
6 things to expect when you are visualizing6 things to expect when you are visualizing
6 things to expect when you are visualizing
 
What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)What to expect when you are visualizing (v.2)
What to expect when you are visualizing (v.2)
 
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Dataร้อยเรื่องราวจากข้อมูล / Storytelling with Data
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
 
Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...Reveal the talking points of every episode of Game of Thrones from fans' conv...
Reveal the talking points of every episode of Game of Thrones from fans' conv...
 
What to expect when you are visualizing
What to expect when you are visualizingWhat to expect when you are visualizing
What to expect when you are visualizing
 
Adventure in Data: A tour of visualization projects at Twitter
Adventure in Data: A tour of visualization projects at TwitterAdventure in Data: A tour of visualization projects at Twitter
Adventure in Data: A tour of visualization projects at Twitter
 
Logs & Visualizations at Twitter
Logs & Visualizations at TwitterLogs & Visualizations at Twitter
Logs & Visualizations at Twitter
 
d3Kit
d3Kitd3Kit
d3Kit
 
Data Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science EnthusiastsData Visualization: A Quick Tour for Data Science Enthusiasts
Data Visualization: A Quick Tour for Data Science Enthusiasts
 
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
 
Data Visualization at Twitter
Data Visualization at TwitterData Visualization at Twitter
Data Visualization at Twitter
 
Making Sense of Millions of Thoughts: Finding Patterns in the Tweets
Making Sense of Millions of Thoughts: Finding Patterns in the TweetsMaking Sense of Millions of Thoughts: Finding Patterns in the Tweets
Making Sense of Millions of Thoughts: Finding Patterns in the Tweets
 
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
 
A Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament RecapA Narrative Display for Sports Tournament Recap
A Narrative Display for Sports Tournament Recap
 
Visualization for Event Sequences Exploration
Visualization for Event Sequences ExplorationVisualization for Event Sequences Exploration
Visualization for Event Sequences Exploration
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Último (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration of Event Sequences in Temporal Categorical Data

  • 1. INTERACTIVE EXPLORATION OF EVENT SEQUENCES IN TEMPORAL CATEGORICAL DATA KRIST WONGSUPHASAWAT DEPARTMENT OF COMPUTER SCIENCE & HUMAN-COMPUTER INTERACTION LAB UNIVERSITY OF MARYLAND al! pos at ion Pro Dissert 2010! , April 30
  • 2. INTERACTIVE EXPLORATION OF EVENT SEQUENCES IN TEMPORAL CATEGORICAL DATA KRIST WONGSUPHASAWAT DEPARTMENT OF COMPUTER SCIENCE & HUMAN-COMPUTER INTERACTION LAB UNIVERSITY OF MARYLAND al! pos at ion Pro Dissert 2010! , April 30
  • 3. TEMPORAL CATEGORICAL DATA •  A type of time series Category! Numerical Categorical Event! Stock: Microsoft Patient ID: 45851737 04/26/2010&10:00 &31.03& 12/02/2008&14:26 &Admit& 04/26/2010&10:15 &31.01& 12/02/2008&14:26 &Emergency& 04/26/2010&10:30 &31.02& 12/02/2008&22:44 &ICU& 04/26/2010&10:45 &31.08& 12/05/2008&05:07 &Floor& 04/26/2010&11:00 &31.16& 12/08/2008&10:02 &Floor& 04/26/2010&11:15 &31.15& 12/14/2008&06:19 &Exit& & Time Admit Emergency ICU Floor Exit
  • 4. TEMPORAL CATEGORICAL DATA •  Electronic Health Records: symptoms, treatment, lab test •  Traffic incident logs: arrival/departure time of each unit •  Student records: course, paper, proposal, defense, etc. •  Usability study logs •  Etc.
  • 5. working with physicians at WASHINGTON HOSPITAL CENTER
  • 6. INTERACTIVE EXPLORATION OF EVENT SEQUENCES IN TEMPORAL CATEGORICAL DATA KRIST WONGSUPHASAWAT DEPARTMENT OF COMPUTER SCIENCE & HUMAN-COMPUTER INTERACTION LAB UNIVERSITY OF MARYLAND al! pos at ion Pro Dissert 2010! , April 30
  • 7. EVENT SEQUENCES •  Examples: “Bounce backs” ICU Floor ICU within 2 days
  • 8. RESEARCH STATEMENT “my research aims to design effective visualization and interaction techniques to support users in exploring event sequences in temporal categorical data.” •  a flexible temporal search approach •  a novel visualization that provides an overview of multiple records
  • 9. RESEARCH QUESTION#1 PRELIM. + PROPOSED MOTIVATION WORK CONCLUSION RESEARCH RESEARCH QUESTIONS QUESTION#2 PRELIM. + PROPOSED WORK
  • 10. TRADITIONAL SEARCH MODEL Know exactly what they want Query Data * How to specify the query? How to display the data? SQL, TSQL, GUI, ... How to process the query? How to make the search efficient? Indexing methods, algorithms
  • 11. EXPLORATORY SEARCH MODEL Uncertain about what they are looking for Know exactly what they want Insight, Hypotheses * search Query Data refine, formulate
  • 12. UNCERTAIN Floor ICU Exit-Alive QUERY within 2 days Find something useful and display RESULTS Frustrated! found 0 record
  • 13. EXPLORATORY SEARCH MODEL Uncertain about what they are looking for Know exactly what they want Insight, Hypotheses * search Query Data refine, formulate How to display the data? Difficult to understand the event sequences in the data quickly
  • 14. NEEDS A STARTING POINT Show overview or summary Where should I start?
  • 15. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? 2.  How to provide an overview of event sequences for temporal categorical data?
  • 16. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? 2.  How to provide an overview of event sequences for temporal categorical data?
  • 17. APPROACHES Exact Search! Similarity-based Search! MUST have A, B, C SHOULD have A, B, C Query Query Record#1 Record#2 more Record#2 Record#1 similar Record#3 Record#3
  • 18. SIMILARITY-BASED SEARCH How to specify query? What is “similar”? How to display results? User Interface! Similarity Measure & Visualization
  • 19. SIMILARITY MEASURE Gives a score that shows how much a record is similar to the query. Min = 0 / Max = 1 Query Record#1 0.80 Record#2 0.63 Record#3 0.96 Record#4 0.77
  • 20. SIMILARITY MEASURE (2) Query Record#3 0.96 Record#1 0.80 more similar Record#4 0.77 Record#2 0.63
  • 21. CHALLENGE •  What is “similar”? depends&on&users/tasks& Record#1 (Query) A B C Record#2 missing A B C Record#3 extra A B C D Record#4 Time difference A B C time
  • 22. MATCH & MISMATCH MEASURE time Record#1 (Query) A C B C Record#2 A B C B Matched Events Missing Event Extra Event } Time Difference Number of Swapping Total Score Number of Missing Events 0 to 1 Number of Extra Events
  • 23. RELATED WORK •  Similarity measure –  For numerical time series •  [Ding et al. 2008], [Liu et al. 2006], [Berndt and Clifford 1994], ... –  Edit distance •  [Levenshtein 1966], [Winkler 1999], [Chen 2004], ... –  Biological sequence searching •  [Pearson and Lipman 1988], [Altschul et al. 1990], ... –  Approximate graph matching •  [Tian et al. 2007], [Tian et al. 2008] , ... –  Temporal Categorical Data •  [Sherkat 2006], ...
  • 25. DEMO - DATA •  Patient transfers in the emergency department •  Real data from hospital (Jan – Mar 2010) ADMIT Admission time EMERGENCY Emergency room ICU Intensive Care Unit INTERMEDIATE Intermediate Medical Care FLOOR Normal room EXIT-ALIVE Leave the hospital alive EXIT-DEAD Leave the hospital dead
  • 26. DEMO - SIMILAN This is when bugs are more likely to occur.!
  • 27. CONTROLLED EXPERIMENT •  18 participants •  2 interfaces by 5 tasks –  Similarity-based Search (Similan) –  Exact Search (LifeLines2) •  Time, error, preference
  • 28. X = Exact Search , S = Similarity-based Search RESULTS Sequence Counting Time const. Uncertainty Sequence CountSpeed (seconds) Time Uncertain 60 15 150 150 40 10 100 100 20 5 50 50 0 0 0 0 X S X S R X S X S Preference (7-points likert scale) 8 8 8 8 6 6 6 6 4 4 4 4 2 2 2 2 0 0 0 0 X S X S X S R X S
  • 30. PROPOSED WORK: FTS •  Draw an example to specify query •  Specify flexible and inflexible constraints MUST HAVE RANGE ? MUST NOT HAVE optional •  Algorithms
  • 31. PROPOSED WORK: FTS (2) •  Combine exact and similarity-based search Primary Bin similar Pass all constraints Secondary Bin similar The rest
  • 32. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? 2.  How to provide an overview of event sequences for temporal categorical data?
  • 33. WHEN YOU READ A PAPER… •  What is it about? INFORMATION VISUALIZATION MANTRA OVERVIEW FIRST, ZOOM AND FILTER, THEN DETAILS ON DEMAND
  • 34. RELATED WORK •  Single-record visualizations Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& Single-record" 12/02/2008&22:44 &ICU& Visualization 12/05/2008&05:07 &Floor& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & •  E.g. LifeLines, MIDGAARD, etc.
  • 35.
  • 36. RELATED WORK (2) •  Multiple instances of single-record visualizations Patient ID: 45851737 Patient ID: 45851737 Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 45851737 Patient ID: &Admit& 12/02/2008&14:26 &Emergency& 12/02/2008&14:26 &Admit& 12/02/2008&22:44 &Emergency& 12/02/2008&14:26 &ICU& 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& 12/02/2008&22:44 &ICU& 12/02/2008&14:26 &Emergency& Visualization 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& Visualization Single-record" Visualization 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& Visualization 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& & 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & 12/14/2008&06:19 &Exit& & & •  E.g. LifeLines2, Continuum, ActiviTree, etc.
  • 38. RELATED WORK (3) •  Multiple-record visualizations Patient ID: 45851737 Patient ID: 45851737 Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 45851737 Patient ID: &Admit& 12/02/2008&14:26 &Emergency& 12/02/2008&14:26 &Admit& 12/02/2008&22:44 &Emergency& 12/02/2008&14:26 &ICU& 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& Multiple-record" 12/02/2008&22:44 &ICU& 12/02/2008&14:26 &Emergency& 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& Visualization 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& & 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & 12/14/2008&06:19 &Exit& & & •  E.g. LifeLines2, Continuum
  • 39. LifeLines2’s Temporal Summary [Wang et al. 2009] Continuum’s Histogram [Andre 2007] Not so useful with many event types Does not help exploring event sequences
  • 40. OVERVIEW OF EVENT SEQUENCES •  Scalability vs. Loss of information Patient ID: 45851737 Patient ID: 45851737 Patient ID: 45851737 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Admit& 12/02/2008&14:26 &Emergency& ? 12/02/2008&14:26 &Admit& 12/02/2008&22:44 &Emergency& 12/02/2008&14:26 &ICU& 12/02/2008&14:26 &Emergency& 12/02/2008&22:44 &ICU& 12/05/2008&05:07 &Floor& 12/02/2008&22:44 &ICU& 12/08/2008&10:02 &Floor& 12/05/2008&05:07 &Floor& 12/05/2008&05:07 &Floor& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& 12/08/2008&10:02 &Floor& 12/14/2008&06:19 &Exit& & 12/14/2008&06:19 &Exit& & & •  Goal: Show common trends, outliers
  • 41. LIFEFLOW AGGREGATE Merge multiple records into tree VISUALIZE Display the tree
  • 42. AGGREGATE •  Aggregate by prefix #1 #2 #3 #4 Example with 4 records
  • 43. AGGREGATE •  Aggregate by prefix #1 #2 #3 #4
  • 44. VISUALIZE •  Inspired by the Icicle tree Number of files!
  • 45. VISUALIZE (2) •  Modify: Use horizontal axis to represent time •  Video
  • 46. DEMO - LIFEFLOW When the lines are combined into flow!
  • 47. LIFEFLOW •  Overview –  See trends –  Spot outliers •  Query •  Compare data
  • 48. COMPARISON ICU ICU Intermediate Intermediate Jan-Mar 2008 April-June 2008 Floor
  • 49. PROPOSED WORK: LIFEFLOW •  Designing interaction
  • 50. PROPOSED WORK: LIFEFLOW •  Primary –  Including non-temporal attributes •  Age, Gender, etc. –  Link with Flexible Temporal Search •  Optional –  Rank-by-feature –  Complex aggregation –  Multiple alignments
  • 51. PROPOSED WORK: LIFEFLOW •  Including non-temporal attributes –  [Pinto et al. 2001] Male Male Female Male Male Female
  • 52. PROPOSED WORK: LIFEFLOW •  Complex aggregation #7 #8 Not just merge by prefix #9 #10 Do not merge if the time gap is too different
  • 53. PROPOSED WORK: LIFEFLOW •  Multiple alignments ICU Floor ICU
  • 54. EVALUATION PLAN •  Flexible Temporal Search –  Controlled Experiment •  18-20 participants / 60-90 mins •  LifeFlow –  Usability Study •  15-20 participants / 60-90 mins –  MILCs •  3-5 domain experts / 5-12 weeks
  • 56. EXPLORATORY SEARCH MODEL Uncertain about what they are looking for Know exactly what they want Insight, Hypotheses * search Query Data refine, formulate How to display the data? Difficult to understand the event sequences in the data quickly
  • 57. RESEARCH QUESTIONS 1.  How to support the users when they are uncertain about what they are looking for? Flexible Temporal Search 2.  How to provide an overview of event sequences for temporal categorical data? LifeFlow: an overview visualization
  • 58. EXPECTED CONTRIBUTIONS 1.  Design of visual representations, user interfaces and interaction techniques 2.  Algorithms for flexible temporal search 3.  Evaluation results 4.  Open new directions for exploring temporal categorical data
  • 59. ACKNOWLEDGEMENT DR. PHUONG HO, DR. MARK SMITH, DAVID ROSEMAN WASHINGTON HOSPITAL CENTER http://www.whcenter.org NATIONAL INSTITUTES OF HEALTH (NIH) http://www.nih.gov MICHAEL PACK, MICHAEL VANDANIKER CENTER FOR ADVANCED TRANPORTATION TECHNOLOGY LAB (CATT LAB) http://www.cattlab.umd.edu