SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
Cagatay Turkay
Erdem Kaya
Selim Balcisoy
Helwig Hauser www.gicentre.net/vis2016
Designing Progressive and Interactive
Analytics Processes
for High-Dimensional Data Analysis
Visual analytics (VA) can be considered as an
interactive and iterative dialogue between the
human and the computer where the interactive
analysis process is a sequence of actions by the
user and responses by the computer motivated by
an analytical question …
Based on several works, e.g., [Keim et al. 2008], [Green et al. 2008]
…. this iterative discourse serves as the fundamental
mechanism through which analysts make observations
… we need to ensure that this dialogue takes place at a
pace that respects the temporal cognitive capabilities
of users …
Please
wait,
while
I
construct
my
next
sentence ...
Card, S.K., Robertson, G.G. and Mackinlay, J.D., 1991, The information visualizer, an information workspace. In Proceedings of the ACM SIGCHI
THURSDAY, 4:15
InfoVis: Scalable
Algorithms
This paper …
… visual data analysis processes where a computational
tool is integrated to support high-dimensional data analysis
… instead of forcing the user to wait for an interactive computation to
finish, we present a best possible result within an acceptable time
frame.
In essence ..
…. techniques and design considerations to
incorporate progressive methods within interactive
analysis processes that involve high-dimensional data ….
Online
algorithms
Visual
Representations
Levels of
Operation
Interactions
Human time constants to govern the pace of interaction….
Levels of Operation
Level 1 (0.1 sec.)
What: (animated) transitions between (computation) results
Why: ensures perceptually smooth transitions
Level 2 (1 sec.)
What: guaranteed response time for intermediate results
Why: Maintains dialog nature
Level 3 (10 – 30 sec.)
What: analytical unit task completion
Why: Answer a specific question
e.g. finding groups, locate outliers
A framework to implement human time constants
Integrating online algorithms
Can operate on small batches of data (on random sample subsets)
Produce approximate results
Updates can be done efficiently
Online PCA (Ross et al., 2008) Online clustering (Sculley et al., 2010)
online PCA
- Incremental SVD computation
- Intermediate results at each 1 sec.
- Immediate response
- Compute on subset but project all
- Colouring & improved transitions
Adaptive random sampling
Guarantee response in a fixed period of time (i.e. 1 sec.)
Faster convergence
1st Batch size: %8 , Time taken: 0.3 sec.
2nd
Batch size: %16, Time taken: 0.7 sec.
3rd
Batch size: %33, Time taken: 1.3 sec.
4th
Batch size: %25, Time taken: 0.9 sec.
5th
Batch size: %25, Time taken: 0.9 sec.
Let’s say these are your data items
online clustering
- Cluster only the subset
- Incrementally grow clusters
progress & certainty
Increasing sample size
Interaction methods to moderate the process
Key-framed brushing [Turkay, 2014]
Well-defined sequences that can be represented in 30 sec.
Help define analytical unit task
Evalution Workshops
Problem: Credit card transactions segmentation -- groups of expenditures with similar
characteristics
Data: 300K+ CC transactions, 5K customers (demog., location, financial metrics, etc.)
Methodology:
- 2-months long case study, 4 analysis session
(1 for training) with 4 CRM analyst
- Fly-on-the-wall observations
- Semi-structured interviews
- Video and sound recorded, renounce times noted,
32 hours of video processed for the extraction
of inference moments and quotes transcribed.
- Insights, questions, hypothesis identified
Observed/reported positive aspects
- Generation and verification of hypotheses in short time
.....
.....
.....
Observed/reported positive aspects
- Generation and verification of hypotheses in short time
- Continuous engagement
“We could generate so many new hypotheses in a very short time without
waiting for the whole calculation to end.”
“..., [Visualization] is quite engaging as we don’t have to wait for even a
moment to get some initial results.”
Observed/reported positive aspects
- Generation and verification of hypotheses in short time
- Continuous engagement
- Stability is key in decisions
“ … It seems like the clustering will not change. ... let’s switch to some
other set …
Observed/reported issues
- Continuous update of the visualization can be distracting
“... it can be distracting to look at an ever-changing visualization. [If we were]
able to set the step size, … then we can have some time to talk about
intermediate results.”
- Uncertainty and unstability is an issue
“…. I’ve just seen a high response score for the selected cluster, but it has just
gone away. “
- Early decisions might be wrong
- Multiple views operating concurrently can be problematic
Unaligned convergence
Ten Design Recommendations
DR1: Employ human time constants as the underlying theoretical
framework that governs the pace of interaction in analytical processes
DR2: Employ online learning algorithms that are capable of handling
data in sub-batches to perform computational tasks.
DR3: Employ an adaptive sampling mechanism that estimates suitable sample sizes for
computations to ensure efficiency in convergence
while still respecting the temporal constraints.
DR4: Facilitate the immediate initiation of computations in response
to user interactions that limit the domain of the algorithms.
DR5: Provide users with interaction mechanisms enabling management (pause, step size, re-
run) of the progression.
DR6: During the interaction design of visual analytic solutions, consider the effects of possible
fluctuations due to unaligned progression
in multiple progressive views.
DR7: Provide interaction mechanisms to define structured investigation sequences for
systematic generation and comparisons of computational results.
DR8: Support the interpretation of the evolution of the results through
suitable visualization techniques.
DR9: Inform analysts on the progress of computations and indications
of time-to-completion.
DR10: Inform analysts on the uncertainty in the computations and the
way the computations develop.
Future challenges & opportunities
Better heuristics/quality metrics
Reproducibility?
- different samples in each run
Provenance
…. instead of forcing the user to adjust to the temporal and
cognitive capabilities of visual analysis solutions, we orient the
technical solutions at the communication characteristics of
the users.
To conclude …
Cagatay Turkay
Erdem Kaya
Selim Balcisoy
Helwig Hauser
Designing Progressive and Interactive
Analytics Processes
for High-Dimensional Data Analysis
Full list of giCentre VIS 2016 contributions
www.gicentre.net/vis2016

Mais conteúdo relacionado

Destaque

Energy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksEnergy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksFinalyear Projects
 
High Dimensional Data Visualization
High Dimensional Data VisualizationHigh Dimensional Data Visualization
High Dimensional Data VisualizationFabian Keller
 
Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Cagatay Turkay
 
Visualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and GuidelinesVisualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and GuidelinesCagatay Turkay
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark OverviewairisData
 
Drawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combinationDrawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combinationMithileysh Sathiyanarayanan
 
Examples for leverage points
Examples for leverage pointsExamples for leverage points
Examples for leverage pointsGeorges Grinstein
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Spark Summit
 
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...Cagatay Turkay
 

Destaque (9)

Energy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksEnergy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networks
 
High Dimensional Data Visualization
High Dimensional Data VisualizationHigh Dimensional Data Visualization
High Dimensional Data Visualization
 
Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?
 
Visualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and GuidelinesVisualization, A Primer - Basics, Techniques and Guidelines
Visualization, A Primer - Basics, Techniques and Guidelines
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Drawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combinationDrawing Euler diagrams and graphs in combination
Drawing Euler diagrams and graphs in combination
 
Examples for leverage points
Examples for leverage pointsExamples for leverage points
Examples for leverage points
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
Enhancing a Social Science Model-building Workflow with Interactive Visualisa...
 

Semelhante a Designing Progressive and Interactive Analytics Processes for High-Dimensional Data Analysis

Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...
Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...
Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...Roman Atachiants
 
Visual thinking colin_ware_lectures_2013_10_research methods
Visual thinking colin_ware_lectures_2013_10_research methodsVisual thinking colin_ware_lectures_2013_10_research methods
Visual thinking colin_ware_lectures_2013_10_research methodsElsa von Licy
 
Research: Developing an Interactive Web Information Retrieval and Visualizati...
Research: Developing an Interactive Web Information Retrieval and Visualizati...Research: Developing an Interactive Web Information Retrieval and Visualizati...
Research: Developing an Interactive Web Information Retrieval and Visualizati...Roman Atachiants
 
J.kim c.bouchard other:a study on designers mental process of information ca...
J.kim c.bouchard other:a study on designers mental process of  information ca...J.kim c.bouchard other:a study on designers mental process of  information ca...
J.kim c.bouchard other:a study on designers mental process of information ca...ArchiLab 7
 
Agile architecture upload
Agile architecture uploadAgile architecture upload
Agile architecture uploadThe Real Dyl
 
HCI 3e - Ch 9: Evaluation techniques
HCI 3e - Ch 9:  Evaluation techniquesHCI 3e - Ch 9:  Evaluation techniques
HCI 3e - Ch 9: Evaluation techniquesAlan Dix
 
Cloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamCloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamDoug Needham
 
Eric Proegler Early Performance Testing from CAST2014
Eric Proegler Early Performance Testing from CAST2014Eric Proegler Early Performance Testing from CAST2014
Eric Proegler Early Performance Testing from CAST2014Eric Proegler
 
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...Data Con LA
 
complete_project
complete_projectcomplete_project
complete_projectAnirban Roy
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer universityLászló Kovács
 
QMRAS Project Presentation
QMRAS Project PresentationQMRAS Project Presentation
QMRAS Project PresentationGary Spencer
 
Twitter Analysis of Road Traffic Congestion Severity Estimation
Twitter Analysis of Road Traffic Congestion Severity EstimationTwitter Analysis of Road Traffic Congestion Severity Estimation
Twitter Analysis of Road Traffic Congestion Severity EstimationGaurav Singh
 
Agile leadership practices for PIONEERS
 Agile leadership practices for PIONEERS Agile leadership practices for PIONEERS
Agile leadership practices for PIONEERSStefan Haas
 
Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유
Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유
Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유NAVER Engineering
 
Story boarding, Handwritten recognition, Inductive, subductive, abductive met...
Story boarding, Handwritten recognition, Inductive, subductive, abductive met...Story boarding, Handwritten recognition, Inductive, subductive, abductive met...
Story boarding, Handwritten recognition, Inductive, subductive, abductive met...Faisal Shahzad Khan
 
Abstract
AbstractAbstract
Abstractemaye
 
Performance characterization in computer vision
Performance characterization in computer visionPerformance characterization in computer vision
Performance characterization in computer visionpotaters
 
Intelligence Density
Intelligence DensityIntelligence Density
Intelligence DensityAhmed Zyada
 

Semelhante a Designing Progressive and Interactive Analytics Processes for High-Dimensional Data Analysis (20)

Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...
Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...
Master Thesis: The Design of a Rich Internet Application for Exploratory Sear...
 
Visual thinking colin_ware_lectures_2013_10_research methods
Visual thinking colin_ware_lectures_2013_10_research methodsVisual thinking colin_ware_lectures_2013_10_research methods
Visual thinking colin_ware_lectures_2013_10_research methods
 
Research: Developing an Interactive Web Information Retrieval and Visualizati...
Research: Developing an Interactive Web Information Retrieval and Visualizati...Research: Developing an Interactive Web Information Retrieval and Visualizati...
Research: Developing an Interactive Web Information Retrieval and Visualizati...
 
gusdazjo_thesis
gusdazjo_thesisgusdazjo_thesis
gusdazjo_thesis
 
J.kim c.bouchard other:a study on designers mental process of information ca...
J.kim c.bouchard other:a study on designers mental process of  information ca...J.kim c.bouchard other:a study on designers mental process of  information ca...
J.kim c.bouchard other:a study on designers mental process of information ca...
 
Agile architecture upload
Agile architecture uploadAgile architecture upload
Agile architecture upload
 
HCI 3e - Ch 9: Evaluation techniques
HCI 3e - Ch 9:  Evaluation techniquesHCI 3e - Ch 9:  Evaluation techniques
HCI 3e - Ch 9: Evaluation techniques
 
Cloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamCloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug Needham
 
Eric Proegler Early Performance Testing from CAST2014
Eric Proegler Early Performance Testing from CAST2014Eric Proegler Early Performance Testing from CAST2014
Eric Proegler Early Performance Testing from CAST2014
 
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
 
complete_project
complete_projectcomplete_project
complete_project
 
Machine learning at b.e.s.t. summer university
Machine learning  at b.e.s.t. summer universityMachine learning  at b.e.s.t. summer university
Machine learning at b.e.s.t. summer university
 
QMRAS Project Presentation
QMRAS Project PresentationQMRAS Project Presentation
QMRAS Project Presentation
 
Twitter Analysis of Road Traffic Congestion Severity Estimation
Twitter Analysis of Road Traffic Congestion Severity EstimationTwitter Analysis of Road Traffic Congestion Severity Estimation
Twitter Analysis of Road Traffic Congestion Severity Estimation
 
Agile leadership practices for PIONEERS
 Agile leadership practices for PIONEERS Agile leadership practices for PIONEERS
Agile leadership practices for PIONEERS
 
Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유
Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유
Beyond TensorBoard: AutoML을 위한 interactive visual analytics 서비스 개발 경험 공유
 
Story boarding, Handwritten recognition, Inductive, subductive, abductive met...
Story boarding, Handwritten recognition, Inductive, subductive, abductive met...Story boarding, Handwritten recognition, Inductive, subductive, abductive met...
Story boarding, Handwritten recognition, Inductive, subductive, abductive met...
 
Abstract
AbstractAbstract
Abstract
 
Performance characterization in computer vision
Performance characterization in computer visionPerformance characterization in computer vision
Performance characterization in computer vision
 
Intelligence Density
Intelligence DensityIntelligence Density
Intelligence Density
 

Último

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfSayantanBiswas37
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...HyderabadDolls
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themeitharjee
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...HyderabadDolls
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...HyderabadDolls
 

Último (20)

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 

Designing Progressive and Interactive Analytics Processes for High-Dimensional Data Analysis

  • 1. Cagatay Turkay Erdem Kaya Selim Balcisoy Helwig Hauser www.gicentre.net/vis2016 Designing Progressive and Interactive Analytics Processes for High-Dimensional Data Analysis
  • 2. Visual analytics (VA) can be considered as an interactive and iterative dialogue between the human and the computer where the interactive analysis process is a sequence of actions by the user and responses by the computer motivated by an analytical question … Based on several works, e.g., [Keim et al. 2008], [Green et al. 2008]
  • 3. …. this iterative discourse serves as the fundamental mechanism through which analysts make observations … we need to ensure that this dialogue takes place at a pace that respects the temporal cognitive capabilities of users …
  • 5. Card, S.K., Robertson, G.G. and Mackinlay, J.D., 1991, The information visualizer, an information workspace. In Proceedings of the ACM SIGCHI
  • 6.
  • 8. This paper … … visual data analysis processes where a computational tool is integrated to support high-dimensional data analysis
  • 9. … instead of forcing the user to wait for an interactive computation to finish, we present a best possible result within an acceptable time frame. In essence ..
  • 10. …. techniques and design considerations to incorporate progressive methods within interactive analysis processes that involve high-dimensional data …. Online algorithms Visual Representations Levels of Operation Interactions
  • 11. Human time constants to govern the pace of interaction….
  • 12. Levels of Operation Level 1 (0.1 sec.) What: (animated) transitions between (computation) results Why: ensures perceptually smooth transitions Level 2 (1 sec.) What: guaranteed response time for intermediate results Why: Maintains dialog nature Level 3 (10 – 30 sec.) What: analytical unit task completion Why: Answer a specific question e.g. finding groups, locate outliers A framework to implement human time constants
  • 13. Integrating online algorithms Can operate on small batches of data (on random sample subsets) Produce approximate results Updates can be done efficiently Online PCA (Ross et al., 2008) Online clustering (Sculley et al., 2010)
  • 14. online PCA - Incremental SVD computation - Intermediate results at each 1 sec. - Immediate response - Compute on subset but project all - Colouring & improved transitions
  • 15. Adaptive random sampling Guarantee response in a fixed period of time (i.e. 1 sec.) Faster convergence 1st Batch size: %8 , Time taken: 0.3 sec. 2nd Batch size: %16, Time taken: 0.7 sec. 3rd Batch size: %33, Time taken: 1.3 sec. 4th Batch size: %25, Time taken: 0.9 sec. 5th Batch size: %25, Time taken: 0.9 sec. Let’s say these are your data items
  • 16. online clustering - Cluster only the subset - Incrementally grow clusters
  • 18. Interaction methods to moderate the process Key-framed brushing [Turkay, 2014] Well-defined sequences that can be represented in 30 sec. Help define analytical unit task
  • 19. Evalution Workshops Problem: Credit card transactions segmentation -- groups of expenditures with similar characteristics Data: 300K+ CC transactions, 5K customers (demog., location, financial metrics, etc.) Methodology: - 2-months long case study, 4 analysis session (1 for training) with 4 CRM analyst - Fly-on-the-wall observations - Semi-structured interviews - Video and sound recorded, renounce times noted, 32 hours of video processed for the extraction of inference moments and quotes transcribed. - Insights, questions, hypothesis identified
  • 20. Observed/reported positive aspects - Generation and verification of hypotheses in short time ..... ..... .....
  • 21. Observed/reported positive aspects - Generation and verification of hypotheses in short time - Continuous engagement “We could generate so many new hypotheses in a very short time without waiting for the whole calculation to end.” “..., [Visualization] is quite engaging as we don’t have to wait for even a moment to get some initial results.”
  • 22. Observed/reported positive aspects - Generation and verification of hypotheses in short time - Continuous engagement - Stability is key in decisions “ … It seems like the clustering will not change. ... let’s switch to some other set …
  • 23. Observed/reported issues - Continuous update of the visualization can be distracting “... it can be distracting to look at an ever-changing visualization. [If we were] able to set the step size, … then we can have some time to talk about intermediate results.” - Uncertainty and unstability is an issue “…. I’ve just seen a high response score for the selected cluster, but it has just gone away. “ - Early decisions might be wrong - Multiple views operating concurrently can be problematic Unaligned convergence
  • 24. Ten Design Recommendations DR1: Employ human time constants as the underlying theoretical framework that governs the pace of interaction in analytical processes DR2: Employ online learning algorithms that are capable of handling data in sub-batches to perform computational tasks. DR3: Employ an adaptive sampling mechanism that estimates suitable sample sizes for computations to ensure efficiency in convergence while still respecting the temporal constraints. DR4: Facilitate the immediate initiation of computations in response to user interactions that limit the domain of the algorithms. DR5: Provide users with interaction mechanisms enabling management (pause, step size, re- run) of the progression. DR6: During the interaction design of visual analytic solutions, consider the effects of possible fluctuations due to unaligned progression in multiple progressive views. DR7: Provide interaction mechanisms to define structured investigation sequences for systematic generation and comparisons of computational results. DR8: Support the interpretation of the evolution of the results through suitable visualization techniques. DR9: Inform analysts on the progress of computations and indications of time-to-completion. DR10: Inform analysts on the uncertainty in the computations and the way the computations develop.
  • 25. Future challenges & opportunities Better heuristics/quality metrics Reproducibility? - different samples in each run Provenance
  • 26. …. instead of forcing the user to adjust to the temporal and cognitive capabilities of visual analysis solutions, we orient the technical solutions at the communication characteristics of the users. To conclude …
  • 27. Cagatay Turkay Erdem Kaya Selim Balcisoy Helwig Hauser Designing Progressive and Interactive Analytics Processes for High-Dimensional Data Analysis Full list of giCentre VIS 2016 contributions www.gicentre.net/vis2016