SlideShare uma empresa Scribd logo
1 de 23
Mining Top-K Multidimensional Gradients Department of Informatics School of Engineering University of Minho PORTUGAL Ronnie Alves, Orlando Belo and Joel Ribeiro  9th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2007)  3-7 September 2007, Regensburg, Germany
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Mining Top-K Multidimensional Gradients
Gradients ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],*Introduction Mining Top-K Multidimensional Gradients How  is the average of duration call affected  By  age , origin, weekday  in cubes with at least 1000 customers  and where the average of duration calls is between  300s and 720s ? > It goes  (75%)  up for middle-age and people in Porto area on Monday. Typical Cubegrade  “how”  query Imielinski  et al DMKD’02, vol.6
Gradients (A=a1, B=b1, C=c1) (A=a1, B=b1, C=c1, D=d1) (A=a1, B=b1) (A=a1, B=b1, C=c2) roll-up(C) drill-down(D=d1) mutate(C=c2) cubegrade operations Even when considering only  iceberg cells , It may still generate a  very large number of pairs . > Mining gradients with constraints: a)  significance , b)  probe  and  c)  gradient > LiveSet-Driven strategy   Constrained Gradients Mining Top-K Multidimensional Gradients Dong  et al TKDM’02, vol.16 *Introduction
Gradients ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],*Introduction Mining Top-K Multidimensional Gradients Find the  Top-K  highest changes situations related to  average of duration call  originated  in the  Porto  area during the  week . > Find  maximum gradient regions (MGRs)  in  the cube that  maximize  the task of mining Top-K gradient cells . Top-K Gradient Query Alves  et al DaWaK’07
What’s New with Top-K Gradients ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],*Introduction Mining Top-K Multidimensional Gradients
Gradient Regions *Top-K Gradients Mining Top-K Multidimensional Gradients countXY( ) sumXY( ) avgXY() convex non-convex gradient region (GR) > Avg() is an  algebraic function  and It also has an  unpredictable spreading factor  regarding its distribution value > There are also  sets of GRs to looking for Different shapes of  aggregating  functions
Gradient Regions ,[object Object],[object Object],[object Object],[object Object],[object Object],Mining Top-K Multidimensional Gradients *Top-K Gradients GR1 GR2 We expect that GRs with largest aggregating values will provide higher gradient cells
Definitions *Top-K Gradients Mining Top-K Multidimensional Gradients Base Table closed   cell maximal   cell maximal probe cell matchable   cells A cell  cg  is said to be  gradient   cell  of a  probe   cell   cp , when they are  matchable cells  and their delta change, given by  Δg(cg, cp)    (g(cg, cp) ≥   )  is true,  where    is a constant value and  g  is a  gradient function .
Gradient Ascent Approach ,[object Object],[object Object],*Top-K Gradients Mining Top-K Multidimensional Gradients When evaluating a GR we first  search for the   maximal   probe   cells , i.e. the highest aggregating values on it and  then calculates its gradients  from all possible matchable cells.
Gradient-based Cubing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],*Top-K Gradients Mining Top-K Multidimensional Gradients
Cubing *Top-K Gradients Mining Top-K Multidimensional Gradients X,Y,Z: Selecting dimensions Value list Inverted index Spreading factors C i ={x1,y3,*}={4} Cuboid cell {1,4} {4} U Count (Ci)=1 Intersect tids aggregating function > Assembling high-dimensional cubes from low-dimensional ones  > Follows Frag-Cubing ideas Li  et al VLDB’04
*Top-K Gradients Set Enumeration Tree Mining Top-K Multidimensional Gradients Gradient Region Top-K sets Min_sf>0.25, valid GR > Lattice is formed by projecting GR[x1] >> GR[y2] >> GR[z2] > Find local gradients Agg_value Probe cells 1
*Top-K Gradients Mining Top-K Multidimensional Gradients 2 Projecting probe cells GR[x1] >> GR[y3] Top-K sets Matchable links Bin x1  = [1,4] Min_avg>2.7,  valid Top-KGR
*Top-K Gradients Mining Top-K Multidimensional Gradients 3 Projecting probe cells GR[x1] >> GR[z1] Top-3 = {i, L, j}  {x1,y2,*} -> {x1,y3,*} {x1,*,z3} -> {x1,*,z1} {x1,*,*} -> {x1,y3,*} Top-K sets Matchable links That’s it!!
Mining Top-K Gradients ,[object Object],*Top-K Gradients Mining Top-K Multidimensional Gradients Min_sf Min_avg
Min_ sf  pruning effects *Evaluation Study Mining Top-K Multidimensional Gradients Datasets Running time(s) Min_sf() D2 D1
Min_avg pruning effects *Evaluation Study Mining Top-K Multidimensional Gradients Datasets D2 D1 Running time(s) Min_avg()
K effects *Evaluation Study Mining Top-K Multidimensional Gradients Running time(s) K-cells D2
General pruning effects *Evaluation Study Mining Top-K Multidimensional Gradients D1 & D2 Valid cells Min_sf() K=5, avg>200 1.3M cells 420K cells 200s 170s 1Gb Ram 1M GRs
Conclusions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Mining Top-K Multidimensional Gradients QUESTIONS??? Department of Informatics School of Engineering University of Minho PORTUGAL Ronnie Alves, Orlando Belo and Joel Ribeiro 9th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2007)  3-7 September 2007, Regensburg, Germany  Web : http://alfa.di.uminho.pt/~ronnie/
Frag-Cubing ,[object Object],ABCD ABC ABD ACD BCD AC BC AD BD CD A D B C AB Partition dimensions into several groups Materialize low dimensional cuboids offline Assembly high dimensional cuboids online Mining Cube Approach [Li et al, VLDB’04] *Top-K Gradients Mining Top-K Multidimensional Gradients

Mais conteúdo relacionado

Mais procurados

Ch 5: Introduction to heap overflows
Ch 5: Introduction to heap overflowsCh 5: Introduction to heap overflows
Ch 5: Introduction to heap overflowsSam Bowne
 
A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...
A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...
A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...Tomoki Koriyama
 
Implementing parallel evolutionary algorithms in concurrent and functional pa...
Implementing parallel evolutionary algorithms in concurrent and functional pa...Implementing parallel evolutionary algorithms in concurrent and functional pa...
Implementing parallel evolutionary algorithms in concurrent and functional pa...José Albert
 
QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...
QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...
QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...QAFest
 
pMatlab on BlueGene
pMatlab on BlueGenepMatlab on BlueGene
pMatlab on BlueGenevsachde
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech ProjectsJody Garnett
 
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources Thomas Gottron
 
Secondary Spectrum Usage for Mobile Devices
Secondary Spectrum Usage for Mobile DevicesSecondary Spectrum Usage for Mobile Devices
Secondary Spectrum Usage for Mobile DevicesAmjed Majid
 
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLModeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLKostis Kyzirakos
 
06 how to write a map reduce version of k-means clustering
06 how to write a map reduce version of k-means clustering06 how to write a map reduce version of k-means clustering
06 how to write a map reduce version of k-means clusteringSubhas Kumar Ghosh
 

Mais procurados (14)

Ch 5: Introduction to heap overflows
Ch 5: Introduction to heap overflowsCh 5: Introduction to heap overflows
Ch 5: Introduction to heap overflows
 
A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...
A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...
A TRAINING METHOD USING
 DNN-GUIDED LAYERWISE PRETRAINING
 FOR DEEP GAUSSIAN ...
 
Implementing parallel evolutionary algorithms in concurrent and functional pa...
Implementing parallel evolutionary algorithms in concurrent and functional pa...Implementing parallel evolutionary algorithms in concurrent and functional pa...
Implementing parallel evolutionary algorithms in concurrent and functional pa...
 
QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...
QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...
QA Fest 2018. Никита Кричко. Методология использования машинного обучения в н...
 
Making data storage more efficient
Making data storage more efficientMaking data storage more efficient
Making data storage more efficient
 
14 lab-planing
14 lab-planing14 lab-planing
14 lab-planing
 
Ch13
Ch13Ch13
Ch13
 
pMatlab on BlueGene
pMatlab on BlueGenepMatlab on BlueGene
pMatlab on BlueGene
 
LocationTech Projects
LocationTech ProjectsLocationTech Projects
LocationTech Projects
 
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
 
Secondary Spectrum Usage for Mobile Devices
Secondary Spectrum Usage for Mobile DevicesSecondary Spectrum Usage for Mobile Devices
Secondary Spectrum Usage for Mobile Devices
 
FTM tree
FTM treeFTM tree
FTM tree
 
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLModeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
 
06 how to write a map reduce version of k-means clustering
06 how to write a map reduce version of k-means clustering06 how to write a map reduce version of k-means clustering
06 how to write a map reduce version of k-means clustering
 

Destaque

Latvia PowerPoint Content
Latvia PowerPoint Content Latvia PowerPoint Content
Latvia PowerPoint Content Andrew Schwartz
 
The New Seven Wonders Of The World
The New Seven Wonders Of The WorldThe New Seven Wonders Of The World
The New Seven Wonders Of The Worldsanctuary
 
Seminar University of Peking, October 2011
Seminar University of Peking, October 2011Seminar University of Peking, October 2011
Seminar University of Peking, October 2011Mike Sharples
 
Adobe et la stratégie multi-écrans
Adobe et la stratégie multi-écransAdobe et la stratégie multi-écrans
Adobe et la stratégie multi-écransMichael Chaize
 
我行·你行·大家行01
我行·你行·大家行01我行·你行·大家行01
我行·你行·大家行01liuruifeng
 
6308 Casper Presentationupdated2
6308 Casper Presentationupdated26308 Casper Presentationupdated2
6308 Casper Presentationupdated2Cma Mohd
 
NxTop Radio Episode 1
NxTop Radio Episode 1NxTop Radio Episode 1
NxTop Radio Episode 1Lexumo
 
Kõnepuue
KõnepuueKõnepuue
Kõnepuuekiq
 
Why stop Open Source in the Enterprise?
Why stop Open Source in the Enterprise?Why stop Open Source in the Enterprise?
Why stop Open Source in the Enterprise?John Newton
 
Multimediatag Heidelberg
Multimediatag HeidelbergMultimediatag Heidelberg
Multimediatag HeidelbergMsSchool
 
Urvalsproblemetihistoria
UrvalsproblemetihistoriaUrvalsproblemetihistoria
Urvalsproblemetihistoriahenriksvensson
 
A Journey Into Wholeness Final
A Journey Into Wholeness  FinalA Journey Into Wholeness  Final
A Journey Into Wholeness Finalmsainfo
 
Leading Without Being In Charge
Leading Without Being In ChargeLeading Without Being In Charge
Leading Without Being In ChargeSelena Deckelmann
 
Summary of LA5
Summary of LA5Summary of LA5
Summary of LA5Cma Mohd
 
O que aconteceu com os mundos virtuais no ensino?
O que aconteceu com os mundos virtuais no ensino?O que aconteceu com os mundos virtuais no ensino?
O que aconteceu com os mundos virtuais no ensino?Neli Maria Mengalli
 
Harry Pictures
Harry PicturesHarry Pictures
Harry Pictures경용 박
 

Destaque (20)

Latvia PowerPoint Content
Latvia PowerPoint Content Latvia PowerPoint Content
Latvia PowerPoint Content
 
The New Seven Wonders Of The World
The New Seven Wonders Of The WorldThe New Seven Wonders Of The World
The New Seven Wonders Of The World
 
Seminar University of Peking, October 2011
Seminar University of Peking, October 2011Seminar University of Peking, October 2011
Seminar University of Peking, October 2011
 
Matkalla metaverseen?
Matkalla metaverseen?Matkalla metaverseen?
Matkalla metaverseen?
 
Adobe et la stratégie multi-écrans
Adobe et la stratégie multi-écransAdobe et la stratégie multi-écrans
Adobe et la stratégie multi-écrans
 
我行·你行·大家行01
我行·你行·大家行01我行·你行·大家行01
我行·你行·大家行01
 
6308 Casper Presentationupdated2
6308 Casper Presentationupdated26308 Casper Presentationupdated2
6308 Casper Presentationupdated2
 
Civil War
Civil WarCivil War
Civil War
 
морфология
морфологияморфология
морфология
 
NxTop Radio Episode 1
NxTop Radio Episode 1NxTop Radio Episode 1
NxTop Radio Episode 1
 
Kõnepuue
KõnepuueKõnepuue
Kõnepuue
 
Why stop Open Source in the Enterprise?
Why stop Open Source in the Enterprise?Why stop Open Source in the Enterprise?
Why stop Open Source in the Enterprise?
 
Ch07
Ch07Ch07
Ch07
 
Multimediatag Heidelberg
Multimediatag HeidelbergMultimediatag Heidelberg
Multimediatag Heidelberg
 
Urvalsproblemetihistoria
UrvalsproblemetihistoriaUrvalsproblemetihistoria
Urvalsproblemetihistoria
 
A Journey Into Wholeness Final
A Journey Into Wholeness  FinalA Journey Into Wholeness  Final
A Journey Into Wholeness Final
 
Leading Without Being In Charge
Leading Without Being In ChargeLeading Without Being In Charge
Leading Without Being In Charge
 
Summary of LA5
Summary of LA5Summary of LA5
Summary of LA5
 
O que aconteceu com os mundos virtuais no ensino?
O que aconteceu com os mundos virtuais no ensino?O que aconteceu com os mundos virtuais no ensino?
O que aconteceu com os mundos virtuais no ensino?
 
Harry Pictures
Harry PicturesHarry Pictures
Harry Pictures
 

Semelhante a DaWaK'07

Applying Linear Optimization Using GLPK
Applying Linear Optimization Using GLPKApplying Linear Optimization Using GLPK
Applying Linear Optimization Using GLPKJeremy Chen
 
Sorry - How Bieber broke Google Cloud at Spotify
Sorry - How Bieber broke Google Cloud at SpotifySorry - How Bieber broke Google Cloud at Spotify
Sorry - How Bieber broke Google Cloud at SpotifyNeville Li
 
Latency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network CodingLatency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network CodingLars Nielsen
 
Konstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-Algorithm
Konstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-AlgorithmKonstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-Algorithm
Konstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-AlgorithmKonstandinos Zamfes
 
A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...
A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...
A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...KamleshKumar394
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceKyong-Ha Lee
 
On Sampling from Massive Graph Streams
On Sampling from Massive Graph StreamsOn Sampling from Massive Graph Streams
On Sampling from Massive Graph StreamsNesreen K. Ahmed
 
Aghora A High-Order DG Solver for Turbulent Flow Simulations.pdf
Aghora  A High-Order DG Solver for Turbulent Flow Simulations.pdfAghora  A High-Order DG Solver for Turbulent Flow Simulations.pdf
Aghora A High-Order DG Solver for Turbulent Flow Simulations.pdfSandra Valenzuela
 
Chris hill rps_postgis_threeoutoffouraintbad_20150505_1
Chris hill rps_postgis_threeoutoffouraintbad_20150505_1Chris hill rps_postgis_threeoutoffouraintbad_20150505_1
Chris hill rps_postgis_threeoutoffouraintbad_20150505_1Chris Hill
 
Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like system
Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like systemAccelerate Reed-Solomon coding for Fault-Tolerance in RAID-like system
Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like systemShuai Yuan
 
A Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.ioA Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.ioRandall Hunt
 
Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...
Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...
Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...Masahiro Kanazaki
 
Data Time Travel by Delta Time Machine
Data Time Travel by Delta Time MachineData Time Travel by Delta Time Machine
Data Time Travel by Delta Time MachineDatabricks
 
Ece512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutionsEce512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutionsnadia abd
 
Modeling of heat transfer in 2 d slab
Modeling of heat transfer in 2 d slabModeling of heat transfer in 2 d slab
Modeling of heat transfer in 2 d slabAlexander Decker
 
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Sease
 
Benchmarking Apache Druid
Benchmarking Apache Druid Benchmarking Apache Druid
Benchmarking Apache Druid Matt Sarrel
 

Semelhante a DaWaK'07 (20)

Understanding JVM GC: advanced!
Understanding JVM GC: advanced!Understanding JVM GC: advanced!
Understanding JVM GC: advanced!
 
Applying Linear Optimization Using GLPK
Applying Linear Optimization Using GLPKApplying Linear Optimization Using GLPK
Applying Linear Optimization Using GLPK
 
Sorry - How Bieber broke Google Cloud at Spotify
Sorry - How Bieber broke Google Cloud at SpotifySorry - How Bieber broke Google Cloud at Spotify
Sorry - How Bieber broke Google Cloud at Spotify
 
Latency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network CodingLatency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network Coding
 
Konstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-Algorithm
Konstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-AlgorithmKonstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-Algorithm
Konstandinos_Zamfes_PSI_Drillig_Cutting-Analysis-Geo-Algorithm
 
A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...
A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...
A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapRe...
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
 
rgDefense
rgDefensergDefense
rgDefense
 
On Sampling from Massive Graph Streams
On Sampling from Massive Graph StreamsOn Sampling from Massive Graph Streams
On Sampling from Massive Graph Streams
 
Aghora A High-Order DG Solver for Turbulent Flow Simulations.pdf
Aghora  A High-Order DG Solver for Turbulent Flow Simulations.pdfAghora  A High-Order DG Solver for Turbulent Flow Simulations.pdf
Aghora A High-Order DG Solver for Turbulent Flow Simulations.pdf
 
Chris hill rps_postgis_threeoutoffouraintbad_20150505_1
Chris hill rps_postgis_threeoutoffouraintbad_20150505_1Chris hill rps_postgis_threeoutoffouraintbad_20150505_1
Chris hill rps_postgis_threeoutoffouraintbad_20150505_1
 
Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like system
Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like systemAccelerate Reed-Solomon coding for Fault-Tolerance in RAID-like system
Accelerate Reed-Solomon coding for Fault-Tolerance in RAID-like system
 
A Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.ioA Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.io
 
Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...
Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...
Efficient Design Exploration for Civil Aircraft Using a Kriging-Based Genetic...
 
Avinash_PPT
Avinash_PPTAvinash_PPT
Avinash_PPT
 
Data Time Travel by Delta Time Machine
Data Time Travel by Delta Time MachineData Time Travel by Delta Time Machine
Data Time Travel by Delta Time Machine
 
Ece512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutionsEce512 h1 20139_621386735458ece512_test2_solutions
Ece512 h1 20139_621386735458ece512_test2_solutions
 
Modeling of heat transfer in 2 d slab
Modeling of heat transfer in 2 d slabModeling of heat transfer in 2 d slab
Modeling of heat transfer in 2 d slab
 
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
 
Benchmarking Apache Druid
Benchmarking Apache Druid Benchmarking Apache Druid
Benchmarking Apache Druid
 

Último

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 

Último (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 

DaWaK'07

  • 1. Mining Top-K Multidimensional Gradients Department of Informatics School of Engineering University of Minho PORTUGAL Ronnie Alves, Orlando Belo and Joel Ribeiro 9th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2007) 3-7 September 2007, Regensburg, Germany
  • 2.
  • 3.
  • 4. Gradients (A=a1, B=b1, C=c1) (A=a1, B=b1, C=c1, D=d1) (A=a1, B=b1) (A=a1, B=b1, C=c2) roll-up(C) drill-down(D=d1) mutate(C=c2) cubegrade operations Even when considering only iceberg cells , It may still generate a very large number of pairs . > Mining gradients with constraints: a) significance , b) probe and c) gradient > LiveSet-Driven strategy Constrained Gradients Mining Top-K Multidimensional Gradients Dong et al TKDM’02, vol.16 *Introduction
  • 5.
  • 6.
  • 7. Gradient Regions *Top-K Gradients Mining Top-K Multidimensional Gradients countXY( ) sumXY( ) avgXY() convex non-convex gradient region (GR) > Avg() is an algebraic function and It also has an unpredictable spreading factor regarding its distribution value > There are also sets of GRs to looking for Different shapes of aggregating functions
  • 8.
  • 9. Definitions *Top-K Gradients Mining Top-K Multidimensional Gradients Base Table closed cell maximal cell maximal probe cell matchable cells A cell cg is said to be gradient cell of a probe cell cp , when they are matchable cells and their delta change, given by Δg(cg, cp)  (g(cg, cp) ≥  ) is true, where  is a constant value and g is a gradient function .
  • 10.
  • 11.
  • 12. Cubing *Top-K Gradients Mining Top-K Multidimensional Gradients X,Y,Z: Selecting dimensions Value list Inverted index Spreading factors C i ={x1,y3,*}={4} Cuboid cell {1,4} {4} U Count (Ci)=1 Intersect tids aggregating function > Assembling high-dimensional cubes from low-dimensional ones > Follows Frag-Cubing ideas Li et al VLDB’04
  • 13. *Top-K Gradients Set Enumeration Tree Mining Top-K Multidimensional Gradients Gradient Region Top-K sets Min_sf>0.25, valid GR > Lattice is formed by projecting GR[x1] >> GR[y2] >> GR[z2] > Find local gradients Agg_value Probe cells 1
  • 14. *Top-K Gradients Mining Top-K Multidimensional Gradients 2 Projecting probe cells GR[x1] >> GR[y3] Top-K sets Matchable links Bin x1 = [1,4] Min_avg>2.7, valid Top-KGR
  • 15. *Top-K Gradients Mining Top-K Multidimensional Gradients 3 Projecting probe cells GR[x1] >> GR[z1] Top-3 = {i, L, j} {x1,y2,*} -> {x1,y3,*} {x1,*,z3} -> {x1,*,z1} {x1,*,*} -> {x1,y3,*} Top-K sets Matchable links That’s it!!
  • 16.
  • 17. Min_ sf pruning effects *Evaluation Study Mining Top-K Multidimensional Gradients Datasets Running time(s) Min_sf() D2 D1
  • 18. Min_avg pruning effects *Evaluation Study Mining Top-K Multidimensional Gradients Datasets D2 D1 Running time(s) Min_avg()
  • 19. K effects *Evaluation Study Mining Top-K Multidimensional Gradients Running time(s) K-cells D2
  • 20. General pruning effects *Evaluation Study Mining Top-K Multidimensional Gradients D1 & D2 Valid cells Min_sf() K=5, avg>200 1.3M cells 420K cells 200s 170s 1Gb Ram 1M GRs
  • 21.
  • 22. Mining Top-K Multidimensional Gradients QUESTIONS??? Department of Informatics School of Engineering University of Minho PORTUGAL Ronnie Alves, Orlando Belo and Joel Ribeiro 9th International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2007) 3-7 September 2007, Regensburg, Germany Web : http://alfa.di.uminho.pt/~ronnie/
  • 23.