SlideShare uma empresa Scribd logo
1 de 18
Discovering Spatial Co-location
Patterns : A summary of Results
Shashi Sekhar, Yan Huang
Dept of Computer Sciences, University of Minnesota, USA
AG 2414
Spatial Analysis
Seminar by-
Adrian C Prelipcean
Ipsit Dash
Outline
• Introduction
• Focus of the Research
• Background of the Problem
• Approaches of Modeling the Co-location
problem
• Co-location Miner Algorithm
• Conclusions
Data Data Data!!!!
• Data are being collected continuously for innumerable
phenomenon.
• Business applications
• Scientific Applications
• National Security Purposes
It is impossible to
analyze each strand of
data collected.
Calls for
Data
Mining
Automation
Hypothesis
Generation
Better Linking of
phenomenon
Data Mining
• Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or
knowledge from huge amount of data backed by complex computer based
decision systems ( A.I.,Business Intelligence, Machine learning)
• Spatial Data Mining is different from Classical Data Mining used with reference to RDBMS.
• Here attributes of the neighbours of some object of interest may have an
influence on the object and therefore have to be considered as well. The
explicit location and extension of spatial objects define implicit relations of
spatial neighbourhood (such as topological, distance and direction relations) which are used by
spatial data mining algorithms.
• Methods used in General Data Mining- Predictive (Classification, Regression)
and Descriptive (Clustering, Association)
• Methods used in determining Spatial Patterns-
 Location prediction model (to identify habitat of endangered species)
 Spatial clusters (crime hot-spots, cancer clusters)
 Spatial associations: co-locations (predator-prey species, symbiosis dental health and fluoride)
 Spatial outlier : discontinuities (bad traffic sensors on highways)
Spatial Associations : Co-locations
• Classical Association methodology : Given a set
of transactions, find rules that will predict the occurrence of
an item based on the occurrences of other items in the
transaction. Implication means co-occurrence, not causality!
• Now in case of Spatial Data, the transactions are not disjoint.
So classical approach is not ideal to be used here.
• Association vs Co-location
Focus of the Research
• To extract information from geospatial data and identify frequent co-
occurrence among Boolean spatial features like Draught, El Nino,
Substantial drop in vegetation etc. on Ecological Datasets.
• Approaches to discover co-location rules can be classified into 2 classes-
Spatial Statistics, Association Rules
• Spatial Statistics- Uses spatial correlation measures to characterize
different relations between spatial features ( chi-sq tests, Correlation
coefficients, regression models etc)
• Association Rules- Assumes that finite set of disjoint sets are given as input to the
Algorithm and they can find the most frequent items from the set and deduce
relationships on those items ( apriori algorithm and Independent Approaches-
Based on suitability of Reference Spatial Feature to mine all association rules of
nearby spatial features. )
Background of the Problem
• Given
1. A set of items T of K boolean spatial feature types T={f1,f2,…,fK)
2. A set of N instances P={p1…pN} each p, is a vector <instance-id, spatial feature type,
location>
3. A neighbor relation R over locations in S
4. Min prevalence threshold value, min conditional probability threshold
• Objectives
1. Completeness: it finds all spatial colocation rules that satisfy the threshold value(s)
2. Correctness: any spatial co-location rule found by the algorithm respects the threshold
value(s)
3. IO cost and CPU cost to generate the colocation rules should be acceptable
• Find
– Co-location rules with high prevalence and high conditional probability
• Constraints
– R is symmetric and reflexive
– Monotonic prevalence measure
– Conditional probability measures are specified by the event centric model
– Sparse data set, the number of instance of any spatial features is << cardinality (P)
Approaches of Modeling the
Co-location Rules Problem
• The reference feature centric model
– Is relevant to application domains focusing on a specific boolean
spatial feature
• The window centric model
– Is relevant to applications like mining, surveying and geology,
which focus on land-parcels
– One goal is to predict sets of spatial features likely to be
discovered in a land parcel given that some other features have
been found there
• The event-centric model
– Is relevant to applications that have interest in finding subsets of
spatial features likely to occur in a neighborhood around
instances of given subsets of event types
Reference feature centric model
• Let the reference feature be A
• The set of spatial predicates include one
predicate: close_to(a,b) which is true if and
only if b is a’s neighbour
Reference feature centric model
Association rule example:
is_type(i,A)∧∃ j is_type(j,B)∧close_to(j,i)
→∃ k is_type(k,C)∧close_to(k,i)
with 100% probability
Window centric model
Number of windows: 16
Number of windows containing A: 15
Number of windows containing A and B: 7
Association rule:
an instance of type A in a window → an
instance of type B in a window with
7/15=46.67% probability
Event centric model
Instances of type A: 4
Instances of type A that have some
instances of type B: 1
Conditional probability for the co-
location rule is:
spatial feature A at location l →
spatial feature type B in 9-neighbor
neighborhood is 25%
Concepts
Co-location Miner Algorithm
• Input
1. K boolean spatial instance and their instances
2. A symmetric and reflexive neighbor relation R
3. A user specified minimum threshold prevalence measure (min_prevalence)
4. A user specified minimum conditional probability (min_cond_prob)
• Output
– Co-location rule sets with participation index> min_prevalence and conditional
probability>min_cond_prob
• Method
1. Prevalent size 1 co-location set along with their table instances=P
2. Generate size 2 co-location rules
3. For size of co-locations in (2,3,…,K-1) do
4. Generate candidate prevalent co-locations using the generalized apriori_gen
algorithm
5. Generate table instances and prune based on neighborhood
6. Prune based on prevalence of co-locations
7. Generate co-location rules
8. end;
Co-location Miner Algorithm
Conclusions
• This paper gives a clear idea about co-
localization problem and its difference from
classical association problem.
• Co-location Miner algorithm was dealt with
easily with a comprehensive understanding
and detailed analysis.
• Future centric approach of the authors
promises development in field of Plane
Sweeping Algorithms.
Spatial Co-location Patterns: articles
• – S. Shekhar and Y. Huang, Discovering Spatial Co-location Patterns: A
Summary of Results, In Proc. of 7th Intl Symposium on Spatial and
Temporal Databases (SSTD), Springer-Verlag, Lecture Notes in Computer
Science, LNCS 2121, p.236 ff, July 2001
• – S. Shekhar and Y. Huang, Multi-resolution Co-location Miner: a New
Algorithm to Find Co-location Patterns from Spatial Datasets, SIAM
SDM02 Workshop on Mining Scientific Datasets, April 2002
• – Y. Huang, H. Xiong, S. Shekhar, and J. Pei, Mining Confident Co-location
Rules without A Support Threshold, in Proc. of 18th ACM Symposium on
Applied Computing (ACM SAC), March 2003
• – Y. Huang, S. Shekhar, and H. Xiong, Discovering Colocation Patterns from
Spatial Datasets: A General Approach, submitted to IEEE Transactions on
Knowledge and Data Engineering (TKDE), 2004

Mais conteúdo relacionado

Mais procurados

Data mining on Social Media
Data mining on Social MediaData mining on Social Media
Data mining on Social Mediahome
 
Remote Sensing Imagery & Artificial Intelligence
Remote Sensing Imagery & Artificial IntelligenceRemote Sensing Imagery & Artificial Intelligence
Remote Sensing Imagery & Artificial IntelligenceEsri Ireland
 
A Journey to the World of GIS
A Journey to the World of GISA Journey to the World of GIS
A Journey to the World of GISNishant Sinha
 
Introduction to ArcGIS 10.1
Introduction to ArcGIS 10.1Introduction to ArcGIS 10.1
Introduction to ArcGIS 10.1Claudio Montoni
 
Band ratioing presentation
Band ratioing presentationBand ratioing presentation
Band ratioing presentationsk asadul haque
 
WEB GIS AND WEB MAP.pptx
WEB GIS AND WEB MAP.pptxWEB GIS AND WEB MAP.pptx
WEB GIS AND WEB MAP.pptxAsim Pt
 
Spatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSpatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSebastian Benthall
 
Unit 1 - SNA QUESTION BANK
Unit 1 - SNA QUESTION BANKUnit 1 - SNA QUESTION BANK
Unit 1 - SNA QUESTION BANKUsha Rani M
 
Change detection using remote sensing and GIS
Change detection using remote sensing and GISChange detection using remote sensing and GIS
Change detection using remote sensing and GISTilok Chetri
 
Multi media Data mining
Multi media Data miningMulti media Data mining
Multi media Data mininghome
 
GIS Standards and Interoperability
GIS Standards and InteroperabilityGIS Standards and Interoperability
GIS Standards and InteroperabilityNasr Khashoggi
 
Spatial Data Mining
Spatial Data MiningSpatial Data Mining
Spatial Data MiningRashmi Bhat
 
DATA in GIS and DATA Query
DATA in GIS and DATA QueryDATA in GIS and DATA Query
DATA in GIS and DATA QueryKU Leuven
 

Mais procurados (20)

Spatial Database Systems
Spatial Database SystemsSpatial Database Systems
Spatial Database Systems
 
What is Machine Learning
What is Machine LearningWhat is Machine Learning
What is Machine Learning
 
Data mining on Social Media
Data mining on Social MediaData mining on Social Media
Data mining on Social Media
 
Remote Sensing Imagery & Artificial Intelligence
Remote Sensing Imagery & Artificial IntelligenceRemote Sensing Imagery & Artificial Intelligence
Remote Sensing Imagery & Artificial Intelligence
 
A Journey to the World of GIS
A Journey to the World of GISA Journey to the World of GIS
A Journey to the World of GIS
 
Introduction to ArcGIS 10.1
Introduction to ArcGIS 10.1Introduction to ArcGIS 10.1
Introduction to ArcGIS 10.1
 
Band ratioing presentation
Band ratioing presentationBand ratioing presentation
Band ratioing presentation
 
WEB GIS AND WEB MAP.pptx
WEB GIS AND WEB MAP.pptxWEB GIS AND WEB MAP.pptx
WEB GIS AND WEB MAP.pptx
 
Spatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSpatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNode
 
Text MIning
Text MIningText MIning
Text MIning
 
Spatial Database
Spatial DatabaseSpatial Database
Spatial Database
 
Unit 1 - SNA QUESTION BANK
Unit 1 - SNA QUESTION BANKUnit 1 - SNA QUESTION BANK
Unit 1 - SNA QUESTION BANK
 
Change detection using remote sensing and GIS
Change detection using remote sensing and GISChange detection using remote sensing and GIS
Change detection using remote sensing and GIS
 
Multi media Data mining
Multi media Data miningMulti media Data mining
Multi media Data mining
 
GIS Standards and Interoperability
GIS Standards and InteroperabilityGIS Standards and Interoperability
GIS Standards and Interoperability
 
Spatial Data Mining
Spatial Data MiningSpatial Data Mining
Spatial Data Mining
 
Data mining
Data miningData mining
Data mining
 
Geodatabases
GeodatabasesGeodatabases
Geodatabases
 
Social Data Mining
Social Data MiningSocial Data Mining
Social Data Mining
 
DATA in GIS and DATA Query
DATA in GIS and DATA QueryDATA in GIS and DATA Query
DATA in GIS and DATA Query
 

Destaque

ppt spatial data
ppt spatial datappt spatial data
ppt spatial dataRahul Kumar
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatialSumant Diwakar
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningSalford Systems
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slidesmahavir_a
 
Classification with Naive Bayes
Classification with Naive BayesClassification with Naive Bayes
Classification with Naive BayesJosh Patterson
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Crop production ppt
Crop production pptCrop production ppt
Crop production pptvaggyaggy
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 

Destaque (16)

ppt spatial data
ppt spatial datappt spatial data
ppt spatial data
 
Temporal data mining
Temporal data miningTemporal data mining
Temporal data mining
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatial
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data Mining
 
WEB MINING.
WEB MINING.WEB MINING.
WEB MINING.
 
Textmining Introduction
Textmining IntroductionTextmining Introduction
Textmining Introduction
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slides
 
Classification with Naive Bayes
Classification with Naive BayesClassification with Naive Bayes
Classification with Naive Bayes
 
GIS Data Types
GIS Data TypesGIS Data Types
GIS Data Types
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Crop production ppt
Crop production pptCrop production ppt
Crop production ppt
 
Data mining
Data miningData mining
Data mining
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
 

Semelhante a Spatial Data Mining : Seminar

Spatial co location pattern mining
Spatial co location pattern miningSpatial co location pattern mining
Spatial co location pattern miningSeung Kwan Kim
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesTown Peterson
 
D1T3 enm workflows updated
D1T3 enm workflows updatedD1T3 enm workflows updated
D1T3 enm workflows updatedTown Peterson
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex sceneKumar Mayank
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
Rank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceRank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceredpel dot com
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.ppsbutest
 
report2.doc
report2.docreport2.doc
report2.docbutest
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep LearningExperfy
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1Johan Blomme
 
Seminar on gis analysis functions
Seminar on gis analysis functionsSeminar on gis analysis functions
Seminar on gis analysis functionsPramoda Raj
 
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHCOLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHIJCI JOURNAL
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray dataGianluca Bontempi
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization CS, NcState
 
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Bob MacMillan
 
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Wilfried Elmenreich
 

Semelhante a Spatial Data Mining : Seminar (20)

Spatial co location pattern mining
Spatial co location pattern miningSpatial co location pattern mining
Spatial co location pattern mining
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling Methodologies
 
D1T3 enm workflows updated
D1T3 enm workflows updatedD1T3 enm workflows updated
D1T3 enm workflows updated
 
DM_clustering.ppt
DM_clustering.pptDM_clustering.ppt
DM_clustering.ppt
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex scene
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
Rank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceRank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependence
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.pps
 
report2.doc
report2.docreport2.doc
report2.doc
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep Learning
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1
 
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
 
Seminar on gis analysis functions
Seminar on gis analysis functionsSeminar on gis analysis functions
Seminar on gis analysis functions
 
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHCOLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
 
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
 
UNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptxUNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptx
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray data
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization 
 
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
 
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
 

Mais de Ipsit Dash

Land Reforms : An overview
Land Reforms : An overviewLand Reforms : An overview
Land Reforms : An overviewIpsit Dash
 
Water Sector Debate
Water Sector DebateWater Sector Debate
Water Sector DebateIpsit Dash
 
Tirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationTirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationIpsit Dash
 
Change Detection Dubai
Change Detection DubaiChange Detection Dubai
Change Detection DubaiIpsit Dash
 
Average- An android project
Average- An android projectAverage- An android project
Average- An android projectIpsit Dash
 
Implementation of INS-GPS
Implementation of INS-GPSImplementation of INS-GPS
Implementation of INS-GPSIpsit Dash
 

Mais de Ipsit Dash (7)

Land Reforms : An overview
Land Reforms : An overviewLand Reforms : An overview
Land Reforms : An overview
 
Water Sector Debate
Water Sector DebateWater Sector Debate
Water Sector Debate
 
Tirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationTirupur Water Supply and Sanitation
Tirupur Water Supply and Sanitation
 
Change Detection Dubai
Change Detection DubaiChange Detection Dubai
Change Detection Dubai
 
Average- An android project
Average- An android projectAverage- An android project
Average- An android project
 
Dark diamond
Dark diamondDark diamond
Dark diamond
 
Implementation of INS-GPS
Implementation of INS-GPSImplementation of INS-GPS
Implementation of INS-GPS
 

Último

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 

Último (20)

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 

Spatial Data Mining : Seminar

  • 1. Discovering Spatial Co-location Patterns : A summary of Results Shashi Sekhar, Yan Huang Dept of Computer Sciences, University of Minnesota, USA AG 2414 Spatial Analysis Seminar by- Adrian C Prelipcean Ipsit Dash
  • 2. Outline • Introduction • Focus of the Research • Background of the Problem • Approaches of Modeling the Co-location problem • Co-location Miner Algorithm • Conclusions
  • 3. Data Data Data!!!! • Data are being collected continuously for innumerable phenomenon. • Business applications • Scientific Applications • National Security Purposes It is impossible to analyze each strand of data collected. Calls for Data Mining Automation Hypothesis Generation Better Linking of phenomenon
  • 4. Data Mining • Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data backed by complex computer based decision systems ( A.I.,Business Intelligence, Machine learning) • Spatial Data Mining is different from Classical Data Mining used with reference to RDBMS. • Here attributes of the neighbours of some object of interest may have an influence on the object and therefore have to be considered as well. The explicit location and extension of spatial objects define implicit relations of spatial neighbourhood (such as topological, distance and direction relations) which are used by spatial data mining algorithms. • Methods used in General Data Mining- Predictive (Classification, Regression) and Descriptive (Clustering, Association) • Methods used in determining Spatial Patterns-  Location prediction model (to identify habitat of endangered species)  Spatial clusters (crime hot-spots, cancer clusters)  Spatial associations: co-locations (predator-prey species, symbiosis dental health and fluoride)  Spatial outlier : discontinuities (bad traffic sensors on highways)
  • 5. Spatial Associations : Co-locations • Classical Association methodology : Given a set of transactions, find rules that will predict the occurrence of an item based on the occurrences of other items in the transaction. Implication means co-occurrence, not causality! • Now in case of Spatial Data, the transactions are not disjoint. So classical approach is not ideal to be used here. • Association vs Co-location
  • 6. Focus of the Research • To extract information from geospatial data and identify frequent co- occurrence among Boolean spatial features like Draught, El Nino, Substantial drop in vegetation etc. on Ecological Datasets. • Approaches to discover co-location rules can be classified into 2 classes- Spatial Statistics, Association Rules • Spatial Statistics- Uses spatial correlation measures to characterize different relations between spatial features ( chi-sq tests, Correlation coefficients, regression models etc) • Association Rules- Assumes that finite set of disjoint sets are given as input to the Algorithm and they can find the most frequent items from the set and deduce relationships on those items ( apriori algorithm and Independent Approaches- Based on suitability of Reference Spatial Feature to mine all association rules of nearby spatial features. )
  • 7. Background of the Problem • Given 1. A set of items T of K boolean spatial feature types T={f1,f2,…,fK) 2. A set of N instances P={p1…pN} each p, is a vector <instance-id, spatial feature type, location> 3. A neighbor relation R over locations in S 4. Min prevalence threshold value, min conditional probability threshold • Objectives 1. Completeness: it finds all spatial colocation rules that satisfy the threshold value(s) 2. Correctness: any spatial co-location rule found by the algorithm respects the threshold value(s) 3. IO cost and CPU cost to generate the colocation rules should be acceptable • Find – Co-location rules with high prevalence and high conditional probability • Constraints – R is symmetric and reflexive – Monotonic prevalence measure – Conditional probability measures are specified by the event centric model – Sparse data set, the number of instance of any spatial features is << cardinality (P)
  • 8. Approaches of Modeling the Co-location Rules Problem • The reference feature centric model – Is relevant to application domains focusing on a specific boolean spatial feature • The window centric model – Is relevant to applications like mining, surveying and geology, which focus on land-parcels – One goal is to predict sets of spatial features likely to be discovered in a land parcel given that some other features have been found there • The event-centric model – Is relevant to applications that have interest in finding subsets of spatial features likely to occur in a neighborhood around instances of given subsets of event types
  • 9. Reference feature centric model • Let the reference feature be A • The set of spatial predicates include one predicate: close_to(a,b) which is true if and only if b is a’s neighbour
  • 10. Reference feature centric model Association rule example: is_type(i,A)∧∃ j is_type(j,B)∧close_to(j,i) →∃ k is_type(k,C)∧close_to(k,i) with 100% probability
  • 11. Window centric model Number of windows: 16 Number of windows containing A: 15 Number of windows containing A and B: 7 Association rule: an instance of type A in a window → an instance of type B in a window with 7/15=46.67% probability
  • 12. Event centric model Instances of type A: 4 Instances of type A that have some instances of type B: 1 Conditional probability for the co- location rule is: spatial feature A at location l → spatial feature type B in 9-neighbor neighborhood is 25%
  • 14. Co-location Miner Algorithm • Input 1. K boolean spatial instance and their instances 2. A symmetric and reflexive neighbor relation R 3. A user specified minimum threshold prevalence measure (min_prevalence) 4. A user specified minimum conditional probability (min_cond_prob) • Output – Co-location rule sets with participation index> min_prevalence and conditional probability>min_cond_prob • Method 1. Prevalent size 1 co-location set along with their table instances=P 2. Generate size 2 co-location rules 3. For size of co-locations in (2,3,…,K-1) do 4. Generate candidate prevalent co-locations using the generalized apriori_gen algorithm 5. Generate table instances and prune based on neighborhood 6. Prune based on prevalence of co-locations 7. Generate co-location rules 8. end;
  • 16.
  • 17. Conclusions • This paper gives a clear idea about co- localization problem and its difference from classical association problem. • Co-location Miner algorithm was dealt with easily with a comprehensive understanding and detailed analysis. • Future centric approach of the authors promises development in field of Plane Sweeping Algorithms.
  • 18. Spatial Co-location Patterns: articles • – S. Shekhar and Y. Huang, Discovering Spatial Co-location Patterns: A Summary of Results, In Proc. of 7th Intl Symposium on Spatial and Temporal Databases (SSTD), Springer-Verlag, Lecture Notes in Computer Science, LNCS 2121, p.236 ff, July 2001 • – S. Shekhar and Y. Huang, Multi-resolution Co-location Miner: a New Algorithm to Find Co-location Patterns from Spatial Datasets, SIAM SDM02 Workshop on Mining Scientific Datasets, April 2002 • – Y. Huang, H. Xiong, S. Shekhar, and J. Pei, Mining Confident Co-location Rules without A Support Threshold, in Proc. of 18th ACM Symposium on Applied Computing (ACM SAC), March 2003 • – Y. Huang, S. Shekhar, and H. Xiong, Discovering Colocation Patterns from Spatial Datasets: A General Approach, submitted to IEEE Transactions on Knowledge and Data Engineering (TKDE), 2004