SlideShare uma empresa Scribd logo
1 de 47
Sequential Patterns Wen-Chih Peng ( 彭文志 ) Dept. of Computer Science National Chiao-Tung University
Sequence Data Sequence Database: Object Timestamp Events A 10 2, 3, 5 A 20 6, 1 A 23 1 B 11 4, 5, 6 B 17 2 B 21 7, 8, 1, 2 B 28 1, 6 C 14 1, 8, 7
Examples of Sequence Data Sequence E1 E2 E1 E3 E2 E3 E4 E2 Element (Transaction) Event  (Item) Bases A,T,G,C An element of the DNA sequence  DNA sequence of a particular species Genome sequences Types of alarms generated by sensors  Events triggered by a sensor at time t History of events generated by a given sensor Event data Home page, index page, contact info, etc A collection of files viewed by a Web visitor after a single mouse click Browsing activity of a particular Web visitor Web Data Books, diary products, CDs, etc A set of items bought by a customer at time t Purchase history of a given customer Customer Event (Item) Element (Transaction) Sequence Sequence Database
Formal Definition of a Sequence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Examples of Sequence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Formal Definition of a Subsequence ,[object Object],[object Object],[object Object],< {2,4} {2,4} {2,5} > < {1,2} {3,4} >  < {2,4} {3,5,6} {8} > Data sequence Yes < {2} {4} > No < {1} {2} > Yes < {2} {3,5} > Contain? Subsequence
What Is Sequential Pattern Mining? ,[object Object],A  sequence database   A  sequence  : < (ef) (ab)  (df) c b > An element may contain a set of items. Items within an element are unordered and we list them alphabetically.   <a(bc)dc> is a  subsequence  of  < a (a bc )(ac) d ( c f)> Given  support threshold   min_sup  =2, <(ab)c> is a  sequential pattern <eg(af)cbc> 40 <(ef)( ab )(df) c b> 30 <(ad)c(bc)(ae)> 20 <a( ab c)(a c )d(cf)> 10 sequence SID
Sequential Pattern Mining: Definition ,[object Object],[object Object],[object Object],[object Object],[object Object]
Extracting Sequential Patterns ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Sequential Pattern Mining: Challenge ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Mining Sequential Patterns ,[object Object],[object Object],[object Object],[object Object],[object Object]
Sequential Patterns (cont’d) ,[object Object],[object Object],[object Object]
Algorithm ,[object Object],[object Object],[object Object],[object Object]
Algorithm (cont’d) ,[object Object],[object Object],[object Object],[object Object]
Original Database
Sort Phase
Customer-Sequence Version  of the Database
Large itemset Phase (support:2)
Transformation Phase
Sequence Phase ,[object Object],[object Object],2 <2 3 4> 2 <1 3 5> 3 <1 3 4>  2 <1 2 4> 2 <1 2 3> Support Sequence <1 3 5 4> <1 3 4 5> <1 2 4 3> <1 2 3 4>
<{1 5} {2} {3} {4}> <{1} {3} {4} {3 5}> <{1} {2} {3} {4}> <{1} {3} {5}> <{4} {5}> Customer Sequences Sequence Support <1> 4 <2> 2 <3> 4 <4> 4 <5> 4 Large 1-Sequences Sequence Support <1 2> 2 <1 3> 4 <1 4> 3 <1 5> 2 <2 3> 2 <2 4> 2 <3 4> 3 <3 5> 2 <4 5> 2 Large 2-Sequences Sequence Support <1 2 3> 2 <1 2 4> 2 <1 3 4> 3 <1 3 5> 2 <2 3 4> 2 Large 3-Sequences Sequence Support <1 2 3 4> 2 Large 4-Sequences Sequence Support <1 2 3 4> 2 <1 3 5> 2 <4 5> 2 Maximal Large Sequences
Maximal Sequence ,[object Object],[object Object],[object Object]
Sequential Patterns With support > 25% < (30) (90) > < (30) (40 70) > The answer set
Notes ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Rule Discovery from Time Sequences ,[object Object],[object Object],[object Object],[object Object],[object Object]
Episode Mining ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],E B A F B A C
Example of Episode Mining  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],0  10  20  30  40  50  60  70  80  90  100  110  120 130 140  150 D  C  A  B  D  A  B  C  A  D  C  A  B  D  A
Event Sequence ,[object Object],[object Object]
Sliding Window ,[object Object],[object Object],0  10  20  30  40  50  60  70  80  90 D  C  A  B  D  A  B  C
Frequency of an Episode ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Find Frequent Episodes ,[object Object],[object Object],[object Object],[object Object]
FreeSpan ,[object Object],[object Object],[object Object],[object Object]
Example of FreeSpan f_list = a: 4 ,b: 4 ,c: 4 ,d: 3 ,e: 3 ,f: 3 g is deleted because of support of g <2 . Example database: min support = 2 <eg(af)cbc> 40 <(ef)(ab)(df)cb> 30 <(ad)c(bc)(ae)> 20 <a(abc)(ac)d(cf)> 10 Sequence Sequence id
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <a> <aa> => {a}-projected database <e( a f)cbc> 40 <(ef)( a b)(df)cb> 30 <( a d)c(bc)( a e)> 20 < a ( a bc)( a c)d(cf)> 10 Sequence Sequence id <a> 40 <a> 30 <aa> 20 <aaa> 10
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <b> <ab> <ba> <(ab)> => {b}-projected database <e( a f)c b c> 40 <(ef)( ab )(df)c b > 30 <( a d)c( b c)( a e)> 20 < a ( ab c)( a c)d(cf)> 10 Sequence Sequence id <ab> 40 <(ab)b> 30 <aba> 20 <a(ab)a> 10
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <c> <ac> <bc> <(bc)> <ca> <cb> <(ab)c> <acc> <acb> => {c}-projected database <e( a f) cbc > 40 <(ef)( ab )(df) cb > 30 <( a d) c ( bc )( a e)> 20 < a ( abc )( ac )d( c f)> 10 Sequence Sequence id <acbc> 40 <(ab)cb> 30 <ac(bc)a> 20 <a(abc)(ac)c> 10
PrefixSpan ,[object Object],[object Object],[object Object]
PrefixSpan - Concepts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
PrefixSpan – Concepts ,[object Object],[object Object],[object Object],[object Object]
Example of PrefixSpan <a>-projected database By scanning <a>-projected database once, all the length-2 sequential patterns having prefix <a> can be found. <aa>:2  <ab>:4  <(ab)>:2  <ac>:4  <ad>:2  <af>:2 Recursively, patterns with prefix <a> can be partitioned into 6 subsets. => <e( a f)cbc > 40 <(ef)( a b)(df)cb > 30 <( a d)c(bc)(ae )> 20 < a (abc)(ac)d(cf) > 10 Sequence Sequence id <(_f)cbc> 40 <(_b)(df)cb> 30 <(_d)c(bc)(ae)> 20 <(abc)(ac)d(cf)> 10
Example of PrefixSpan (cont’d) < aa >-projected database => < ab >-projected database => Sequential patterns of <ab>-projected db: <(_c)>,<(_c)a>,<a><c> <(_f)c b c> 40 <(_b)(df)cb> 30 <(_d)c( b c)( a e)> 20 <( a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10 <(_e)> 20 <(_bc)(ac)d(cf)> 10
Example of PrefixSpan (cont’d) <b>-projected database Sequential patterns <b> <ba> <bc> <(bc)> <(bc)a> <bd> <bdc> <bf> => <e(af)c b c > 40 <(ef)(a b )(df)cb > 30 <(ad)c( b c)(ae )> 20 <a(a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(df)cb> 30 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10
References ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object],[object Object]

Mais conteúdo relacionado

Mais procurados

Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3
guesta3202
 
Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)
James Clause
 
Обзор фреймворка Twisted
Обзор фреймворка TwistedОбзор фреймворка Twisted
Обзор фреймворка Twisted
Maxim Kulsha
 

Mais procurados (20)

AST: threats and opportunities
AST: threats and opportunitiesAST: threats and opportunities
AST: threats and opportunities
 
Python memory management_v2
Python memory management_v2Python memory management_v2
Python memory management_v2
 
From java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italyFrom java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italy
 
Python 표준 라이브러리
Python 표준 라이브러리Python 표준 라이브러리
Python 표준 라이브러리
 
Yoyak ScalaDays 2015
Yoyak ScalaDays 2015Yoyak ScalaDays 2015
Yoyak ScalaDays 2015
 
The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202
 
Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3
 
Python fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuanPython fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuan
 
Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)
 
Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語
 
PDBC
PDBCPDBC
PDBC
 
java 8 Hands on Workshop
java 8 Hands on Workshopjava 8 Hands on Workshop
java 8 Hands on Workshop
 
집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링
 
Introducción a Elixir
Introducción a ElixirIntroducción a Elixir
Introducción a Elixir
 
MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용
 
Advanced Python, Part 2
Advanced Python, Part 2Advanced Python, Part 2
Advanced Python, Part 2
 
Intro
IntroIntro
Intro
 
Getting started cpp full
Getting started cpp   fullGetting started cpp   full
Getting started cpp full
 
Tokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java DeveloperTokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java Developer
 
Обзор фреймворка Twisted
Обзор фреймворка TwistedОбзор фреймворка Twisted
Обзор фреймворка Twisted
 

Destaque

Preview book the rabbit who forgot to fly
Preview book the rabbit who forgot to flyPreview book the rabbit who forgot to fly
Preview book the rabbit who forgot to fly
s.anshuman
 
Evaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea PsihometricaEvaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea Psihometrica
1Leu
 
Tulburari De Personalitate
Tulburari De PersonalitateTulburari De Personalitate
Tulburari De Personalitate
1Leu
 
Dementastd
DementastdDementastd
Dementastd
1Leu
 
Pdi abril2011 (1)
Pdi abril2011 (1)Pdi abril2011 (1)
Pdi abril2011 (1)
lpalacin
 
Toxicostd
ToxicostdToxicostd
Toxicostd
1Leu
 
Pendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman PurbaPendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman Purba
kaloky
 
Comportamentele Pozitive
Comportamentele PozitiveComportamentele Pozitive
Comportamentele Pozitive
1Leu
 
New York Bluejays
New York BluejaysNew York Bluejays
New York Bluejays
jawad723
 

Destaque (20)

Analyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraAnalyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and Cassandra
 
Preview - The Mind of Business Plan
Preview - The Mind of Business PlanPreview - The Mind of Business Plan
Preview - The Mind of Business Plan
 
Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?
 
The Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend ReachThe Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
 
Preview book the rabbit who forgot to fly
Preview book the rabbit who forgot to flyPreview book the rabbit who forgot to fly
Preview book the rabbit who forgot to fly
 
Diario Blog 0119 0123
Diario Blog 0119 0123Diario Blog 0119 0123
Diario Blog 0119 0123
 
Evaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea PsihometricaEvaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea Psihometrica
 
Tulburari De Personalitate
Tulburari De PersonalitateTulburari De Personalitate
Tulburari De Personalitate
 
Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009
 
Dementastd
DementastdDementastd
Dementastd
 
Pdi abril2011 (1)
Pdi abril2011 (1)Pdi abril2011 (1)
Pdi abril2011 (1)
 
Toxicostd
ToxicostdToxicostd
Toxicostd
 
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLDPresentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
 
Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010
 
Pendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman PurbaPendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman Purba
 
Comportamentele Pozitive
Comportamentele PozitiveComportamentele Pozitive
Comportamentele Pozitive
 
New York Bluejays
New York BluejaysNew York Bluejays
New York Bluejays
 
Preview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New YouPreview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New You
 
Preview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing ProductsPreview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing Products
 
EFT On Feeling Disappointed
EFT On Feeling DisappointedEFT On Feeling Disappointed
EFT On Feeling Disappointed
 

Semelhante a 03

Team activity analysis / visualization
Team activity analysis / visualizationTeam activity analysis / visualization
Team activity analysis / visualization
Nicolas Maisonneuve
 
2007 Tidc India Profiling
2007 Tidc India Profiling2007 Tidc India Profiling
2007 Tidc India Profiling
danrinkes
 

Semelhante a 03 (20)

Team activity analysis / visualization
Team activity analysis / visualizationTeam activity analysis / visualization
Team activity analysis / visualization
 
chapter-8.ppt
chapter-8.pptchapter-8.ppt
chapter-8.ppt
 
Reverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading SkillsReverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading Skills
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
 
04 sequentialbasics 1
04 sequentialbasics 104 sequentialbasics 1
04 sequentialbasics 1
 
SMDMS'13
SMDMS'13SMDMS'13
SMDMS'13
 
Data in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data EfficientlyData in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data Efficiently
 
Data Presentations Cassandra Sigmod
Data  Presentations  Cassandra SigmodData  Presentations  Cassandra Sigmod
Data Presentations Cassandra Sigmod
 
RxJava In Baby Steps
RxJava In Baby StepsRxJava In Baby Steps
RxJava In Baby Steps
 
Mining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systemsMining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systems
 
dfl
dfldfl
dfl
 
Odp
OdpOdp
Odp
 
Extracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEEExtracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEE
 
2007 Tidc India Profiling
2007 Tidc India Profiling2007 Tidc India Profiling
2007 Tidc India Profiling
 
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
 
Writing Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeepWriting Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeep
 
Basics & asymptotic notations
Basics & asymptotic notationsBasics & asymptotic notations
Basics & asymptotic notations
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
 
Data Structure: Algorithm and analysis
Data Structure: Algorithm and analysisData Structure: Algorithm and analysis
Data Structure: Algorithm and analysis
 
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

03

  • 1. Sequential Patterns Wen-Chih Peng ( 彭文志 ) Dept. of Computer Science National Chiao-Tung University
  • 2. Sequence Data Sequence Database: Object Timestamp Events A 10 2, 3, 5 A 20 6, 1 A 23 1 B 11 4, 5, 6 B 17 2 B 21 7, 8, 1, 2 B 28 1, 6 C 14 1, 8, 7
  • 3. Examples of Sequence Data Sequence E1 E2 E1 E3 E2 E3 E4 E2 Element (Transaction) Event (Item) Bases A,T,G,C An element of the DNA sequence DNA sequence of a particular species Genome sequences Types of alarms generated by sensors Events triggered by a sensor at time t History of events generated by a given sensor Event data Home page, index page, contact info, etc A collection of files viewed by a Web visitor after a single mouse click Browsing activity of a particular Web visitor Web Data Books, diary products, CDs, etc A set of items bought by a customer at time t Purchase history of a given customer Customer Event (Item) Element (Transaction) Sequence Sequence Database
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 17. Customer-Sequence Version of the Database
  • 18. Large itemset Phase (support:2)
  • 20.
  • 21. <{1 5} {2} {3} {4}> <{1} {3} {4} {3 5}> <{1} {2} {3} {4}> <{1} {3} {5}> <{4} {5}> Customer Sequences Sequence Support <1> 4 <2> 2 <3> 4 <4> 4 <5> 4 Large 1-Sequences Sequence Support <1 2> 2 <1 3> 4 <1 4> 3 <1 5> 2 <2 3> 2 <2 4> 2 <3 4> 3 <3 5> 2 <4 5> 2 Large 2-Sequences Sequence Support <1 2 3> 2 <1 2 4> 2 <1 3 4> 3 <1 3 5> 2 <2 3 4> 2 Large 3-Sequences Sequence Support <1 2 3 4> 2 Large 4-Sequences Sequence Support <1 2 3 4> 2 <1 3 5> 2 <4 5> 2 Maximal Large Sequences
  • 22.
  • 23. Sequential Patterns With support > 25% < (30) (90) > < (30) (40 70) > The answer set
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33. Example of FreeSpan f_list = a: 4 ,b: 4 ,c: 4 ,d: 3 ,e: 3 ,f: 3 g is deleted because of support of g <2 . Example database: min support = 2 <eg(af)cbc> 40 <(ef)(ab)(df)cb> 30 <(ad)c(bc)(ae)> 20 <a(abc)(ac)d(cf)> 10 Sequence Sequence id
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40. Example of PrefixSpan <a>-projected database By scanning <a>-projected database once, all the length-2 sequential patterns having prefix <a> can be found. <aa>:2 <ab>:4 <(ab)>:2 <ac>:4 <ad>:2 <af>:2 Recursively, patterns with prefix <a> can be partitioned into 6 subsets. => <e( a f)cbc > 40 <(ef)( a b)(df)cb > 30 <( a d)c(bc)(ae )> 20 < a (abc)(ac)d(cf) > 10 Sequence Sequence id <(_f)cbc> 40 <(_b)(df)cb> 30 <(_d)c(bc)(ae)> 20 <(abc)(ac)d(cf)> 10
  • 41. Example of PrefixSpan (cont’d) < aa >-projected database => < ab >-projected database => Sequential patterns of <ab>-projected db: <(_c)>,<(_c)a>,<a><c> <(_f)c b c> 40 <(_b)(df)cb> 30 <(_d)c( b c)( a e)> 20 <( a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10 <(_e)> 20 <(_bc)(ac)d(cf)> 10
  • 42. Example of PrefixSpan (cont’d) <b>-projected database Sequential patterns <b> <ba> <bc> <(bc)> <(bc)a> <bd> <bdc> <bf> => <e(af)c b c > 40 <(ef)(a b )(df)cb > 30 <(ad)c( b c)(ae )> 20 <a(a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(df)cb> 30 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.