SlideShare uma empresa Scribd logo
1 de 19
I- Extended Databases
Key words:Key words:
Knowledge Discovery inKnowledge Discovery in
Databases (KDD).Databases (KDD).
Data Mining (DM).Data Mining (DM).
Data Warehousing (DW) .Data Warehousing (DW) .
Query Optimization (QO).Query Optimization (QO).
Assistant Professor,
Computer Science Department,
Faculty of Science,
Al-Tahadi University,
P.O. Box 727,
Sirt ,Libya,
Dr. Zakaria Suliman ZubiDr. Zakaria Suliman Zubi
ByBy
3
I- Extended DatabasesI- Extended Databases
 Abstract .
 Introduction of the Indicative Databases .
 I-Extended Databases (IE) motivation.
 I-Extended Databases (IE) and KDD
processes .
 Example .
 Conclusions and Remarks .
 Questions.
4
AbstractAbstract (1)
 How we can handle generalizations in a very large
database using Association Rules (AR), and
inclusion Functional Dependencies (FD)?
 The answer is Inductive database.
 I- Extended database has a similar property to
inductive databases.
 I- Extended database contain exceedingly defined
generalizations about the data .
5
AbstractAbstract (2)
 It can be used in the process of Data Mining.
 It was proposed in ODBC_KDD(2) Model.
 The query will uses normal database terminology.
 The main aim of I-Extended database is to interact
with a spatial Data Mining query called Knowledge
Discovery Query Language (KDQL) described in
[22].
 The KDQL was demonstrated and introduced as a
query in the ODBC_KDD (2) model in [22].
6
Introduction of the Indicative DatabasesIntroduction of the Indicative Databases
 KDD process, contains several steps: understanding the domain,
preparing the data set, discovering patterns (i.e., computing a
theory), post-processing of discovered patterns, and putting the
results into use.
 KDD, we need a query language that not only enables the user
to select subsets of the data, but also to specify DM tasks and
select patterns from the corresponding theories.
 Considering the KDQL rules operator which was described in [
21] as a possible querying language on mining association rules
for i-extended database.
 Query should be an object of a similar type than its arguments.
7
The model was introduced at the Institute of Mathematics andThe model was introduced at the Institute of Mathematics and
Informatics at Debrecen University, Debrecen, Hungary 2002.Informatics at Debrecen University, Debrecen, Hungary 2002.
I-Extended Databases Motivation
Gateway
8
 I-Extended database is a pair R = (R, (PR, e, V))
 Where :
–R is a database schema.
–PRis a collection of patterns.
–V is a set of result values .
– e is the evaluation function that defines pattern semantics.
 This function maps each pair (r, θi) to an element of V, where
r is a database over R and θi P∊ R is a pattern.
 An instance of the schema, i-extended database (r, s) over the
schema R consists of a database r over the schema R and a
subset s ⊆ PR.
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
9
 Example :
If the patterns are Boolean formulae about the database, V is
{true, false},
And the evaluation function e(r, θ) has value true
iff the formula θ is true about r.
In practice, a user might be interested in selecting from the
intentionally defined collection of all Boolean formulas, the
formulas which are true or the formulas which are false.
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
10
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
 I-Extended Database : Is a database that in
addition to data also contain exceedingly defined
generalizations about the data. First we illustrate
the Association Rules, and then we Generalize the
approach and point out key issues for query
evaluation in general.
 I-Extended database is a database that has similar
properties that are in inductive database that
shows how it can be used throughout the whole
process of DM due to the closure property of the
framework.
11
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
 The aim of I-Extended Database is as follow:The aim of I-Extended Database is as follow:
– I-extended database consists of a normal database
associated to a subset of patterns from a class of
patterns, and an evaluation function that tells how the
patterns occur in the data.
– I-extended database can be queried (in principle) just
by using normal relational algebra or SQL, with the
added property of being able to refer to the values of
the evaluation function on the patterns.
– Modeling KDD processes as a sequence of queries on
i-extended database gives rise to chances for
reasoning and optimizing these processes
12
I-Extended Databases (IE) and KDD
processes
 KDD consists of several steps one of these steps is Data Mining.
 In Data Mining process we are concerned with unique class of
patterns for a real life mining processes presented in a dynamic
nature of knowledge acquisition scenario.
 These interesting patterns will be presented in I-Extended
Databases based on there captured frequency, confidence and
support values.
 Knowledge gathered often affects the search process, giving
rise to new goals in addition to the original ones.
13
I-Extended Databases (IE) and KDD processI-Extended Databases (IE) and KDD process
continuecontinue
 KDD processes can be described by sequences of
operations, i.e., queries over relevant i-extended database.
 Sequences of queries are abstract and concise descriptions
of DM processes.
 These descriptions can even be annotated by statistical
information about the size of selected dataset, the size of
intermediate collection of patterns etc..
 Providing knowledge for further use of these relevant
sequences.
14
Example/
Patterns in three instances of I-Extended
Database
 Schema R = {A1,…..,An} of attributes with
domain {0, 1}.
 Relation r over R, an association rule about r is
an expression of the form X⇒B where X ⊆ R
and B ∊R  X.
 The intuitive meaning of the rule is that if a
row of the matrix r has a 1 in each column of
X, then the row tends to have a 1 also in
column B.
 This semantics is captured by frequency and
confidence values. Given W ⊆ R, support (W, r)
denotes the fraction of rows of r that have a 1
in each column of W.
 The frequency of X ⇒ B in r is defined to be
support(X ⋃{B}, r) while its confidence is
support(X ⋃ {B}, r)/ support(X , r). Typically,
we are interested in association rules for which
the frequency and the confidence are greater
15
Conclusions and RemarksConclusions and Remarks
 I-Extended Databases enables the definition of mining process
as a sequences of queries by using a closure property.
 I-Extended Databases is a mandatory step towards to a
general purpose query languages for KDD applications.
 I-Extended Databases supports pattern generation, pattern
filtering and pattern combining operations.
 I-Extended Databases can uses standard database
terminology to carry out any significant patterns without
introducing any additional concepts .
16
Importance ReferencesImportance References
 [20] T. Imielinski and H. Mannila. A database
perspective on knowledge discovery. Communications
of ACM, 39:58-64, 1996.
 [21] Zakaria S. Zubi, Knowledge Discovery in Remote
Access Database, Ch. 9 , PhD dissertation, Debrecen
University, Hungary, 2002.
 [22] Zakaria S. Zubi, Fazekas Gábor, On ODBC_KDD
models, paper,5th International Conference on Applied
Informatics, , 28 January -3 February 2001, Eger,
Hungary,2001.
17
Thank you!!!
18
19

Mais conteúdo relacionado

Mais procurados

Cs583 info-retrieval
Cs583 info-retrievalCs583 info-retrieval
Cs583 info-retrievalBorseshweta
 
Mc0088 data mining
Mc0088  data miningMc0088  data mining
Mc0088 data miningsmumbahelp
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with PythonDavis David
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendSalah Amean
 
Scalable keyword search on large rdf data
Scalable keyword search on large rdf dataScalable keyword search on large rdf data
Scalable keyword search on large rdf dataLeMeniz Infotech
 
An improvised frequent pattern tree
An improvised frequent pattern treeAn improvised frequent pattern tree
An improvised frequent pattern treeIJDKP
 
similarity measure
similarity measure similarity measure
similarity measure ZHAO Sam
 
Machine learning introduction
Machine learning introductionMachine learning introduction
Machine learning introductionAnas Jamil
 
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Salah Amean
 
A Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from TextA Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from TextDeep Kayal
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysisDataminingTools Inc
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search enginesBoris Galitsky
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2Gokulks007
 

Mais procurados (20)

Ej36829834
Ej36829834Ej36829834
Ej36829834
 
Bt0066 dbms
Bt0066 dbmsBt0066 dbms
Bt0066 dbms
 
Warehousing
WarehousingWarehousing
Warehousing
 
Data structures
Data structuresData structures
Data structures
 
Cs583 info-retrieval
Cs583 info-retrievalCs583 info-retrieval
Cs583 info-retrieval
 
Mc0088 data mining
Mc0088  data miningMc0088  data mining
Mc0088 data mining
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with Python
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
 
Scalable keyword search on large rdf data
Scalable keyword search on large rdf dataScalable keyword search on large rdf data
Scalable keyword search on large rdf data
 
An improvised frequent pattern tree
An improvised frequent pattern treeAn improvised frequent pattern tree
An improvised frequent pattern tree
 
similarity measure
similarity measure similarity measure
similarity measure
 
Machine learning introduction
Machine learning introductionMachine learning introduction
Machine learning introduction
 
Data types ,variables,array
Data types ,variables,arrayData types ,variables,array
Data types ,variables,array
 
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
 
A Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from TextA Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from Text
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search engines
 
Data mining
Data miningData mining
Data mining
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
 
Data Mining and Knowledge
Data Mining and KnowledgeData Mining and Knowledge
Data Mining and Knowledge
 

Destaque

Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Zakaria Zubi
 
Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)Zakaria Zubi
 
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...Zakaria Zubi
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification Zakaria Zubi
 
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA Zakaria Zubi
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternZakaria Zubi
 

Destaque (8)

Ismail&&ziko 2003
Ismail&&ziko 2003Ismail&&ziko 2003
Ismail&&ziko 2003
 
Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases
 
Edi text
Edi textEdi text
Edi text
 
Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)
 
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification
 
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime Pattern
 

Semelhante a I- Extended Databases

Probablistic information retrieval
Probablistic information retrievalProbablistic information retrieval
Probablistic information retrievalNisha Arankandath
 
Document ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspaceDocument ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspacePrakash Dubey
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAcsandit
 
HyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-AnsweringHyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-AnsweringJinho Choi
 
Information retrival system and PageRank algorithm
Information retrival system and PageRank algorithmInformation retrival system and PageRank algorithm
Information retrival system and PageRank algorithmRupali Bhatnagar
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Templatebutest
 
A Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia ArticlesA Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia Articlesijma
 
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...onlmcq
 
G04124041046
G04124041046G04124041046
G04124041046IOSR-JEN
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Miningbutest
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Enayat Rajabi
 
Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)IJASCSE
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.pptBereketAraya
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.pptBereketAraya
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligencevini89
 
Data Science Using Scikit-Learn
Data Science Using Scikit-LearnData Science Using Scikit-Learn
Data Science Using Scikit-LearnDucat India
 
Project Presentation
Project PresentationProject Presentation
Project Presentationbutest
 

Semelhante a I- Extended Databases (20)

Probablistic information retrieval
Probablistic information retrievalProbablistic information retrieval
Probablistic information retrieval
 
Document ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspaceDocument ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspace
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
 
HyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-AnsweringHyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-Answering
 
Information retrival system and PageRank algorithm
Information retrival system and PageRank algorithmInformation retrival system and PageRank algorithm
Information retrival system and PageRank algorithm
 
R tutorial
R tutorialR tutorial
R tutorial
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Template
 
A Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia ArticlesA Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia Articles
 
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...
 
G04124041046
G04124041046G04124041046
G04124041046
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)
 
Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.ppt
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.ppt
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Data Science Using Scikit-Learn
Data Science Using Scikit-LearnData Science Using Scikit-Learn
Data Science Using Scikit-Learn
 
Ir
IrIr
Ir
 
Ir
IrIr
Ir
 
Project Presentation
Project PresentationProject Presentation
Project Presentation
 

Mais de Zakaria Zubi

applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...Zakaria Zubi
 
Applying web mining application for user behavior understanding
Applying web mining application for user behavior understandingApplying web mining application for user behavior understanding
Applying web mining application for user behavior understandingZakaria Zubi
 
Ibtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital imagesIbtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital imagesZakaria Zubi
 
Information communication technology in libya for educational purposes
Information communication technology in libya for educational purposesInformation communication technology in libya for educational purposes
Information communication technology in libya for educational purposesZakaria Zubi
 

Mais de Zakaria Zubi (6)

applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
 
Applying web mining application for user behavior understanding
Applying web mining application for user behavior understandingApplying web mining application for user behavior understanding
Applying web mining application for user behavior understanding
 
Model
ModelModel
Model
 
Ibtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital imagesIbtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital images
 
Deep Web mining
Deep Web miningDeep Web mining
Deep Web mining
 
Information communication technology in libya for educational purposes
Information communication technology in libya for educational purposesInformation communication technology in libya for educational purposes
Information communication technology in libya for educational purposes
 

Último

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Último (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

I- Extended Databases

  • 1. I- Extended Databases Key words:Key words: Knowledge Discovery inKnowledge Discovery in Databases (KDD).Databases (KDD). Data Mining (DM).Data Mining (DM). Data Warehousing (DW) .Data Warehousing (DW) . Query Optimization (QO).Query Optimization (QO).
  • 2. Assistant Professor, Computer Science Department, Faculty of Science, Al-Tahadi University, P.O. Box 727, Sirt ,Libya, Dr. Zakaria Suliman ZubiDr. Zakaria Suliman Zubi ByBy
  • 3. 3 I- Extended DatabasesI- Extended Databases  Abstract .  Introduction of the Indicative Databases .  I-Extended Databases (IE) motivation.  I-Extended Databases (IE) and KDD processes .  Example .  Conclusions and Remarks .  Questions.
  • 4. 4 AbstractAbstract (1)  How we can handle generalizations in a very large database using Association Rules (AR), and inclusion Functional Dependencies (FD)?  The answer is Inductive database.  I- Extended database has a similar property to inductive databases.  I- Extended database contain exceedingly defined generalizations about the data .
  • 5. 5 AbstractAbstract (2)  It can be used in the process of Data Mining.  It was proposed in ODBC_KDD(2) Model.  The query will uses normal database terminology.  The main aim of I-Extended database is to interact with a spatial Data Mining query called Knowledge Discovery Query Language (KDQL) described in [22].  The KDQL was demonstrated and introduced as a query in the ODBC_KDD (2) model in [22].
  • 6. 6 Introduction of the Indicative DatabasesIntroduction of the Indicative Databases  KDD process, contains several steps: understanding the domain, preparing the data set, discovering patterns (i.e., computing a theory), post-processing of discovered patterns, and putting the results into use.  KDD, we need a query language that not only enables the user to select subsets of the data, but also to specify DM tasks and select patterns from the corresponding theories.  Considering the KDQL rules operator which was described in [ 21] as a possible querying language on mining association rules for i-extended database.  Query should be an object of a similar type than its arguments.
  • 7. 7 The model was introduced at the Institute of Mathematics andThe model was introduced at the Institute of Mathematics and Informatics at Debrecen University, Debrecen, Hungary 2002.Informatics at Debrecen University, Debrecen, Hungary 2002. I-Extended Databases Motivation Gateway
  • 8. 8  I-Extended database is a pair R = (R, (PR, e, V))  Where : –R is a database schema. –PRis a collection of patterns. –V is a set of result values . – e is the evaluation function that defines pattern semantics.  This function maps each pair (r, θi) to an element of V, where r is a database over R and θi P∊ R is a pattern.  An instance of the schema, i-extended database (r, s) over the schema R consists of a database r over the schema R and a subset s ⊆ PR. I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue
  • 9. 9  Example : If the patterns are Boolean formulae about the database, V is {true, false}, And the evaluation function e(r, θ) has value true iff the formula θ is true about r. In practice, a user might be interested in selecting from the intentionally defined collection of all Boolean formulas, the formulas which are true or the formulas which are false. I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue
  • 10. 10 I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue  I-Extended Database : Is a database that in addition to data also contain exceedingly defined generalizations about the data. First we illustrate the Association Rules, and then we Generalize the approach and point out key issues for query evaluation in general.  I-Extended database is a database that has similar properties that are in inductive database that shows how it can be used throughout the whole process of DM due to the closure property of the framework.
  • 11. 11 I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue  The aim of I-Extended Database is as follow:The aim of I-Extended Database is as follow: – I-extended database consists of a normal database associated to a subset of patterns from a class of patterns, and an evaluation function that tells how the patterns occur in the data. – I-extended database can be queried (in principle) just by using normal relational algebra or SQL, with the added property of being able to refer to the values of the evaluation function on the patterns. – Modeling KDD processes as a sequence of queries on i-extended database gives rise to chances for reasoning and optimizing these processes
  • 12. 12 I-Extended Databases (IE) and KDD processes  KDD consists of several steps one of these steps is Data Mining.  In Data Mining process we are concerned with unique class of patterns for a real life mining processes presented in a dynamic nature of knowledge acquisition scenario.  These interesting patterns will be presented in I-Extended Databases based on there captured frequency, confidence and support values.  Knowledge gathered often affects the search process, giving rise to new goals in addition to the original ones.
  • 13. 13 I-Extended Databases (IE) and KDD processI-Extended Databases (IE) and KDD process continuecontinue  KDD processes can be described by sequences of operations, i.e., queries over relevant i-extended database.  Sequences of queries are abstract and concise descriptions of DM processes.  These descriptions can even be annotated by statistical information about the size of selected dataset, the size of intermediate collection of patterns etc..  Providing knowledge for further use of these relevant sequences.
  • 14. 14 Example/ Patterns in three instances of I-Extended Database  Schema R = {A1,…..,An} of attributes with domain {0, 1}.  Relation r over R, an association rule about r is an expression of the form X⇒B where X ⊆ R and B ∊R X.  The intuitive meaning of the rule is that if a row of the matrix r has a 1 in each column of X, then the row tends to have a 1 also in column B.  This semantics is captured by frequency and confidence values. Given W ⊆ R, support (W, r) denotes the fraction of rows of r that have a 1 in each column of W.  The frequency of X ⇒ B in r is defined to be support(X ⋃{B}, r) while its confidence is support(X ⋃ {B}, r)/ support(X , r). Typically, we are interested in association rules for which the frequency and the confidence are greater
  • 15. 15 Conclusions and RemarksConclusions and Remarks  I-Extended Databases enables the definition of mining process as a sequences of queries by using a closure property.  I-Extended Databases is a mandatory step towards to a general purpose query languages for KDD applications.  I-Extended Databases supports pattern generation, pattern filtering and pattern combining operations.  I-Extended Databases can uses standard database terminology to carry out any significant patterns without introducing any additional concepts .
  • 16. 16 Importance ReferencesImportance References  [20] T. Imielinski and H. Mannila. A database perspective on knowledge discovery. Communications of ACM, 39:58-64, 1996.  [21] Zakaria S. Zubi, Knowledge Discovery in Remote Access Database, Ch. 9 , PhD dissertation, Debrecen University, Hungary, 2002.  [22] Zakaria S. Zubi, Fazekas Gábor, On ODBC_KDD models, paper,5th International Conference on Applied Informatics, , 28 January -3 February 2001, Eger, Hungary,2001.
  • 18. 18
  • 19. 19