SlideShare uma empresa Scribd logo
1 de 31
Baixar para ler offline
Towards an Empirical Semantic
Web Science: Knowledge Pattern
Extraction and Usage
           Andrea Nuzzolese
                    Ph.D. Student
            Università di Bologna
               STLab, ISTC-CNR
Outline




•   Empirical Semantic Web Science and Knowledge Patterns (KPs)

•   A possible methodology for making KPs emerge from the Web of
    Data

•   The work done so far in KP extraction

•   Evaluating KPs' efficacy through Exploratory Search




                                2
Does a Web science exist?


•   A science usually is applied to clear research objects
    ✦   Physical and biological science analyzes the natural world, and tries to find
        microscopic laws that, extrapolated to the macroscopic realm, would
        generate the behavior observed

•   The Web is an engineered space created through formally
    specified languages and protocols

•   Web pages with their content and links are created by humans
    with a particular task governed by social conventions and laws

•   A Web science exists [Berners-Lee Et Al., 2006] and is oriented
    to:
    ✦   Growth of the engineered space;
    ✦   Human-web interaction patterns
                                         3
What about a Web of Data science?

•   Linked data offers huge data for empirical research




                                 4
What are the research objects of the empirical
                SW science?




 •   The Semantic Web and Linked data give us the chance to
     empirically study what are the patterns in organizing and
     representing knowledge

 •   The research objects of the Semantic Web as an empirical science
     are Knowledge Patterns (KPs)




                                  5
Knoweldge Patterns




•   KPs are small well connected units of meaning, which are
    ✦   task based
    ✦   well grounded
    ✦   cognitively sound

•   KPs find their theoretical grounding in frames
    ✦   “… a frame is a data-structure for representing a stereotyped
        situation.” [Minsky 1975]
    ✦   “...the availability of global patterns of knowledge cuts down on non-determinacy
        enough to offset idiosyncratic bottom-up input that might otherwise be
        confusing.” [Beaugrande 1980]



                                           6
An example of KP




         7
Empirical Semantic Web and KPs




•   KPs emerge from the knowledge soup deriving from the Web

•   A methodology for KP extraction from the Web




                              8
KP extraction



•   The Web is populated by heterogeneous sources

•   We can classify sources in two categories
    ✦   Formal and semi-formal sources modeled by adopting a top-down approach
        ✴   e.g., foundational ontologies, frames, thesauri, etc.
    ✦   Non-formal sources modeled by adopting a bottom-up approach
        ✴   e.g., RDBs, Linked Data, Web pages, XML documents, etc.

•   Our KP extraction methodology is based on two complementary
    approaches
    ✦   A top-down approach
    ✦   A bottom-up approach


                                               9
KP boundary




      10
KP detection and discovery




•   The top-down approach is aimed to extract KPs that already
    exists in a formal or semi-formal structure
    ✦   Possible techniques: reengineering, refactoring based on association rules,
        key concept identification, ontology mapping, etc.

•   The bottom-up approach is aimed to extract to discover or detect
    KPs from data
    ✦   Possible techniques: inductive techniques, machine learning, data mining,
        ontology mining, etc.




                                        11
KP validation



•   The top-down and the bottom-up approaches concur in the
    validation of KPs

•   KP extraction is a matter of understanding how the world or
    specific domains have been described from different perspectives
    ✦   The perspective of domain experts, ontologists, etc., which try to give
        formalizations either of the world or of specific domains
    ✦   The perspective of users, data entries, etc, which effectively populate and
        manage data that report facts about the world

•   For example it would be cognitively relevant if an occurrence of
    KP emerges both with the top-down and the bottom-up
    approach

                                        12
KP extraction methodology




             13
KP reengineering from FrameNet’s frames




•   FrameNet is a cognitive sound lexical knowledge base, which is
    grounded in a large corpus

•   FrameNet consists of a set of frames, which have frame elements
    lexical units, which pair words (lexemes) to frames, and relations
    to corpus elements
    ✦   Each frame can be interpreted as a class of situations




                                        14
An example of frame




          15
Using Semion for reengineering and
                refactoring FrameNet’s frame

!"#$%"$#&'(
!%)*+&(




,-./$-01%(
!%)*+&(




,-./$-01%(
2&"&(




34#5$0(
2&"&(




6*7*#*.1&'(
2&"&(



                                16
FrameNet as LOD




        17
FrameNet as KPs




        18
KP discovery from Wikipedia links




•   Hypothesis
    ✦   the types of linked resources that occur most often for a certain type of
        resource constitute its KP
    ✦   since we expect that any cognitive invariance in explaining/describing things
        is reflected in the wikilink graph, discovered KPs are cognitively sound

•   Contribution
    ✦   an EKP discovery procedure
    ✦   184 EKPs published in OWL2




                                        19
Collecting paths from wikilinks

                                                                              dbpedia:
     dbpo:Person            owl:Thing                        owl:Thing
                                                                             Organisation


                                                Path
        dbpo:                                                                 dbpedia:
                           db:Minnie_Mouse      db:The_Walt_Disney_Company    Company
 FictionalCharacter




dbpo:wikiPageWikiLink                           Path

       rdf:type
                                  dbpo: db:Mickey_Mouse
                           FictionalCharacter
    rdfs:subClassOf

                               dbpo:
                                                             owl:Thing
                        FictionalCharacter

                                             dbpo:Person
                                                       20
Path popularity


                                           Jackson_5
        Dave_Grohl          Michael_Jackson

                                                              Jackie_Jackson
                       Nirvana

                                Madonna
                                                 Prince
                       Charlie_Parker                     Keith_Jarrett

Foo Fighters                                Beatles
   nSubjectRes(Pi,j)/nRes(Si)

                                                              John_Lennon
                                Paul_McCartney



                                     21
Boundaries of KPs




•   An KP(Si) is a set of paths, such that


                  Pi,j ∈ KP(Si) !   pathPopularity(Pi,j, Si) ≥ t



•   t is a threshold, under which a path is not included in an KP

•   How to get a good value for t?



                                     22
Boundary induction


Step                        Description

 1     For each path, calculate the path popularity

       For each subject type, get the 40 top-ranked path popularity
 2
       values*
       Apply multiple correlation (Pearson ρ) between the paths of all
 3     subject types by rank, and check for homogeneity of ranks
       across subject types
       For each of the 40 path popularity ranks, calculate its mean
 4
       across all subject types

 5     Apply k-means clustering on the 40 ranks

       Decide threshold(s) based on k-means as well as other
 6
       indicators (e.g. FrameNet roles distribution)
                             23
Boundary induction




          24
How can be KPs evaluated and used?




•   The evaluation of KPs should be performed in terms of their
    capability to be cognitively sound in capturing and representing
    knowledge

•   A scenario that can be used as for evaluating the efficacy of KPs
    is the exploratory search combined with user studies.




                                 25
Why exploratory search?



•   Exploratory search is characterized “by uncertainty about the space
    being searched and the nature of the problem that motivates the
    search” [White Et Al., 2005]

•   KPs can be used for supporting exploratory search
    ✦   They can be used in order to filter knowledge by drawing a meaningful
        boundary around the retrieved data
    ✦   They allow to suggest exploratory paths based on cognitive criteria of
        relevance

•   We can investigate how KPs help users in exploratory search
    tasks


                                       26
Aemoo: KP-based exploratory search




•   A Web application that supports exploratory search on the Web
    based on KPs extracted from Wikipedia links

•   It aggregates knowledge from Linked Data, Wikipedia, Twitter and
    Google News by applying KPs as knowledge lenses over data

•   It provides an effective summary of knowledge about an entity,
    including explanations




                                27
Exploring knowledge with Aemoo (1)




                  28
Exploring knowledge with Aemoo (2)




                  29
Conclusions


•   We want to contribute to the realization of the Semantic Web as
    an empirical science by providing a methodology for KP
    extraction

•   Our methodology for extracting KPs is based on two approaches
    ✦   a top-down approach
    ✦   a bottom-up approach

•   We have seen our experience in KP extraction so far
    ✦   KPs from FrameNet’s frames
    ✦   KPs from Wikipedia links

•   The evaluation we have in mind should be performed by means of
    exploratory search tasks
    ✦   Aemoo
                                     30
Thanks




  31

Mais conteúdo relacionado

Destaque

Conference Linked Data: the ScholarlyData project
Conference Linked Data: the ScholarlyData projectConference Linked Data: the ScholarlyData project
Conference Linked Data: the ScholarlyData projectAndrea Nuzzolese
 
Winnie's presentation
Winnie's presentationWinnie's presentation
Winnie's presentationasiimwe1990
 
David aradillasppt
David aradillaspptDavid aradillasppt
David aradillaspptdeividex
 
Differentiated unit- plant CSI
Differentiated unit- plant CSIDifferentiated unit- plant CSI
Differentiated unit- plant CSImariatzifas
 
Winnie's presentation
Winnie's presentationWinnie's presentation
Winnie's presentationasiimwe1990
 
Winnie's presentation
Winnie's presentationWinnie's presentation
Winnie's presentationasiimwe1990
 
Leader brands analysis
Leader brands analysisLeader brands analysis
Leader brands analysisMark O'Connor
 
eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...
eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...
eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...Guglielmo Arrigoni
 

Destaque (9)

Conference Linked Data: the ScholarlyData project
Conference Linked Data: the ScholarlyData projectConference Linked Data: the ScholarlyData project
Conference Linked Data: the ScholarlyData project
 
Winnie's presentation
Winnie's presentationWinnie's presentation
Winnie's presentation
 
David aradillasppt
David aradillaspptDavid aradillasppt
David aradillasppt
 
Differentiated unit- plant CSI
Differentiated unit- plant CSIDifferentiated unit- plant CSI
Differentiated unit- plant CSI
 
Winnie's presentation
Winnie's presentationWinnie's presentation
Winnie's presentation
 
Oke
OkeOke
Oke
 
Winnie's presentation
Winnie's presentationWinnie's presentation
Winnie's presentation
 
Leader brands analysis
Leader brands analysisLeader brands analysis
Leader brands analysis
 
eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...
eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...
eCommerce Age: Come aumentare le vendite del tuo Shop Online con l'Email Mark...
 

Semelhante a Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and Usage

ESWC 2011 BLOOMS+
ESWC 2011 BLOOMS+ ESWC 2011 BLOOMS+
ESWC 2011 BLOOMS+ Prateek Jain
 
Knowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseKnowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseAndrea Nuzzolese
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Dr. Aparna Varde
 
Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksAndrea Nuzzolese
 
Introduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdfIntroduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdfJaberRad1
 
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesLinked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesPrateek Jain
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolLaura Po
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsAndre Freitas
 
An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jDebanjan Mahata
 
Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...megan.fitzgibbons
 
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCFueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCValentina Presutti
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain
 
20130622 okfn hackathon t2
20130622 okfn hackathon t220130622 okfn hackathon t2
20130622 okfn hackathon t2Seonho Kim
 
Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Roi Blanco
 
A Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary DefenseA Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary DefenseYongyao Jiang
 
Book of the Dead Project
Book of the Dead ProjectBook of the Dead Project
Book of the Dead ProjectBarry Norton
 

Semelhante a Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and Usage (20)

ESWC 2011 BLOOMS+
ESWC 2011 BLOOMS+ ESWC 2011 BLOOMS+
ESWC 2011 BLOOMS+
 
Knowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseKnowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuse
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
 
Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia links
 
Introduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdfIntroduction_to_knowledge_graph.pdf
Introduction_to_knowledge_graph.pdf
 
PhD Proposal Defense - Prateek Jain
PhD Proposal Defense - Prateek JainPhD Proposal Defense - Prateek Jain
PhD Proposal Defense - Prateek Jain
 
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesLinked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX tool
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge Graphs
 
An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4j
 
Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...
 
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCFueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
 
20130622 okfn hackathon t2
20130622 okfn hackathon t220130622 okfn hackathon t2
20130622 okfn hackathon t2
 
Mining Web content for Enhanced Search
Mining Web content for Enhanced Search Mining Web content for Enhanced Search
Mining Web content for Enhanced Search
 
Domain-specific Knowledge Extraction from the Web of Data
Domain-specific Knowledge Extraction from the Web of DataDomain-specific Knowledge Extraction from the Web of Data
Domain-specific Knowledge Extraction from the Web of Data
 
Prateek Jain's Dissertation Defense - Linked Open Data Alignment and Querying
Prateek Jain's Dissertation Defense - Linked Open Data Alignment and QueryingPrateek Jain's Dissertation Defense - Linked Open Data Alignment and Querying
Prateek Jain's Dissertation Defense - Linked Open Data Alignment and Querying
 
A Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary DefenseA Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary Defense
 
Book of the Dead Project
Book of the Dead ProjectBook of the Dead Project
Book of the Dead Project
 
Deep Web mining
Deep Web miningDeep Web mining
Deep Web mining
 

Mais de Andrea Nuzzolese

Semantic Technologies in ST&DL
Semantic Technologies in ST&DLSemantic Technologies in ST&DL
Semantic Technologies in ST&DLAndrea Nuzzolese
 
Evaluating citation functions in CiTO: cognitive issues
Evaluating citation functions in CiTO: cognitive issuesEvaluating citation functions in CiTO: cognitive issues
Evaluating citation functions in CiTO: cognitive issuesAndrea Nuzzolese
 
Towards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citationsTowards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citationsAndrea Nuzzolese
 
Knowledge Representation and Reasoning with Apache Stanbol
Knowledge Representation and Reasoning with Apache StanbolKnowledge Representation and Reasoning with Apache Stanbol
Knowledge Representation and Reasoning with Apache StanbolAndrea Nuzzolese
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetAndrea Nuzzolese
 
Aemoo: exploratory search based on knowledge patterns over the Semantic Web
Aemoo:  exploratory search based on knowledge patterns over the Semantic WebAemoo:  exploratory search based on knowledge patterns over the Semantic Web
Aemoo: exploratory search based on knowledge patterns over the Semantic WebAndrea Nuzzolese
 

Mais de Andrea Nuzzolese (8)

Semantic Technologies in ST&DL
Semantic Technologies in ST&DLSemantic Technologies in ST&DL
Semantic Technologies in ST&DL
 
Sheldon challenge
Sheldon challengeSheldon challenge
Sheldon challenge
 
Evaluating citation functions in CiTO: cognitive issues
Evaluating citation functions in CiTO: cognitive issuesEvaluating citation functions in CiTO: cognitive issues
Evaluating citation functions in CiTO: cognitive issues
 
Loditaly2014 new
Loditaly2014 newLoditaly2014 new
Loditaly2014 new
 
Towards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citationsTowards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citations
 
Knowledge Representation and Reasoning with Apache Stanbol
Knowledge Representation and Reasoning with Apache StanbolKnowledge Representation and Reasoning with Apache Stanbol
Knowledge Representation and Reasoning with Apache Stanbol
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
 
Aemoo: exploratory search based on knowledge patterns over the Semantic Web
Aemoo:  exploratory search based on knowledge patterns over the Semantic WebAemoo:  exploratory search based on knowledge patterns over the Semantic Web
Aemoo: exploratory search based on knowledge patterns over the Semantic Web
 

Último

Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdfssuserdda66b
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 

Último (20)

Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 

Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and Usage

  • 1. Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and Usage Andrea Nuzzolese Ph.D. Student Università di Bologna STLab, ISTC-CNR
  • 2. Outline • Empirical Semantic Web Science and Knowledge Patterns (KPs) • A possible methodology for making KPs emerge from the Web of Data • The work done so far in KP extraction • Evaluating KPs' efficacy through Exploratory Search 2
  • 3. Does a Web science exist? • A science usually is applied to clear research objects ✦ Physical and biological science analyzes the natural world, and tries to find microscopic laws that, extrapolated to the macroscopic realm, would generate the behavior observed • The Web is an engineered space created through formally specified languages and protocols • Web pages with their content and links are created by humans with a particular task governed by social conventions and laws • A Web science exists [Berners-Lee Et Al., 2006] and is oriented to: ✦ Growth of the engineered space; ✦ Human-web interaction patterns 3
  • 4. What about a Web of Data science? • Linked data offers huge data for empirical research 4
  • 5. What are the research objects of the empirical SW science? • The Semantic Web and Linked data give us the chance to empirically study what are the patterns in organizing and representing knowledge • The research objects of the Semantic Web as an empirical science are Knowledge Patterns (KPs) 5
  • 6. Knoweldge Patterns • KPs are small well connected units of meaning, which are ✦ task based ✦ well grounded ✦ cognitively sound • KPs find their theoretical grounding in frames ✦ “… a frame is a data-structure for representing a stereotyped situation.” [Minsky 1975] ✦ “...the availability of global patterns of knowledge cuts down on non-determinacy enough to offset idiosyncratic bottom-up input that might otherwise be confusing.” [Beaugrande 1980] 6
  • 8. Empirical Semantic Web and KPs • KPs emerge from the knowledge soup deriving from the Web • A methodology for KP extraction from the Web 8
  • 9. KP extraction • The Web is populated by heterogeneous sources • We can classify sources in two categories ✦ Formal and semi-formal sources modeled by adopting a top-down approach ✴ e.g., foundational ontologies, frames, thesauri, etc. ✦ Non-formal sources modeled by adopting a bottom-up approach ✴ e.g., RDBs, Linked Data, Web pages, XML documents, etc. • Our KP extraction methodology is based on two complementary approaches ✦ A top-down approach ✦ A bottom-up approach 9
  • 11. KP detection and discovery • The top-down approach is aimed to extract KPs that already exists in a formal or semi-formal structure ✦ Possible techniques: reengineering, refactoring based on association rules, key concept identification, ontology mapping, etc. • The bottom-up approach is aimed to extract to discover or detect KPs from data ✦ Possible techniques: inductive techniques, machine learning, data mining, ontology mining, etc. 11
  • 12. KP validation • The top-down and the bottom-up approaches concur in the validation of KPs • KP extraction is a matter of understanding how the world or specific domains have been described from different perspectives ✦ The perspective of domain experts, ontologists, etc., which try to give formalizations either of the world or of specific domains ✦ The perspective of users, data entries, etc, which effectively populate and manage data that report facts about the world • For example it would be cognitively relevant if an occurrence of KP emerges both with the top-down and the bottom-up approach 12
  • 14. KP reengineering from FrameNet’s frames • FrameNet is a cognitive sound lexical knowledge base, which is grounded in a large corpus • FrameNet consists of a set of frames, which have frame elements lexical units, which pair words (lexemes) to frames, and relations to corpus elements ✦ Each frame can be interpreted as a class of situations 14
  • 15. An example of frame 15
  • 16. Using Semion for reengineering and refactoring FrameNet’s frame !"#$%"$#&'( !%)*+&( ,-./$-01%( !%)*+&( ,-./$-01%( 2&"&( 34#5$0( 2&"&( 6*7*#*.1&'( 2&"&( 16
  • 19. KP discovery from Wikipedia links • Hypothesis ✦ the types of linked resources that occur most often for a certain type of resource constitute its KP ✦ since we expect that any cognitive invariance in explaining/describing things is reflected in the wikilink graph, discovered KPs are cognitively sound • Contribution ✦ an EKP discovery procedure ✦ 184 EKPs published in OWL2 19
  • 20. Collecting paths from wikilinks dbpedia: dbpo:Person owl:Thing owl:Thing Organisation Path dbpo: dbpedia: db:Minnie_Mouse db:The_Walt_Disney_Company Company FictionalCharacter dbpo:wikiPageWikiLink Path rdf:type dbpo: db:Mickey_Mouse FictionalCharacter rdfs:subClassOf dbpo: owl:Thing FictionalCharacter dbpo:Person 20
  • 21. Path popularity Jackson_5 Dave_Grohl Michael_Jackson Jackie_Jackson Nirvana Madonna Prince Charlie_Parker Keith_Jarrett Foo Fighters Beatles nSubjectRes(Pi,j)/nRes(Si) John_Lennon Paul_McCartney 21
  • 22. Boundaries of KPs • An KP(Si) is a set of paths, such that Pi,j ∈ KP(Si) ! pathPopularity(Pi,j, Si) ≥ t • t is a threshold, under which a path is not included in an KP • How to get a good value for t? 22
  • 23. Boundary induction Step Description 1 For each path, calculate the path popularity For each subject type, get the 40 top-ranked path popularity 2 values* Apply multiple correlation (Pearson ρ) between the paths of all 3 subject types by rank, and check for homogeneity of ranks across subject types For each of the 40 path popularity ranks, calculate its mean 4 across all subject types 5 Apply k-means clustering on the 40 ranks Decide threshold(s) based on k-means as well as other 6 indicators (e.g. FrameNet roles distribution) 23
  • 25. How can be KPs evaluated and used? • The evaluation of KPs should be performed in terms of their capability to be cognitively sound in capturing and representing knowledge • A scenario that can be used as for evaluating the efficacy of KPs is the exploratory search combined with user studies. 25
  • 26. Why exploratory search? • Exploratory search is characterized “by uncertainty about the space being searched and the nature of the problem that motivates the search” [White Et Al., 2005] • KPs can be used for supporting exploratory search ✦ They can be used in order to filter knowledge by drawing a meaningful boundary around the retrieved data ✦ They allow to suggest exploratory paths based on cognitive criteria of relevance • We can investigate how KPs help users in exploratory search tasks 26
  • 27. Aemoo: KP-based exploratory search • A Web application that supports exploratory search on the Web based on KPs extracted from Wikipedia links • It aggregates knowledge from Linked Data, Wikipedia, Twitter and Google News by applying KPs as knowledge lenses over data • It provides an effective summary of knowledge about an entity, including explanations 27
  • 28. Exploring knowledge with Aemoo (1) 28
  • 29. Exploring knowledge with Aemoo (2) 29
  • 30. Conclusions • We want to contribute to the realization of the Semantic Web as an empirical science by providing a methodology for KP extraction • Our methodology for extracting KPs is based on two approaches ✦ a top-down approach ✦ a bottom-up approach • We have seen our experience in KP extraction so far ✦ KPs from FrameNet’s frames ✦ KPs from Wikipedia links • The evaluation we have in mind should be performed by means of exploratory search tasks ✦ Aemoo 30