SlideShare uma empresa Scribd logo
1 de 49
Ontologies for multimedia:Ontologies for multimedia:
the Semantic Culture Webthe Semantic Culture Web
Guus SchreiberGuus Schreiber
Free University AmsterdamFree University Amsterdam
Co-chair W3C Semantic Web Deployment WGCo-chair W3C Semantic Web Deployment WG
Overview
• My target: a Semantic Culture Web
• Ontology perspective:
– Principles for ontology engineering on Web scale
– Some remarks about web standards
• Technologies for realizing a Culture Web
– Ontology-based methods
– Image analysis
– NLP / information extraction
– Combinations are key!
Acknowledgements
• MultimediaN E-Culture Project:
– Alia Amin, Mark van Assem, Victor de Boer, Lynda
Hardman, Michiel Hildebrand, Laura Hollink, Zhisheng
Huang, Marco de Niet, Borys Omelayenko, Jacco van
Ossenbruggen, Ronny Siebes, Jos Taekema, Anna Tordai,
Jan Wielemaker, Bob Wielinga
• CHOICE Project @ Sound & Vision
– Hennie Brugman, Luit Gazendam, Veronique Malaise,
Johan Oomen, Mettina Veenstra
• MuNCH project @ Sound & Vision
– Laura Hollink, Bouke Hunning, Michiel van Liempt, Johan
Oomen Maarten de Rijke, Arnold Smeulders, Cees Snoek,
Marcel Worring,
Culture Web
Principles for ontology
engineering in a distributed world
1. Modesty principle
• Ontology engineers should refrain from
developing their own idiosyncratic ontologies
• Instead, they should make the available rich
vocabularies, thesauri and databases
available in web format
• Initially, only add the originally intended
semantics
Implicit WordNet semantics
“The ent operator specifies that the second
synset is an entailment of first synset. This
relation only holds for verbs. “
• Example: [breathe, inhale] entails [sneeze,
exhale]
• Semantics (OWL statements):
– Transitive property
– Inverse property: entailedBy
– Value restrictions for VerbSynSet (subclass of
SynSet)
Recipes for vocabulary URIs
• Simplified rule:
– Use “hash" variant” for vocabularies that are
relatively small and require frequent access
http://www.w3.org/2004/02/skos/core#Concept
– Use “slash” variant for large vocabularies, where
you do not want always the whole vocabulary to
be retrieved
http://xmlns.com/foaf/0.1/Person
• For more information and other recipes, see:
http://www.w3.org/TR/swbp-vocab-pub/
Query for WordNet URI returns
“concept-bounded description”
How useful are RDF and OWL?
• RDF: basic level of interoperability
• Some constructs of OWL are key:
– Logical characteristics of properties: symmetric,
transitive, inverse
– Identity: sameAs
• OWL pitfalls
– Bad: if it is written in OWL it is an ontology
– Worse: if it is not in OWL, then it is not an
ontology
2. Scale principle: “Think large!”
"Once you have a truly massive amount of
information integrated as knowledge, then the
human-software system will be superhuman, in
the same sense that mankind with writing is
superhuman compared to mankind before
writing."
Doug Lenat
Applications require many ontologies
3. Pattern principle:
don’t try to be too creative!
• Ontology engineering should not be an art
but a discipline
• Patterns play a key role in methodology for
ontology engineering
• See for example patterns developed by the
W3C Semantic Web Best Practices group
http://www.w3.org/2001/sw/BestPractices/
SKOS:
pattern for thesaurus modeling
• Based on ISO standard
• RDF representation
• Documentation:
http://www.w3.org/TR/swbp-skos-core-guide/
• Base class: SKOS Concept
Multi-lingual labels for concepts
Semantic relation:
broader and narrower
• No subclass semantics assumed!
4. Enrichment principle
• Don’t modify, but add!
• Techniques:
– Learning ontology relations/mappings
– Semantic analysis, e.g. OntoClean
– Processing of scope notes in thesauri
Example enrichment
• Learning relations between art styles in AAT
and artists in ULAN through NLP of
art0historic texts
• But don’t learn things that already exist!
DERAIN, Andre
The Turning Road
MATISSE, Henri
Le Bonheur de vivre
Extracting additional knowledge from
scope notes
Hypothesis underlying Culture Web
• Semantic Web technology is in particular
useful in knowledge-rich domains
or formulated differently
• If we cannot show added value in
knowledge-rich domains, then it may have no
value at all
Baseline architecture for a
Semantic Culture Web
• Should be fully based on web standards
– XML, RDF/OWL, SVG, AJAX
• OWL use is typically limited
• Methodology for metadata conversion
– Information extraction
– Should be professional service
• Scalability is key issue
– 100+ collections is minimum
• New search paradigms
• Public annotation facilities
• Evaluation studies with stakeholders!
Culture Web demonstrator
http://e-culture.multimedian.nl
Small datasets already give
scalability issues
??
New search paradigms:
Relation search
Search in digital media archives:
typical use case
• A person searches for
photos of an “orange ape”
• An image collection of
animal photographs
contains snapshots of
orang-utans.
• The search engine finds
the photos, despite the fact
that the words “orange”
and “ape” do not appear in
annotations
Techniques that can be used
• Ontologies: explicit background knowledge
plus semantic annotation: semantic link
between annotated concept and vocabulary
• Natural-language processing: co-occurrence
of ‘orange”, “ape” and “orangutans”
• Image processing: e.g. detectors for “orange”
and “ape”
Observation: no single technique can solve
every problem!
Supporting annotation of broadcasts
through information extraction
• Current situation:
mainly manual
• Not feasible for large-
scale digital archiving
• Context documents
for programs can be
identified
• Can we generate
candidate
annotation?
• Example from
CHOICE project
ranked keywords rank
Governments 1
Soldiers 1
Prisoners of war 3
Ministers 3
Prime ministers 3
Prisons 4
Civil servants 4
Camps 5
Voting 5
Democratization 5
Missions 6
Agreements 7
Christians 8
Lakes 9
News papers 9
Writing 9
Users keywords N
User study Peace troops 6
Military operations 5
Armed forces 3
Government policy 2
soldiers 2
Expert
description Peace troops E
Military operations E
Ranking based on semantic
distance In thesaurus
Supporting annotation: Automatically
deriving spatial relations
Object1
left
Object2
Supporting annotation:
Recognizing color of cloths
Requires
Reliable
segmentation
Color value
from AAT
Visual WordNet (Stein et al.)
• Adding knowledge about visual
characteristics to WordNet: mobility,
color, …
• Build detectors for the visual features
• Use visual data to prune the tree of
categories when analyzing a visual
object
Sample visual features and their
mapping to WordNet
Experiment: pruning the search for
“conveyance” concepts
6 concepts found
Including taxi cab
12 concepts found
Including passenger train
and commuter train
Three visual features: material, motion, environment
Assumption is that these work perfectly
Concept detectors in video (Snoek et
al)
Lexicon is specific for news domain
LSCOM lexicon: 229 - Weather
LSCOM enrichment: mapping to WordNet
• 365 concept detectors (MediaMil/LSCOM)
• Manual mapping process, 2 subjects per
concept, 65% inter-subject overlap
• 273 matched to 1 WordNet concept
• 39 were union of 2+ concepts
Fish => wn:Fish OR wn:AquaticMammal
• 45 were intersection of 2+ concepts
MaleNewsSubject => wn:Male AND wn:Subject
• 8 were instances of a concept
John Kerry => wn:Senator
LSCOM lexicon: 110 – Female Anchor
• Combination of “Female”
and “Anchor person”
• Link to WordNet enables
use of WorldNet's semantic
network for LSCOM
concepts
Combining NLP, image analysis and
ontologies for selecting detectors
Building Finder: integrating image
analysis and textual sources
• Knoblock et al. (USC/ISI)
• Multiple heterogeneous sources
– Satellite images (Microsoft Terraservice)
– Road map info (US)
– Address information (white pages)
• Image analysis techniques to map satellite
data to road map
• RDF used for syntactic interoperability
Take home message
• There’s lots of existing semantics out there.
Use it!
• Think multi-disciplinary!
– Realistic applications require combination of
techniques
• In open knowledge-rich environments there
are loys of opportunities for SW technology
– But we have to take them
Ontologies for multimedia: the Semantic Culture Web
Ontologies for multimedia: the Semantic Culture Web

Mais conteúdo relacionado

Mais procurados

Principles and pragmatics of a Semantic Culture Web
 Principles and pragmatics of a Semantic Culture Web Principles and pragmatics of a Semantic Culture Web
Principles and pragmatics of a Semantic Culture WebGuus Schreiber
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Lora Aroyo
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...Lora Aroyo
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18Rafael Alvarado
 
Bloggen dhd (von Laurent Romary)
Bloggen dhd  (von Laurent Romary)Bloggen dhd  (von Laurent Romary)
Bloggen dhd (von Laurent Romary)MaxWeberStiftung
 
Federating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, StudentsFederating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, StudentsJack Park
 
Dh presentation helig 2014
Dh presentation helig 2014Dh presentation helig 2014
Dh presentation helig 2014HELIGLIASA
 
Columbia.lippincott.2012
Columbia.lippincott.2012Columbia.lippincott.2012
Columbia.lippincott.2012JoanLippincott
 
Creating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataCreating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataAngela Zoss
 
WebART in 10 minutes
WebART in 10 minutesWebART in 10 minutes
WebART in 10 minutesJaap Kamps
 
Balboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public ResourceBalboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public ResourceAnna Chiaretta Lavatelli
 
Toward a World Wise Web
Toward a World Wise WebToward a World Wise Web
Toward a World Wise Webszpak
 
Digital Humanities: An Introduction
Digital Humanities: An IntroductionDigital Humanities: An Introduction
Digital Humanities: An IntroductionDilip Barad
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesChantal van Son
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities researchHarriett Green
 
HyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingHyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingJack Park
 
Adaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generationAdaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generationPeter Brusilovsky
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...Lora Aroyo
 

Mais procurados (20)

Principles and pragmatics of a Semantic Culture Web
 Principles and pragmatics of a Semantic Culture Web Principles and pragmatics of a Semantic Culture Web
Principles and pragmatics of a Semantic Culture Web
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18
 
Bloggen dhd (von Laurent Romary)
Bloggen dhd  (von Laurent Romary)Bloggen dhd  (von Laurent Romary)
Bloggen dhd (von Laurent Romary)
 
Federating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, StudentsFederating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, Students
 
Dh presentation helig 2014
Dh presentation helig 2014Dh presentation helig 2014
Dh presentation helig 2014
 
Columbia.lippincott.2012
Columbia.lippincott.2012Columbia.lippincott.2012
Columbia.lippincott.2012
 
Creating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataCreating and Processing Digital Humanities Data
Creating and Processing Digital Humanities Data
 
Granada0611 digital humanities
Granada0611 digital humanitiesGranada0611 digital humanities
Granada0611 digital humanities
 
WebART in 10 minutes
WebART in 10 minutesWebART in 10 minutes
WebART in 10 minutes
 
Balboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public ResourceBalboa Park Commons: Collaborative Digitization for a Public Resource
Balboa Park Commons: Collaborative Digitization for a Public Resource
 
Toward a World Wise Web
Toward a World Wise WebToward a World Wise Web
Toward a World Wise Web
 
Digital Humanities
Digital HumanitiesDigital Humanities
Digital Humanities
 
Digital Humanities: An Introduction
Digital Humanities: An IntroductionDigital Humanities: An Introduction
Digital Humanities: An Introduction
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social Sciences
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities research
 
HyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingHyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive Computing
 
Adaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generationAdaptive Educational Hypermedia: From generation to generation
Adaptive Educational Hypermedia: From generation to generation
 
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...
 

Semelhante a Ontologies for multimedia: the Semantic Culture Web

Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringeswcsummerschool
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word CloudsMarina Santini
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) WebDavid Crowley
 
From adaptive hypermedia to the adaptive Web
From adaptive hypermedia to the adaptive WebFrom adaptive hypermedia to the adaptive Web
From adaptive hypermedia to the adaptive WebPeter Brusilovsky
 
Doing DH in Theological Libraries
Doing DH in Theological LibrariesDoing DH in Theological Libraries
Doing DH in Theological LibrariesClifford Anderson
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and TechniquesBernhard Haslhofer
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationRachel Vacek
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospectsGuus Schreiber
 
A Framework for Ontology Usage Analysis
A Framework for Ontology Usage AnalysisA Framework for Ontology Usage Analysis
A Framework for Ontology Usage AnalysisJamshaid Ashraf
 
Institutional Repository (IR) and Open Access in Academic Libraries
Institutional Repository (IR) and Open Access in Academic LibrariesInstitutional Repository (IR) and Open Access in Academic Libraries
Institutional Repository (IR) and Open Access in Academic LibrariesHong (Jenny) Jing
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsJohn Breslin
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...TimelessFuture
 

Semelhante a Ontologies for multimedia: the Semantic Culture Web (20)

Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
Our World is Socio-technical
Our World is Socio-technicalOur World is Socio-technical
Our World is Socio-technical
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) Web
 
From adaptive hypermedia to the adaptive Web
From adaptive hypermedia to the adaptive WebFrom adaptive hypermedia to the adaptive Web
From adaptive hypermedia to the adaptive Web
 
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
NISO Virtual Conference: Web-Scale Discovery Services: Transforming Access to...
 
Doing DH in Theological Libraries
Doing DH in Theological LibrariesDoing DH in Theological Libraries
Doing DH in Theological Libraries
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post Implementation
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospects
 
A Framework for Ontology Usage Analysis
A Framework for Ontology Usage AnalysisA Framework for Ontology Usage Analysis
A Framework for Ontology Usage Analysis
 
Institutional Repository (IR) and Open Access in Academic Libraries
Institutional Repository (IR) and Open Access in Academic LibrariesInstitutional Repository (IR) and Open Access in Academic Libraries
Institutional Repository (IR) and Open Access in Academic Libraries
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social Semantics
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
 
Ir1
Ir1Ir1
Ir1
 

Mais de Guus Schreiber

Ontologies: vehicles for reuse
Ontologies: vehicles for reuseOntologies: vehicles for reuse
Ontologies: vehicles for reuseGuus Schreiber
 
Linking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveLinking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveGuus Schreiber
 
CommonKADS project management
CommonKADS project managementCommonKADS project management
CommonKADS project managementGuus Schreiber
 
UML notations used by CommonKADS
UML notations used by CommonKADSUML notations used by CommonKADS
UML notations used by CommonKADSGuus Schreiber
 
Advanced knowledge modelling
Advanced knowledge modellingAdvanced knowledge modelling
Advanced knowledge modellingGuus Schreiber
 
CommonKADS design and implementation
CommonKADS design and implementationCommonKADS design and implementation
CommonKADS design and implementationGuus Schreiber
 
CommonKADS communication model
CommonKADS communication modelCommonKADS communication model
CommonKADS communication modelGuus Schreiber
 
CommonKADS knowledge modelling process
CommonKADS knowledge modelling processCommonKADS knowledge modelling process
CommonKADS knowledge modelling processGuus Schreiber
 
CommonKADS knowledge model templates
CommonKADS knowledge model templatesCommonKADS knowledge model templates
CommonKADS knowledge model templatesGuus Schreiber
 
CommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsCommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsGuus Schreiber
 
CommonKADS knowledge management
CommonKADS knowledge managementCommonKADS knowledge management
CommonKADS knowledge managementGuus Schreiber
 
CommonKADS context models
CommonKADS context modelsCommonKADS context models
CommonKADS context modelsGuus Schreiber
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsGuus Schreiber
 
E-Culture semantic search pilot
E-Culture semantic search pilotE-Culture semantic search pilot
E-Culture semantic search pilotGuus Schreiber
 
Ontology Engineering: Ontology Use
Ontology Engineering: Ontology UseOntology Engineering: Ontology Use
Ontology Engineering: Ontology UseGuus Schreiber
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignmentGuus Schreiber
 
Ontology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationOntology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationGuus Schreiber
 
Ontology Engineering: ontology construction II
Ontology Engineering: ontology construction IIOntology Engineering: ontology construction II
Ontology Engineering: ontology construction IIGuus Schreiber
 

Mais de Guus Schreiber (20)

Ontologies: vehicles for reuse
Ontologies: vehicles for reuseOntologies: vehicles for reuse
Ontologies: vehicles for reuse
 
Linking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveLinking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archive
 
CommonKADS project management
CommonKADS project managementCommonKADS project management
CommonKADS project management
 
UML notations used by CommonKADS
UML notations used by CommonKADSUML notations used by CommonKADS
UML notations used by CommonKADS
 
Advanced knowledge modelling
Advanced knowledge modellingAdvanced knowledge modelling
Advanced knowledge modelling
 
CommonKADS design and implementation
CommonKADS design and implementationCommonKADS design and implementation
CommonKADS design and implementation
 
CommonKADS communication model
CommonKADS communication modelCommonKADS communication model
CommonKADS communication model
 
CommonKADS knowledge modelling process
CommonKADS knowledge modelling processCommonKADS knowledge modelling process
CommonKADS knowledge modelling process
 
CommonKADS knowledge model templates
CommonKADS knowledge model templatesCommonKADS knowledge model templates
CommonKADS knowledge model templates
 
CommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsCommonKADS knowledge modelling basics
CommonKADS knowledge modelling basics
 
CommonKADS knowledge management
CommonKADS knowledge managementCommonKADS knowledge management
CommonKADS knowledge management
 
CommonKADS context models
CommonKADS context modelsCommonKADS context models
CommonKADS context models
 
Introduction
IntroductionIntroduction
Introduction
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to Applications
 
E-Culture semantic search pilot
E-Culture semantic search pilotE-Culture semantic search pilot
E-Culture semantic search pilot
 
Vista-TV overview
Vista-TV overviewVista-TV overview
Vista-TV overview
 
Ontology Engineering: Ontology Use
Ontology Engineering: Ontology UseOntology Engineering: Ontology Use
Ontology Engineering: Ontology Use
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Ontology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationOntology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluation
 
Ontology Engineering: ontology construction II
Ontology Engineering: ontology construction IIOntology Engineering: ontology construction II
Ontology Engineering: ontology construction II
 

Último

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 

Último (20)

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 

Ontologies for multimedia: the Semantic Culture Web

  • 1. Ontologies for multimedia:Ontologies for multimedia: the Semantic Culture Webthe Semantic Culture Web Guus SchreiberGuus Schreiber Free University AmsterdamFree University Amsterdam Co-chair W3C Semantic Web Deployment WGCo-chair W3C Semantic Web Deployment WG
  • 2. Overview • My target: a Semantic Culture Web • Ontology perspective: – Principles for ontology engineering on Web scale – Some remarks about web standards • Technologies for realizing a Culture Web – Ontology-based methods – Image analysis – NLP / information extraction – Combinations are key!
  • 3. Acknowledgements • MultimediaN E-Culture Project: – Alia Amin, Mark van Assem, Victor de Boer, Lynda Hardman, Michiel Hildebrand, Laura Hollink, Zhisheng Huang, Marco de Niet, Borys Omelayenko, Jacco van Ossenbruggen, Ronny Siebes, Jos Taekema, Anna Tordai, Jan Wielemaker, Bob Wielinga • CHOICE Project @ Sound & Vision – Hennie Brugman, Luit Gazendam, Veronique Malaise, Johan Oomen, Mettina Veenstra • MuNCH project @ Sound & Vision – Laura Hollink, Bouke Hunning, Michiel van Liempt, Johan Oomen Maarten de Rijke, Arnold Smeulders, Cees Snoek, Marcel Worring,
  • 5.
  • 6.
  • 7. Principles for ontology engineering in a distributed world
  • 8. 1. Modesty principle • Ontology engineers should refrain from developing their own idiosyncratic ontologies • Instead, they should make the available rich vocabularies, thesauri and databases available in web format • Initially, only add the originally intended semantics
  • 9.
  • 10. Implicit WordNet semantics “The ent operator specifies that the second synset is an entailment of first synset. This relation only holds for verbs. “ • Example: [breathe, inhale] entails [sneeze, exhale] • Semantics (OWL statements): – Transitive property – Inverse property: entailedBy – Value restrictions for VerbSynSet (subclass of SynSet)
  • 11. Recipes for vocabulary URIs • Simplified rule: – Use “hash" variant” for vocabularies that are relatively small and require frequent access http://www.w3.org/2004/02/skos/core#Concept – Use “slash” variant for large vocabularies, where you do not want always the whole vocabulary to be retrieved http://xmlns.com/foaf/0.1/Person • For more information and other recipes, see: http://www.w3.org/TR/swbp-vocab-pub/
  • 12. Query for WordNet URI returns “concept-bounded description”
  • 13. How useful are RDF and OWL? • RDF: basic level of interoperability • Some constructs of OWL are key: – Logical characteristics of properties: symmetric, transitive, inverse – Identity: sameAs • OWL pitfalls – Bad: if it is written in OWL it is an ontology – Worse: if it is not in OWL, then it is not an ontology
  • 14. 2. Scale principle: “Think large!” "Once you have a truly massive amount of information integrated as knowledge, then the human-software system will be superhuman, in the same sense that mankind with writing is superhuman compared to mankind before writing." Doug Lenat
  • 16. 3. Pattern principle: don’t try to be too creative! • Ontology engineering should not be an art but a discipline • Patterns play a key role in methodology for ontology engineering • See for example patterns developed by the W3C Semantic Web Best Practices group http://www.w3.org/2001/sw/BestPractices/
  • 17. SKOS: pattern for thesaurus modeling • Based on ISO standard • RDF representation • Documentation: http://www.w3.org/TR/swbp-skos-core-guide/ • Base class: SKOS Concept
  • 19. Semantic relation: broader and narrower • No subclass semantics assumed!
  • 20. 4. Enrichment principle • Don’t modify, but add! • Techniques: – Learning ontology relations/mappings – Semantic analysis, e.g. OntoClean – Processing of scope notes in thesauri
  • 21. Example enrichment • Learning relations between art styles in AAT and artists in ULAN through NLP of art0historic texts • But don’t learn things that already exist!
  • 22. DERAIN, Andre The Turning Road MATISSE, Henri Le Bonheur de vivre
  • 24. Hypothesis underlying Culture Web • Semantic Web technology is in particular useful in knowledge-rich domains or formulated differently • If we cannot show added value in knowledge-rich domains, then it may have no value at all
  • 25. Baseline architecture for a Semantic Culture Web • Should be fully based on web standards – XML, RDF/OWL, SVG, AJAX • OWL use is typically limited • Methodology for metadata conversion – Information extraction – Should be professional service • Scalability is key issue – 100+ collections is minimum • New search paradigms • Public annotation facilities • Evaluation studies with stakeholders!
  • 26.
  • 28. Small datasets already give scalability issues
  • 30.
  • 31. Search in digital media archives: typical use case • A person searches for photos of an “orange ape” • An image collection of animal photographs contains snapshots of orang-utans. • The search engine finds the photos, despite the fact that the words “orange” and “ape” do not appear in annotations
  • 32. Techniques that can be used • Ontologies: explicit background knowledge plus semantic annotation: semantic link between annotated concept and vocabulary • Natural-language processing: co-occurrence of ‘orange”, “ape” and “orangutans” • Image processing: e.g. detectors for “orange” and “ape” Observation: no single technique can solve every problem!
  • 33. Supporting annotation of broadcasts through information extraction • Current situation: mainly manual • Not feasible for large- scale digital archiving • Context documents for programs can be identified • Can we generate candidate annotation? • Example from CHOICE project
  • 34. ranked keywords rank Governments 1 Soldiers 1 Prisoners of war 3 Ministers 3 Prime ministers 3 Prisons 4 Civil servants 4 Camps 5 Voting 5 Democratization 5 Missions 6 Agreements 7 Christians 8 Lakes 9 News papers 9 Writing 9 Users keywords N User study Peace troops 6 Military operations 5 Armed forces 3 Government policy 2 soldiers 2 Expert description Peace troops E Military operations E Ranking based on semantic distance In thesaurus
  • 35. Supporting annotation: Automatically deriving spatial relations Object1 left Object2
  • 36. Supporting annotation: Recognizing color of cloths Requires Reliable segmentation Color value from AAT
  • 37. Visual WordNet (Stein et al.) • Adding knowledge about visual characteristics to WordNet: mobility, color, … • Build detectors for the visual features • Use visual data to prune the tree of categories when analyzing a visual object
  • 38. Sample visual features and their mapping to WordNet
  • 39. Experiment: pruning the search for “conveyance” concepts 6 concepts found Including taxi cab 12 concepts found Including passenger train and commuter train Three visual features: material, motion, environment Assumption is that these work perfectly
  • 40. Concept detectors in video (Snoek et al)
  • 41. Lexicon is specific for news domain LSCOM lexicon: 229 - Weather
  • 42. LSCOM enrichment: mapping to WordNet • 365 concept detectors (MediaMil/LSCOM) • Manual mapping process, 2 subjects per concept, 65% inter-subject overlap • 273 matched to 1 WordNet concept • 39 were union of 2+ concepts Fish => wn:Fish OR wn:AquaticMammal • 45 were intersection of 2+ concepts MaleNewsSubject => wn:Male AND wn:Subject • 8 were instances of a concept John Kerry => wn:Senator
  • 43. LSCOM lexicon: 110 – Female Anchor • Combination of “Female” and “Anchor person” • Link to WordNet enables use of WorldNet's semantic network for LSCOM concepts
  • 44. Combining NLP, image analysis and ontologies for selecting detectors
  • 45. Building Finder: integrating image analysis and textual sources • Knoblock et al. (USC/ISI) • Multiple heterogeneous sources – Satellite images (Microsoft Terraservice) – Road map info (US) – Address information (white pages) • Image analysis techniques to map satellite data to road map • RDF used for syntactic interoperability
  • 46.
  • 47. Take home message • There’s lots of existing semantics out there. Use it! • Think multi-disciplinary! – Realistic applications require combination of techniques • In open knowledge-rich environments there are loys of opportunities for SW technology – But we have to take them