Anna Karenina in Ontology Matching

•Transferir como PPTX, PDF•

1 gostou•364 visualizações

The document discusses the challenges of vocabulary alignment between ontologies. It notes that current systems use complex reasoning that does not scale well to large vocabularies and makes results difficult to explain. While alignment failures occur for different reasons each time, the document argues that interactive alignment involving domain experts can address this problem and be applied successfully even to large datasets. It suggests that the current evaluation protocol does not adequately assess interactive features or account for the human roles involved in ontology development and use. An example alignment between the AAT and WordNet ontologies is provided.

Diversão e humor

The Anna Karenina problem in vocabulary alignment:

“Happy alignments are all alike;
every unhappy alignment is
unhappy in its own way”

Jacco van Ossenbruggen Panel at the Ontology Matching
CWI & VU University Amsterdam workshop, ISWC 2012

OAEI VLCR track
• 2008: 1 participant
• 2009: 2 participants
• 2010, 2011

1

OAEI Library track
• 2008: 3 participants
• 2009: 1 participant
• 2010, 2011
• 2012: It’s back!

2

OAEI Directory track

from: Results of the Ontology Alignment Evaluation Initiative 2010
JérômeEuzenat, Alfio Ferrara, Christian Meilicke, Juan Pane, François Scharffe, PavelShvaiko,
HeinerStuckenschmidt, OndřejŠváb-Zamazal, VojtěchSvátek and CássiaTrojahn dos Santos
3

Observations
• Current systems are complex
reasoning engines that combine
multiple strategies in some “smart” way
• This “smartness” has major drawbacks:
– does not scale on large vocabularies
– hard to predict if it will work for your data
– hard to explain results afterwards:
what went wrong, why & how to fix it

4

Bad news, good news
• Bad news:
– alignments fail for different reasons every time
– solving this is an AI-complete problem
– requires knowledge that is in the heads of the
domain experts, not in the data
• Good news:
– with experts on board, it is not that difficult
– we can even do large datasets interactively
– users are willing to spend time to get it right

5

Evaluation
• Current evaluation protocol
– is not suited for evaluating interactive features
– has abstracted away all human parties involved
• ontology publishers
• application developers
• application users
– ignores that ontology publishers are often willing
to spend serious time & effort on alignment
process

http://semanticweb.cs.vu.nl/lod/tpdl2011/ 6

Example: AAT to WordNet
• aat:restorer
altLabels: restaurateur (fr), Restaurator (de) , hersteller (nl), ...
scopeNote: Those engaged in making changes to an object or structure so
that it will closely approximate its state at a specific time in its history.
(...)
When changes made are to prevent further deterioration, see
"preservationists." More generally, for those who undertake
treatment, preventive care, and research directed toward long-term
safekeeping of cultural and natural heritage, see "conservators."
• wn:restorer
synonyms: refinisher, renovator, restorer, preserver
gloss: a skilled worker who is employed to restore or refinish buildings or
antique furniture.

http://semanticweb.cs.vu.nl/lod/tpdl2011/ 7

Mais conteúdo relacionado

Semelhante a Anna Karenina in Ontology Matching

Report of the second FAIRDOM foundryFAIRDOM

Some perspectives from the Astropy ProjectKelle Cruz

Open hpi semweb-06-part2Nadine Ludwig

Big Data Standards - Workshop, ExpBio, Boston, 2015Susanna-Assunta Sansone

Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumAnita de Waard

Dey alexander usability_training_notes_01danamato

Designing for those digging rocks, pirouetting and saving lives: Our design p...Elizabeth Chesters

Fundamentals of software sustainabilityDaniel S. Katz

U mpresBui Hieu

Lecture: Semantic Word CloudsMarina Santini

eResearchDenis Gillet

Pal gov.tutorial4.session8 2.stepwisemethodologiesMustafa Jarrar

asdfasTianwei_liu

BlahTianwei_liu

S3 knowledge value-generation-discussion_reportDigital Business Innovation Community

Impact the UX of Your Website with Contextual InquiryRachel Vacek

Multi task learning stepping away from narrow expert models 7.11.18Cloudera, Inc.

OOR--Open-Ontology-Repository--jun2010Peter Yim

Scientific Information Management at the U.S. Geological SurveyDave Govoni

A Pragmatic Perspective on Software VisualizationArie van Deursen

Semelhante a Anna Karenina in Ontology Matching (20)

Report of the second FAIRDOM foundry

Some perspectives from the Astropy Project

Open hpi semweb-06-part2

Big Data Standards - Workshop, ExpBio, Boston, 2015

Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum

Dey alexander usability_training_notes_01

Designing for those digging rocks, pirouetting and saving lives: Our design p...

Fundamentals of software sustainability

U mpres

Lecture: Semantic Word Clouds

eResearch

Pal gov.tutorial4.session8 2.stepwisemethodologies

asdfas

Blah

S3 knowledge value-generation-discussion_report

Impact the UX of Your Website with Contextual Inquiry

Multi task learning stepping away from narrow expert models 7.11.18

OOR--Open-Ontology-Repository--jun2010

Scientific Information Management at the U.S. Geological Survey

A Pragmatic Perspective on Software Visualization

Mais de Jacco van Ossenbruggen

Cultural AI - KB College 2 july 2019 (Dutch)Jacco van Ossenbruggen

The Nature of Digitally-Produced Data: Towards Social-Scientific Tool CriticismJacco van Ossenbruggen

#kbdata: Exploring potential impact of technology limitations on DH researchJacco van Ossenbruggen

Gist 16-march-2015-jaccoJacco van Ossenbruggen

Using Semantic Web Technologies to Reproduce a Pharmacovigilance Case StudyJacco van Ossenbruggen

Een semantisch Web voor archieven:bouw bruggen, geen muren Jacco van Ossenbruggen

Mais de Jacco van Ossenbruggen (6)

Cultural AI - KB College 2 july 2019 (Dutch)

The Nature of Digitally-Produced Data: Towards Social-Scientific Tool Criticism

#kbdata: Exploring potential impact of technology limitations on DH research

Gist 16-march-2015-jacco

Using Semantic Web Technologies to Reproduce a Pharmacovigilance Case Study

Een semantisch Web voor archieven:bouw bruggen, geen muren

Último

Russian Escorts Agency In Goa 💚 9316020077 💚 Russian Call Girl Goasexy call girls service in goa

Call Girls Manjri Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi

Model Call Girls In Ariyalur WhatsApp Booking 7427069034 call girl service 24... Shivani Pandey

↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...noor ahmed

Independent Sonagachi Escorts ✔ 9332606886✔ Full Night With Room Online Booki...Riya Pathan

VIP Model Call Girls Budhwar Peth ( Pune ) Call ON 8005736733 Starting From 5...SUHANI PANDEY

Goa Call "Girls Service 9316020077 Call "Girls in Goasexy call girls service in goa

Goa Call Girls 9316020077 Call Girls In Goa By Russian Call Girl in goarussian goa call girl and escorts service

Nayabad Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Sex At ...aamir

College Call Girls Pune 8617697112 Short 1500 Night 6000 Best call girls ServiceNitya salvi

↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...noor ahmed

Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser... Shivani Pandey

Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...noor ahmed

Top Rated Pune Call Girls Dhayari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Call Girls in Nagpur High Profile

Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...aamir

Top Rated Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...Call Girls in Nagpur High Profile

Hotel And Home Service Available Kolkata Call Girls South End Park ✔ 62971435...ritikasharma

𓀤Call On 6297143586 𓀤 Sonagachi Call Girls In All Kolkata 24/7 Provide Call W...rahim quresi

𓀤Call On 6297143586 𓀤 Ultadanga Call Girls In All Kolkata 24/7 Provide Call W...rahim quresi

↑Top Model (Kolkata) Call Girls Rajpur ⟟ 8250192130 ⟟ High Class Call Girl In...noor ahmed

Anna Karenina in Ontology Matching

1. The Anna Karenina problem in vocabulary alignment: “Happy alignments are all alike; every unhappy alignment is unhappy in its own way” Jacco van Ossenbruggen Panel at the Ontology Matching CWI & VU University Amsterdam workshop, ISWC 2012

2. OAEI VLCR track • 2008: 1 participant • 2009: 2 participants • 2010, 2011 1

3. OAEI Library track • 2008: 3 participants • 2009: 1 participant • 2010, 2011 • 2012: It’s back! 2

4. OAEI Directory track from: Results of the Ontology Alignment Evaluation Initiative 2010 JérômeEuzenat, Alfio Ferrara, Christian Meilicke, Juan Pane, François Scharffe, PavelShvaiko, HeinerStuckenschmidt, OndřejŠváb-Zamazal, VojtěchSvátek and CássiaTrojahn dos Santos 3

5. Observations • Current systems are complex reasoning engines that combine multiple strategies in some “smart” way • This “smartness” has major drawbacks: – does not scale on large vocabularies – hard to predict if it will work for your data – hard to explain results afterwards: what went wrong, why & how to fix it 4

6. Bad news, good news • Bad news: – alignments fail for different reasons every time – solving this is an AI-complete problem – requires knowledge that is in the heads of the domain experts, not in the data • Good news: – with experts on board, it is not that difficult – we can even do large datasets interactively – users are willing to spend time to get it right 5

7. Evaluation • Current evaluation protocol – is not suited for evaluating interactive features – has abstracted away all human parties involved • ontology publishers • application developers • application users – ignores that ontology publishers are often willing to spend serious time & effort on alignment process http://semanticweb.cs.vu.nl/lod/tpdl2011/ 6

8. Example: AAT to WordNet • aat:restorer altLabels: restaurateur (fr), Restaurator (de) , hersteller (nl), ... scopeNote: Those engaged in making changes to an object or structure so that it will closely approximate its state at a specific time in its history. (...) When changes made are to prevent further deterioration, see "preservationists." More generally, for those who undertake treatment, preventive care, and research directed toward long-term safekeeping of cultural and natural heritage, see "conservators." • wn:restorer synonyms: refinisher, renovator, restorer, preserver gloss: a skilled worker who is employed to restore or refinish buildings or antique furniture. http://semanticweb.cs.vu.nl/lod/tpdl2011/ 7

Notas do Editor

Teaser slide for conference announcementsIn Europeana we have tried to align vocabularies using off the shelf tools.That is, we tried to find for each term in a source vocabulary, a similar term in the target vocabularyFor the first two vocabularies this failed, post mortem analysis learned this was because there was something unique in the data the tool could deal with.The next pair failed too, but for different even more unique, reasons. Etc etc,This became known as the Anna Karenina problem
State of the art in alignment:Ontology Alignment Evaluation InitiativeBad omen: “our” tracks, Library and VLRC, had ≤2 participants in 2008 and 2009 and were not organized or cancelled in 2010…
State of the art in alignment:Ontology Alignment Evaluation InitiativeBad omen: “our” tracks, Library and VLRC, had ≤2 participants in 2008 and 2009, and were not organized or cancelled in 2010…
State of the art in alignment:Ontology Alignment Evaluation InitiativeBad omen: “our” tracks, Library and VLRC, had ≤2 participants in 2008 and 2009, and were not organized or cancelled in 2010…
Sound like pretty similar, untill you read the full AAT scope note

Anna Karenina in Ontology Matching

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a Anna Karenina in Ontology Matching

Semelhante a Anna Karenina in Ontology Matching (20)

Mais de Jacco van Ossenbruggen

Mais de Jacco van Ossenbruggen (6)

Último

Último (20)

Anna Karenina in Ontology Matching

Notas do Editor