SlideShare uma empresa Scribd logo
1 de 37
Baixar para ler offline
Evaluating Data Quality in Europeana:
Metrics for Multilinguality
Péter Király1
, Juliane Stiller2
, Valentine Charles3
, Werner Bailer4
, Nuno Freire5
1
Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen
2
Berlin School of Library and Information Science, Humboldt-Universität zu Berlin
3
Europeana Foundation, The Hague
4
Joanneum Research Forschungsgesellschaft mbH, Graz
5
INESC-ID, Lisbon
MTSR 2018 - Track on Cultural Collections and Applications, Limassol, Oct. 24, 2018
1
Nummertjes by Fabio (CC BY-NC 2.0)
Agenda
1. Europeana
2. Multilingual Information in Europeana’s Metadata
3. Multilinguality as a Facet of Quality Dimensions
4. Results
5. Demo
2
Europeana - Platform for
Cultural Heritage Material
www.europeana.eu
○ Books, newspapers, letters, paintings, photographs, radio shows, films,
etc.
○ Text, images, video, audio, sounds, 3D
○ Over 58 million objects
○ > 50 languages
Europeana - Facts
4
Multilingual Information in
Europeana’s Metadata
5
English cultural heritage object:
<dc:language>en</dc:language>
English cultural heritage object:
<dc:language>en</dc:language>
German metadata
Multilinguality on Field Level
<#record> a ore:Proxy ;
dc:subject “Ballet”, “Opera”@en
<#record> a ore:Proxy ; edm:europeanaProxy true ;
dc:subject <http://data.europeana.eu/concept/base/264>.
<http://data.europeana.eu/concept/base/264> a skos:Concept .
skos:prefLabel "Ballett"@no, "बैले"@hi, "Ballett"@de, "Балет"@be, "Балет"@ru,
"Balé"@pt, "Балет"@bg, "Baletas"@lt, "Balet"@hr, "Balets"@lv .
Europeana Dereferencing
Literal, literal with language tag
Processes Contributing to Multilinguality
dc: subject
“subject”@en
dc:creator
<http://vocab.getty.edu/...>
dc:type
<http://voc.example./…>
dc:subject
<http://dbpedia.org/
aSubjectID>
dc:subject
“Subject”
Data from Provider
dc:creator
new labels in
different languages
Data added by Europeana: dereferencing step
Quantifiable: “term”@language annotation
dc:subject
New labels in different
languages
Quantify Multilinguality of Data to:
○ Establish a sense of the multilingual reach of Europeana, incl.
distribution of languages
○ Identify the impact of different workflows / processes on
multilinguality of data
○ Take measures to improve multilinguality in data
○ Devise strategies for underrepresented languages
What Could be Measured?
○ Number of (distinct) languages in the metadata
○ Number of language-tagged literals
○ Tagged literals per language
○ Existence of language information fields such as dc:language
○ Consistency and conformity of language information
Multilinguality as a Facet of
Quality Dimensions
12
Completeness
○ This dimension:
○ expresses the number (fraction) of fields present in a dataset
○ identifies non-empty values in a record or (sub-)collection.
○ Multilingual completeness is captured by:
○ Presence of value in dc:language
○ Share of fields with language tags to overall available fields
Consistency
○ Describes the logical coherence of metadata
○ Assesses variety of language values in the dc:language field:
how many distinct values?
○ Contributes to features like language-based facet
Conformity
○ Describes the conformity to a given standard such as ISO-639-2
○ Example: English is expressed as: English, ENG, en, en-uk, …
○ Share of values that comply or do not comply
Accessibility
○ Access to information and data across languages
○ Distribution of linguistic information in metadata
○ Quantifying the language tag
○ The more language tags, the higher the multilingual reach
Dimensions, Criteria & Measures
Dimension Criteria Measure
Completeness Presence or absence of values in fields
relating to the language of the object or
the metadata
Share of multilingual fields to overall
fields
Presence or absence of dc:language
field
Consistency Variance in language notation Distinct language notations
Conformity Compliance to ISO-639-2 Share of values that comply
Accessibility Accessibility across languages
expressed through language tags
Number of distinct languages
Number of languages/Number of
tagged literals
tagged literals per language
Results
18
Data processing workflow
web interfacestatistical analysismeasuringingestion
★ OAI-PMH
★ Europeana API
★ Hadoop
★ NoSQL
★ Spark
★ Hadoop
★ Java
★ Apache Solr
★ Spark
★ R
★ PHP
★ D3.js
★ highchart.js
★ NoSQL
json csv json, png html, svg
20
DEMO
Questions
★ Contact
valentine.charles@europeana.eu
juliane.stiller@ibi.hu-berlin.de
werner.bailer@joanneum.at
peter.kiraly@gwdg.de
nfreire@gmail.com
★ Metadata Quality Assurance Framework
http://144.76.218.178/europeana-qa
★ Europeana Data Quality Committee
https://pro.europeana.eu/project/data-qu
ality-committee
22

Mais conteúdo relacionado

Mais procurados

Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Antoine Isaac
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in EuropeanaAntoine Isaac
 
L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...
L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...
L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...CASRAI
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseAntoine Isaac
 
Talk of Europe: Linked data of the European Parliament
Talk of Europe:  Linked data of the European ParliamentTalk of Europe:  Linked data of the European Parliament
Talk of Europe: Linked data of the European ParliamentLaura Hollink
 
Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...
Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...
Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...Péter Király
 
Bringing parliamentary debates to the Semantic Web
Bringing parliamentary debates to the Semantic WebBringing parliamentary debates to the Semantic Web
Bringing parliamentary debates to the Semantic WebLaura Hollink
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015Antoine Isaac
 
Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Antoine Isaac
 
Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Antoine Isaac
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session Antoine Isaac
 
Building the Biblissima Observatory
Building the Biblissima ObservatoryBuilding the Biblissima Observatory
Building the Biblissima ObservatoryEquipex Biblissima
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaAntoine Isaac
 
Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Valentine Charles
 
Stiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of MetadataStiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of MetadataPéter Király
 
EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...
EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...
EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...EHRI
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14Antoine Isaac
 

Mais procurados (20)

Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15Multilingual challenges for accessing digitized culture online - Riga Summit 15
Multilingual challenges for accessing digitized culture online - Riga Summit 15
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in Europeana
 
L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...
L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...
L&P Dominique Berube & Tanja Niemann - Usability and Visibility: Adding Value...
 
A portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data caseA portrait of Europeana as a Linked Open Data case
A portrait of Europeana as a Linked Open Data case
 
Talk of Europe: Linked data of the European Parliament
Talk of Europe:  Linked data of the European ParliamentTalk of Europe:  Linked data of the European Parliament
Talk of Europe: Linked data of the European Parliament
 
Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...
Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...
Multilinguality of Metadata. Measuring the Multilingual Degree of Europeana‘s...
 
Bringing parliamentary debates to the Semantic Web
Bringing parliamentary debates to the Semantic WebBringing parliamentary debates to the Semantic Web
Bringing parliamentary debates to the Semantic Web
 
EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015EuropeanaTech update - Europeana AGM 2015
EuropeanaTech update - Europeana AGM 2015
 
NECTAR_VRE1
NECTAR_VRE1NECTAR_VRE1
NECTAR_VRE1
 
2013 05-23-knowledge triangle
2013 05-23-knowledge triangle2013 05-23-knowledge triangle
2013 05-23-knowledge triangle
 
Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13Data modelling at Europeana and DM2E - SMW13
Data modelling at Europeana and DM2E - SMW13
 
Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)Semantic Web, Linked Data: the Europeana case(s)
Semantic Web, Linked Data: the Europeana case(s)
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
Building the Biblissima Observatory
Building the Biblissima ObservatoryBuilding the Biblissima Observatory
Building the Biblissima Observatory
 
Enriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpediaEnriching Cultural Heritage Data with DBpedia
Enriching Cultural Heritage Data with DBpedia
 
Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...
 
Stiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of MetadataStiller & Király, Multilinguality of Metadata
Stiller & Király, Multilinguality of Metadata
 
EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...
EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...
EHRI Project: Developing a Pan-European Archival Infrastructure for Holocaust...
 
Europeana DSI - LT-Accelerate 14
Europeana DSI -  LT-Accelerate 14Europeana DSI -  LT-Accelerate 14
Europeana DSI - LT-Accelerate 14
 
Organising a GLAM wiki
Organising a GLAM wikiOrganising a GLAM wiki
Organising a GLAM wiki
 

Semelhante a Evaluating Data Quality in Europeana: Metrics for Multilinguality

Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)
Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)
Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)Péter Király
 
Data Quality Assessment in Europeana: Metrics for Multilinguality
Data Quality Assessment in Europeana:  Metrics for MultilingualityData Quality Assessment in Europeana:  Metrics for Multilinguality
Data Quality Assessment in Europeana: Metrics for MultilingualityJuliane Stiller
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...Europeana
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana
 
Is MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, TildeIs MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, TildeABBYY Language Serivces
 
Europeana 1914-1918, User-Generated Content and Linked Open Data
Europeana 1914-1918, User-Generated Content and Linked Open DataEuropeana 1914-1918, User-Generated Content and Linked Open Data
Europeana 1914-1918, User-Generated Content and Linked Open DataValentine Charles
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...The European Library
 
2015-11-18 research seminar
2015-11-18 research seminar2015-11-18 research seminar
2015-11-18 research seminarifi8106tlu
 
Big (Language) Data – From research strategies to proof-of-concept and implem...
Big (Language) Data – From research strategies to proof-of-concept and implem...Big (Language) Data – From research strategies to proof-of-concept and implem...
Big (Language) Data – From research strategies to proof-of-concept and implem...LEARN Project
 
Digital humanities in Estonia: digital divide or linguistic isolation?
Digital humanities in Estonia: digital divide or linguistic isolation?Digital humanities in Estonia: digital divide or linguistic isolation?
Digital humanities in Estonia: digital divide or linguistic isolation?Mari Sarv
 
Hernani-iCorpora-PosterA1
Hernani-iCorpora-PosterA1Hernani-iCorpora-PosterA1
Hernani-iCorpora-PosterA1hpcosta
 
Annotated Bibliography Of Language Documentation
Annotated Bibliography Of Language DocumentationAnnotated Bibliography Of Language Documentation
Annotated Bibliography Of Language DocumentationSarah Marie
 
European Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and FutureEuropean Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and FutureGeorg Rehm
 
CLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposalsCLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposalsMartin Wynne
 
Human Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeHuman Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeGeorg Rehm
 
Models and Tools for Knowledge Reconstruction
Models and Tools for Knowledge ReconstructionModels and Tools for Knowledge Reconstruction
Models and Tools for Knowledge ReconstructionPaolo Nesi
 
Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...Web2Learn
 
Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...LangOER
 

Semelhante a Evaluating Data Quality in Europeana: Metrics for Multilinguality (20)

Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)
Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)
Evaluating Data Quality in Europeana: Metrics for Multilinguality (MTSR 2018)
 
Data Quality Assessment in Europeana: Metrics for Multilinguality
Data Quality Assessment in Europeana:  Metrics for MultilingualityData Quality Assessment in Europeana:  Metrics for Multilinguality
Data Quality Assessment in Europeana: Metrics for Multilinguality
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
 
Is MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, TildeIs MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
 
E-ARK: Open Data Mining for Government Archives
E-ARK: Open Data Mining for Government ArchivesE-ARK: Open Data Mining for Government Archives
E-ARK: Open Data Mining for Government Archives
 
Europeana 1914-1918, User-Generated Content and Linked Open Data
Europeana 1914-1918, User-Generated Content and Linked Open DataEuropeana 1914-1918, User-Generated Content and Linked Open Data
Europeana 1914-1918, User-Generated Content and Linked Open Data
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
 
2015-11-18 research seminar
2015-11-18 research seminar2015-11-18 research seminar
2015-11-18 research seminar
 
Big (Language) Data – From research strategies to proof-of-concept and implem...
Big (Language) Data – From research strategies to proof-of-concept and implem...Big (Language) Data – From research strategies to proof-of-concept and implem...
Big (Language) Data – From research strategies to proof-of-concept and implem...
 
Digital humanities in Estonia: digital divide or linguistic isolation?
Digital humanities in Estonia: digital divide or linguistic isolation?Digital humanities in Estonia: digital divide or linguistic isolation?
Digital humanities in Estonia: digital divide or linguistic isolation?
 
Hernani-iCorpora-PosterA1
Hernani-iCorpora-PosterA1Hernani-iCorpora-PosterA1
Hernani-iCorpora-PosterA1
 
Annotated Bibliography Of Language Documentation
Annotated Bibliography Of Language DocumentationAnnotated Bibliography Of Language Documentation
Annotated Bibliography Of Language Documentation
 
European Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and FutureEuropean Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and Future
 
CLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposalsCLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposals
 
Human Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeHuman Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual Europe
 
Models and Tools for Knowledge Reconstruction
Models and Tools for Knowledge ReconstructionModels and Tools for Knowledge Reconstruction
Models and Tools for Knowledge Reconstruction
 
Session5 03.george rehm
Session5 03.george rehmSession5 03.george rehm
Session5 03.george rehm
 
Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...
 
Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...Framing quality indicators for multilingual repositories of Open Educational ...
Framing quality indicators for multilingual repositories of Open Educational ...
 

Mais de Juliane Stiller

KOBV-Forum 2022 - Digitale Inklusion von Menschen mit Fluchtbiografie
KOBV-Forum 2022 - Digitale Inklusion von Menschen mit FluchtbiografieKOBV-Forum 2022 - Digitale Inklusion von Menschen mit Fluchtbiografie
KOBV-Forum 2022 - Digitale Inklusion von Menschen mit FluchtbiografieJuliane Stiller
 
KOBV-Forum 2022 - Desinformationen im Gesundheitsbereich
KOBV-Forum 2022 - Desinformationen im GesundheitsbereichKOBV-Forum 2022 - Desinformationen im Gesundheitsbereich
KOBV-Forum 2022 - Desinformationen im GesundheitsbereichJuliane Stiller
 
Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...
Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...
Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...Juliane Stiller
 
Berlin auf dem Weg zu Open Research
Berlin auf dem Weg zu Open ResearchBerlin auf dem Weg zu Open Research
Berlin auf dem Weg zu Open ResearchJuliane Stiller
 
Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...
Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...
Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...Juliane Stiller
 
Cross-Lingual Bibliographic Search (CLuBS)
Cross-Lingual Bibliographic Search (CLuBS)Cross-Lingual Bibliographic Search (CLuBS)
Cross-Lingual Bibliographic Search (CLuBS)Juliane Stiller
 
Zur Bedeutung digitaler Kompetenzen von Geflüchteten bei der Jobsuche
Zur Bedeutung digitaler Kompetenzen von Geflüchteten bei der JobsucheZur Bedeutung digitaler Kompetenzen von Geflüchteten bei der Jobsuche
Zur Bedeutung digitaler Kompetenzen von Geflüchteten bei der JobsucheJuliane Stiller
 
Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...
Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...
Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...Juliane Stiller
 
The Role of Information Literacy for the Integration of Refugees
The Role of Information Literacy for the Integration of RefugeesThe Role of Information Literacy for the Integration of Refugees
The Role of Information Literacy for the Integration of RefugeesJuliane Stiller
 
Query Translation for Cross-lingual Search in the Academic Search Engine PubP...
Query Translation for Cross-lingual Search in the Academic Search Engine PubP...Query Translation for Cross-lingual Search in the Academic Search Engine PubP...
Query Translation for Cross-lingual Search in the Academic Search Engine PubP...Juliane Stiller
 
Have You Hired a Refugee? - Hiring Success 2018 Europe
 Have You Hired a Refugee? - Hiring Success 2018 Europe  Have You Hired a Refugee? - Hiring Success 2018 Europe
Have You Hired a Refugee? - Hiring Success 2018 Europe Juliane Stiller
 
Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...
Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...
Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...Juliane Stiller
 
Iconference 2018 stiller trkulja-digital literacy session-27-03
Iconference 2018 stiller trkulja-digital literacy session-27-03Iconference 2018 stiller trkulja-digital literacy session-27-03
Iconference 2018 stiller trkulja-digital literacy session-27-03Juliane Stiller
 
A Decade of Evaluating Europeana: Constructs, Contexts, Methods & Criteria
A Decade of Evaluating Europeana: Constructs, Contexts, Methods & CriteriaA Decade of Evaluating Europeana: Constructs, Contexts, Methods & Criteria
A Decade of Evaluating Europeana: Constructs, Contexts, Methods & CriteriaJuliane Stiller
 

Mais de Juliane Stiller (14)

KOBV-Forum 2022 - Digitale Inklusion von Menschen mit Fluchtbiografie
KOBV-Forum 2022 - Digitale Inklusion von Menschen mit FluchtbiografieKOBV-Forum 2022 - Digitale Inklusion von Menschen mit Fluchtbiografie
KOBV-Forum 2022 - Digitale Inklusion von Menschen mit Fluchtbiografie
 
KOBV-Forum 2022 - Desinformationen im Gesundheitsbereich
KOBV-Forum 2022 - Desinformationen im GesundheitsbereichKOBV-Forum 2022 - Desinformationen im Gesundheitsbereich
KOBV-Forum 2022 - Desinformationen im Gesundheitsbereich
 
Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...
Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...
Open Access in Museen. Vorteile der Offenheit und wie Museen mehr Offenheit w...
 
Berlin auf dem Weg zu Open Research
Berlin auf dem Weg zu Open ResearchBerlin auf dem Weg zu Open Research
Berlin auf dem Weg zu Open Research
 
Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...
Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...
Transfer informationswissenschaftlicher Fachkompetenz in die Praxis: Erfahrun...
 
Cross-Lingual Bibliographic Search (CLuBS)
Cross-Lingual Bibliographic Search (CLuBS)Cross-Lingual Bibliographic Search (CLuBS)
Cross-Lingual Bibliographic Search (CLuBS)
 
Zur Bedeutung digitaler Kompetenzen von Geflüchteten bei der Jobsuche
Zur Bedeutung digitaler Kompetenzen von Geflüchteten bei der JobsucheZur Bedeutung digitaler Kompetenzen von Geflüchteten bei der Jobsuche
Zur Bedeutung digitaler Kompetenzen von Geflüchteten bei der Jobsuche
 
Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...
Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...
Die Rolle digitaler Kompetenzen bei der Jobsuche: Ergebnisse aus einer Studie...
 
The Role of Information Literacy for the Integration of Refugees
The Role of Information Literacy for the Integration of RefugeesThe Role of Information Literacy for the Integration of Refugees
The Role of Information Literacy for the Integration of Refugees
 
Query Translation for Cross-lingual Search in the Academic Search Engine PubP...
Query Translation for Cross-lingual Search in the Academic Search Engine PubP...Query Translation for Cross-lingual Search in the Academic Search Engine PubP...
Query Translation for Cross-lingual Search in the Academic Search Engine PubP...
 
Have You Hired a Refugee? - Hiring Success 2018 Europe
 Have You Hired a Refugee? - Hiring Success 2018 Europe  Have You Hired a Refugee? - Hiring Success 2018 Europe
Have You Hired a Refugee? - Hiring Success 2018 Europe
 
Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...
Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...
Integrating Refugee Migrants into the Labour Market: the Necessity of Digital...
 
Iconference 2018 stiller trkulja-digital literacy session-27-03
Iconference 2018 stiller trkulja-digital literacy session-27-03Iconference 2018 stiller trkulja-digital literacy session-27-03
Iconference 2018 stiller trkulja-digital literacy session-27-03
 
A Decade of Evaluating Europeana: Constructs, Contexts, Methods & Criteria
A Decade of Evaluating Europeana: Constructs, Contexts, Methods & CriteriaA Decade of Evaluating Europeana: Constructs, Contexts, Methods & Criteria
A Decade of Evaluating Europeana: Constructs, Contexts, Methods & Criteria
 

Último

Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 

Último (20)

Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 

Evaluating Data Quality in Europeana: Metrics for Multilinguality

  • 1. Evaluating Data Quality in Europeana: Metrics for Multilinguality Péter Király1 , Juliane Stiller2 , Valentine Charles3 , Werner Bailer4 , Nuno Freire5 1 Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen 2 Berlin School of Library and Information Science, Humboldt-Universität zu Berlin 3 Europeana Foundation, The Hague 4 Joanneum Research Forschungsgesellschaft mbH, Graz 5 INESC-ID, Lisbon MTSR 2018 - Track on Cultural Collections and Applications, Limassol, Oct. 24, 2018 1 Nummertjes by Fabio (CC BY-NC 2.0)
  • 2. Agenda 1. Europeana 2. Multilingual Information in Europeana’s Metadata 3. Multilinguality as a Facet of Quality Dimensions 4. Results 5. Demo 2
  • 3. Europeana - Platform for Cultural Heritage Material www.europeana.eu
  • 4. ○ Books, newspapers, letters, paintings, photographs, radio shows, films, etc. ○ Text, images, video, audio, sounds, 3D ○ Over 58 million objects ○ > 50 languages Europeana - Facts 4
  • 6. English cultural heritage object: <dc:language>en</dc:language>
  • 7. English cultural heritage object: <dc:language>en</dc:language> German metadata
  • 8. Multilinguality on Field Level <#record> a ore:Proxy ; dc:subject “Ballet”, “Opera”@en <#record> a ore:Proxy ; edm:europeanaProxy true ; dc:subject <http://data.europeana.eu/concept/base/264>. <http://data.europeana.eu/concept/base/264> a skos:Concept . skos:prefLabel "Ballett"@no, "बैले"@hi, "Ballett"@de, "Балет"@be, "Балет"@ru, "Balé"@pt, "Балет"@bg, "Baletas"@lt, "Balet"@hr, "Balets"@lv . Europeana Dereferencing Literal, literal with language tag
  • 9. Processes Contributing to Multilinguality dc: subject “subject”@en dc:creator <http://vocab.getty.edu/...> dc:type <http://voc.example./…> dc:subject <http://dbpedia.org/ aSubjectID> dc:subject “Subject” Data from Provider dc:creator new labels in different languages Data added by Europeana: dereferencing step Quantifiable: “term”@language annotation dc:subject New labels in different languages
  • 10. Quantify Multilinguality of Data to: ○ Establish a sense of the multilingual reach of Europeana, incl. distribution of languages ○ Identify the impact of different workflows / processes on multilinguality of data ○ Take measures to improve multilinguality in data ○ Devise strategies for underrepresented languages
  • 11. What Could be Measured? ○ Number of (distinct) languages in the metadata ○ Number of language-tagged literals ○ Tagged literals per language ○ Existence of language information fields such as dc:language ○ Consistency and conformity of language information
  • 12. Multilinguality as a Facet of Quality Dimensions 12
  • 13. Completeness ○ This dimension: ○ expresses the number (fraction) of fields present in a dataset ○ identifies non-empty values in a record or (sub-)collection. ○ Multilingual completeness is captured by: ○ Presence of value in dc:language ○ Share of fields with language tags to overall available fields
  • 14. Consistency ○ Describes the logical coherence of metadata ○ Assesses variety of language values in the dc:language field: how many distinct values? ○ Contributes to features like language-based facet
  • 15. Conformity ○ Describes the conformity to a given standard such as ISO-639-2 ○ Example: English is expressed as: English, ENG, en, en-uk, … ○ Share of values that comply or do not comply
  • 16. Accessibility ○ Access to information and data across languages ○ Distribution of linguistic information in metadata ○ Quantifying the language tag ○ The more language tags, the higher the multilingual reach
  • 17. Dimensions, Criteria & Measures Dimension Criteria Measure Completeness Presence or absence of values in fields relating to the language of the object or the metadata Share of multilingual fields to overall fields Presence or absence of dc:language field Consistency Variance in language notation Distinct language notations Conformity Compliance to ISO-639-2 Share of values that comply Accessibility Accessibility across languages expressed through language tags Number of distinct languages Number of languages/Number of tagged literals tagged literals per language
  • 19.
  • 20. Data processing workflow web interfacestatistical analysismeasuringingestion ★ OAI-PMH ★ Europeana API ★ Hadoop ★ NoSQL ★ Spark ★ Hadoop ★ Java ★ Apache Solr ★ Spark ★ R ★ PHP ★ D3.js ★ highchart.js ★ NoSQL json csv json, png html, svg 20
  • 21. DEMO
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37. Questions ★ Contact valentine.charles@europeana.eu juliane.stiller@ibi.hu-berlin.de werner.bailer@joanneum.at peter.kiraly@gwdg.de nfreire@gmail.com ★ Metadata Quality Assurance Framework http://144.76.218.178/europeana-qa ★ Europeana Data Quality Committee https://pro.europeana.eu/project/data-qu ality-committee 22