SlideShare uma empresa Scribd logo
1 de 40
Best Practices for Multilingual
Linked Open Data


       Jose Emilio Labra Gayo
      University of Oviedo, Spain
          http://www.di.uniovi.es/~labra
Towards the web of data




                              Web of documents                    Web of Data
      Unit of information: Web page (HTML)               Unit of information: data (RDF)
      Human readable                                     Machine readable
      Challenge: Multilingual pages                      Intrinsically Multilingual


Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Example
     <html lang="en">          English                                     <html lang="es">            Espanish
     <body>                                                                <body>
     <h1>Juan's Home page</h1>                                             <h1>Página personal de Juan</h1>

     <p>Juan is a Professor at the                                         <p>Juan es Catedrático en la
     University of Oviedo, Spain</p>                                          Universidad de Oviedo, España</p>

     <p>Phone: +34-1234567</p>                                             <p>Tlfno: +34-1234567</p>
     </body>                                                               </body>
     </html>                                                               </html>




                      http://uniovi.es/people#juan

                                        foaf:phone
                                                         tel:+34-1234567



Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Multilingual data
          Data that appears in a multilingual context
                    It contains labels/comments
                    Human-readable information
                    Using different languages/conventions




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Example of Multilingual Data
     <html lang="en">          English                                      <html lang="es">             Spanish
     <body>                                                                 <body>
     <h1>Juan's Home page</h1>                                              <h1>Página personal de Juan</h1>

     <p>Juan is a Professor at the                                          <p>Juan es Catedrático en la
     University of Oviedo, Spain</p>                                           Universidad de Oviedo, España</p>

     <p>Phone: +34-1234567</p>                                              <p>Tlfno: +34-1234567</p>
     </body>                                                                </body>
     </html>                                                                </html>


            http://uniovi.es/people#juan
                                                                                          Web of Data
    ex:position                                               ex:position
                                                                               Unit of information: data (RDF)
                                                                               Human + Machine readable
         "Professor"@en                                  "Catedrático"@es
                                                                               New Challenge: Multilingual


Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Linked Open Data
          Principles on how to publish data
          Increasing adoption




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Best practices for LOD
          Several proposals:
                    Linked data book [Heath, Bizer, 2011]
                    Linked data patterns [Dodds, Davis, 2012]
                    Best Practices for Publishing Linked Data [Hyland et al]
                    SemWeb Rules of thumb [R. Cyganiak]
                    etc. . .
          In this talk
                    Best practices affected by multilinguality



Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Multilingual LOD patterns
          1. Design a good IRI scheme
          2. Separate domains by language
          3. Model resources, not labels
          4. Provide human-readable info
          5. Labels for all
          6. Use Multilingual literals
          7. Language Content negotiation
          8. Literals without language
          9. Multilingual vocabularies
          10. Interlanguage links
Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
1. Design a good IRI scheme
          Cool URIs                                      Cool IRIs
                    Don't change
                    Identify things
                    If possible, use human-readable IRIs

                                                http://dbpedia.org/resource/Armenia

                                               http://դբպեդիա.օրգ/րեսօուրսե/Հայաստան




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
1. Design a good IRI scheme
          Most datasets use only URIs
          IRIs may be difficult to maintain
                    Domain names, phising, …
                    IRI support in current libraries
                    Some hybrid solutions

                            http://dbpedia.org/resource/Armenia

                            http://dbpedia.org/resource/Հայաստան

                            http://դբպեդիա.օրգ/րեսօուրսե/Հայաստան
Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
2. Separate domains by language
                    Instead of

                         http://dbpedia.org/resource/Հայաստան

                    Language dependant URIs
                         http://en.dbpedia.org/resource/Armenia

                         http://hy.dbpedia.org/resource/Հայաստան


                              Language identifiers ≠ Country identifiers
                                 Example: Use "hy" instead of "am"

Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
2. Separate domains by language
          Where should we put the language tag?
                  http://hy.dbpedia.org/resource/Հայաստան
                  http://dbpedia.org/resource/hy/Հայաստան

                  http://dbpedia.org/resource/Հայաստան/hy




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
3. Interlanguage links
          Provide links between concepts in different
            languages


         http://dbpedia.org/resource/Armenia
                                                         owl:sameAs

                                  http://hy.dbpedia.org/resource/Հայաստան




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
3. Interlanguage links
          Beware of cross-lingual mappings
                  Example:

                                         Concept of             Concept of
                                         professor in     ≠    professor in
                                        english culture       spanish culture


                             "Professor"@en                        "Profesor"@es

                Use other properties to link:
                          dbo:interlanguageLink
                          rdfs:seeAlso
                          skos:related
                          ...
Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
4. Model resources, not labels
          Define URIs only for resources
                  Resources do not depend on a given language
                  Assign labels to those resources
          Do not mint separate URIs for labels




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
4. Model resources, not labels
                          e:workPlace                           http://example.org/UniversityOfOviedo

     http://uniovi.es/people#juan

                       e:workPlace                              http://example.org/UniversidadDeOviedo




      http://uniovi.es/people#juan

                         e:workPlace                              http://example.org/Uniovi
                                                   rdfs:label                          rdfs:label

                              “University of Oviedo”@en                    “Universidad de Oviedo”@es


Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
4. Model resources, not labels
          Some domains may want to model labels
                    Linguistic resources
                              Thesaurus, corpora, etc.
                              Assertions and relations between labels
                              Examples:
                                        SKOS-XL labels *
                                             Resources of type sxosxl:Label
                                        Strings URI-identifiable: NIF



                                                         *http://www.w3.org/TR/skos-reference/skos-xl.html


Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
5. Provide human-readable info
          Not only machine-readable information
                    Combine machine & human-readable info
                    Human-readable info must be multilingual




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
5. Provide human-readable info
          Facilitates search over the web of data
          Linked data browsing
                    Applications can display labels instead of URIs
          Common properties:
                    rdfs:label
                    skos:prefLabel
                    dcterms:title
                    dcterms:description
                    rdfs:comment
                    etc.


Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
5. Provide human-readable info
          What is the right level of textual information?
          Balance between RDF/Textual world
          Example:           rdfs:label  “University of Oviedo"@en
          http://example.org/Uniovi
                                                         rdfs:label    “Universidad de Oviedo"@es


          http://example.org/Uniovi                                   http://...University
                                                          rdf:type
                           e:city
                                                                      rdfs:label           rdfs:label
                       http://...Oviedo
                rdfs:label                                  “University"@en        “Universidad"@es

                                  “Oviedo"
Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
6. Labels for all
          Provide labels for all URIs
                    Individuals / Concepts / Properties
                    Not just the main entities
          Displaying labels becomes easier and faster

                                                    http://uniovi.org/people#juan

                                                          rdfs:label
                                                                                    “Juan"



Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
6. Labels for all
          It may be difficult to select the right label
                    Don't provide more than one preferred label
                    Not feasible for some datasets
                              Only 38% non-information resources have labels
                                                                             [B. Ell et al, 2011]

          Labels are for humans
                    Avoid camel case or similar notations
                    Guidelines for labelling
                              Upper case, space delimitters, etc

Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra   [Montiel et al, 2011]
7. Use Multilingual literals
          Use language tags
                    Select the right IETF language tag (RFC 5646)
          Example:
                 "University of Oviedo"@en
                 "Universidad de Oviedo"@es
                 "Universidá d'Uvieu"@ast
                 "Օվիեդոյի համալսարանում"@hy



Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
7. Use Multilingual literals
     Multilingual literals & SPARQL
                                                          http://uniovi.es/people#juan

                                                         ex:position            ex:position

                                                         "Professor"@en     "Catedrático"@es


             SELECT * WHERE {
               ?x ex:position "Professor" .                               Returns Nothing
             }

             SELECT * WHERE {
                                                                          Returns <...#juan>
               ?x ex:position "Professor"@en .
             }
Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
7. Use Multilingual literals
     Underused feature
               4.78% non info-resources have one language tag
               Only 0.7% datasets contain several language tags


                          Most commonly language used:
                                    44.72% (en), 5.22% (de), 5.11% (fr), 3.96% (it),...
                                                                                  [B.Ell et al, 2011]




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
7. Use Multilingual literals
          What about longer descriptions:
                              dcterms:description, rdfs:comment…
          CDATA like or XML literals ?
          Reuse existing practices in XML I18n
          Problems:
                    Gap between descriptions and RDF model
                    SPARQLing may be a challenge



Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
8. Literals without language tag
          Include literals without language-tag
          SPARQL queries are easier
          Example:
                                                         http://uniovi.es/people#juan

                        ex:position                                                     ex:position
                                                           ex:position

                              "Professor"                       "Professor"@en       "Catedrático"@es


                                 SELECT * WHERE {
                                   ?x ex:position "Professor" .                  Returns <...#juan>
                                 }

Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
8. Literals without language tag

          Selecting a default language = controversial
                    Declare the primary language of a dataset
                              Some properties: lexvo:language
                    Consumers may not be aware of the default
                     language




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
9. Language Content negotiation
          Use HTTP Accept-Language
          Return different sets of labels
          Reduce load in client applications




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
9. Language Content negotiation
          No Accept-Language declaration (all)

                                                     http://uniovi.es/people#juan
                     ex:position                                                            ex:country
                                                    ex:position        ex:country

          "Professor"@en                                 "Catedrático"@es    "Spain"@en   "España"@es




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
9. Language Content negotiation
          Accept-language: es

                                                     http://uniovi.es/people#juan

                                                                                      ex:country
                                                    ex:position

                                                         "Catedrático"@es           "España"@es




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
9. Language Content negotiation
          Accept-language: en

                                                     http://uniovi.es/people#juan
                     ex:position
                                                                     ex:country

          "Professor"@en                                                   "Spain"@en




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
9. Languge Content negotiation
          Not done in practice.
                    Implementation issues?
          Ensure equivalent representations for each
           language
                                  Content                                  Content
                                represented                              represented
                                 by spanish              equivalent to    by english
                                   labels                                   labels




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
10. Multilingual vocabularies
          Link to existing vocabularies
          Quality selection criteria for vocabularies
                    Use vocabularies that contain descriptions in
                     more than one language
                                                         [Hyland et al, 2012]




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
10. Multilingual vocabularies
          Popular vocabularies are not localized
                  Example: FOAF, DC, etc.
          Should we extend it?
                  Example:
                            dc:contributor rdfs:label "Colaborador"@es .




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Other issues, not covered
        Unicode support in RDF
        Language declarations & Microdata
        Internationalization topics:
                  Text direction
                                                                                           “Botón PUSH"@es
                  Ruby annotations                                 rdfs:label

                  Notes for localizers                                 http://.../id23                     translate = no

                  Translation rules                                 rdfs:label               “PUSH Buttom"@en
        Linguistic topics
                Ontology-lexicon, Lemon Model
                                                         [Gracia et al, 2011, Buitelaar et al, 2011, McCrae et al 2011]

Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Conclusions
          Web of data is not just for machines
          LOD applications will be used by humans
                    ...and
                    Human users talk many different languages


          Best? practices for Multilingual LOD




Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
Acknowledgements
          Jose María Álvarez Rodríguez
          Sören Auer
          Richard Cyganiak
          Basil Ell
          Sebastian Hellmann
          Aidan Hogan
          Dimitris Kontokostas
          Pablo Mendes
          Elena Montiel
          Jeni Tennison
          Boris Villazón-Terrazas



Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
References
         [Buitelaar et al, 2011] Ontology Lexicalisation: The lemon Perspective, 9th International Conference on
              Terminology and Artificial Intelligence, 2011
         [Cyganiak] SemWeb Rules of thumb
                   http://www.w3.org/wiki/User:Rcygania2/RulesOfThumb
         [Dodds, Davis, 2012] Linked data patterns
                   http://patterns.dataincubator.org/book/
         [Ell et al, 2011] Labels in the Web of Data, ISWC 2011
         [Gracia et al, 2011] Challenges for the Multilingual Web of Data, International Jounal on Semantic
              Web and Information Systems, 2011
         [Hogan et al, 2012] An empirical study of Linked Data Conformance, Journal of Web Semantics, to
              appear.
         [Heath, Bizer, 2011] Linked data: Evolving the Web into a Global Data Space
                   http://linkeddatabook.com/editions/1.0/
         [Hyland et al] Best Practices for Publishing Linked Data
                   https://dvcs.w3.org/hg/gld/raw-file/default/bp/index.html#internationalized-resource-identifiers
         [Hyland et al] Linked data cookbook. Open Government Linked Data
                   http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook
         [McCrae et al, 2011] Linking Lexical Resources and Ontologies on the Semantic Web with lemon,
            ESWC, 2011
         [Montiel et al, 11] Style Guidelines for Naming and Labeling Ontologies in the Multilingual Web, Elena
            Montiel-Ponsoda et al, DC-2011


Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
End of presentation




                      http://purl.org/weso

Mais conteúdo relacionado

Mais procurados

Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011François Scharffe
 
Multiplicity and Publishing in Open Annotation (tutorial)
Multiplicity and Publishing in Open Annotation (tutorial)Multiplicity and Publishing in Open Annotation (tutorial)
Multiplicity and Publishing in Open Annotation (tutorial)Robert Sanderson
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22jodischneider
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museumstrevorthornton
 
Understanding the Standards Gap
Understanding the Standards GapUnderstanding the Standards Gap
Understanding the Standards GapDan Brickley
 
Building the New Open Linked Library
Building the New Open Linked LibraryBuilding the New Open Linked Library
Building the New Open Linked LibraryJoel Richard
 
RDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic RepositoriesRDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic RepositoriesMarin Dimitrov
 
Forging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebForging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebGillian Byrne
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologiesMelanie Courtot
 
BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?Thomas Meehan
 
FOAF for Social Network Portability
FOAF for Social Network PortabilityFOAF for Social Network Portability
FOAF for Social Network PortabilityUldis Bojars
 
Adding Meaning To Your Data
Adding Meaning To Your DataAdding Meaning To Your Data
Adding Meaning To Your DataDuncan Hull
 
Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013Herbert Van de Sompel
 
Evolving the Web into a Giant Global Database
Evolving the Web into a Giant Global DatabaseEvolving the Web into a Giant Global Database
Evolving the Web into a Giant Global DatabaseMarko Rodriguez
 
Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections Mitchell Whitelaw
 
Wuhan Wednesday Discussion Breakout Session Keiser
Wuhan Wednesday Discussion Breakout Session KeiserWuhan Wednesday Discussion Breakout Session Keiser
Wuhan Wednesday Discussion Breakout Session KeiserBEKINC
 

Mais procurados (20)

20110728 datalift-rpi-troy
20110728 datalift-rpi-troy20110728 datalift-rpi-troy
20110728 datalift-rpi-troy
 
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
 
Multiplicity and Publishing in Open Annotation (tutorial)
Multiplicity and Publishing in Open Annotation (tutorial)Multiplicity and Publishing in Open Annotation (tutorial)
Multiplicity and Publishing in Open Annotation (tutorial)
 
Hacking with Semantic Web
Hacking with Semantic WebHacking with Semantic Web
Hacking with Semantic Web
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
RDF data model
RDF data modelRDF data model
RDF data model
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
 
Understanding the Standards Gap
Understanding the Standards GapUnderstanding the Standards Gap
Understanding the Standards Gap
 
Building the New Open Linked Library
Building the New Open Linked LibraryBuilding the New Open Linked Library
Building the New Open Linked Library
 
RDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic RepositoriesRDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic Repositories
 
Forging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebForging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic Web
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies
 
BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?BIBFRAME : the future of cataloguing?
BIBFRAME : the future of cataloguing?
 
FOAF for Social Network Portability
FOAF for Social Network PortabilityFOAF for Social Network Portability
FOAF for Social Network Portability
 
Adding Meaning To Your Data
Adding Meaning To Your DataAdding Meaning To Your Data
Adding Meaning To Your Data
 
Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013
 
Evolving the Web into a Giant Global Database
Evolving the Web into a Giant Global DatabaseEvolving the Web into a Giant Global Database
Evolving the Web into a Giant Global Database
 
Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections
 
Wuhan Wednesday Discussion Breakout Session Keiser
Wuhan Wednesday Discussion Breakout Session KeiserWuhan Wednesday Discussion Breakout Session Keiser
Wuhan Wednesday Discussion Breakout Session Keiser
 

Destaque (6)

ShEx vs SHACL
ShEx vs SHACLShEx vs SHACL
ShEx vs SHACL
 
SHACL by example
SHACL by exampleSHACL by example
SHACL by example
 
RDF Validation Future work and applications
RDF Validation Future work and applicationsRDF Validation Future work and applications
RDF Validation Future work and applications
 
Multlingual Linked Data Patterns
Multlingual Linked Data PatternsMultlingual Linked Data Patterns
Multlingual Linked Data Patterns
 
RDF validation tutorial
RDF validation tutorialRDF validation tutorial
RDF validation tutorial
 
Arquitectura de la Web y Computación en el Servidor
Arquitectura de la Web y Computación en el ServidorArquitectura de la Web y Computación en el Servidor
Arquitectura de la Web y Computación en el Servidor
 

Semelhante a Best Practices for multilingual linked open data

Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Alannah Fitzgerald
 
Hickok boost englishonline
Hickok boost englishonlineHickok boost englishonline
Hickok boost englishonlineloclexuan
 
Opensource and openlearning
Opensource and openlearningOpensource and openlearning
Opensource and openlearningAjaigururaj R
 
Spanish in Texas: Open learning tools for exploring language diversity
Spanish in Texas: Open learning tools for exploring language diversitySpanish in Texas: Open learning tools for exploring language diversity
Spanish in Texas: Open learning tools for exploring language diversitySpanish in Texas Project
 
Delhi University OER for ELT Collections Building
Delhi University OER for ELT Collections BuildingDelhi University OER for ELT Collections Building
Delhi University OER for ELT Collections BuildingAlannah Fitzgerald
 
Flexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldFlexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldAlannah Fitzgerald
 
Methodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF caseMethodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF caseJorge Gracia
 
Is linked data something for me?
Is linked data something for me?Is linked data something for me?
Is linked data something for me?Christophe Guéret
 
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Alannah Fitzgerald
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Alannah Fitzgerald
 
Elt age internet
Elt age internetElt age internet
Elt age internetElif Selli
 
Building Open Educational Resources for EAP at Hanoi Open University
Building Open Educational Resources for EAP at Hanoi Open UniversityBuilding Open Educational Resources for EAP at Hanoi Open University
Building Open Educational Resources for EAP at Hanoi Open UniversityAlannah Fitzgerald
 
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Daniel Vila Suero
 
Linking Data, Linking People
Linking Data, Linking PeopleLinking Data, Linking People
Linking Data, Linking PeoplefereiraJ
 
Scholarly Communication, Open Access & Repositories
Scholarly Communication, Open Access & RepositoriesScholarly Communication, Open Access & Repositories
Scholarly Communication, Open Access & RepositoriesGaz Johnson
 
Creation to Classroom OETC 2017
Creation to Classroom OETC 2017Creation to Classroom OETC 2017
Creation to Classroom OETC 2017OhioLINK
 

Semelhante a Best Practices for multilingual linked open data (20)

Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
 
OSS & Libraries
OSS & LibrariesOSS & Libraries
OSS & Libraries
 
RDF, linked data and semantic web
RDF, linked data and semantic webRDF, linked data and semantic web
RDF, linked data and semantic web
 
The OER in COERLL: Defining Open Education
The OER in COERLL: Defining Open EducationThe OER in COERLL: Defining Open Education
The OER in COERLL: Defining Open Education
 
Hickok boost englishonline
Hickok boost englishonlineHickok boost englishonline
Hickok boost englishonline
 
Opensource and openlearning
Opensource and openlearningOpensource and openlearning
Opensource and openlearning
 
Spanish in Texas: Open learning tools for exploring language diversity
Spanish in Texas: Open learning tools for exploring language diversitySpanish in Texas: Open learning tools for exploring language diversity
Spanish in Texas: Open learning tools for exploring language diversity
 
Delhi University OER for ELT Collections Building
Delhi University OER for ELT Collections BuildingDelhi University OER for ELT Collections Building
Delhi University OER for ELT Collections Building
 
Flexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldFlexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual World
 
Methodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF caseMethodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF case
 
Is linked data something for me?
Is linked data something for me?Is linked data something for me?
Is linked data something for me?
 
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...
 
Elt age internet
Elt age internetElt age internet
Elt age internet
 
Building Open Educational Resources for EAP at Hanoi Open University
Building Open Educational Resources for EAP at Hanoi Open UniversityBuilding Open Educational Resources for EAP at Hanoi Open University
Building Open Educational Resources for EAP at Hanoi Open University
 
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
 
Roar Presentation To School Of Psychology
Roar Presentation To School Of PsychologyRoar Presentation To School Of Psychology
Roar Presentation To School Of Psychology
 
Linking Data, Linking People
Linking Data, Linking PeopleLinking Data, Linking People
Linking Data, Linking People
 
Scholarly Communication, Open Access & Repositories
Scholarly Communication, Open Access & RepositoriesScholarly Communication, Open Access & Repositories
Scholarly Communication, Open Access & Repositories
 
Creation to Classroom OETC 2017
Creation to Classroom OETC 2017Creation to Classroom OETC 2017
Creation to Classroom OETC 2017
 

Mais de Jose Emilio Labra Gayo

Introducción a la investigación/doctorado
Introducción a la investigación/doctoradoIntroducción a la investigación/doctorado
Introducción a la investigación/doctoradoJose Emilio Labra Gayo
 
Challenges and applications of RDF shapes
Challenges and applications of RDF shapesChallenges and applications of RDF shapes
Challenges and applications of RDF shapesJose Emilio Labra Gayo
 
Legislative data portals and linked data quality
Legislative data portals and linked data qualityLegislative data portals and linked data quality
Legislative data portals and linked data qualityJose Emilio Labra Gayo
 
Validating RDF data: Challenges and perspectives
Validating RDF data: Challenges and perspectivesValidating RDF data: Challenges and perspectives
Validating RDF data: Challenges and perspectivesJose Emilio Labra Gayo
 
Legislative document content extraction based on Semantic Web technologies
Legislative document content extraction based on Semantic Web technologiesLegislative document content extraction based on Semantic Web technologies
Legislative document content extraction based on Semantic Web technologiesJose Emilio Labra Gayo
 
Como publicar datos: hacia los datos abiertos enlazados
Como publicar datos: hacia los datos abiertos enlazadosComo publicar datos: hacia los datos abiertos enlazados
Como publicar datos: hacia los datos abiertos enlazadosJose Emilio Labra Gayo
 

Mais de Jose Emilio Labra Gayo (20)

Publicaciones de investigación
Publicaciones de investigaciónPublicaciones de investigación
Publicaciones de investigación
 
Introducción a la investigación/doctorado
Introducción a la investigación/doctoradoIntroducción a la investigación/doctorado
Introducción a la investigación/doctorado
 
Challenges and applications of RDF shapes
Challenges and applications of RDF shapesChallenges and applications of RDF shapes
Challenges and applications of RDF shapes
 
Legislative data portals and linked data quality
Legislative data portals and linked data qualityLegislative data portals and linked data quality
Legislative data portals and linked data quality
 
Validating RDF data: Challenges and perspectives
Validating RDF data: Challenges and perspectivesValidating RDF data: Challenges and perspectives
Validating RDF data: Challenges and perspectives
 
Wikidata
WikidataWikidata
Wikidata
 
Legislative document content extraction based on Semantic Web technologies
Legislative document content extraction based on Semantic Web technologiesLegislative document content extraction based on Semantic Web technologies
Legislative document content extraction based on Semantic Web technologies
 
ShEx by Example
ShEx by ExampleShEx by Example
ShEx by Example
 
Introduction to SPARQL
Introduction to SPARQLIntroduction to SPARQL
Introduction to SPARQL
 
Introducción a la Web Semántica
Introducción a la Web SemánticaIntroducción a la Web Semántica
Introducción a la Web Semántica
 
RDF Data Model
RDF Data ModelRDF Data Model
RDF Data Model
 
2017 Tendencias en informática
2017 Tendencias en informática2017 Tendencias en informática
2017 Tendencias en informática
 
Introduction to SPARQL
Introduction to SPARQLIntroduction to SPARQL
Introduction to SPARQL
 
19 javascript servidor
19 javascript servidor19 javascript servidor
19 javascript servidor
 
Como publicar datos: hacia los datos abiertos enlazados
Como publicar datos: hacia los datos abiertos enlazadosComo publicar datos: hacia los datos abiertos enlazados
Como publicar datos: hacia los datos abiertos enlazados
 
16 Alternativas XML
16 Alternativas XML16 Alternativas XML
16 Alternativas XML
 
XSLT
XSLTXSLT
XSLT
 
XPath
XPathXPath
XPath
 
Máster en Ingeniería Web
Máster en Ingeniería WebMáster en Ingeniería Web
Máster en Ingeniería Web
 
2016 temuco tecnologias_websemantica
2016 temuco tecnologias_websemantica2016 temuco tecnologias_websemantica
2016 temuco tecnologias_websemantica
 

Último

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 

Último (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 

Best Practices for multilingual linked open data

  • 1. Best Practices for Multilingual Linked Open Data Jose Emilio Labra Gayo University of Oviedo, Spain http://www.di.uniovi.es/~labra
  • 2. Towards the web of data Web of documents Web of Data Unit of information: Web page (HTML) Unit of information: data (RDF) Human readable Machine readable Challenge: Multilingual pages Intrinsically Multilingual Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 3. Example <html lang="en"> English <html lang="es"> Espanish <body> <body> <h1>Juan's Home page</h1> <h1>Página personal de Juan</h1> <p>Juan is a Professor at the <p>Juan es Catedrático en la University of Oviedo, Spain</p> Universidad de Oviedo, España</p> <p>Phone: +34-1234567</p> <p>Tlfno: +34-1234567</p> </body> </body> </html> </html> http://uniovi.es/people#juan foaf:phone tel:+34-1234567 Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 4. Multilingual data Data that appears in a multilingual context It contains labels/comments Human-readable information Using different languages/conventions Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 5. Example of Multilingual Data <html lang="en"> English <html lang="es"> Spanish <body> <body> <h1>Juan's Home page</h1> <h1>Página personal de Juan</h1> <p>Juan is a Professor at the <p>Juan es Catedrático en la University of Oviedo, Spain</p> Universidad de Oviedo, España</p> <p>Phone: +34-1234567</p> <p>Tlfno: +34-1234567</p> </body> </body> </html> </html> http://uniovi.es/people#juan Web of Data ex:position ex:position Unit of information: data (RDF) Human + Machine readable "Professor"@en "Catedrático"@es New Challenge: Multilingual Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 6. Linked Open Data Principles on how to publish data Increasing adoption Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 7. Best practices for LOD Several proposals: Linked data book [Heath, Bizer, 2011] Linked data patterns [Dodds, Davis, 2012] Best Practices for Publishing Linked Data [Hyland et al] SemWeb Rules of thumb [R. Cyganiak] etc. . . In this talk Best practices affected by multilinguality Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 8. Multilingual LOD patterns 1. Design a good IRI scheme 2. Separate domains by language 3. Model resources, not labels 4. Provide human-readable info 5. Labels for all 6. Use Multilingual literals 7. Language Content negotiation 8. Literals without language 9. Multilingual vocabularies 10. Interlanguage links Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 9. 1. Design a good IRI scheme Cool URIs Cool IRIs Don't change Identify things If possible, use human-readable IRIs http://dbpedia.org/resource/Armenia http://դբպեդիա.օրգ/րեսօուրսե/Հայաստան Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 10. 1. Design a good IRI scheme Most datasets use only URIs IRIs may be difficult to maintain Domain names, phising, … IRI support in current libraries Some hybrid solutions http://dbpedia.org/resource/Armenia http://dbpedia.org/resource/Հայաստան http://դբպեդիա.օրգ/րեսօուրսե/Հայաստան Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 11. 2. Separate domains by language Instead of http://dbpedia.org/resource/Հայաստան Language dependant URIs http://en.dbpedia.org/resource/Armenia http://hy.dbpedia.org/resource/Հայաստան Language identifiers ≠ Country identifiers Example: Use "hy" instead of "am" Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 12. 2. Separate domains by language Where should we put the language tag? http://hy.dbpedia.org/resource/Հայաստան http://dbpedia.org/resource/hy/Հայաստան http://dbpedia.org/resource/Հայաստան/hy Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 13. 3. Interlanguage links Provide links between concepts in different languages http://dbpedia.org/resource/Armenia owl:sameAs http://hy.dbpedia.org/resource/Հայաստան Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 14. 3. Interlanguage links Beware of cross-lingual mappings Example: Concept of Concept of professor in ≠ professor in english culture spanish culture "Professor"@en "Profesor"@es Use other properties to link: dbo:interlanguageLink rdfs:seeAlso skos:related ... Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 15. 4. Model resources, not labels Define URIs only for resources Resources do not depend on a given language Assign labels to those resources Do not mint separate URIs for labels Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 16. 4. Model resources, not labels e:workPlace http://example.org/UniversityOfOviedo http://uniovi.es/people#juan e:workPlace http://example.org/UniversidadDeOviedo http://uniovi.es/people#juan e:workPlace http://example.org/Uniovi rdfs:label rdfs:label “University of Oviedo”@en “Universidad de Oviedo”@es Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 17. 4. Model resources, not labels Some domains may want to model labels Linguistic resources Thesaurus, corpora, etc. Assertions and relations between labels Examples: SKOS-XL labels * Resources of type sxosxl:Label Strings URI-identifiable: NIF *http://www.w3.org/TR/skos-reference/skos-xl.html Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 18. 5. Provide human-readable info Not only machine-readable information Combine machine & human-readable info Human-readable info must be multilingual Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 19. 5. Provide human-readable info Facilitates search over the web of data Linked data browsing Applications can display labels instead of URIs Common properties: rdfs:label skos:prefLabel dcterms:title dcterms:description rdfs:comment etc. Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 20. 5. Provide human-readable info What is the right level of textual information? Balance between RDF/Textual world Example: rdfs:label “University of Oviedo"@en http://example.org/Uniovi rdfs:label “Universidad de Oviedo"@es http://example.org/Uniovi http://...University rdf:type e:city rdfs:label rdfs:label http://...Oviedo rdfs:label “University"@en “Universidad"@es “Oviedo" Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 21. 6. Labels for all Provide labels for all URIs Individuals / Concepts / Properties Not just the main entities Displaying labels becomes easier and faster http://uniovi.org/people#juan rdfs:label “Juan" Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 22. 6. Labels for all It may be difficult to select the right label Don't provide more than one preferred label Not feasible for some datasets Only 38% non-information resources have labels [B. Ell et al, 2011] Labels are for humans Avoid camel case or similar notations Guidelines for labelling Upper case, space delimitters, etc Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra [Montiel et al, 2011]
  • 23. 7. Use Multilingual literals Use language tags Select the right IETF language tag (RFC 5646) Example: "University of Oviedo"@en "Universidad de Oviedo"@es "Universidá d'Uvieu"@ast "Օվիեդոյի համալսարանում"@hy Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 24. 7. Use Multilingual literals Multilingual literals & SPARQL http://uniovi.es/people#juan ex:position ex:position "Professor"@en "Catedrático"@es SELECT * WHERE { ?x ex:position "Professor" . Returns Nothing } SELECT * WHERE { Returns <...#juan> ?x ex:position "Professor"@en . } Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 25. 7. Use Multilingual literals Underused feature 4.78% non info-resources have one language tag Only 0.7% datasets contain several language tags Most commonly language used: 44.72% (en), 5.22% (de), 5.11% (fr), 3.96% (it),... [B.Ell et al, 2011] Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 26. 7. Use Multilingual literals What about longer descriptions: dcterms:description, rdfs:comment… CDATA like or XML literals ? Reuse existing practices in XML I18n Problems: Gap between descriptions and RDF model SPARQLing may be a challenge Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 27. 8. Literals without language tag Include literals without language-tag SPARQL queries are easier Example: http://uniovi.es/people#juan ex:position ex:position ex:position "Professor" "Professor"@en "Catedrático"@es SELECT * WHERE { ?x ex:position "Professor" . Returns <...#juan> } Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 28. 8. Literals without language tag Selecting a default language = controversial Declare the primary language of a dataset Some properties: lexvo:language Consumers may not be aware of the default language Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 29. 9. Language Content negotiation Use HTTP Accept-Language Return different sets of labels Reduce load in client applications Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 30. 9. Language Content negotiation No Accept-Language declaration (all) http://uniovi.es/people#juan ex:position ex:country ex:position ex:country "Professor"@en "Catedrático"@es "Spain"@en "España"@es Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 31. 9. Language Content negotiation Accept-language: es http://uniovi.es/people#juan ex:country ex:position "Catedrático"@es "España"@es Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 32. 9. Language Content negotiation Accept-language: en http://uniovi.es/people#juan ex:position ex:country "Professor"@en "Spain"@en Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 33. 9. Languge Content negotiation Not done in practice. Implementation issues? Ensure equivalent representations for each language Content Content represented represented by spanish equivalent to by english labels labels Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 34. 10. Multilingual vocabularies Link to existing vocabularies Quality selection criteria for vocabularies Use vocabularies that contain descriptions in more than one language [Hyland et al, 2012] Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 35. 10. Multilingual vocabularies Popular vocabularies are not localized Example: FOAF, DC, etc. Should we extend it? Example: dc:contributor rdfs:label "Colaborador"@es . Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 36. Other issues, not covered Unicode support in RDF Language declarations & Microdata Internationalization topics: Text direction “Botón PUSH"@es Ruby annotations rdfs:label Notes for localizers http://.../id23 translate = no Translation rules rdfs:label “PUSH Buttom"@en Linguistic topics Ontology-lexicon, Lemon Model [Gracia et al, 2011, Buitelaar et al, 2011, McCrae et al 2011] Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 37. Conclusions Web of data is not just for machines LOD applications will be used by humans ...and Human users talk many different languages Best? practices for Multilingual LOD Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 38. Acknowledgements Jose María Álvarez Rodríguez Sören Auer Richard Cyganiak Basil Ell Sebastian Hellmann Aidan Hogan Dimitris Kontokostas Pablo Mendes Elena Montiel Jeni Tennison Boris Villazón-Terrazas Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 39. References [Buitelaar et al, 2011] Ontology Lexicalisation: The lemon Perspective, 9th International Conference on Terminology and Artificial Intelligence, 2011 [Cyganiak] SemWeb Rules of thumb http://www.w3.org/wiki/User:Rcygania2/RulesOfThumb [Dodds, Davis, 2012] Linked data patterns http://patterns.dataincubator.org/book/ [Ell et al, 2011] Labels in the Web of Data, ISWC 2011 [Gracia et al, 2011] Challenges for the Multilingual Web of Data, International Jounal on Semantic Web and Information Systems, 2011 [Hogan et al, 2012] An empirical study of Linked Data Conformance, Journal of Web Semantics, to appear. [Heath, Bizer, 2011] Linked data: Evolving the Web into a Global Data Space http://linkeddatabook.com/editions/1.0/ [Hyland et al] Best Practices for Publishing Linked Data https://dvcs.w3.org/hg/gld/raw-file/default/bp/index.html#internationalized-resource-identifiers [Hyland et al] Linked data cookbook. Open Government Linked Data http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook [McCrae et al, 2011] Linking Lexical Resources and Ontologies on the Semantic Web with lemon, ESWC, 2011 [Montiel et al, 11] Style Guidelines for Naming and Labeling Ontologies in the Multilingual Web, Elena Montiel-Ponsoda et al, DC-2011 Jose Emilio Labra Gayo, http://www.di.uniovi.es/~labra
  • 40. End of presentation http://purl.org/weso