SlideShare uma empresa Scribd logo
1 de 67
Baixar para ler offline
Methodological Guidelines for
   Publishing Linked Data



Boris Villazón-Terrazas, Asunción Gómez-Pérez, and Óscar Corcho

    Facultad de Informática, Universidad Politécnica de Madrid
  Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
                      http://www.oeg-upm.net
                      http://www oeg upm net
               {bvillazon,asun,ocorcho}@fi.upm.es
           Phone: 34.91.3366605, Fax: 34.91.3524819



                CONSEGI 2011 – Brasília, Brazil
                      12th May, 2011
ToC




• Introduction to Linked Data

• G id li
  Guidelines f P bli hi Li k d D t
             for Publishing Linked Data

• Demo




                           2
ToC


• Introduction to Linked Data

• Guidelines for Publishing Linked Data

• Demo




                           3
Classic Web



          MovieDB




                                                 Data exposed to
                                                  the Web via
                                                 HTML, pdf, etc.

            CIA
           World
          FactBook




© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
                                                                       4
Classic Web




                                                  Information from
                                                  Complexpages
                                                    single  queries
                                                  can be multiple
                                                    over found via
                                                     pages / data
                                                   search engines
                                                      sources??




© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
                                                                       5
What do we actually want?

      • Use the Web like a single global database




    CIA
   World                                                                                     MovieDB
  FactBook




© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
                                                                       6
Linked Data enables such Web of Data
Global Identifier: URI (Uniform Resource Identifier) which is a string of characters used
                                         Identifier),
               to identify a name or a resource on the Internet.
Data Model: RDF (Resource Description Framework), which is a standard model
               for data interchange on the Web
Access Mechanism: HTTP
Connection: Typed Links


           8000000
                                                                                                         “Even the Rain”

       http://.../population
                                                                                                          http://.../name
                                                                     http://.../filming_location
     http://cia.../Bolivia
                                                                                                   http://imdb.../TLLuvia
                                                                                                      p




        CIA
       World
                                                                                                           MovieDB
      FactBook



    © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig
                                                                           7
In a nutshell
• An extension of the current
  Web…
   • … where information and services
                 data
     are given well-defined and explicitly
     represented meaning, …
   • … so that it can be shared and used
     by humans and machines ...
                       machines,
   • ... better enabling them to work in
     cooperation


• How?
   • Promoting information exchange by
     tagging web content with machine
     processable descriptions of its
     meaning.
   • A d t h l i and i f t t
     And technologies d infrastructure
     to do this
   • And clear principles on how to
     publish data


                                             8
The four principles (Tim Berners Lee, 2006)


1. Use URIs as names            • http://www.w3.org/D
   for things                     esignIssues/Linked
2. Use HTTP URIs so               Data.html
   that people can look
   up those names.           http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
                            http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html

3. When someone looks
   up a URI, provide
   useful information,
   using th standards
      i the t d d
   (RDF*, SPARQL)
4.
4 Include links to other
   URIs, so that they can
   discover more things.

                            9
So does that mean I have to publish my data as Linked Data, now?

          • But, why?

                        • What was your incentive to publish an HTML page in 1990?
                           • Share data in documents and because your neighbor
                             was doing it


                         • So, why should we publish Linked Data in 2011?
                             ,   y           p
                            • Share data as data and because your neighbor is doing it




© Slide adapted from “Introduction to Linked Data”- Juan Sequeda
                                                                   10
And guess who is starting to publish Linked Data now?


 •   UK Government
 •   US Government
 •   BBC
 •   Open Calais
 •   Freebase
 •   NY Times
 •   CNET
 •   Dbpedia
 •   ….




                       11
Linked Open Data evolution

 2007


          2008

                             2009




           12     12
Linked Open Data

2010




http://richard.cyganiak.de/2007/10/lod/
                                          13
ToC




• Introduction to Linked Data

• G id li
  Guidelines f P bli hi Li k d D t
             for Publishing Linked Data

• Demo




                           14
Linked Data in OEG

• GeoLinkedData is an open initiative whose aim is to
  enrich the Web of Data with Spanish geospatial data.
                               p      g   p
   http://geo.linkeddata.es

• El Viajero Linked Data is project that focuses on the
  integration of the contents produced by newspapers
  and digital platforms belonging to Prisa Group
                                           Group.
   http://webenemasuno.linkeddata.es/

• A project with the Biblioteca Nacional to publish the
  library information as Linked Data.
         y
    http://cultura.linkeddata.es/visualizer/



                           15
Linked Data in OEG

• Tools for generating and cosuming Linked Data, e.g.,
   • geometry2rdf http://www oeg upm net/index php/downloads/151 geometry2rdf
                     http://www.oeg-upm.net/index.php/downloads/151-geometry2rdf

   • map4rdf http://oegdev.dia.fi.upm.es/projects/map4rdf/


• Spanish Thematic Network of Linked Data
        http://red.linkeddata.es
           p

                                 » Group leader: Ontology Engineering Group

                                 » 19 Research Groups

                                 » 4 companies




                                        16
Guidelines for Publishing Linked Data




      17
Guidelines for Publishing Linked Data




      18
Identification of the data sources



• Guidelines based on the Open Data Manual 1




• Two possibilities

   • To find the data sources already available in a public data
     catalog, e.g., Aporta project 2

   • To get an agreement with a particular government body to
     p
     publish its data sources, e.g., GeoLinkedData - IGN
                                 g



   1   http://opendatamanual.org/
   2   http://aporta.es
                                    19
Identification of the data sources
                                                             GeoLinkedData

                                                            Agreement with the IGN
                IGN
National Geographic Institute of Spain
            g p                   p

        Oracle & MySQL




                                                             Data sources available
                                                            in a public data catalog
         INE
National Statistic Institute of Spain




                                         20
Identification of the data sources
                                                IGN & INE




           Year




Province                         Industry Production Index




                  21
Guidelines for Publishing Linked Data




      22
Vocabulary Modelling
                                                                            Ontology




•   An ontology is an engineering artifact, which provides:
     •   A set of terms
     •   A set of explicit assumptions regarding the intended meaning of the terms.
           • Almost always including concepts and their classification
           • Almost always including properties between concepts




•   Shared understanding of a domain of interest
            nderstanding




                                          23
Vocabulary Modelling
                                 Reuse available vocabularies



Search for suitable
  vocabularies



                                                 Linked Open Vocabularies




    are there         Yes                  Build the vocabulary by
     suitable                                 reusing available
  vocabularies?                                 vocabularies


            No



        …
                            24
Vocabulary Modelling
                 Reuse available non-ontological resources

                                               Highly reliable Web Sites



   Search for suitable                         Domain-related sites
non-ontological resources

                                               Government Catalogs




        are there           Yes        Build the vocabulary by
         suitable                      transforming available
       resources?                             resources


               No




Build the vocabulary from
         scratch



                                  25
Vocabulary Modelling
                                                                                                                 GeoLinkedData
                                                                         WGS84 Geo
                                                                      Positioning: an RDF
                                                                          vocabulary                                   scv:Dimension
                                                                                                                          scv:Item
                                                                                                                        scv:Dataset

               hydrographical
             phenomena (rivers
                          (rivers,
                 lakes, etc.)




                                                                                                                         Vocabulary for
                                                                                                                         instants, intervals,
                                                                                                                                 ,          ,
                                                                                                                         durations, etc.




                                                                                            Names and
                                                                                            international code
                                     Ontology for OGC                                       systems for
                                     Geography Markup                                       territories and
                                     Language
                                        g g                                                 groups




Classes                        33          33
Object Properties
  j       p                    44          44
Data Properties              318          318
                                                        http://neon-toolkit.org/


                                                                      26
Guidelines for Publishing Linked Data




      27
Generation of the RDF Data




                             NOR2O

       INE




                          ODEMapster


      IGN




             Geospatial       Geometry2RDF
              column


IGN




                                       28
Generation of the RDF Data
                                                            NOR2O
Industry Production Index   Year




Province




                                   NOR2O




                                   29
Generation of the RDF Data
                                                                       R2O & ODEMapster
•   R2O is an extensible fully declarative language to describe
                extensible,
    mappings between relational database schemas and ontologies.
•   The ODEMapster processor generates RDF instances from
    relational instances based on the mapping description
    expressed in the R2O document




    www.oeg-upm.net/index.php/en/downloads/9-r2o-odempaster
                                                              30
Generation of the RDF Data
                                     R2O & ODEMapster
• Creation of the R2O Mappings




                         31
Generation of the RDF Data
         R2O & ODEMapster


         Excerpt of the R2O document




32
Generation of the RDF Data
                                                                             geometry2rdf

• Tool for generating RDF from geometrical information

• The geometry could be available in GML or WKT

• The RDF generated follows our Geometry Model




  http://www.oeg-upm.net/index.php/en/downloads/151-geometry2rdf

                                                           33
Generation of the RDF Data
                                geometry2rdf



                   Oracle STO UTIL package




SELECT TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry))
          AS Gml311Geometry
FROM "BCN200"."BCN200_0301L_RIO" c
WHERE c.Etiqueta='Arroyo'




     34
Generation of the RDF Data
              geometry2rdf
Generation of the RDF Data
                                                                                        Geometry Model
                                                               geoes: http://geo.linkeddata.es/
                                                               geo: http://www.w3.org/2003/01/geo/wgs84_pos#




                                        geoes:ontology/Geometría

                      rdfs:subClassOf                                          rdfs:subClassOf
                                        rdfs:subClassOf


          geo:Point                        geoes:ontology/Curva                          geoes:ontology/Polígono


                                            formadoPor                                      formadoPor


  39
geo:lat         39
             geo:long
                                              Collection of 2 or                                 Collection of 3 or
                                              more geo:Points                                    more geo:Points




                                                          36
Generation of the RDF Data
RDF generated according to our Geometry Model




                              1   2




                          0


                 0

           37
Generation of the RDF Data
                                                                                    URI Generation

• URIs are extremely relevant in this process since
  they are the key for the alignment of heterogeneous
  resources that come from different data sources.
      • Cool URIs 1
      • UK Cabinet Office 2


• Examples:
  http://geo.linkeddata.es/ontology/{class/property}
        http://geo.linkeddata.es/ontology/Lago

  http://geo.linkeddata.es/resource/dataset/type/{resourcename}
  http://geo linkeddata es/resource/dataset/type/{resourcename}
            http://geo.linkeddata.es/resource/Provincia/Madrid

  1   http://www.w3.org/TR/cooluris/
  2   http://www.cabinetoffice.gov.uk/media/301253/puiblic sector uri.pdf

                                                                38
Generation of the RDF Data
                                                       Provenance Information

• It is relevant
    • to manage the provenance information of the resources
    • to establish the license of the information


• Example




  Pubby: http://www4.wiwiss.fu-berlin.de/pubby/


                                                  39
Guidelines for Publishing Linked Data




      40
Publication of the RDF data

          map4rdf



                                      map4rdf
http://oegdev.dia.fi.upm.es/projects/map4rdf/




                                HTML                    Linked Data            SPARQL




      Including Provenance                      Pubby
             Support

 http://www4.wiwiss.fu-berlin.de/pubby/   Pubby 0.3




                                                                          Virtuoso 6.1.0

                                                               41
Guidelines for Publishing Linked Data




      42
Data Cleansing

• To find possible errors, identified by Hogan et al.
   • http-level issues such as accessibility and derefencability
                 issues,                         derefencability,
     e.g., HTTP URIs return 40x/50x errors
   • reasoning issues such as namespace without vocabulary,
     e.g., rss:item term invented
   • malformed/incompatible datatypes, e.g., “true” as xsd:int


• To fix the identified errors

• Example, encoding URIs
   • Special characters á é ñ
                        á, é,
       • http://geo.linkeddata.es/resource/Provincia/M%C3%A1laga




                                 43
Guidelines for Publishing Linked Data




      44
Linking the RDF Data




                     Identify suitable data sets                                       http://ckan.net
                         as li ki t
                             linking targets
                                          t




                       Discover relationships
                        between data items
LIMES                                              Silk Framework
http://aksw.org/Projects/limes                     http://www4.wiwiss.fu-berlin.de/bizer/silk/




                     Validate the relationships
                            discovered              sameAs Validator
                                                    http://oegdev.dia.fi.upm.es:8080/sameAs/




                                                                45
Linking the RDF Data
                                                                        GeoLinkedData


                   GeoLinked
                     Data




                               DBPedia                     GeoNames




        ….                                  ….                                 ….

http://dbpedia.org/re              http://geo.linkeddata                http://sws.geoname
   source/Madrid                       .es/.../Madrid                      s.org/6355233/


        ….                                 ….                                   ….

                                                46
Linking the RDF Data
                                                sameAs Validator




http://oegdev.dia.fi.upm.es:8080/sameAs/




                                           47
Guidelines for Publishing Linked Data




      48
Enable Effective Discovery
                                 Register the dataset into CKAN Registry

• Add the dataset to CKAN, the open registry of data
  and content packages

• Minimum information
    • Name, unique ID for your data set on CKAN
    • Title, full name of your data set
           ,              y
    • URL, link to the data set home page




  http://www.w3.org/wiki/TaskForces/CommunityProjects/LinkingOpenData/DataSets/CKANmetainformation


                                                       49
Enable Effective Discovery
                                                  Sitemap protocol

• Used by web crawlers
• Efficiently find all your content & discover
  what has been updated
             http://sitemaps.org/




A sitemap fil contains i f
   i      file      i information regarding one or more URL on
                               i         di                URLs
   your Web site. The information that is stored there helps search
   engines better spider your website.


                                 50
Enable Effective Discovery
Sindice: the best RDF search engine




     51
Enable Effective Discovery
                                                                     sitemap4rdf


• Simple command line tool
• Sends a SPARQL query to list all URIs
• Generates sitemap

 sitemap4rdf htt //
  it    4 df http://yoursite/sparql htt //
                         it /     l http://yoursite/resource/
                                                it /        /

 Example:

 sitemap4rdf http://geo.linkeddata.es/sparql http://geo.linkeddata.es/


• run sitemap4rdf specifying th SPARQL endpoint
       it    4 df      if i the               d i t
  and the prefix of the URLs to include in the Sitemap

  http://lab.linkeddata.deri.ie/2010/sitemap4rdf/


                                                    52
Enable Effective Discovery
                    Submit the sitemap location - Sindice

• http://sindice.com/main/submit




                           53
Enable Effective Discovery
                   Submit the sitemap location - Google

• https://www.google.com/webmasters/tools/




                         54
ToC




• Introduction to Linked Data

• G id li
  Guidelines f P bli hi Li k d D t
             for Publishing Linked Data

• Demo




                           55
DEMO
http://geo.linkeddata.es/browser
http://geo linkeddata es/browser




              56
Provinces




57
Capital of Province




58
Provinces – Industry Production Index




 59
Beaches




60
DEMO
http://webenemasuno.linkeddata.es/
http://webenemasuno linkeddata es/




                61
Trips




62
Guide Locations




63
Guide




64
Future Work




65
Methodological Guidelines for
   Publishing Linked Data



Boris Villazón-Terrazas, Asunción Gómez-Pérez, and Óscar Corcho

    Facultad de Informática, Universidad Politécnica de Madrid
  Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
                      http://www.oeg-upm.net
                      http://www oeg upm net
               {bvillazon,asun,ocorcho}@fi.upm.es
           Phone: 34.91.3366605, Fax: 34.91.3524819



                CONSEGI 2011 – Brasília, Brazil
                      12th May, 2011

Mais conteúdo relacionado

Mais procurados

RDFa From Theory to Practice
RDFa From Theory to PracticeRDFa From Theory to Practice
RDFa From Theory to PracticeAdrian Stevenson
 
Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]
Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]
Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]Teodora Petkova
 
Питер Мика "Making the web searchable"
Питер Мика "Making the web searchable"Питер Мика "Making the web searchable"
Питер Мика "Making the web searchable"Yandex
 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataOlaf Hartig
 
Linking Data: The Legal Implications - SemTech2010
Linking Data: The Legal Implications - SemTech2010Linking Data: The Legal Implications - SemTech2010
Linking Data: The Legal Implications - SemTech2010mleyden
 
Transcript - DOIs to support citation of grey literature
Transcript - DOIs to support citation of grey literatureTranscript - DOIs to support citation of grey literature
Transcript - DOIs to support citation of grey literatureARDC
 
Blogs, Wikis, & Flickr: Oh My!
Blogs, Wikis, & Flickr: Oh My!Blogs, Wikis, & Flickr: Oh My!
Blogs, Wikis, & Flickr: Oh My!GenealogyMedia.com
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
The right to be forgotten Bill Hannay
The right to be forgotten  Bill HannayThe right to be forgotten  Bill Hannay
The right to be forgotten Bill HannayCharleston Conference
 
SchemEX -- Building an Index for Linked Open Data
SchemEX -- Building an Index for Linked Open DataSchemEX -- Building an Index for Linked Open Data
SchemEX -- Building an Index for Linked Open DataAnsgar Scherp
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital librariesSören Auer
 
Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Oscar Corcho
 
Cloud Computing and Genealogical Collaboration
Cloud Computing and Genealogical CollaborationCloud Computing and Genealogical Collaboration
Cloud Computing and Genealogical CollaborationGenealogyMedia.com
 

Mais procurados (16)

RDFa From Theory to Practice
RDFa From Theory to PracticeRDFa From Theory to Practice
RDFa From Theory to Practice
 
Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]
Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]
Solid: An Ecology of Digital Being [@SLA Europe October 28, 2020]
 
LOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolPartyLOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolParty
 
Питер Мика "Making the web searchable"
Питер Мика "Making the web searchable"Питер Мика "Making the web searchable"
Питер Мика "Making the web searchable"
 
Query Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked DataQuery Processing and Trustworthiness in the Web of Linked Data
Query Processing and Trustworthiness in the Web of Linked Data
 
Linking Data: The Legal Implications - SemTech2010
Linking Data: The Legal Implications - SemTech2010Linking Data: The Legal Implications - SemTech2010
Linking Data: The Legal Implications - SemTech2010
 
Transcript - DOIs to support citation of grey literature
Transcript - DOIs to support citation of grey literatureTranscript - DOIs to support citation of grey literature
Transcript - DOIs to support citation of grey literature
 
Blogs, Wikis, & Flickr: Oh My!
Blogs, Wikis, & Flickr: Oh My!Blogs, Wikis, & Flickr: Oh My!
Blogs, Wikis, & Flickr: Oh My!
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and AuthoringLOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
 
NISO DCMI Webinar bibframe-20130123
NISO DCMI Webinar bibframe-20130123NISO DCMI Webinar bibframe-20130123
NISO DCMI Webinar bibframe-20130123
 
The right to be forgotten Bill Hannay
The right to be forgotten  Bill HannayThe right to be forgotten  Bill Hannay
The right to be forgotten Bill Hannay
 
SchemEX -- Building an Index for Linked Open Data
SchemEX -- Building an Index for Linked Open DataSchemEX -- Building an Index for Linked Open Data
SchemEX -- Building an Index for Linked Open Data
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)
 
Cloud Computing and Genealogical Collaboration
Cloud Computing and Genealogical CollaborationCloud Computing and Genealogical Collaboration
Cloud Computing and Genealogical Collaboration
 

Destaque

Towards a Commons RDF Java library
Towards a Commons RDF Java libraryTowards a Commons RDF Java library
Towards a Commons RDF Java librarySergio Fernández
 
SEEMP - Semantic Aspects and Interoperability
SEEMP - Semantic Aspects and InteroperabilitySEEMP - Semantic Aspects and Interoperability
SEEMP - Semantic Aspects and InteroperabilityBoris Villazón-Terrazas
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataBoris Villazón-Terrazas
 
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...Boris Villazón-Terrazas
 
Linguistic resources enhanced with geospatial Information
Linguistic resources enhanced with geospatial InformationLinguistic resources enhanced with geospatial Information
Linguistic resources enhanced with geospatial InformationBoris Villazón-Terrazas
 

Destaque (11)

Towards a Commons RDF Java library
Towards a Commons RDF Java libraryTowards a Commons RDF Java library
Towards a Commons RDF Java library
 
Geolinkeddata 07042011 1
Geolinkeddata 07042011 1Geolinkeddata 07042011 1
Geolinkeddata 07042011 1
 
SEEMP - Semantic Aspects and Interoperability
SEEMP - Semantic Aspects and InteroperabilitySEEMP - Semantic Aspects and Interoperability
SEEMP - Semantic Aspects and Interoperability
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
Yet another SPARQL 1.1 brief introduction
Yet another SPARQL 1.1 brief introductionYet another SPARQL 1.1 brief introduction
Yet another SPARQL 1.1 brief introduction
 
Sitemap4rdf(v2 boris)
Sitemap4rdf(v2 boris)Sitemap4rdf(v2 boris)
Sitemap4rdf(v2 boris)
 
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
 
Linguistic resources enhanced with geospatial Information
Linguistic resources enhanced with geospatial InformationLinguistic resources enhanced with geospatial Information
Linguistic resources enhanced with geospatial Information
 
Ecuadorian Geospatial Linked Data
Ecuadorian Geospatial Linked Data Ecuadorian Geospatial Linked Data
Ecuadorian Geospatial Linked Data
 
iSOCO - Research Lab Brief Introduction
iSOCO - Research Lab Brief IntroductioniSOCO - Research Lab Brief Introduction
iSOCO - Research Lab Brief Introduction
 
Data Shapes and Data Transformations
Data Shapes and Data TransformationsData Shapes and Data Transformations
Data Shapes and Data Transformations
 

Semelhante a Methodological Guidelines for Publishing Linked Data

[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...Data Beers
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataBoris Villazón-Terrazas
 
Linked_Open_Data_Rome_Netcamp_13
Linked_Open_Data_Rome_Netcamp_13Linked_Open_Data_Rome_Netcamp_13
Linked_Open_Data_Rome_Netcamp_13Michele Piunti
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked DataAdrian Stevenson
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementRENDER project
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commonsJesse Wang
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic TechnologiesPeter Haase
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data GenerationFilip Radulovic
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF AgeM. Tamer Özsu
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
Skb web2.0
Skb web2.0Skb web2.0
Skb web2.0animove
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
Semantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for InformationSemantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for Information3 Round Stones
 
Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Haklae Kim
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challengesMichael Hausenblas
 

Semelhante a Methodological Guidelines for Publishing Linked Data (20)

[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
Linked_Open_Data_Rome_Netcamp_13
Linked_Open_Data_Rome_Netcamp_13Linked_Open_Data_Rome_Netcamp_13
Linked_Open_Data_Rome_Netcamp_13
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data Management
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic Technologies
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF Age
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
Skb web2.0
Skb web2.0Skb web2.0
Skb web2.0
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
Semantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for InformationSemantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for Information
 
Linked Data
Linked DataLinked Data
Linked Data
 
Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do Big Data on the Web – What We Will Do
Big Data on the Web – What We Will Do
 
Tutorial
TutorialTutorial
Tutorial
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challenges
 

Mais de Boris Villazón-Terrazas

RDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingRDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingBoris Villazón-Terrazas
 
Map4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial DatasetsMap4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial DatasetsBoris Villazón-Terrazas
 
Linked Data Projects at OEG - Current Status
Linked Data Projects at OEG - Current StatusLinked Data Projects at OEG - Current Status
Linked Data Projects at OEG - Current StatusBoris Villazón-Terrazas
 
A Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and OrganizationA Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and OrganizationBoris Villazón-Terrazas
 
Linked Data Research Projects at Ontology Engineering Group
Linked Data Research Projects at Ontology Engineering GroupLinked Data Research Projects at Ontology Engineering Group
Linked Data Research Projects at Ontology Engineering GroupBoris Villazón-Terrazas
 
Lightweight Semantic Annotation of Geospatial RESTful Services
Lightweight Semantic Annotation of Geospatial RESTful ServicesLightweight Semantic Annotation of Geospatial RESTful Services
Lightweight Semantic Annotation of Geospatial RESTful ServicesBoris Villazón-Terrazas
 
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseAn Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseBoris Villazón-Terrazas
 

Mais de Boris Villazón-Terrazas (12)

RDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingRDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct Mapping
 
Map4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial DatasetsMap4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial Datasets
 
Statistical Linked Data
Statistical Linked DataStatistical Linked Data
Statistical Linked Data
 
Publishing Linked Data from RDB
Publishing Linked Data from RDBPublishing Linked Data from RDB
Publishing Linked Data from RDB
 
Linked Data Projects at OEG - Current Status
Linked Data Projects at OEG - Current StatusLinked Data Projects at OEG - Current Status
Linked Data Projects at OEG - Current Status
 
A Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and OrganizationA Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and Organization
 
Linked Data Research Projects at Ontology Engineering Group
Linked Data Research Projects at Ontology Engineering GroupLinked Data Research Projects at Ontology Engineering Group
Linked Data Research Projects at Ontology Engineering Group
 
Lightweight Semantic Annotation of Geospatial RESTful Services
Lightweight Semantic Annotation of Geospatial RESTful ServicesLightweight Semantic Annotation of Geospatial RESTful Services
Lightweight Semantic Annotation of Geospatial RESTful Services
 
Geometry2rdf(v2 boris)
Geometry2rdf(v2 boris)Geometry2rdf(v2 boris)
Geometry2rdf(v2 boris)
 
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseAn Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
 
Geo linked data lstd10(v2-boris)
Geo linked data lstd10(v2-boris)Geo linked data lstd10(v2-boris)
Geo linked data lstd10(v2-boris)
 
GeoLinkedData
GeoLinkedDataGeoLinkedData
GeoLinkedData
 

Último

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 

Último (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 

Methodological Guidelines for Publishing Linked Data

  • 1. Methodological Guidelines for Publishing Linked Data Boris Villazón-Terrazas, Asunción Gómez-Pérez, and Óscar Corcho Facultad de Informática, Universidad Politécnica de Madrid Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net http://www oeg upm net {bvillazon,asun,ocorcho}@fi.upm.es Phone: 34.91.3366605, Fax: 34.91.3524819 CONSEGI 2011 – Brasília, Brazil 12th May, 2011
  • 2. ToC • Introduction to Linked Data • G id li Guidelines f P bli hi Li k d D t for Publishing Linked Data • Demo 2
  • 3. ToC • Introduction to Linked Data • Guidelines for Publishing Linked Data • Demo 3
  • 4. Classic Web MovieDB Data exposed to the Web via HTML, pdf, etc. CIA World FactBook © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig 4
  • 5. Classic Web Information from Complexpages single queries can be multiple over found via pages / data search engines sources?? © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig 5
  • 6. What do we actually want? • Use the Web like a single global database CIA World MovieDB FactBook © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig 6
  • 7. Linked Data enables such Web of Data Global Identifier: URI (Uniform Resource Identifier) which is a string of characters used Identifier), to identify a name or a resource on the Internet. Data Model: RDF (Resource Description Framework), which is a standard model for data interchange on the Web Access Mechanism: HTTP Connection: Typed Links 8000000 “Even the Rain” http://.../population http://.../name http://.../filming_location http://cia.../Bolivia http://imdb.../TLLuvia p CIA World MovieDB FactBook © Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig 7
  • 8. In a nutshell • An extension of the current Web… • … where information and services data are given well-defined and explicitly represented meaning, … • … so that it can be shared and used by humans and machines ... machines, • ... better enabling them to work in cooperation • How? • Promoting information exchange by tagging web content with machine processable descriptions of its meaning. • A d t h l i and i f t t And technologies d infrastructure to do this • And clear principles on how to publish data 8
  • 9. The four principles (Tim Berners Lee, 2006) 1. Use URIs as names • http://www.w3.org/D for things esignIssues/Linked 2. Use HTTP URIs so Data.html that people can look up those names. http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html 3. When someone looks up a URI, provide useful information, using th standards i the t d d (RDF*, SPARQL) 4. 4 Include links to other URIs, so that they can discover more things. 9
  • 10. So does that mean I have to publish my data as Linked Data, now? • But, why? • What was your incentive to publish an HTML page in 1990? • Share data in documents and because your neighbor was doing it • So, why should we publish Linked Data in 2011? , y p • Share data as data and because your neighbor is doing it © Slide adapted from “Introduction to Linked Data”- Juan Sequeda 10
  • 11. And guess who is starting to publish Linked Data now? • UK Government • US Government • BBC • Open Calais • Freebase • NY Times • CNET • Dbpedia • …. 11
  • 12. Linked Open Data evolution  2007  2008  2009 12 12
  • 14. ToC • Introduction to Linked Data • G id li Guidelines f P bli hi Li k d D t for Publishing Linked Data • Demo 14
  • 15. Linked Data in OEG • GeoLinkedData is an open initiative whose aim is to enrich the Web of Data with Spanish geospatial data. p g p http://geo.linkeddata.es • El Viajero Linked Data is project that focuses on the integration of the contents produced by newspapers and digital platforms belonging to Prisa Group Group. http://webenemasuno.linkeddata.es/ • A project with the Biblioteca Nacional to publish the library information as Linked Data. y http://cultura.linkeddata.es/visualizer/ 15
  • 16. Linked Data in OEG • Tools for generating and cosuming Linked Data, e.g., • geometry2rdf http://www oeg upm net/index php/downloads/151 geometry2rdf http://www.oeg-upm.net/index.php/downloads/151-geometry2rdf • map4rdf http://oegdev.dia.fi.upm.es/projects/map4rdf/ • Spanish Thematic Network of Linked Data http://red.linkeddata.es p » Group leader: Ontology Engineering Group » 19 Research Groups » 4 companies 16
  • 17. Guidelines for Publishing Linked Data 17
  • 18. Guidelines for Publishing Linked Data 18
  • 19. Identification of the data sources • Guidelines based on the Open Data Manual 1 • Two possibilities • To find the data sources already available in a public data catalog, e.g., Aporta project 2 • To get an agreement with a particular government body to p publish its data sources, e.g., GeoLinkedData - IGN g 1 http://opendatamanual.org/ 2 http://aporta.es 19
  • 20. Identification of the data sources GeoLinkedData Agreement with the IGN IGN National Geographic Institute of Spain g p p Oracle & MySQL Data sources available in a public data catalog INE National Statistic Institute of Spain 20
  • 21. Identification of the data sources IGN & INE Year Province Industry Production Index 21
  • 22. Guidelines for Publishing Linked Data 22
  • 23. Vocabulary Modelling Ontology • An ontology is an engineering artifact, which provides: • A set of terms • A set of explicit assumptions regarding the intended meaning of the terms. • Almost always including concepts and their classification • Almost always including properties between concepts • Shared understanding of a domain of interest nderstanding 23
  • 24. Vocabulary Modelling Reuse available vocabularies Search for suitable vocabularies Linked Open Vocabularies are there Yes Build the vocabulary by suitable reusing available vocabularies? vocabularies No … 24
  • 25. Vocabulary Modelling Reuse available non-ontological resources Highly reliable Web Sites Search for suitable Domain-related sites non-ontological resources Government Catalogs are there Yes Build the vocabulary by suitable transforming available resources? resources No Build the vocabulary from scratch 25
  • 26. Vocabulary Modelling GeoLinkedData WGS84 Geo Positioning: an RDF vocabulary scv:Dimension scv:Item scv:Dataset hydrographical phenomena (rivers (rivers, lakes, etc.) Vocabulary for instants, intervals, , , durations, etc. Names and international code Ontology for OGC systems for Geography Markup territories and Language g g groups Classes 33 33 Object Properties j p 44 44 Data Properties 318 318 http://neon-toolkit.org/ 26
  • 27. Guidelines for Publishing Linked Data 27
  • 28. Generation of the RDF Data NOR2O INE ODEMapster IGN Geospatial Geometry2RDF column IGN 28
  • 29. Generation of the RDF Data NOR2O Industry Production Index Year Province NOR2O 29
  • 30. Generation of the RDF Data R2O & ODEMapster • R2O is an extensible fully declarative language to describe extensible, mappings between relational database schemas and ontologies. • The ODEMapster processor generates RDF instances from relational instances based on the mapping description expressed in the R2O document www.oeg-upm.net/index.php/en/downloads/9-r2o-odempaster 30
  • 31. Generation of the RDF Data R2O & ODEMapster • Creation of the R2O Mappings 31
  • 32. Generation of the RDF Data R2O & ODEMapster Excerpt of the R2O document 32
  • 33. Generation of the RDF Data geometry2rdf • Tool for generating RDF from geometrical information • The geometry could be available in GML or WKT • The RDF generated follows our Geometry Model http://www.oeg-upm.net/index.php/en/downloads/151-geometry2rdf 33
  • 34. Generation of the RDF Data geometry2rdf Oracle STO UTIL package SELECT TO_CHAR(SDO_UTIL.TO_GML311GEOMETRY(geometry)) AS Gml311Geometry FROM "BCN200"."BCN200_0301L_RIO" c WHERE c.Etiqueta='Arroyo' 34
  • 35. Generation of the RDF Data geometry2rdf
  • 36. Generation of the RDF Data Geometry Model geoes: http://geo.linkeddata.es/ geo: http://www.w3.org/2003/01/geo/wgs84_pos# geoes:ontology/Geometría rdfs:subClassOf rdfs:subClassOf rdfs:subClassOf geo:Point geoes:ontology/Curva geoes:ontology/Polígono formadoPor formadoPor 39 geo:lat 39 geo:long Collection of 2 or Collection of 3 or more geo:Points more geo:Points 36
  • 37. Generation of the RDF Data RDF generated according to our Geometry Model 1 2 0 0 37
  • 38. Generation of the RDF Data URI Generation • URIs are extremely relevant in this process since they are the key for the alignment of heterogeneous resources that come from different data sources. • Cool URIs 1 • UK Cabinet Office 2 • Examples: http://geo.linkeddata.es/ontology/{class/property} http://geo.linkeddata.es/ontology/Lago http://geo.linkeddata.es/resource/dataset/type/{resourcename} http://geo linkeddata es/resource/dataset/type/{resourcename} http://geo.linkeddata.es/resource/Provincia/Madrid 1 http://www.w3.org/TR/cooluris/ 2 http://www.cabinetoffice.gov.uk/media/301253/puiblic sector uri.pdf 38
  • 39. Generation of the RDF Data Provenance Information • It is relevant • to manage the provenance information of the resources • to establish the license of the information • Example Pubby: http://www4.wiwiss.fu-berlin.de/pubby/ 39
  • 40. Guidelines for Publishing Linked Data 40
  • 41. Publication of the RDF data map4rdf map4rdf http://oegdev.dia.fi.upm.es/projects/map4rdf/ HTML Linked Data SPARQL Including Provenance Pubby Support http://www4.wiwiss.fu-berlin.de/pubby/ Pubby 0.3 Virtuoso 6.1.0 41
  • 42. Guidelines for Publishing Linked Data 42
  • 43. Data Cleansing • To find possible errors, identified by Hogan et al. • http-level issues such as accessibility and derefencability issues, derefencability, e.g., HTTP URIs return 40x/50x errors • reasoning issues such as namespace without vocabulary, e.g., rss:item term invented • malformed/incompatible datatypes, e.g., “true” as xsd:int • To fix the identified errors • Example, encoding URIs • Special characters á é ñ á, é, • http://geo.linkeddata.es/resource/Provincia/M%C3%A1laga 43
  • 44. Guidelines for Publishing Linked Data 44
  • 45. Linking the RDF Data Identify suitable data sets http://ckan.net as li ki t linking targets t Discover relationships between data items LIMES Silk Framework http://aksw.org/Projects/limes http://www4.wiwiss.fu-berlin.de/bizer/silk/ Validate the relationships discovered sameAs Validator http://oegdev.dia.fi.upm.es:8080/sameAs/ 45
  • 46. Linking the RDF Data GeoLinkedData GeoLinked Data DBPedia GeoNames …. …. …. http://dbpedia.org/re http://geo.linkeddata http://sws.geoname source/Madrid .es/.../Madrid s.org/6355233/ …. …. …. 46
  • 47. Linking the RDF Data sameAs Validator http://oegdev.dia.fi.upm.es:8080/sameAs/ 47
  • 48. Guidelines for Publishing Linked Data 48
  • 49. Enable Effective Discovery Register the dataset into CKAN Registry • Add the dataset to CKAN, the open registry of data and content packages • Minimum information • Name, unique ID for your data set on CKAN • Title, full name of your data set , y • URL, link to the data set home page http://www.w3.org/wiki/TaskForces/CommunityProjects/LinkingOpenData/DataSets/CKANmetainformation 49
  • 50. Enable Effective Discovery Sitemap protocol • Used by web crawlers • Efficiently find all your content & discover what has been updated http://sitemaps.org/ A sitemap fil contains i f i file i information regarding one or more URL on i di URLs your Web site. The information that is stored there helps search engines better spider your website. 50
  • 51. Enable Effective Discovery Sindice: the best RDF search engine 51
  • 52. Enable Effective Discovery sitemap4rdf • Simple command line tool • Sends a SPARQL query to list all URIs • Generates sitemap sitemap4rdf htt // it 4 df http://yoursite/sparql htt // it / l http://yoursite/resource/ it / / Example: sitemap4rdf http://geo.linkeddata.es/sparql http://geo.linkeddata.es/ • run sitemap4rdf specifying th SPARQL endpoint it 4 df if i the d i t and the prefix of the URLs to include in the Sitemap http://lab.linkeddata.deri.ie/2010/sitemap4rdf/ 52
  • 53. Enable Effective Discovery Submit the sitemap location - Sindice • http://sindice.com/main/submit 53
  • 54. Enable Effective Discovery Submit the sitemap location - Google • https://www.google.com/webmasters/tools/ 54
  • 55. ToC • Introduction to Linked Data • G id li Guidelines f P bli hi Li k d D t for Publishing Linked Data • Demo 55
  • 59. Provinces – Industry Production Index 59
  • 66.
  • 67. Methodological Guidelines for Publishing Linked Data Boris Villazón-Terrazas, Asunción Gómez-Pérez, and Óscar Corcho Facultad de Informática, Universidad Politécnica de Madrid Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net http://www oeg upm net {bvillazon,asun,ocorcho}@fi.upm.es Phone: 34.91.3366605, Fax: 34.91.3524819 CONSEGI 2011 – Brasília, Brazil 12th May, 2011