SlideShare uma empresa Scribd logo
1 de 50
Baixar para ler offline
Semantic Representations
     for Research
          Rinke Hoekstra and Stefan Schlobach
   VU University Amsterdam/University of Amsterdam
               http://www.data2semantics.org
About us...
•   Knowledge Representation and Reasoning Group
    Frank van Harmelen


•   Modeling of complex domains
•   Querying and reasoning over these
    models
•   ... at a very large scale (the Web)
About us...
•   Knowledge Representation and Reasoning Group
    Frank van Harmelen



•   Experience
    a.o. CATCH, STICH, LarKC, CEDAR and Data2Semantics

•   Premier group for provenance and linked data at scale
Overview
•   Research Lifecycle
    Data2Semantics and LarKC


•   Historical Census Data
    CEDAR and Data2Semantics


•   Short Title Catalogue of The Netherlands (STCN)
    Inger Leemans, Fernie Maas, Paul Huygen, Albert Meroño-Peñuela
How to share, publish, access, analyse, interpret and reuse data?




    Increase the ease of sharing scientific data ...
    ... of accessing, analysing and interpreting data ...
    ... and thereby increasing the reuse of data
EASY Data Repository
Enrich datasets: census data
EASY Data Repository
Enrich datasets: census data

Large volumes of publications
Improve services to clients
Automated services
EASY Data Repository
Enrich datasets: census data

Large volumes of publications
Improve services to clients
Automated services

Build systems for hospitals
EASY Data Repository
Enrich datasets: census data

Large volumes of publications
Improve services to clients
Automated services

Build systems for hospitals
Linked Data
•   “Semantic Hyperlinks” between data items


•   Every data item has a global identifier ...
•   ... that looks like a web address (URI) ...
•   ... is linked and described using shared vocabularies


•   Resource Description Framework (RDF)
•   SPARQL query language & endpoint
Linked Data
                                                                                                                                                                                                                  Linked
                                                                                                                                                                                                      LOV          User            Slideshare         tags2con
                                                                                                                                                                                    Audio
                                                                                                                                                                                                                 Feedback             2RDF            delicious
                                                                                                                                                                 Moseley          Scrobbler                                                                             Bricklink        Sussex
                                                                                                                                                                  Folk             (DBTune)                                                                                              Reading            St.
                                                                                                                                                 GTAA
                                                                                                                                Magna-                                                                                                                                                    Lists          Andrews
                                                                                                                                                                                                                          Klapp-
                                                                                                                                 tune                                                                                     stuhl-                                                                         Resource           NTU
                                                                                                                 DB                                                                                                        club                                                                            Lists          Resource
                                                                                                               Tropes                                                                                       Lotico                         Semantic        yovisto
                                                                                                                                          John                     Music                                                                                                     Man-                                           Lists
                                                                                                                                                                                        Music                                               Tweet                           chester
                                                                                           Hellenic                                       Peel                     Brainz                                                                                                                                                                       NDL
                                                                                                                                         (DBTune)                  (Data                Brainz                                                                              Reading
                                                                                                                                                                                                                                                                                                                                              subjects
                                                                                            FBD                                                                                        (zitgist)                                                                             Lists                     Open
                                                                                                           EUTC                                                  Incubator)                                                Linked
                                                                          Hellenic                                                                                                                                                                                                                    Library                    Open                           t4gm
                                                                                                          Produc-                                                                                                         Crunch-
                                                                            PD                                               Surge                                                                       RDF                                                                                                                                                     info
                                                                                                           tions
                                                                                                                                                 Discogs                                                                    base                                                                                                Library
                                                                                                                             Radio                                                                                                          Ontos          Source Code
                                                             Crime                                                                                                                                      ohloh                                                                          Plymouth                                 (Talis)
                                                                                                                                                   (Data                                                                                    News                                                                                                                            LEM
                                                                                                                                                                                                                                                            Ecosystem                   Reading                                                   RAMEAU
                                                            Reports                           business                                           Incubator)
                                                                                  Crime       data.gov.                                                                                                                                     Portal         Linked Data                    Lists                                                     SH
                                                              UK                                                                                                      Music             Jamendo
                                                                                   (En-          uk
                                                                                                                                                                     Brainz             (DBtune)                                                                                                              LinkedL
                                                Ox                               AKTing)                      FanHubz                                                                                                 gnoss                                                                                                                                                                  ntnusc
                                                                                                                                                                    (DBTune)                                                                                                    SSW                             CCN




                                                •
                                               Points                                                                                                                                                                                                                                                                             Thesau-
                                                                                                                                Last.FM                                                              Poké-                                                                     Thesaur
                                                                      Popula-                                                    artists                                                                                                       Didactal                          us                                                rus W



                                                                     “Semantic Hyperlinks” between data items
                                                                                                                                                                                                     pédia                                                                                                                                                                  LIBRIS
                                                                     tion (En-                                                  (DBTune)                Last.FM                                                                                   ia                                               theses.                                                LCSH                                         Rådata
                                  reegle                                                research          patents                                                                                                                                                                                                       MARC
                                                                      AKTing)                                                                           (rdfize)                                                                                                    my                                fr                                                                                                nå!
                                                                                        data.gov.         data.go                                                                                                                                                                                                       Codes
                       Ren.
                                                     NHS                                   uk              v.uk                                                                                                 Good-                                             Experi-
                                                                                                                                                                           Classical                                                                                                                                     List
                      Energy                         (En-                                                                                                                                                        win               flickr                          ment
                                                                                                                                                                             (DB             Pokedex                                                                                                                                                                                                             Norwe-
                      Genera-                       AKTing)                Mortality                                           BBC                                                                              Family            wrappr                                            Sudoc                                               PSH
                                                                                                                                                                            Tune)                                                                                                                                                                                                                                 gian
                                                                            (En-
                       tors                                                                                                  Program                                                                                                                                                                                                                                                                              MeSH
                                                                           AKTing)                                                                                                                                                              semantic
                                                                                                                               mes                  BBC                                                                                                                                                IdRef                                                               GND
                                                            CO2                         educatio          OpenEI                                                                                                                                web.org               SW
                                         Energy                                                                                                                                                                                                                                                        Sudoc                                           ndlna
                                                          Emission                      n.data.g                                                    Music                                                                                                            Dog                                                                                                                              VIAF
               EEA                        (En-                                                                                                                     Chronic-                             Linked
                                                            (En-                         ov.uk                                                                                                                            Portu-                                     Food                                                             UB
                                         AKTing)                                                                                                                     ling              Event             MDB
                                                          AKTing)                                                                                                                                                         guese                                                                                                      Mann-                                                                              Europeana
                                                                                                                                       BBC                         America             Media
                                                                                                                                                                                                                         DBpedia                                                                                     Calames         heim
                                                                           Ord-                                       Recht-          Wildlife                                                                                                                                                                                                                      Deutsche
               Open                                                                                                                                                                                                                          Revyu                                     DDC
                                                                                                     Openly           spraak.         Finder                                                                                                                                                                                                                           Bio-              lobid
              Election                                                    nance
                                 legislation                                                          Local              nl                                                                                                                                     RDF                                                                                                  graphie
                                                                                                                                                                                                                                                                                                                                                                                    Resources                 NSZL




                                                •
                Data                                                      Survey                                                                     Tele-                                                                                                                                            data                                            Ulm                                                                        Swedish
     EU                                                                                                                                                                     New                                                                                 Book
              Project           data.gov.uk                                                                                                         graphis                                                                                                                                           bnf.fr                                                                                                 Catalog              Open
    Insti-                                                                                                                                                                  York                                                                               Mashup



                                                                     Every data item has a global identifier ...
   tutions                                                                                                                                                                                    URI                Greek            Open                                                                                            P20                                                                                            Cultural
                                                        UK Post-                                                                                                           Times
                                                                                                                                                                                             Burner             DBpedia           Calais                                                                                                                                                                                         Heritage
                                                         codes                          statistics                                                                                                                                                                                                                                                          ECS             Wiki                    lobid
                GovWILD                                                                 data.gov.                                 Taxon                                                                                                        iServe                                                                                                      South-                                  Organi-
                                                                                                              LOIUS                                                                                                                                                                 BNB
Brazilian
                                                                                           uk                                    Concept                                                                                                                                                                                 ECS                               ampton                                  sations
                                                                                                                                                         Geo                 World                                                                                 OS                                 BibBase                                                                                                          STW            GESIS
  Poli-                           ESD                                                                                                                                                                                                                                                                                   South-                ECS
                                                                                                                                                        Names                Fact-                                                                                                                                                          (RKB
 ticians                         stan-         reference                                                                                                                                                                                                                                                                ampton
                                                                                                                                                                             book




                                                •
                                                                     data.gov.uk                                                                                                               Freebase                                                                                                                                   Explorer)                                  Budapest
                                 dards         data.gov.                                                               NASA                                                                                                                                                                                             EPrints
                                                   uk                 intervals                                                                                                                                                                      Project                                                                                                        OAI
                 Lichfield                                                                                             (Data



                                                                     ... that looks like a web address (URI) ...
                                                                                               transport                                                                                                                  DBpedia                                       data                                                                                                                             Pisa
                  Spen-                                                                                                Incu-                                                                                                                         Guten-             dcs
                                                                                               data.gov.                                                                                                                                                                                                                                                                                                                RESEX         Scholaro-
    ISTAT          ding                                                                                                bator)               Fishes                                                                                                    berg                              DBLP                 DBLP
                                                                                                  uk                                                            Geo
                                                                                                                                                                                                                                                                                                                                                                                                                                       meter
   Immi-                        Scotland                                                                                                   of Texas                                                                                                                                      (FU                 (L3S)
                                Pupils &                                                                                                                                       Uberblic                                                                                                                                      DBLP
   gration                                                                                                                                                     Species                                                                                                                 Berlin)                                                                                      IRIT
                                 Exams                                                                        Euro-                                                                                    dbpedia                                                     data-                                                     (RKB
                                               London                                                                                                                                                                                        TCM                                                                                                    ACM
                                                                                                               stat                                                                                      lite                                                      open-                                                   Explorer)                                                                                            NVD
                                               Gazette                                                        (FUB)                                                                                                                          Gene                                                                                                                                                       IBM
                      Traffic                                                                                                     Geo                                                                                                                              ac-uk




                                                •
                     Scotland                                      TWC LOGD                Eurostat                                                                                                                       Daily               DIT
                                                                                                                                 Linked                                                                                                                                                                    UN/
   Data                                                                                                                                             UMBEL                                                                 Med                                                                ERA
                                                                                                                                  Data                                                                                                                                                                   LOCODE



                                                                     ... is linked and described using shared vocabularies
                                                                                                                                                                                                                                                                                                                                                                                                                   DEPLOY
   Gov.ie                         CORDIS                                                                                                                             YAGO                                                                                                                                                                                                           New-
                                                                                                                                                                                          lingvoj                                                           Disea-
                                   (RKB                                                                                                                                                                                                                     some               SIDER                                                                         RAE2001                castle                                        LOCAH
                CORDIS           Explorer)                                                                   Linked                                                                                                                                                                                                   Eurécom
                                                                                   Eurostat                                                                                                                                                 Drug                                                                                   CiteSeer                                                            Roma
                 (FUB)                                                                                    Sensor Data
                                                   GovTrack                       (Ontology                (Kno.e.sis)                                    Open                                                                              Bank                                                      Pfam                                                                                                         Course-
                                                                                   Central)                                          riese                                                           Enipedia
                                                                                                                                                           Cyc              Lexvo                                      LinkedCT                                                                                                                                                                                     ware
                                    Linked                                                                                                                                                                                                                                          PDB
                                                                                                                                                                                                                                                           UniProt                                                         VIVO
             EURES                 EDGAR                                                                                                                                                                                                                                                                                                                                         dotAC
                                                                     US SEC                                                                                                                                                                                                                                               Indiana                ePrints                                                IEEE
                                  (Ontology                                                                                                                                             totl.net
                                                                   (rdfabout)
                                   Central)                                                                                                         WordNet                                                                                                                                                                                                                                                             RISKS
                                                                                                                                                     (VUA)                                                        Taxono               UniProt
                                                                                       US Census               EUNIS              Twarql                                                                                                                                                             HGNC
                                                Semantic                                                                                                               Cornetto                                                        (Bio2RDF)
                                                                                       (rdfabout)                                                                                                                   my                                                                                                                   VIVO
                      FTS                         XBRL                                                                                                                                                                                                         PRO-            ProDom                                 STITCH            Cornell                LAAS
                                                                                                                                                                                                                                                               SITE                                                                                                                        KISTI                NSF
                                  Scotland
                                                                GeoWord                                                                                                                       LODE




                                                •
                                    Geo-
                                   graphy                         Net                                                                                WordNet           WordNet                                                                                                                                                                                            JISC
                                                                                                                                                      (W3C)



                                                                     Resource Description Framework (RDF)
                                                                                                           Climbing                                                      (RKB                                                 Affy-                                                                                                KEGG
                                                                                                                                 Linked                                                                                                                                                                                                               VIVO UF
                                                                                         SMC                                                                           Explorer)                              SISVU           metrix                                                                           Pub                 Drug
                                                    Piedmont                           Journals                                 GeoData                                                                                                         PubMed                                        SGD                                                                                                ECCO-
                                    Finnish                                                                                                                                                                                                                               Gene                                Chem
                                    Munici-
                                                    Accomo-               El                                                                                                           AGROV                                                                             Ontology                                                                                                                 TCP                           Media
                                                     dations                                                                                         Alpine                                                                                                                                                                                                             bible
                                    palities                           Viajero                                                                                                          OC
                                                                                                                                                      Ski                                                                                                                                                                                                             ontology
                                                                       Tourism                                                                                                                                                                                                                                                                 KEGG
                                                                                        Ocean
                                                                                                                                                     Austria
                                                                                                                                                                                                                                                                                                                                              Enzyme                                     PBAC                           Geographic
                                                                                                                        Metoffice                                  GEMET                                             ChEMBL




                                                •
                                                           Italian                     Drilling                                                                                                                                        OMIM                                                                                KEGG
                                                                                                                         Weather                                                    Open
                                                            public                     Codices            AEMET                                                                                      Linked                                                                         MGI                                   Pathway
                                                                                                                                                                                    Data                                                                                                                                                                                                                                Publications


                                                                     SPARQL query language & endpoint
                                                           schools                                                      Forecasts                                                                     Open                                                  InterPro                                    GeneID                                                       KEGG
                                                                                                                                                 EARTh                             Thesau-
                                                                         Turismo
                                                                                                                                                                                     rus             Colors                                                                                                                                                         Reaction
                                                                            de
                                                                        Zaragoza                                                                                Product                                                Smart                                                                                                                           KEGG
                                                                                                                                                                                                                                                                                                                                                                                                       User-generated content
                                                                                                                                  Weather                         DB                                                    Link                                                                  Medi                                                     Glycan
                                                                                               Janus                              Stations                                    Product                                                                                                         Care                                       KEGG
                                                                                                AMP                                                                                                                                 UniParc             UniRef              UniSTS                                                                                                                                     Government
                                                                                                                                                                               Types                Italian
                                                                                                                                                                                                                                                                                                                        Homolo           Com-
                                                                                                                    Yahoo!                       Airports                                          Museums                                                                                                                               pound
                                                                                                                                                                              Ontology                          Google
                                                                                                                                                                                                                                                                                                                         Gene
                                                                                                                     Geo                                                                                          Art
                                                                                                                    Planet        National
                                                                                                                                                                                                                wrapper
                                                                                                                                                                                                                                                                                                         Chem2                                                                                                        Cross-domain
                                                                                                                                   Radio-                                                                                                                                                               Bio2RDF
                                                                                                                                  activity                                                                                                                                                UniPath
                                                                                                                                     JP                        Sears                Open                                            Linked                               OGOLOD            way
                                                                                                                                                                                                                                                                                                                                                                                                                       Life sciences
                                                                                                                                                                                   Corpo-           Amster-                                          Reactome
                                                                                                                                                                                                     dam              medu-          Open
                                                                                                                                                                                    rates                                          Numbers
                                                                                                                                                                                                    Museum            cator
                                                                                                                                                                                                                                                                                                                                                                                           As of September 2011
Research Lifecycle
                                                                                   Linked Data
                                                                                                    Cloud$        Analysis and
                                                                                      Cloud                         Metrics

                   acquiring$data$from$text?$                                                                                      Ana
                                                                                                                                    Me
           Semi8
     Semi-Automatic                                                                                              Querying and
        Automa;c$
       Annotation                                                                                                  Ranking
        Annota;on$       e.g.$GATE$
                                                                         Amalgame$                        SILK$
                        OpenCalais$
                                                                                                                                   Que
                                                Graph$Rewri;ng$        Graph$Rewri;ng$
                                                                                                                                  and$R
                                                                                   Link to Other
                             RDF Conversion         Internal Linking                                              Visualization
                                                                                       Data
                                   RDF$              RDF$                     Internal$              Link$to$
                                Conversion$        Cleaning$                   Linking$            Other$Data$
xml2rdf$
  d2rq$                                                                                                                           Visua
rdb2rdf$
     Semi-Automatic                                                                Provenance
   $ Conversion                                                                    Enrichment
                                                                                                              User Interfaces

                                                                                                   Provenance$
                                                                                                   Enrichment$
                                                                                                                                     U
                                                                                                                                  Inte
                                                                                 RDF Feedback
          Semi8
        Automa;c$
                                                        Provenance Tracking
        Conversion$

       “tablinker”$
Challenges

•   Build useful services and tools for data publishers ...
•   ... that maintain provenance information ...
•   ... and cater for the entire research cycle ...
•   ... including a feedback loop to new research
Challenges

•   Build useful services and tools for data publishers ...
•   ... that maintain provenance information ...
•   ... and cater for the entire research cycle ...
•   ... including a feedback loop to new research
Large Knowledge Collider
•   Data analysis pipeline
•   Custom workflows
•   Highly scalable


•   Query driven
•   Exposed as SPARQL endpoint
Historical Census Data
•   Gathered from 1795 - 1971
•   Demographics, houses, occupations
Historical Census Data
•   Gathered from 1795 - 1971
•   Demographics, houses, occupations
Historical Census Data
•   Gathered from 1795 - 1971
•   Demographics, houses, occupations
Historical Census Data
•   Gathered from 1795 - 1971
•   Demographics, houses, occupations


•   507 Excel files
•   2288 tables
•   33283 annotations
Annotations
•   Created at data entry time
•   Created as we speak


•   Corrections to original census tables
•   Corrections to excel version of census table
•   Any additonal remarks...
Harmonization
                                           ?
•   Enable historical research
    across census years




•   Query across multiple heterogeneous datasets
•   Accommodate multiple interpretations
Harmonization
•   Overcome structural heterogeneity


•   Overcome semantic heterogeneity
    •   Different categories (age groups, locations)
    •   Different values (names of religions, municipalities)
Current Situation
•   Iterative refinement of MySQL database tables
•   Harmonization against existing codifications


•   Expensive manual process
•   Loss of information between harmonization steps
•   Loss of detail in mapping to existing codification
•   Not repeatable
Requirements
•   (Semi-)automatic conversion and harmonization
•   Repeatable
•   Conservation of information (only add)
•   Provenance (who did what)
•   Flexible model
•   Linking to other datasets
•   Publish as open data
Research Cycle
                                                                                   Linked Data
                                                                                                    Cloud$        Analysis and
                                                                                      Cloud                         Metrics

                   acquiring$data$from$text?$                                                                                      Ana
                                                                                                                                    Me
           Semi8
     Semi-Automatic                                                                                              Querying and
        Automa;c$
       Annotation                                                                                                  Ranking
        Annota;on$       e.g.$GATE$
                                                                         Amalgame$                        SILK$
                        OpenCalais$
                                                                                                                                   Que
                                                Graph$Rewri;ng$        Graph$Rewri;ng$
                                                                                                                                  and$R
                                                                                   Link to Other
                             RDF Conversion         Internal Linking                                              Visualization
                                                                                       Data
                                   RDF$              RDF$                     Internal$              Link$to$
                                Conversion$        Cleaning$                   Linking$            Other$Data$
xml2rdf$
  d2rq$                                                                                                                           Visua
rdb2rdf$
     Semi-Automatic                                                                Provenance
   $ Conversion                                                                    Enrichment
                                                                                                              User Interfaces

                                                                                                   Provenance$
                                                                                                   Enrichment$
                                                                                                                                     U
                                                                                                                                  Inte
                                                                                 RDF Feedback
          Semi8
        Automa;c$
                                                        Provenance Tracking
        Conversion$

       “tablinker”$
TabLinker
http://github.com/Data2Semantics/TabLinker
TabLinker
http://github.com/Data2Semantics/TabLinker
TabLinker
http://github.com/Data2Semantics/TabLinker
12



                                                                       1878




          TabLinker
                                                                   M


                                                                           O
I
                                                 leeftijd   ?
        http://github.com/Data2Semantics/TabLinker
            nummer der beroepsklasse                                                             ?
                                                                                      geboortejaar

                                                                       ?
                                                                geslacht
                                                                                 ?
                                                                   huwelijkse staat


    E         pannenbakkers
                                                 beroep

                                                        positie
                                             D                             1



                  letter der beroepsklasse
TabLinker
•   Verbatim graph representation of spreadsheet

•   Separate layer for semantics of spreadsheet

•   Separate graphs for any annotations, interpretations and
    harmonizations of the underlying data

•   Round-tripping from Excel to RDF and back
Sheet1:E15   Sheet1:C14   Sheet1:B8                Sheet1:L15                Sheet1:L3   Sheet1:L4   Sheet1:L5




                                      Sheet1:F15                Sheet1:D15   Sheet1:L6
d2s:HierarchicalRowHeader                                   d2s:DataCell                          d2s:Header



                             rdf:type                                rdf:type                                rdf:type
                  rdf:type                                                                                              rdf:type
      rdf:type                                                                                                                     rdf:type


Sheet1:E15       Sheet1:C14             Sheet1:B8                 Sheet1:L15                            Sheet1:L3             Sheet1:L4       Sheet1:L5




                                                     Sheet1:F15                  Sheet1:D15             Sheet1:L6


                                                       rdf:type                        rdf:type   rdf:type



                                                    d2s:RowHeader                        d2s:Metadata
d2s:HierarchicalRowHeader
          d2s:HierarchicalRowHeader                                                       d2s:DataCell                                   d2s:Header



                                    rdf:type
                                    rdf:type                                                 rdf:type                                         rdf:type
                         rdf:type
                         rdf:type                                                                                                                        rdf:type
           rdf:type
           rdf:type                                                                                                                                                   rdf:type


  Sheet1:E15
  Sheet1:E15           Sheet1:C14
                       Sheet1:C14              Sheet1:B8
                                               Sheet1:B8                                  Sheet1:L15                                     Sheet1:L3             Sheet1:L4         Sheet1:L5


                                               d2s:isDimension


                                                       :I
                          d2s:isDimension
                                                                                         d2s:isObservation                             d2s:isDimension


                                                                                                                                                              d2s:isDimension




d2s:isDimension                                       :I/E                                     _:x                                 :14--15_1875--1874                  d2s:isDimension




                                                                                                                                                :M



                                                                                                                                                :O

                      Sheet1:I/E/Fabricage_van_dakpannen__pannenbakkers

                                                                                 :D                                :5                           :10



                                                                           d2s:isDimension                   d2s:isDimension           d2s:isDimension




                                                                           Sheet1:F15                        Sheet1:D15                  Sheet1:L6


                                                                              rdf:type                                  rdf:type   rdf:type



                                                                          d2s:RowHeader                                   d2s:Metadata
d2s:HierarchicalRowHeader
          d2s:HierarchicalRowHeader                                                                                  d2s:DataCell                                                 d2s:Header



                                    rdf:type
                                    rdf:type                                                                            rdf:type                                                     rdf:type
                         rdf:type
                         rdf:type                                                                                                                                                                  rdf:type
           rdf:type
           rdf:type                                                                                                                                                                                             rdf:type


  Sheet1:E15
  Sheet1:E15           Sheet1:C14
                       Sheet1:C14              Sheet1:B8
                                               Sheet1:B8                                                             Sheet1:L15                                                    Sheet1:L3             Sheet1:L4         Sheet1:L5


                                               d2s:isDimension


                                                       :I
                          d2s:isDimension                                                                                                     "1"^^xsd:int
                                                                                                                    d2s:isObservation                                            d2s:isDimension


                                                  skos:broader           :Nummer_der_beroepsklasse                                                                                                      d2s:isDimension
                                                                                                                                d2s:populationSize



d2s:isDimension                                       :I/E       :Letter__Onderdeel_beroepsklasse_                        _:x                   d2s:dimension             :14--15_1875--1874                     d2s:isDimension



                                                                                                                                                        d2s:dimension
                                                  skos:broader
                                                                                                                                                                                       :M

                         :BENAMING_van_de_onderdeelen_der_onderscheidene_beroepsklassen__met_de_daartoe_behoorende_beroepen                                      d2s:dimension
                                                                                                                                        :Regelnummer
                                                                                                                                                                                       :O
                                                                                       :Positie_in_het_beroep__aangeduid_met_A__B__C_of_D                        d2s:dimension
                      Sheet1:I/E/Fabricage_van_dakpannen__pannenbakkers

                                                                                                            :D                                           :5                            :10



                                                                                                      d2s:isDimension                            d2s:isDimension                 d2s:isDimension




                                                                                                      Sheet1:F15                                     Sheet1:D15                   Sheet1:L6


                                                                                                         rdf:type                                             rdf:type    rdf:type



                                                                                                     d2s:RowHeader                                              d2s:Metadata
Harmonization within a year                                       I



                                                                                skos:broader
                                                           skos:broader
                                   skos:broader


                          D                                        E                                    A



                                                  skos:broader         skos:broader                         skos:broader
           skos:broader



                                                                                                                        Fabricage van
                                Fabricage van steen                                                                    aardewerk (incl.
Fabricage van                                                                Fabricage van dakpannen
                             (molensteen, steenbakkers,                                                              porcelein, terracotta,
     kalk                                                                        (pannenbakkers)
                                   tegelbakkers)                                                                       kachelbakkers,
                                                                                                                     pottenbakkers, enz.)

                                                                  Sheet1:I



                                                                 skos:broader            skos:broader
                                 skos:broader


                  Sheet1:D                                       Sheet1:E                                    Sheet1:A



                                                     skos:broader         skos:broader                              skos:broader
          skos:broader


                                                                                                                       Sheet1:Fabricage van
                               Sheet1:Fabricage van steen                        Sheet1:Fabricage van                     aardewerk (incl.
 Sheet1:Fabricage
                               (molensteen, steenbakkers,                             dakpannen                         porcelein, terracotta,
     van kalk
                                     tegelbakkers)                                 (pannenbakkers)                        kachelbakkers,
                                                                                                                        pottenbakkers, enz.)
Harmonization across years                              I



                                                                              skos:broader
                                                           skos:broader
                                   skos:broader


                            D                                    E                                        A




1889         skos:broader
                                                  skos:broader       skos:broader                                skos:broader




                                                                                                                             Fabricage van
                               Fabricage van steen                                                                          aardewerk (incl.
 Fabricage van                                                             Fabricage van dakpannen
                            (molensteen, steenbakkers,                                                                    porcelein, terracotta,
      kalk                                                                     (pannenbakkers)
                                  tegelbakkers)                                                                             kachelbakkers,
                                                                                                                          pottenbakkers, enz.)




                                          skos:narrowMatch                                   I                skos:closeMatch


   skos:exactMatch
                                                                                                                                             skos:narrowMatch
                                                                                                         skos:broader
                                                                                     skos:broader
                                                            skos:broader


                                                    D                                        E                                     A



                                                                            skos:broader         skos:broader                           skos:broader             1899
                                  skos:broader



                                                                                                                                                      Fabricage van
                                                        Fabricage van steen                                                                          aardewerk (incl.
                       Fabricage van                                                                 Fabricage van dakpannen
                                                          (steenbakkers,                                                                                porcelein,
                            kalk                                                                         (pannenbakkers)
                                                           tegelbakkers)                                                                             kachelbakkers,
                                                                                                                                                   pottenbakkers, enz.)
Harmonization external linking
                                                                     I



                                                                                 skos:broader
                                                               skos:broader
                                       skos:broader


                                 D                                   E                                  A



                                                      skos:broader       skos:broader                        skos:broader
             skos:broader



                                                                                                                           Fabricage van
                                    Fabricage van steen                                                                   aardewerk (incl.
Fabricage van                                                                 Fabricage van dakpannen
                                 (molensteen, steenbakkers,                                                             porcelein, terracotta,
     kalk                                                                         (pannenbakkers)
                                       tegelbakkers)                                                                      kachelbakkers,
                                                                                                                        pottenbakkers, enz.)

skos:exactMatch                       skos:broadMatch                              skos:broadMatch                          skos:closeMatch
                  skos:exactMatch                                                                     skos:exactMatch
                                                              skos:exactMatch


 HISCO:23811                           HISCO:25281                                      HISCO:25281                          HISCO:26345



                   HISCO:23810                                HISCO:25281                               HISCO:26340




                                      HISCO: Historical International Standard Classification of Occupations
Curation & Annotation
<http://example.com/workbook1/sheet1>      <http://example.com/workbook1/sheet1/corrected>                                                              provo:Activity
                                                                                                                                          rdf:type
                                                                                                                 :curation20120126
             "1"^^xsd:int                              "11"^^xsd:int
                                                                                             provo:wasGeneratedBy                     provo:hadAgent

                                                                                                                        provo:startedAt
           d2s:populationSize d2s:populationSize                                                            provo:endedAt
                                                               "1889"^^xsd:int                                                                          :RinkeHoekstra
                                   d2s:censusYear
                  _:x
                                   d2s:birthYears
                                                                       :1875--1874                         _:b                      _:a
                                        d2s:gemeente
             d2s:dimension      d2s:ageGroup
                                                                                                    time:inXSDDateTime           time:inXSDDateTime
                                                                            :Assendelft

         :14--15_1875--1874                              :14-15
                                                                                                  "20120126T09:00:00"                 "20120126T08:30:00"
Open Issues
•   Create the necessary mappings between graphs
    ... this is historical research

•   Mappings are interpretations
•   Query within a specified interpretation space


•   How to reliably perform statistical analysis across
    mappings?
•   How to study concept drift across years?
Short Title Catalogue
•   All books published in NL until 1800
•   Digitized over a period of 30 years


•   139817 publications (KB says >190000)
•   9962 publishers
•   23627 authors
•   96024 links to scanned title pages
Redactiebladen
•   Redactiebladen
•   PPN identifiers
•   KMC codes
Requirements
•   (Semi-)automatic conversion and harmonization
•   Repeatable
•   Conservation of information (only add)
•   Provenance (who did what)
•   Flexible model
•   Linking to other datasets
•   Publish as open data
Research Cycle
                                                                                   Linked Data
                                                                                                    Cloud$        Analysis and
                                                                                      Cloud                         Metrics

                   acquiring$data$from$text?$                                                                                      Ana
                                                                                                                                    Me
           Semi8
     Semi-Automatic                                                                                              Querying and
        Automa;c$
       Annotation                                                                                                  Ranking
        Annota;on$       e.g.$GATE$
                                                                         Amalgame$                        SILK$
                        OpenCalais$
                                                                                                                                   Que
                                                Graph$Rewri;ng$        Graph$Rewri;ng$
                                                                                                                                  and$R
                                                                                   Link to Other
                             RDF Conversion         Internal Linking                                              Visualization
                                                                                       Data
                                   RDF$              RDF$                     Internal$              Link$to$
                                Conversion$        Cleaning$                   Linking$            Other$Data$
xml2rdf$
  d2rq$                                                                                                                           Visua
rdb2rdf$
     Semi-Automatic                                                                Provenance
   $ Conversion                                                                    Enrichment
                                                                                                              User Interfaces

                                                                                                   Provenance$
                                                                                                   Enrichment$
                                                                                                                                     U
                                                                                                                                  Inte
                                                                                 RDF Feedback
          Semi8
        Automa;c$
                                                        Provenance Tracking
        Conversion$

       “tablinker”$
Procedure
•   Convert to MySQL database
    Paul Huygen
•   Specify mapping to RDF
    D2RQ mapping language
•   Interlink with other datasources
    Bibliografish portaal, Rijksmuseum, Iconclass, Ecartico
•   Publish as browsable and queryable dataset
    http://stcn.data2semantics.org
Procedure
•   Convert to MySQL database ✓
    Paul Huygen
•   Specify mapping to RDF ✓
    D2RQ mapping language
•   Interlink with other datasources
    Bibliografish portaal, Rijksmuseum, Iconclass, Ecartico
•   Publish as browsable and queryable dataset ✓
    http://stcn.data2semantics.org
http://stcn.data2semantics.org/resource/publicatie/337778825
Fingerprints
    Wilhelmus Nakatenus S.J. (1617-1682)


                                     rdfs:label


                                                       STCN:auteur/070082960



                                           stcn:publicatie               stcn:publicatie



                           STCN:publicatie/                                                STCN:publicatie/
                                                           stcn:titeluitgave
                             336280211                                                       314125434



stcn:illustratie       stcn:vingerafdruk                                                                  stcn:vingerafdruk
                                                            skos:exactMatch

                                                                                                                                         stcn:illustratie
                                                    rdfs:label            rdfs:label
                   STCN:vingerafdruk/27                                                                         STCN:vingerafdruk/1207



                                              Hemels palm-hof, ofte Groot getyde-boek
                                       rdfs:label
                                                                                             rdfs:label




                                                  000012 - *b1 A4 ella : b2 2C7 ns$in
Co-authors betweenness centrality (Gephi)
Summary
•   We use a highly flexible modeling framework that ...
•   ... allows for rapid data publication and integration ...
•   ... that is extensible and distributed (DB = Web)...
•   ... allows for co-existing diverging interpretations ...
•   ... adheres to the law of conservation of information ..
•   ... offers existing methods for capturing provenance ...
•   ... allows for a closed loop research cycle.

Mais conteúdo relacionado

Destaque (20)

Nada é impossivel
Nada é impossivelNada é impossivel
Nada é impossivel
 
PixXx
PixXxPixXx
PixXx
 
Monetizing portfolio
Monetizing portfolioMonetizing portfolio
Monetizing portfolio
 
Fastsocket Linxiaofeng
Fastsocket LinxiaofengFastsocket Linxiaofeng
Fastsocket Linxiaofeng
 
Spelling beeeeeeeeee
Spelling beeeeeeeeeeSpelling beeeeeeeeee
Spelling beeeeeeeeee
 
Hadith Qudsi
Hadith QudsiHadith Qudsi
Hadith Qudsi
 
2011新年好
2011新年好2011新年好
2011新年好
 
Slideshare test
Slideshare testSlideshare test
Slideshare test
 
Learning Health Sciences Slides for MIDAS October 2015
Learning Health Sciences Slides for MIDAS October 2015Learning Health Sciences Slides for MIDAS October 2015
Learning Health Sciences Slides for MIDAS October 2015
 
Unit 10-tourism
Unit 10-tourismUnit 10-tourism
Unit 10-tourism
 
Meteorology
MeteorologyMeteorology
Meteorology
 
Laplace1
Laplace1Laplace1
Laplace1
 
Visão apresentação
Visão   apresentaçãoVisão   apresentação
Visão apresentação
 
The great wall of china is a series of fortifications made of stone
The great wall of china is a series of fortifications made of stoneThe great wall of china is a series of fortifications made of stone
The great wall of china is a series of fortifications made of stone
 
Unit 10. Materials. PPT
Unit 10. Materials. PPTUnit 10. Materials. PPT
Unit 10. Materials. PPT
 
Taborine 2014
Taborine 2014Taborine 2014
Taborine 2014
 
100 cau chia dong tu
100 cau chia dong tu100 cau chia dong tu
100 cau chia dong tu
 
SINO-CITYLINK LOGISTICS_SEP 2015
SINO-CITYLINK LOGISTICS_SEP 2015SINO-CITYLINK LOGISTICS_SEP 2015
SINO-CITYLINK LOGISTICS_SEP 2015
 
Orbitals filling in the periodic table
Orbitals filling in the periodic tableOrbitals filling in the periodic table
Orbitals filling in the periodic table
 
MPhil CS Curriculum 10 11 2014
MPhil CS Curriculum 10 11 2014MPhil CS Curriculum 10 11 2014
MPhil CS Curriculum 10 11 2014
 

Semelhante a Semantic Representations for Research

Ontology Alignment using Linked Data
Ontology Alignment using Linked DataOntology Alignment using Linked Data
Ontology Alignment using Linked DataTim Hodson
 
Krextor – An Extensible Framework for Contributing Content Math to the Web of...
Krextor – An Extensible Framework for Contributing Content Math to the Web of...Krextor – An Extensible Framework for Contributing Content Math to the Web of...
Krextor – An Extensible Framework for Contributing Content Math to the Web of...Christoph Lange
 
Euroscipy SemNews 2011
Euroscipy SemNews 2011Euroscipy SemNews 2011
Euroscipy SemNews 2011Logilab
 
20111110 LOD のご紹介
20111110 LOD のご紹介20111110 LOD のご紹介
20111110 LOD のご紹介Fumihiro Kato
 
Collection Ranking and Selection for Federated Entity Search
Collection Ranking and Selection for Federated Entity SearchCollection Ranking and Selection for Federated Entity Search
Collection Ranking and Selection for Federated Entity Searchkrisztianbalog
 
gStore: A Graph-based SPARQL Query Engine
gStore: A Graph-based SPARQL Query EnginegStore: A Graph-based SPARQL Query Engine
gStore: A Graph-based SPARQL Query EngineM. Tamer Özsu
 

Semelhante a Semantic Representations for Research (8)

Ontology Alignment using Linked Data
Ontology Alignment using Linked DataOntology Alignment using Linked Data
Ontology Alignment using Linked Data
 
Krextor – An Extensible Framework for Contributing Content Math to the Web of...
Krextor – An Extensible Framework for Contributing Content Math to the Web of...Krextor – An Extensible Framework for Contributing Content Math to the Web of...
Krextor – An Extensible Framework for Contributing Content Math to the Web of...
 
Euroscipy SemNews 2011
Euroscipy SemNews 2011Euroscipy SemNews 2011
Euroscipy SemNews 2011
 
ReDD-Observatory
ReDD-ObservatoryReDD-Observatory
ReDD-Observatory
 
Semantic Pingback (EKAW)
Semantic Pingback (EKAW)Semantic Pingback (EKAW)
Semantic Pingback (EKAW)
 
20111110 LOD のご紹介
20111110 LOD のご紹介20111110 LOD のご紹介
20111110 LOD のご紹介
 
Collection Ranking and Selection for Federated Entity Search
Collection Ranking and Selection for Federated Entity SearchCollection Ranking and Selection for Federated Entity Search
Collection Ranking and Selection for Federated Entity Search
 
gStore: A Graph-based SPARQL Query Engine
gStore: A Graph-based SPARQL Query EnginegStore: A Graph-based SPARQL Query Engine
gStore: A Graph-based SPARQL Query Engine
 

Mais de Rinke Hoekstra

Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseRinke Hoekstra
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataRinke Hoekstra
 
QBer - Connect your data to the cloud
QBer - Connect your data to the cloudQBer - Connect your data to the cloud
QBer - Connect your data to the cloudRinke Hoekstra
 
Jurix 2014 welcome presentation
Jurix 2014 welcome presentationJurix 2014 welcome presentation
Jurix 2014 welcome presentationRinke Hoekstra
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Rinke Hoekstra
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationRinke Hoekstra
 
Linkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research DataLinkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research DataRinke Hoekstra
 
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document ServerA Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document ServerRinke Hoekstra
 
Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?Rinke Hoekstra
 
Linked Science - Building a Web of Research Data
Linked Science - Building a Web of Research DataLinked Science - Building a Web of Research Data
Linked Science - Building a Web of Research DataRinke Hoekstra
 
The Knowledge Reengineering Bottleneck
The Knowledge Reengineering BottleneckThe Knowledge Reengineering Bottleneck
The Knowledge Reengineering BottleneckRinke Hoekstra
 
Concept- en Definitie Extractie
Concept- en Definitie ExtractieConcept- en Definitie Extractie
Concept- en Definitie ExtractieRinke Hoekstra
 
The MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked DataThe MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked DataRinke Hoekstra
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of DataRinke Hoekstra
 
History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)Rinke Hoekstra
 
Making Sense of Design Patterns
Making Sense of Design PatternsMaking Sense of Design Patterns
Making Sense of Design PatternsRinke Hoekstra
 
Publicatie van Linked Open Overheids Data
Publicatie van Linked Open Overheids DataPublicatie van Linked Open Overheids Data
Publicatie van Linked Open Overheids DataRinke Hoekstra
 

Mais de Rinke Hoekstra (20)

Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities Data
 
QBer - Connect your data to the cloud
QBer - Connect your data to the cloudQBer - Connect your data to the cloud
QBer - Connect your data to the cloud
 
Jurix 2014 welcome presentation
Jurix 2014 welcome presentationJurix 2014 welcome presentation
Jurix 2014 welcome presentation
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance Visualization
 
Linkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research DataLinkitup: Link Discovery for Research Data
Linkitup: Link Discovery for Research Data
 
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document ServerA Network Analysis of Dutch Regulations - Using the Metalex Document Server
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
 
Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?Linked (Open) Data - But what does it buy me?
Linked (Open) Data - But what does it buy me?
 
Linked Science - Building a Web of Research Data
Linked Science - Building a Web of Research DataLinked Science - Building a Web of Research Data
Linked Science - Building a Web of Research Data
 
COMMIT/VIVO
COMMIT/VIVOCOMMIT/VIVO
COMMIT/VIVO
 
The Knowledge Reengineering Bottleneck
The Knowledge Reengineering BottleneckThe Knowledge Reengineering Bottleneck
The Knowledge Reengineering Bottleneck
 
Linked Census Data
Linked Census DataLinked Census Data
Linked Census Data
 
Concept- en Definitie Extractie
Concept- en Definitie ExtractieConcept- en Definitie Extractie
Concept- en Definitie Extractie
 
The MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked DataThe MetaLex Document Server - Legal Documents as Versioned Linked Data
The MetaLex Document Server - Legal Documents as Versioned Linked Data
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)History of Knowledge Representation (SIKS Course 2010)
History of Knowledge Representation (SIKS Course 2010)
 
Making Sense of Design Patterns
Making Sense of Design PatternsMaking Sense of Design Patterns
Making Sense of Design Patterns
 
Publicatie van Linked Open Overheids Data
Publicatie van Linked Open Overheids DataPublicatie van Linked Open Overheids Data
Publicatie van Linked Open Overheids Data
 

Último

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Último (20)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Semantic Representations for Research

  • 1. Semantic Representations for Research Rinke Hoekstra and Stefan Schlobach VU University Amsterdam/University of Amsterdam http://www.data2semantics.org
  • 2. About us... • Knowledge Representation and Reasoning Group Frank van Harmelen • Modeling of complex domains • Querying and reasoning over these models • ... at a very large scale (the Web)
  • 3. About us... • Knowledge Representation and Reasoning Group Frank van Harmelen • Experience a.o. CATCH, STICH, LarKC, CEDAR and Data2Semantics • Premier group for provenance and linked data at scale
  • 4. Overview • Research Lifecycle Data2Semantics and LarKC • Historical Census Data CEDAR and Data2Semantics • Short Title Catalogue of The Netherlands (STCN) Inger Leemans, Fernie Maas, Paul Huygen, Albert Meroño-Peñuela
  • 5. How to share, publish, access, analyse, interpret and reuse data? Increase the ease of sharing scientific data ... ... of accessing, analysing and interpreting data ... ... and thereby increasing the reuse of data
  • 6.
  • 7. EASY Data Repository Enrich datasets: census data
  • 8. EASY Data Repository Enrich datasets: census data Large volumes of publications Improve services to clients Automated services
  • 9. EASY Data Repository Enrich datasets: census data Large volumes of publications Improve services to clients Automated services Build systems for hospitals
  • 10. EASY Data Repository Enrich datasets: census data Large volumes of publications Improve services to clients Automated services Build systems for hospitals
  • 11. Linked Data • “Semantic Hyperlinks” between data items • Every data item has a global identifier ... • ... that looks like a web address (URI) ... • ... is linked and described using shared vocabularies • Resource Description Framework (RDF) • SPARQL query language & endpoint
  • 12. Linked Data Linked LOV User Slideshare tags2con Audio Feedback 2RDF delicious Moseley Scrobbler Bricklink Sussex Folk (DBTune) Reading St. GTAA Magna- Lists Andrews Klapp- tune stuhl- Resource NTU DB club Lists Resource Tropes Lotico Semantic yovisto John Music Man- Lists Music Tweet chester Hellenic Peel Brainz NDL (DBTune) (Data Brainz Reading subjects FBD (zitgist) Lists Open EUTC Incubator) Linked Hellenic Library Open t4gm Produc- Crunch- PD Surge RDF info tions Discogs base Library Radio Ontos Source Code Crime ohloh Plymouth (Talis) (Data News LEM Ecosystem Reading RAMEAU Reports business Incubator) Crime data.gov. Portal Linked Data Lists SH UK Music Jamendo (En- uk Brainz (DBtune) LinkedL Ox AKTing) FanHubz gnoss ntnusc (DBTune) SSW CCN • Points Thesau- Last.FM Poké- Thesaur Popula- artists Didactal us rus W “Semantic Hyperlinks” between data items pédia LIBRIS tion (En- (DBTune) Last.FM ia theses. LCSH Rådata reegle research patents MARC AKTing) (rdfize) my fr nå! data.gov. data.go Codes Ren. NHS uk v.uk Good- Experi- Classical List Energy (En- win flickr ment (DB Pokedex Norwe- Genera- AKTing) Mortality BBC Family wrappr Sudoc PSH Tune) gian (En- tors Program MeSH AKTing) semantic mes BBC IdRef GND CO2 educatio OpenEI web.org SW Energy Sudoc ndlna Emission n.data.g Music Dog VIAF EEA (En- Chronic- Linked (En- ov.uk Portu- Food UB AKTing) ling Event MDB AKTing) guese Mann- Europeana BBC America Media DBpedia Calames heim Ord- Recht- Wildlife Deutsche Open Revyu DDC Openly spraak. Finder Bio- lobid Election nance legislation Local nl RDF graphie Resources NSZL • Data Survey Tele- data Ulm Swedish EU New Book Project data.gov.uk graphis bnf.fr Catalog Open Insti- York Mashup Every data item has a global identifier ... tutions URI Greek Open P20 Cultural UK Post- Times Burner DBpedia Calais Heritage codes statistics ECS Wiki lobid GovWILD data.gov. Taxon iServe South- Organi- LOIUS BNB Brazilian uk Concept ECS ampton sations Geo World OS BibBase STW GESIS Poli- ESD South- ECS Names Fact- (RKB ticians stan- reference ampton book • data.gov.uk Freebase Explorer) Budapest dards data.gov. NASA EPrints uk intervals Project OAI Lichfield (Data ... that looks like a web address (URI) ... transport DBpedia data Pisa Spen- Incu- Guten- dcs data.gov. RESEX Scholaro- ISTAT ding bator) Fishes berg DBLP DBLP uk Geo meter Immi- Scotland of Texas (FU (L3S) Pupils & Uberblic DBLP gration Species Berlin) IRIT Exams Euro- dbpedia data- (RKB London TCM ACM stat lite open- Explorer) NVD Gazette (FUB) Gene IBM Traffic Geo ac-uk • Scotland TWC LOGD Eurostat Daily DIT Linked UN/ Data UMBEL Med ERA Data LOCODE ... is linked and described using shared vocabularies DEPLOY Gov.ie CORDIS YAGO New- lingvoj Disea- (RKB some SIDER RAE2001 castle LOCAH CORDIS Explorer) Linked Eurécom Eurostat Drug CiteSeer Roma (FUB) Sensor Data GovTrack (Ontology (Kno.e.sis) Open Bank Pfam Course- Central) riese Enipedia Cyc Lexvo LinkedCT ware Linked PDB UniProt VIVO EURES EDGAR dotAC US SEC Indiana ePrints IEEE (Ontology totl.net (rdfabout) Central) WordNet RISKS (VUA) Taxono UniProt US Census EUNIS Twarql HGNC Semantic Cornetto (Bio2RDF) (rdfabout) my VIVO FTS XBRL PRO- ProDom STITCH Cornell LAAS SITE KISTI NSF Scotland GeoWord LODE • Geo- graphy Net WordNet WordNet JISC (W3C) Resource Description Framework (RDF) Climbing (RKB Affy- KEGG Linked VIVO UF SMC Explorer) SISVU metrix Pub Drug Piedmont Journals GeoData PubMed SGD ECCO- Finnish Gene Chem Munici- Accomo- El AGROV Ontology TCP Media dations Alpine bible palities Viajero OC Ski ontology Tourism KEGG Ocean Austria Enzyme PBAC Geographic Metoffice GEMET ChEMBL • Italian Drilling OMIM KEGG Weather Open public Codices AEMET Linked MGI Pathway Data Publications SPARQL query language & endpoint schools Forecasts Open InterPro GeneID KEGG EARTh Thesau- Turismo rus Colors Reaction de Zaragoza Product Smart KEGG User-generated content Weather DB Link Medi Glycan Janus Stations Product Care KEGG AMP UniParc UniRef UniSTS Government Types Italian Homolo Com- Yahoo! Airports Museums pound Ontology Google Gene Geo Art Planet National wrapper Chem2 Cross-domain Radio- Bio2RDF activity UniPath JP Sears Open Linked OGOLOD way Life sciences Corpo- Amster- Reactome dam medu- Open rates Numbers Museum cator As of September 2011
  • 13. Research Lifecycle Linked Data Cloud$ Analysis and Cloud Metrics acquiring$data$from$text?$ Ana Me Semi8 Semi-Automatic Querying and Automa;c$ Annotation Ranking Annota;on$ e.g.$GATE$ Amalgame$ SILK$ OpenCalais$ Que Graph$Rewri;ng$ Graph$Rewri;ng$ and$R Link to Other RDF Conversion Internal Linking Visualization Data RDF$ RDF$ Internal$ Link$to$ Conversion$ Cleaning$ Linking$ Other$Data$ xml2rdf$ d2rq$ Visua rdb2rdf$ Semi-Automatic Provenance $ Conversion Enrichment User Interfaces Provenance$ Enrichment$ U Inte RDF Feedback Semi8 Automa;c$ Provenance Tracking Conversion$ “tablinker”$
  • 14. Challenges • Build useful services and tools for data publishers ... • ... that maintain provenance information ... • ... and cater for the entire research cycle ... • ... including a feedback loop to new research
  • 15. Challenges • Build useful services and tools for data publishers ... • ... that maintain provenance information ... • ... and cater for the entire research cycle ... • ... including a feedback loop to new research
  • 16. Large Knowledge Collider • Data analysis pipeline • Custom workflows • Highly scalable • Query driven • Exposed as SPARQL endpoint
  • 17. Historical Census Data • Gathered from 1795 - 1971 • Demographics, houses, occupations
  • 18. Historical Census Data • Gathered from 1795 - 1971 • Demographics, houses, occupations
  • 19. Historical Census Data • Gathered from 1795 - 1971 • Demographics, houses, occupations
  • 20. Historical Census Data • Gathered from 1795 - 1971 • Demographics, houses, occupations • 507 Excel files • 2288 tables • 33283 annotations
  • 21. Annotations • Created at data entry time • Created as we speak • Corrections to original census tables • Corrections to excel version of census table • Any additonal remarks...
  • 22. Harmonization ? • Enable historical research across census years • Query across multiple heterogeneous datasets • Accommodate multiple interpretations
  • 23. Harmonization • Overcome structural heterogeneity • Overcome semantic heterogeneity • Different categories (age groups, locations) • Different values (names of religions, municipalities)
  • 24. Current Situation • Iterative refinement of MySQL database tables • Harmonization against existing codifications • Expensive manual process • Loss of information between harmonization steps • Loss of detail in mapping to existing codification • Not repeatable
  • 25. Requirements • (Semi-)automatic conversion and harmonization • Repeatable • Conservation of information (only add) • Provenance (who did what) • Flexible model • Linking to other datasets • Publish as open data
  • 26. Research Cycle Linked Data Cloud$ Analysis and Cloud Metrics acquiring$data$from$text?$ Ana Me Semi8 Semi-Automatic Querying and Automa;c$ Annotation Ranking Annota;on$ e.g.$GATE$ Amalgame$ SILK$ OpenCalais$ Que Graph$Rewri;ng$ Graph$Rewri;ng$ and$R Link to Other RDF Conversion Internal Linking Visualization Data RDF$ RDF$ Internal$ Link$to$ Conversion$ Cleaning$ Linking$ Other$Data$ xml2rdf$ d2rq$ Visua rdb2rdf$ Semi-Automatic Provenance $ Conversion Enrichment User Interfaces Provenance$ Enrichment$ U Inte RDF Feedback Semi8 Automa;c$ Provenance Tracking Conversion$ “tablinker”$
  • 30. 12 1878 TabLinker M O I leeftijd ? http://github.com/Data2Semantics/TabLinker nummer der beroepsklasse ? geboortejaar ? geslacht ? huwelijkse staat E pannenbakkers beroep positie D 1 letter der beroepsklasse
  • 31. TabLinker • Verbatim graph representation of spreadsheet • Separate layer for semantics of spreadsheet • Separate graphs for any annotations, interpretations and harmonizations of the underlying data • Round-tripping from Excel to RDF and back
  • 32. Sheet1:E15 Sheet1:C14 Sheet1:B8 Sheet1:L15 Sheet1:L3 Sheet1:L4 Sheet1:L5 Sheet1:F15 Sheet1:D15 Sheet1:L6
  • 33. d2s:HierarchicalRowHeader d2s:DataCell d2s:Header rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type Sheet1:E15 Sheet1:C14 Sheet1:B8 Sheet1:L15 Sheet1:L3 Sheet1:L4 Sheet1:L5 Sheet1:F15 Sheet1:D15 Sheet1:L6 rdf:type rdf:type rdf:type d2s:RowHeader d2s:Metadata
  • 34. d2s:HierarchicalRowHeader d2s:HierarchicalRowHeader d2s:DataCell d2s:Header rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type Sheet1:E15 Sheet1:E15 Sheet1:C14 Sheet1:C14 Sheet1:B8 Sheet1:B8 Sheet1:L15 Sheet1:L3 Sheet1:L4 Sheet1:L5 d2s:isDimension :I d2s:isDimension d2s:isObservation d2s:isDimension d2s:isDimension d2s:isDimension :I/E _:x :14--15_1875--1874 d2s:isDimension :M :O Sheet1:I/E/Fabricage_van_dakpannen__pannenbakkers :D :5 :10 d2s:isDimension d2s:isDimension d2s:isDimension Sheet1:F15 Sheet1:D15 Sheet1:L6 rdf:type rdf:type rdf:type d2s:RowHeader d2s:Metadata
  • 35. d2s:HierarchicalRowHeader d2s:HierarchicalRowHeader d2s:DataCell d2s:Header rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type rdf:type Sheet1:E15 Sheet1:E15 Sheet1:C14 Sheet1:C14 Sheet1:B8 Sheet1:B8 Sheet1:L15 Sheet1:L3 Sheet1:L4 Sheet1:L5 d2s:isDimension :I d2s:isDimension "1"^^xsd:int d2s:isObservation d2s:isDimension skos:broader :Nummer_der_beroepsklasse d2s:isDimension d2s:populationSize d2s:isDimension :I/E :Letter__Onderdeel_beroepsklasse_ _:x d2s:dimension :14--15_1875--1874 d2s:isDimension d2s:dimension skos:broader :M :BENAMING_van_de_onderdeelen_der_onderscheidene_beroepsklassen__met_de_daartoe_behoorende_beroepen d2s:dimension :Regelnummer :O :Positie_in_het_beroep__aangeduid_met_A__B__C_of_D d2s:dimension Sheet1:I/E/Fabricage_van_dakpannen__pannenbakkers :D :5 :10 d2s:isDimension d2s:isDimension d2s:isDimension Sheet1:F15 Sheet1:D15 Sheet1:L6 rdf:type rdf:type rdf:type d2s:RowHeader d2s:Metadata
  • 36. Harmonization within a year I skos:broader skos:broader skos:broader D E A skos:broader skos:broader skos:broader skos:broader Fabricage van Fabricage van steen aardewerk (incl. Fabricage van Fabricage van dakpannen (molensteen, steenbakkers, porcelein, terracotta, kalk (pannenbakkers) tegelbakkers) kachelbakkers, pottenbakkers, enz.) Sheet1:I skos:broader skos:broader skos:broader Sheet1:D Sheet1:E Sheet1:A skos:broader skos:broader skos:broader skos:broader Sheet1:Fabricage van Sheet1:Fabricage van steen Sheet1:Fabricage van aardewerk (incl. Sheet1:Fabricage (molensteen, steenbakkers, dakpannen porcelein, terracotta, van kalk tegelbakkers) (pannenbakkers) kachelbakkers, pottenbakkers, enz.)
  • 37. Harmonization across years I skos:broader skos:broader skos:broader D E A 1889 skos:broader skos:broader skos:broader skos:broader Fabricage van Fabricage van steen aardewerk (incl. Fabricage van Fabricage van dakpannen (molensteen, steenbakkers, porcelein, terracotta, kalk (pannenbakkers) tegelbakkers) kachelbakkers, pottenbakkers, enz.) skos:narrowMatch I skos:closeMatch skos:exactMatch skos:narrowMatch skos:broader skos:broader skos:broader D E A skos:broader skos:broader skos:broader 1899 skos:broader Fabricage van Fabricage van steen aardewerk (incl. Fabricage van Fabricage van dakpannen (steenbakkers, porcelein, kalk (pannenbakkers) tegelbakkers) kachelbakkers, pottenbakkers, enz.)
  • 38. Harmonization external linking I skos:broader skos:broader skos:broader D E A skos:broader skos:broader skos:broader skos:broader Fabricage van Fabricage van steen aardewerk (incl. Fabricage van Fabricage van dakpannen (molensteen, steenbakkers, porcelein, terracotta, kalk (pannenbakkers) tegelbakkers) kachelbakkers, pottenbakkers, enz.) skos:exactMatch skos:broadMatch skos:broadMatch skos:closeMatch skos:exactMatch skos:exactMatch skos:exactMatch HISCO:23811 HISCO:25281 HISCO:25281 HISCO:26345 HISCO:23810 HISCO:25281 HISCO:26340 HISCO: Historical International Standard Classification of Occupations
  • 39. Curation & Annotation <http://example.com/workbook1/sheet1> <http://example.com/workbook1/sheet1/corrected> provo:Activity rdf:type :curation20120126 "1"^^xsd:int "11"^^xsd:int provo:wasGeneratedBy provo:hadAgent provo:startedAt d2s:populationSize d2s:populationSize provo:endedAt "1889"^^xsd:int :RinkeHoekstra d2s:censusYear _:x d2s:birthYears :1875--1874 _:b _:a d2s:gemeente d2s:dimension d2s:ageGroup time:inXSDDateTime time:inXSDDateTime :Assendelft :14--15_1875--1874 :14-15 "20120126T09:00:00" "20120126T08:30:00"
  • 40. Open Issues • Create the necessary mappings between graphs ... this is historical research • Mappings are interpretations • Query within a specified interpretation space • How to reliably perform statistical analysis across mappings? • How to study concept drift across years?
  • 41. Short Title Catalogue • All books published in NL until 1800 • Digitized over a period of 30 years • 139817 publications (KB says >190000) • 9962 publishers • 23627 authors • 96024 links to scanned title pages
  • 42. Redactiebladen • Redactiebladen • PPN identifiers • KMC codes
  • 43. Requirements • (Semi-)automatic conversion and harmonization • Repeatable • Conservation of information (only add) • Provenance (who did what) • Flexible model • Linking to other datasets • Publish as open data
  • 44. Research Cycle Linked Data Cloud$ Analysis and Cloud Metrics acquiring$data$from$text?$ Ana Me Semi8 Semi-Automatic Querying and Automa;c$ Annotation Ranking Annota;on$ e.g.$GATE$ Amalgame$ SILK$ OpenCalais$ Que Graph$Rewri;ng$ Graph$Rewri;ng$ and$R Link to Other RDF Conversion Internal Linking Visualization Data RDF$ RDF$ Internal$ Link$to$ Conversion$ Cleaning$ Linking$ Other$Data$ xml2rdf$ d2rq$ Visua rdb2rdf$ Semi-Automatic Provenance $ Conversion Enrichment User Interfaces Provenance$ Enrichment$ U Inte RDF Feedback Semi8 Automa;c$ Provenance Tracking Conversion$ “tablinker”$
  • 45. Procedure • Convert to MySQL database Paul Huygen • Specify mapping to RDF D2RQ mapping language • Interlink with other datasources Bibliografish portaal, Rijksmuseum, Iconclass, Ecartico • Publish as browsable and queryable dataset http://stcn.data2semantics.org
  • 46. Procedure • Convert to MySQL database ✓ Paul Huygen • Specify mapping to RDF ✓ D2RQ mapping language • Interlink with other datasources Bibliografish portaal, Rijksmuseum, Iconclass, Ecartico • Publish as browsable and queryable dataset ✓ http://stcn.data2semantics.org
  • 48. Fingerprints Wilhelmus Nakatenus S.J. (1617-1682) rdfs:label STCN:auteur/070082960 stcn:publicatie stcn:publicatie STCN:publicatie/ STCN:publicatie/ stcn:titeluitgave 336280211 314125434 stcn:illustratie stcn:vingerafdruk stcn:vingerafdruk skos:exactMatch stcn:illustratie rdfs:label rdfs:label STCN:vingerafdruk/27 STCN:vingerafdruk/1207 Hemels palm-hof, ofte Groot getyde-boek rdfs:label rdfs:label 000012 - *b1 A4 ella : b2 2C7 ns$in
  • 50. Summary • We use a highly flexible modeling framework that ... • ... allows for rapid data publication and integration ... • ... that is extensible and distributed (DB = Web)... • ... allows for co-existing diverging interpretations ... • ... adheres to the law of conservation of information .. • ... offers existing methods for capturing provenance ... • ... allows for a closed loop research cycle.