SlideShare uma empresa Scribd logo
1 de 52
+

Semantic Web
Technologies 2012-2013
Part I
Mariano Rodriguez-Muro,
Free University of Bozen-Bolzano
+

Disclaimer

License

This work is licensed under the
Creative Commons Attribution-Share Alike 3.0 License
http://creativecommons.org/licenses/by-sa/3.0/
+

Intro


Course organization



Intro to Semantic Web



Intro to Semantic Technologies
+
Course organization
+

About me
Mariano Rodríguez-Muro
Assistant Professor at KRDB
Faculty of computer Science (POS Building, 202)
Tel. +390471016228
rodriguez =at= inf.unibz.it

Research interests:


Techniques for query answering optimization



SPARQL, Big RDFS, virtual RDF



Data integration with Semantic Tech and SemTech in the
enterprise.
+

About you


Which program?



Which semester?



Why are you here?




Topic relates to my area



Looking for project/thesis?



Just Interesting?





Topic is mandatory

Need some credits?

Special interests?
+

Course organization (Part I)


Website:




Moodle




…

Schedule






http://rodriguez-muro.com/courses/index.php?title=SWT12

Lecture: Tuesday:10:30 am to 12:30 pm
Lecture: Thursday 8:30 am to 10:30 am
Lab: Tuesday 2:00 to 4:00 pm

Office Hours



With appointment
Please use forums as main means of comunication
+

Reference Material


Slides, Papers



Foundations of Semantic Web. Pascal HItzler, Markus
Krotzsch and Sebastian Rudolph. Chapman & Hall/CRC, 2010.
(Code FSW)



Semantic Web Programming. John Hebeler et. al. Wiley.
2009. (Code SWP)



Programming the Semantic Web. Toby Segaran, Colin Evans
and Jamie Taylor. O‟Reilly. 2009. (Code PTSW)

Available at the library. SWP and PTSW available as ebooks.
+

Grading


Part I 50%, Part II 50%



Grading Part I



Lab exercises: 15%
Mid-term: 35%



Exercises: Each week a new assignment. All assignments are
graded. All assignments are mandatory. Delivery must be
done by the next week. Java and SQL/JDBC is required.
Projects must be packaged with Maven.



Midterm. Covers all material seen during the lectures. From
slides, presentation and selected book chapters/readings
(marked at the end of each slide)
+
Introduction
Semantic Web
+

Web of Documents



Primary objects: documents



Degree of structure in data: low



Semantics of content:Implicit



Designed for: human consumption

Links between documents
+

Web of documents: The problem
+

Example: Elvis
+

Web of data: The problem


How about this query:




How many romantic comedy Hollywood movies are directed by a
person who is born in a city that has average temperature above 15
degrees!?

You need to:




Find reliable sources containing facts about movies (genre &
director), birthplaces of famous artists/directors, average
temperature of cities across the world, etc.
 The result: several lists of thousands of facts
Integrate all the data, join the facts that come from heterogeneous
sources

Even if possible, it may take days to answer just a single query!
+

The Vision
I have a dream for the Web in which computers
become capable of analyzing all the data on the
Web - the content, links, and transactions
between people and computers. A Semantic
Web, which should make this possible, has yet
to emerge, but when it does, the day-to-day
mechanisms of trade, bureaucracy and our daily
lives will be handled by machines talking to machines. The intelligent agents people have
touted for ages will finally materialize.
Barners-Lee, 1999
+

The semantic web


Primary objects: things

Links between: things



Degree of Structure: high



Explicit semantics of contents and links



Designed for both machines and humans
+

Web of data
+
Semantic Technologies
+

Not only about the web


The semantic web vision has generated technologies that are
applied outside the web context including:



Government



Research (Bio, Geo, Cultural heritage, etc.)



Software development





Enterprise intelligence

…

Semantic technologies provide flexible and powerful tools to
accomplish things that were not possible or not practical in the
past.
+

22

Introduction to the Semantic Web
approach

How does a Semantic Web approach help us
merge data sets, infer new relations, and
integrate outside data sources?
+

23

The rough structure of data
integration with SWT
Map the various data onto an abstract data representation

1.
•

Make the data independent of its internal representation…

2.

Merge the resulting representations

3.

Start making queries on the whole
•

Queries not possible on the individual data sets
+

Data set “A”: A simplified book store

Books
ID

Author

ISBN0-00-651409-X id_xyz

Authors
ID

Title
The Glass Palace

Name

id_xyz

Ghosh, Amitav

Publishers
ID

Harper Collins

id_qpr

Home page

Publisher Name

id_qpr

Publisher

http://www.amitavghosh.com

City
London

Year
2000

24
+

25

1st: Export your data as a set of
relations
+

26

Some notes on the data export
Data export does not necessarily mean physical conversion of
the data
Relations can be virtual, generated on-the-fly at query time
via SQL “bridges”
scraping HTML pages

extracting data from Excel sheets
etc.

One can export part of the data
+

27

Data set “F”: Another book store‟s
data
A

1

7

11
12
13

D

E

ID
ISBN0 2020386682

Traducteur
Titre
Original
Le Palais A13
ISBN-0-00-651409-X
des
miroirs

ID
ISBN-0-00-651409-X

Auteur
A12

2
3

6

B

Nom
Ghosh, Amitav
Besse, Christianne
2nd: Export your second set of data
+

28
3rd: start merging your data
+

29
3rd: start merging your data (cont‟d)
+

30
4th: Merge identical resources
+

31
+

32

Start making queries…


User of data set “F” can now ask queries like:


“What is the title of the original version of Le Palais des miroirs?”



This information is not in the data set “F”...



…but can be retrieved after merging with data set “A”!
5th: Query the merged data set
+

33
+

34

However, more can be achieved…


We “know” that a:author and f:auteur are really the same



But our automatic merge does not know that!



Let us add some extra information to the merged data:


a:author is equivalent to f:auteur



Both identify a Person, a category (type) for certain resources



a:name and f:nom are equivalent to foaf:name
3rd revisited: Use the extra knowledge
+

35
+

36

Start making richer queries!


User of data set “F” can now query:


“What is the home page of Le Palais des miroirs’s „auteur‟?”



The information is not in data set “F” or “A”…



…but was made available by:


Merging data sets “A” and “F”



Adding three simple “glue” statements
6th: Richer queries
+

37
+

38

Bring in other data sources


We can integrate new information into our merged data set
from other sources




e.g. additional information about author Amitav Ghosh

Perhaps the largest public source of general knowledge is
Wikipedia


Structured data can be extracted from Wikipedia using dedicated
tools

May 12, 2009
7th: Merge with Wikipedia data
+

owl:sameAs

39
7th (cont‟d): Merge with Wikipedia data
+

owl:sameAs

40
7th (cont‟d): Merge with Wikipedia data
+

owl:sameAs

41
+

42

Is that surprising?


It may look like it but, in fact, it should not be…



What happened via automatic means is done every day by
Web users!



The difference: a bit of extra rigour so that machines could do
this, too
+

43

What did we do?


We combined different data sets that



...are of different formats (RDBMS, Excel spreadsheet, (X)HTML, etc)





...may be internal or somewhere on the Web
...have different names for the same relations

We could combine the data because some URIs were identical


i.e. the ISBNs in this case



We could add some simple additional information (the “glue”) to
help further merge data sets



The result? Answer queries that could not previously be asked
+

44

What did we do? (cont‟d)
+

45

The abstraction pays off because…


…the graph representation is independent of the details of the
native structures



…a change in local database schemas, HTML structures, etc.
do not affect the whole


“schema independence”



…new data, new connections can be added seamlessly &
incrementally



… it doesn‟t matter if you are at the enterprise level or at the
web level
+

46

So where is the Semantic Web?

Semantic Web technologies make such integration possible
+
Semantic Technologies
Today: Applications, Use cases, Technologies, Systems
+

Web of data today
+

Semantics today


Linked-in



Schema.org



Good-relations



Oracle (Server)



IBM (DB2, Watson)



Apple (Siri)



SAP



Evri, Linked-in, many startups



Many deployed systems
+

Semantic Web Technologies


A set of technologies and frameworks that enable semantic
data management, data integration and the web of data


Resource Description Framework (RDF)



A variety of data interchange formats (e.g., RDF/XML, N3, Turtle, NTriples)



Semantic languages such as RDF Schema (RDFS) and the Web
Ontology Language (OWL) and Rules (SWRL)



Query language (SPARQL)



Software infrastructure (RDF/SPARQL frameworks, Triple stores,
Data integrators, Query engines, Reasoners)



Publicly available connected dataset and open data initiatives
(LOD)
+

SWT Part I


The Data Model (RDF)



The query language (SPARQL)



Software Development (Architecture, Frameworks and Tools)



A little more semantics (RDFS, inference techniques, tools and
data integration)



Interacting with the enterprise (Legacy sources, XML, DBMS,
mappings)



More complex semantics (Rules, data integration and
reasoning with rules)
+

Reading material


PTSW Chapter 1



SWP Part I, Chapter 1



FTW Section 1.4

Mais conteúdo relacionado

Mais procurados

Semantic Web: Intro
Semantic Web: IntroSemantic Web: Intro
Semantic Web: IntroFariz Darari
 
The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about itHerbert Van de Sompel
 
Machine Learning & Embeddings for Large Knowledge Graphs
Machine Learning & Embeddings  for Large Knowledge GraphsMachine Learning & Embeddings  for Large Knowledge Graphs
Machine Learning & Embeddings for Large Knowledge GraphsHeiko Paulheim
 
How links can make your open data even greater
How links can make your open data even greaterHow links can make your open data even greater
How links can make your open data even greaterCristina Sarasua
 
Semantic Technologies and Information Integration
Semantic Technologies and Information IntegrationSemantic Technologies and Information Integration
Semantic Technologies and Information IntegrationAI4BD GmbH
 
20110330 bruxelles doc_freedom
20110330 bruxelles doc_freedom20110330 bruxelles doc_freedom
20110330 bruxelles doc_freedomStefan Gradmann
 
09 semantic web & ontologies
09 semantic web & ontologies09 semantic web & ontologies
09 semantic web & ontologiesMarina Santini
 
Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Webhala Skaf
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DoneHerbert Van de Sompel
 
What is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspectiveWhat is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspectiveArmin Haller
 
Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebHerbert Van de Sompel
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021Fabien Gandon
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Herbert Van de Sompel
 
Beautifying Data in the real world
Beautifying Data in the real worldBeautifying Data in the real world
Beautifying Data in the real worldTan Tran
 
Web Search and SEO - Web Technologies (1019888BNR)
Web Search and SEO - Web Technologies (1019888BNR)Web Search and SEO - Web Technologies (1019888BNR)
Web Search and SEO - Web Technologies (1019888BNR)Beat Signer
 

Mais procurados (18)

Semantic Web: Intro
Semantic Web: IntroSemantic Web: Intro
Semantic Web: Intro
 
The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about it
 
Machine Learning & Embeddings for Large Knowledge Graphs
Machine Learning & Embeddings  for Large Knowledge GraphsMachine Learning & Embeddings  for Large Knowledge Graphs
Machine Learning & Embeddings for Large Knowledge Graphs
 
Semantic Web Good News
Semantic Web Good NewsSemantic Web Good News
Semantic Web Good News
 
How links can make your open data even greater
How links can make your open data even greaterHow links can make your open data even greater
How links can make your open data even greater
 
Semantic Technologies and Information Integration
Semantic Technologies and Information IntegrationSemantic Technologies and Information Integration
Semantic Technologies and Information Integration
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
20110330 bruxelles doc_freedom
20110330 bruxelles doc_freedom20110330 bruxelles doc_freedom
20110330 bruxelles doc_freedom
 
Linked Data
Linked DataLinked Data
Linked Data
 
09 semantic web & ontologies
09 semantic web & ontologies09 semantic web & ontologies
09 semantic web & ontologies
 
Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Web
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than Done
 
What is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspectiveWhat is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspective
 
Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized Web
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
 
Beautifying Data in the real world
Beautifying Data in the real worldBeautifying Data in the real world
Beautifying Data in the real world
 
Web Search and SEO - Web Technologies (1019888BNR)
Web Search and SEO - Web Technologies (1019888BNR)Web Search and SEO - Web Technologies (1019888BNR)
Web Search and SEO - Web Technologies (1019888BNR)
 

Semelhante a Semantic Web Technologies 2012-2013 Part I

Introduction to Semantic Web Technologies
Introduction to Semantic Web TechnologiesIntroduction to Semantic Web Technologies
Introduction to Semantic Web TechnologiesIvan Herman
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008Blogtalk 2008
 
Nova Spivack - Semantic Web Talk
Nova Spivack - Semantic Web TalkNova Spivack - Semantic Web Talk
Nova Spivack - Semantic Web Talksyawal
 
2011 05-02 linked data intro
2011 05-02 linked data intro2011 05-02 linked data intro
2011 05-02 linked data introvafopoulos
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebGIS Colorado
 
2011 05-01 linked data
2011 05-01 linked data2011 05-01 linked data
2011 05-01 linked datavafopoulos
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for RealJames Hendler
 
Open belgium 2015 - open tourism
Open belgium 2015 - open tourismOpen belgium 2015 - open tourism
Open belgium 2015 - open tourismRaf Buyle
 
RDFa From Theory to Practice
RDFa From Theory to PracticeRDFa From Theory to Practice
RDFa From Theory to PracticeAdrian Stevenson
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataJames Hendler
 
Semantic Web Science
Semantic Web ScienceSemantic Web Science
Semantic Web ScienceJames Hendler
 
SIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesSIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesUldis Bojars
 
Explaining The Semantic Web
Explaining The Semantic WebExplaining The Semantic Web
Explaining The Semantic WebAditya Tuli
 

Semelhante a Semantic Web Technologies 2012-2013 Part I (20)

Introduction to Semantic Web Technologies
Introduction to Semantic Web TechnologiesIntroduction to Semantic Web Technologies
Introduction to Semantic Web Technologies
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008
 
Slides.ppt
Slides.pptSlides.ppt
Slides.ppt
 
Slides.ppt
Slides.pptSlides.ppt
Slides.ppt
 
Nova Spivack - Semantic Web Talk
Nova Spivack - Semantic Web TalkNova Spivack - Semantic Web Talk
Nova Spivack - Semantic Web Talk
 
Semantic Web - Introduction
Semantic Web - IntroductionSemantic Web - Introduction
Semantic Web - Introduction
 
Slides
SlidesSlides
Slides
 
Slides.ppt
Slides.pptSlides.ppt
Slides.ppt
 
2011 05-02 linked data intro
2011 05-02 linked data intro2011 05-02 linked data intro
2011 05-02 linked data intro
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
2011 05-01 linked data
2011 05-01 linked data2011 05-01 linked data
2011 05-01 linked data
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for Real
 
Open belgium 2015 - open tourism
Open belgium 2015 - open tourismOpen belgium 2015 - open tourism
Open belgium 2015 - open tourism
 
RDFa From Theory to Practice
RDFa From Theory to PracticeRDFa From Theory to Practice
RDFa From Theory to Practice
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
Semantic Web Science
Semantic Web ScienceSemantic Web Science
Semantic Web Science
 
SIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesSIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media Sites
 
Explaining The Semantic Web
Explaining The Semantic WebExplaining The Semantic Web
Explaining The Semantic Web
 
Web 3.0 Emerging
Web 3.0 EmergingWeb 3.0 Emerging
Web 3.0 Emerging
 

Mais de Mariano Rodriguez-Muro

SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingSWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingMariano Rodriguez-Muro
 
SWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jenaSWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jenaMariano Rodriguez-Muro
 
SWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFSSWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFSMariano Rodriguez-Muro
 
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsSWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsMariano Rodriguez-Muro
 
SWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQLSWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQLMariano Rodriguez-Muro
 

Mais de Mariano Rodriguez-Muro (20)

SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
SWT Lab 3
SWT Lab 3SWT Lab 3
SWT Lab 3
 
SWT Lab 5
SWT Lab 5SWT Lab 5
SWT Lab 5
 
SWT Lab 2
SWT Lab 2SWT Lab 2
SWT Lab 2
 
SWT Lab 1
SWT Lab 1SWT Lab 1
SWT Lab 1
 
SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2SWT Lecture Session 11 - R2RML part 2
SWT Lecture Session 11 - R2RML part 2
 
SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1
 
SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mappingSWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 9 - RDB2RDF direct mapping
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 
SWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jenaSWT Lecture Session 8 - Inference in jena
SWT Lecture Session 8 - Inference in jena
 
SWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFSSWT Lecture Session 7 - Advanced uses of RDFS
SWT Lecture Session 7 - Advanced uses of RDFS
 
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsSWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
 
SWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFSSWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFS
 
SWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQLSWT Lecture Session 4 - SW architectures and SPARQL
SWT Lecture Session 4 - SW architectures and SPARQL
 
SWT Lecture Session 4 - Sesame
SWT Lecture Session 4 - SesameSWT Lecture Session 4 - Sesame
SWT Lecture Session 4 - Sesame
 
SWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQLSWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQL
 
7 advanced uses of rdfs
7 advanced uses of rdfs7 advanced uses of rdfs
7 advanced uses of rdfs
 
5 rdfs
5 rdfs5 rdfs
5 rdfs
 
4 sw architectures and sparql
4 sw architectures and sparql4 sw architectures and sparql
4 sw architectures and sparql
 
4 sesame
4 sesame4 sesame
4 sesame
 

Último

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Último (20)

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Semantic Web Technologies 2012-2013 Part I

  • 1. + Semantic Web Technologies 2012-2013 Part I Mariano Rodriguez-Muro, Free University of Bozen-Bolzano
  • 2. + Disclaimer License This work is licensed under the Creative Commons Attribution-Share Alike 3.0 License http://creativecommons.org/licenses/by-sa/3.0/
  • 3. + Intro  Course organization  Intro to Semantic Web  Intro to Semantic Technologies
  • 5. + About me Mariano Rodríguez-Muro Assistant Professor at KRDB Faculty of computer Science (POS Building, 202) Tel. +390471016228 rodriguez =at= inf.unibz.it Research interests:  Techniques for query answering optimization  SPARQL, Big RDFS, virtual RDF  Data integration with Semantic Tech and SemTech in the enterprise.
  • 6. + About you  Which program?  Which semester?  Why are you here?   Topic relates to my area  Looking for project/thesis?  Just Interesting?   Topic is mandatory Need some credits? Special interests?
  • 7. + Course organization (Part I)  Website:   Moodle   … Schedule     http://rodriguez-muro.com/courses/index.php?title=SWT12 Lecture: Tuesday:10:30 am to 12:30 pm Lecture: Thursday 8:30 am to 10:30 am Lab: Tuesday 2:00 to 4:00 pm Office Hours   With appointment Please use forums as main means of comunication
  • 8. + Reference Material  Slides, Papers  Foundations of Semantic Web. Pascal HItzler, Markus Krotzsch and Sebastian Rudolph. Chapman & Hall/CRC, 2010. (Code FSW)  Semantic Web Programming. John Hebeler et. al. Wiley. 2009. (Code SWP)  Programming the Semantic Web. Toby Segaran, Colin Evans and Jamie Taylor. O‟Reilly. 2009. (Code PTSW) Available at the library. SWP and PTSW available as ebooks.
  • 9. + Grading  Part I 50%, Part II 50%  Grading Part I   Lab exercises: 15% Mid-term: 35%  Exercises: Each week a new assignment. All assignments are graded. All assignments are mandatory. Delivery must be done by the next week. Java and SQL/JDBC is required. Projects must be packaged with Maven.  Midterm. Covers all material seen during the lectures. From slides, presentation and selected book chapters/readings (marked at the end of each slide)
  • 11. + Web of Documents  Primary objects: documents  Degree of structure in data: low  Semantics of content:Implicit  Designed for: human consumption Links between documents
  • 12. + Web of documents: The problem
  • 14.
  • 15.
  • 16. + Web of data: The problem  How about this query:   How many romantic comedy Hollywood movies are directed by a person who is born in a city that has average temperature above 15 degrees!? You need to:   Find reliable sources containing facts about movies (genre & director), birthplaces of famous artists/directors, average temperature of cities across the world, etc.  The result: several lists of thousands of facts Integrate all the data, join the facts that come from heterogeneous sources Even if possible, it may take days to answer just a single query!
  • 17. + The Vision I have a dream for the Web in which computers become capable of analyzing all the data on the Web - the content, links, and transactions between people and computers. A Semantic Web, which should make this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The intelligent agents people have touted for ages will finally materialize. Barners-Lee, 1999
  • 18. + The semantic web  Primary objects: things Links between: things  Degree of Structure: high  Explicit semantics of contents and links  Designed for both machines and humans
  • 21. + Not only about the web  The semantic web vision has generated technologies that are applied outside the web context including:   Government  Research (Bio, Geo, Cultural heritage, etc.)  Software development   Enterprise intelligence … Semantic technologies provide flexible and powerful tools to accomplish things that were not possible or not practical in the past.
  • 22. + 22 Introduction to the Semantic Web approach How does a Semantic Web approach help us merge data sets, infer new relations, and integrate outside data sources?
  • 23. + 23 The rough structure of data integration with SWT Map the various data onto an abstract data representation 1. • Make the data independent of its internal representation… 2. Merge the resulting representations 3. Start making queries on the whole • Queries not possible on the individual data sets
  • 24. + Data set “A”: A simplified book store Books ID Author ISBN0-00-651409-X id_xyz Authors ID Title The Glass Palace Name id_xyz Ghosh, Amitav Publishers ID Harper Collins id_qpr Home page Publisher Name id_qpr Publisher http://www.amitavghosh.com City London Year 2000 24
  • 25. + 25 1st: Export your data as a set of relations
  • 26. + 26 Some notes on the data export Data export does not necessarily mean physical conversion of the data Relations can be virtual, generated on-the-fly at query time via SQL “bridges” scraping HTML pages extracting data from Excel sheets etc. One can export part of the data
  • 27. + 27 Data set “F”: Another book store‟s data A 1 7 11 12 13 D E ID ISBN0 2020386682 Traducteur Titre Original Le Palais A13 ISBN-0-00-651409-X des miroirs ID ISBN-0-00-651409-X Auteur A12 2 3 6 B Nom Ghosh, Amitav Besse, Christianne
  • 28. 2nd: Export your second set of data + 28
  • 29. 3rd: start merging your data + 29
  • 30. 3rd: start merging your data (cont‟d) + 30
  • 31. 4th: Merge identical resources + 31
  • 32. + 32 Start making queries…  User of data set “F” can now ask queries like:  “What is the title of the original version of Le Palais des miroirs?”  This information is not in the data set “F”...  …but can be retrieved after merging with data set “A”!
  • 33. 5th: Query the merged data set + 33
  • 34. + 34 However, more can be achieved…  We “know” that a:author and f:auteur are really the same  But our automatic merge does not know that!  Let us add some extra information to the merged data:  a:author is equivalent to f:auteur  Both identify a Person, a category (type) for certain resources  a:name and f:nom are equivalent to foaf:name
  • 35. 3rd revisited: Use the extra knowledge + 35
  • 36. + 36 Start making richer queries!  User of data set “F” can now query:  “What is the home page of Le Palais des miroirs’s „auteur‟?”  The information is not in data set “F” or “A”…  …but was made available by:  Merging data sets “A” and “F”  Adding three simple “glue” statements
  • 38. + 38 Bring in other data sources  We can integrate new information into our merged data set from other sources   e.g. additional information about author Amitav Ghosh Perhaps the largest public source of general knowledge is Wikipedia  Structured data can be extracted from Wikipedia using dedicated tools May 12, 2009
  • 39. 7th: Merge with Wikipedia data + owl:sameAs 39
  • 40. 7th (cont‟d): Merge with Wikipedia data + owl:sameAs 40
  • 41. 7th (cont‟d): Merge with Wikipedia data + owl:sameAs 41
  • 42. + 42 Is that surprising?  It may look like it but, in fact, it should not be…  What happened via automatic means is done every day by Web users!  The difference: a bit of extra rigour so that machines could do this, too
  • 43. + 43 What did we do?  We combined different data sets that   ...are of different formats (RDBMS, Excel spreadsheet, (X)HTML, etc)   ...may be internal or somewhere on the Web ...have different names for the same relations We could combine the data because some URIs were identical  i.e. the ISBNs in this case  We could add some simple additional information (the “glue”) to help further merge data sets  The result? Answer queries that could not previously be asked
  • 44. + 44 What did we do? (cont‟d)
  • 45. + 45 The abstraction pays off because…  …the graph representation is independent of the details of the native structures  …a change in local database schemas, HTML structures, etc. do not affect the whole  “schema independence”  …new data, new connections can be added seamlessly & incrementally  … it doesn‟t matter if you are at the enterprise level or at the web level
  • 46. + 46 So where is the Semantic Web? Semantic Web technologies make such integration possible
  • 47. + Semantic Technologies Today: Applications, Use cases, Technologies, Systems
  • 48. + Web of data today
  • 49. + Semantics today  Linked-in  Schema.org  Good-relations  Oracle (Server)  IBM (DB2, Watson)  Apple (Siri)  SAP  Evri, Linked-in, many startups  Many deployed systems
  • 50. + Semantic Web Technologies  A set of technologies and frameworks that enable semantic data management, data integration and the web of data  Resource Description Framework (RDF)  A variety of data interchange formats (e.g., RDF/XML, N3, Turtle, NTriples)  Semantic languages such as RDF Schema (RDFS) and the Web Ontology Language (OWL) and Rules (SWRL)  Query language (SPARQL)  Software infrastructure (RDF/SPARQL frameworks, Triple stores, Data integrators, Query engines, Reasoners)  Publicly available connected dataset and open data initiatives (LOD)
  • 51. + SWT Part I  The Data Model (RDF)  The query language (SPARQL)  Software Development (Architecture, Frameworks and Tools)  A little more semantics (RDFS, inference techniques, tools and data integration)  Interacting with the enterprise (Legacy sources, XML, DBMS, mappings)  More complex semantics (Rules, data integration and reasoning with rules)
  • 52. + Reading material  PTSW Chapter 1  SWP Part I, Chapter 1  FTW Section 1.4

Notas do Editor

  1. http://creativecommons.org/licenses/by-sa/3.0/You are free:to Share — to copy, distribute and transmit the workto Remix — to adapt the workto make commercial use of the workUnder the following conditions:Attribution — You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).Share Alike — If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.With the understanding that:Waiver — Any of the above conditions can be waived if you get permission from the copyright holder.Public Domain — Where the work or any of its elements is in the public domain under applicable law, that status is in no way affected by the license.Other Rights — In no way are any of the following rights affected by the license:Your fair dealing or fair use rights, or other applicable copyright exceptions and limitations;The author's moral rights;Rights other persons may have either in the work itself or in how the work is used, such as publicity or privacy rights.Notice — For any reuse or distribution, you must make clear to others the license terms of this work. The best way to do this is with a link to this web page.
  2. We need A data modelA query languageStandards and tools to publish the dataStandards and tools to consume the data
  3. Big players are betting on this