SlideShare uma empresa Scribd logo
1 de 48
Baixar para ler offline
INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB), FACULTY OF ECONOMICS AND 
BUSINESS ENGINEERING, DEPARTMENT OF INFORMATICS 
KIT – University of the State of Baden-Wuerttemberg and 
National Research Center of the Helmholtz Association 
www.kit.edu 
Using Entity Matching to enrich wiki content 
Semi-automatic enrichment of a semantic virtual research environment 
with external data. Development of a solution in form of a SMW Plugin.
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
2 
Acknowledgement 
This talk is funded by the the 
Institute of applied Informatics and formal Description Methods (AIFB), 
Faculty of Economics and Business Engineering, 
Department of Informatics, 
Karlsruhe Institute of Technology (KIT) 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
3 
Overview 
1. Introduction 
1. Problem defnition 
2. Objectives 
2. Solution architecture 
3. Implementation 
4. Evaluation 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
4 
Introduction 
● Background 
– Science becoming more global 
● Collaboration of institutes, across different cities, countries 
– More interconnections at the same time through global collaboration 
Wilsdon, James. Knowledge, networks and nations: global scientifc collaboration in the 21st century. The Royal Society, 2011. 
10/7/14 
optional
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
5 
Introduction 
● Virtual Research Environment (VRE) 
– Defnition: „A VRE is best viewed as a framework into which tools, services and 
resources can be plugged.“ M. Fraser. Virtual research environments: overview and activity. Ariadne, 44:31–40, 2005. 
– Defnition: „Virtual Research Environments are innovative, web-based, community-oriented, 
comprehensive, fexible, and secure working environments conceived to 
serve the needs of modern science.“ Candela, Leonardo, Donatella Castelli, and Pasquale Pagano. Virtual Research 
Environments: An Overview and a Research Agenda. Data Science Journal 12.0 (2013): GRDI75-GRDI81. 
– Synonyms: 
Virtual Research Environments (Carusi & Reimer, 2010) 
Science Gateways (Wilkins-Diehr, 2007) 
Collaboratories (Wulf, 1993) 
Digital Libraries (Candela, Castelli, & Pagano, 2011) 
Inhabited Information Spaces (Snowdon, Churchill, & Frécon, 2004) 
L. Candela. Virtual Research Environments, GRDI2020, http://www.grdi2020.eu/Repository/FileScaricati/eb0e8fea-c496-45b7-a0c5-831b90fe0045.pdf, 
Date: 06.04.2014 
– Provides the means to work on research questions collaboratively 
– Overcoming e.g. locational or temporal barriers 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
6 
Introduction 
● Virtual Research Environment (VRE) 
– https://nanoHUB.org/ „Online simulation and more for nanotechnology“ 
10/7/14 
optional
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
7 
Introduction 
● Virtual Research Environment (VRE) 
– http://smw-cora.org/ 
„Semantic collaborative Corpora Analysis for Humanities and Social Sciences“ 
Developed by: 
10/7/14 
Funded by:
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
8 
Overview 
1. Introduction 
1. Problem defnition 
2. Objectives 
2. Solution architecture 
3. Implementation 
4. Evaluation 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
9 
Problem defnition 
● Example: two Datasets taken from Semantic CorA Date: 17.02.2014 
– Different amounts of attributes 
● New research question arises: occupation now in focus 
● Problem: Occupation attribute set only on 375 of 4014 persons 
10/7/14
1) Identifcation of corresponding entities in external data sources 
● Example: Deutsche Nationalbibliothek (German National Library) 
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
10 
Problem defnition 
10/7/14
1) Identifcation of corresponding entities in external data sources 
● Example: Deutsche Nationalbibliothek (German National Library) 
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
11 
Problem defnition 
10/7/14 
Entity matching
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
12 
Problem defnition 
2) Data integration 
10/7/14 
● Example: Occupation 
Integration
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
13 
Overview 
1. Introduction 
1. Problem defnition 
2. Objectives 
2. Solution architecture 
3. Implementation 
4. Evaluation 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
14 
Objectives 
● Formulation of objectives 
10/7/14 
Development of a solution to the presented problems for a Virtual Research 
Environment. Important aspects: 
1) Semi-automatic identifcation of corresponding entities 
2) Semi-automatic integration of data from identifed entities 
3) Usability to support researchers in regard to 1) an 2) 
● Note to 3) Usability 
– Implication: Need to support expressing new matching strategies 
– Implication: Need to support data integration process
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
15 
Overview 
1. Introduction 
1. Problem defnition 
2. Objectives 
2. Solution architecture 
3. Implementation 
4. Evaluation 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
16 
Solution architecture 
1. Identifcation of various requirements 
2. Defne basic workfow: Entity selection 
10/7/14 
Set data source 
Specify 
reference 
links 
Entity matching 
Review Result review 
Links correct? 
Data integration 
no 
yes
1. Identifcation of various requirements 
2. Defne basic workfow 
3. Chose basic architecture alternative in regard to Entity Matching software. 
Assessment of the degree of software reuse, scalability, maintainability and 
development effort. 
Primary architecture options: 
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
17 
Solution architecture 
10/7/14 
A1 Stand-alone execution 
A2 Reuse existing implementations 
A3 Webservice 
A4 Own implementation 
optional
1. Identifcation of various requirements 
2. Defne basic workfow 
3. Chose basic architecture alternative in regard to Entity Matching software. 
Assessment of the degree of software reuse, scalability, maintainability and 
development effort. 
Primary architecture options: 
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
18 
Solution architecture 
10/7/14 
A1 Stand-alone execution ✓ 
A2 Reuse existing implementations 
A3 Webservice 
A4 Own implementation 
optional
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
19 
Overview 
1. Introduction 
1. Problem defnition 
2. Objectives 
2. Solution architecture 
3. Implementation 
4. Evaluation 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
20 
Implementation 
● Selection of Entity Matching frameworks 
– Identifcation of 33 Entity Matching frameworks 
10/7/14 
Swoosh, SERF, Silk - Linking Framework, LIMES, RelDC Engine, AgreementMakerLight, CIDER-CL, CroMatcher, IAMA, LogMap, LogMapLt, 
MaasMatch, StringsAuto MapSSS, ODGOMS, RiMOM, ServOMap, SLINT+, SPHeRe (System for Parallel Heterogeneity Resolution), SYNTHESIS, 
WeSeE-Match, XmapGen, XmapSiG, YAM++, … 
– Selection of suitable frameworks, based on... 
● Availability 
● Features 
● Integrability 
● Development/maintenance status 
– Candidates: LIMES (completed), Silk (prepared) 
– Abstraction over Entity Matching frameworks 
● Improves software evolution 
● Opportunity to combine different Entity Matching approaches, to improve 
results and lower context dependence [CKM09] 
optional
– Semantic Web Browser (SWB) for review and correction, as well as displaying 
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
21 
Implementation 
● Methods of Data integration 
10/7/14 
external data in local entities 
– Parser function to query job information und results
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
22 
Implementation 
● Front-end implementation 
– GUI structured in sections 
10/7/14 
● Job management, entity selection, data sources, reference links, 
result review and correction 
– Working in different sections independently is possible
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
23 
Overview 
1. Introduction 
1. Problem defnition 
2. Objectives 
2. Solution architecture 
3. Implementation 
4. Evaluation 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
24 
Evaluation 
● Formative and summative evaluation 
● Theoretical review 
– Achievement of objectives 
– Requirements covering 
● Assessment of usability 
– Usability test 
– Questionnaire ISONORM 9241/110 [JP97] (long version) 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
25 
Evaluation 
● Achievement of objectives 
Development of a solution for a virtual research environment. Essential aspects: 
1) Semi-automatic identifcation of corresponding entities 
2) Semi-automatic integration of data from identifed entities 
3) Usability to support researchers in regard to 1) und 2) 
– Generic solution as MediaWiki / SMW extension for all virtual research 
environments, based on MediaWiki / SMW. 
– Solution covers aspect 1) completely. 
– Solution covers aspect 2) partially: Concept takes semi-automatic data integration 
into account, though implementation in prototype not yet completed. 
– Solution covers aspect 3) for the most part. 
Results of a usability evaluation on following slides. 
10/7/14 
optional
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
26 
Evaluation 
● Usability test 
– Co-Discovery method in combination with the Coaching Method [SB11]. 
● Subjects should work through the main success scenario from use case 1 
(identifcation of external entities) together and communicate their thoughts in 
the process 
● Asking questions was allowed, but help on tasks was only given, if the 
subjects were not able to solve a task at all. 
● Problems were discovered by observation and feedback. 
– 2 Subjects: Researchers who were familiar with Semantic CorA. 
– “Real” data taken from Semantic CorA. 
– Overall result: 
With only little help the subjects were able to complete all steps of the main 
success scenario of use case 1 and thereby achive the goal of this use case. 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
27 
Evaluation 
● Usability test 
– Usability problems 
● Unintuitive adding of elements 
● Missing descriptions or explanations 
● No options for working on multiple elements simultaneously 
● Confusing layout 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
28 
Evaluation 
● ISONORM 9241/110 (long version) [JP97] 
– Standardized questionnaire, concerning the series of standards Ergonomie der 
Mensch-System-Interaktion, Section 110 of DIN EN ISO 9241 (German Institute for 
Standardization). 
– 35 Questions which are answered by 7-step gamut ranging from very negative to 
very positive. 
– Suitable for summative as well as formative evaluation. 
– Comparability with other software. 
– No special training for subjects required. 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
29 
Evaluation 
● Open problems 
– Functionality of data integration 
● Semantic Web Browser integration not yet completed 
● Prototype does not allow to create mappings from data to an internal 
vocabulary of the virtual research environment 
– Further integration of Entity Matching frameworks 
● LIMES Entity Matching framework integrated 
Silk framework selected but not yet implemented in the prototype 
– Extension of confguration options 
● Many technical details hidden from users 
● Evaluation for default values needed 
– Provenance and access control 
● No rights management, access control or provenance information apart from 
default SMW/MediaWiki functionality 
10/7/14
Reference value 
Evaluation result 
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
30 
Evaluation 
● ISONORM 9241/110 (long version) [JP97] 
– Result 
10/7/14 
Conformance to 
expectations 
Self-descriptiveness 
Suitablity for learning 
Controllability 
Suitablity for the task 
Fault tolerance 
Customizability
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
31 
Evaluation 
● Opportunities for improvement / outlook 
– Investigation on other implementation options (slide 17) 
A2 Reuse of existing implementations 
A3 Webservice 
A4 Own implementation 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
32 
Evaluation 
● Opportunities for improvement / outlook 
– Automatic selection of candidates for reference links (e.g. by approaches such as 
Silk [VBGK09] or SAIM [LHS+13]) 
10/7/14 
Quelle: Silk Link Discovery Framework Wiki 
https://www.assembla.com/wiki/show/silk/Managing_Reference_Links 
Quelle: SAIM Projekt Seite 
http://aksw.org/Projects/SAIM.html
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
33 
Evaluation 
● Opportunities for improvement / outlook 
– Entity Matching by crowdsourcing 
Interesting within the context of virtual research environments or wikis as some of these 
have large user groups and therefore have a great potential for utilizing crowdsourcing. 
10/7/14 
Game with a purpose (GWAP): Veri-Links, [LNE13]
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
34 
Thank you for your attention! 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
35 
Appendix 
● Use Case 1, notation by [ACB03] 
VREU – Virtual Research Environment User 
Primary Actor: VREU 
Level: User goal 
Precondition: The Virtual Research Environment contains at least one entity. 
Postcondition: The amount of entities in the Virtual Research Environment has not 
changed. 
Main success scenario: 
1. VREU selects entities which are to be identifed in external data sources. 
2. VREU sets the storage location of the external data source (SPARQL Endpoint or 
RDF File in local fle system). 
3. VREU provides some reference-links for the Entity Matching system. 
4. VREU reviews links generated by the Entity Matching system. 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
36 
Appendix 
● Use Case 1 (continuation) 
Extensions: 
2a. External data not accessible (unable to read input fle or SPARQL endpoint 
unavailable). 
2a1. Error is shown to the VREU. 
2a2. VREU cancels this use case or restarts. 
Variations: 
3’. Reference-links are proposed by the Entity Matching system and only have to 
be accepted or denied by the VREU. 
4’. Links are integrated into the Virtual Research Environment without review. 
4”. Links are only reviewed on selected entities. 
4”’. No links were found. 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
37 
Appendix 
● Use Case 2 
VREU – Virtual Research Environment User 
Primary Actor: VREU 
Level: User goal 
Includes: Use Case 1 
Precondition: At least 1 link was generated in Use Case 1. 
Invariant: The amount of entities does not change. 
Postcondition: Selected entities were enriched with external data. 
Main success scenario: 
1. VREU specifes a mapping from the vocabulary of the external dataset to the 
vocabulary of the Virtual Research Environment. 
2. VREU selects data types which are to be integrated. 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
38 
Appendix 
● Use Case 2 (continuation) 
Extensions: 2a. VREU defnes rules, how data should be modifed uppon import (e.g. 
by automatic translation). 
Variations: 1a. Mapping on the identity, to adopt attributes without change. 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
39 
Appendix 
● Related Work 
– Becker, Christian, Bizer, Christian, Erdmann, Michael and Greaves, Mark. 
Extending SMW+ with a Linked Data Integration Framework. Paper presented at 
the meeting of the Posters & Demos at the International Semantic Web Conference 
(ISWC2010), Shanghai, 2010. 
● Architecture: (experpt) 
– Web Data Access Module (LDSpider, SPARQL, RDF dumps) 
– Vocabulary Mapping Module (R2R Framework) 
– Identity Resolution Module (LDIF, Silk Server) 
– Quality Evaluation Module (Trust Policy Evaluation Engine) 
– Integrated Web Data (Named Graph Data model and provenance 
information) 
● Front-End: 
– Ontology Browser of SMW+ 
– Inline queries and interactive query interface 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
40 
Appendix 
● Related Work 
– Becker, Christian, Bizer, Christian, Erdmann, Michael and Greaves, Mark. 
Extending SMW+ with a Linked Data Integration Framework. Paper presented at 
the meeting of the Posters & Demos at the International Semantic Web Conference 
(ISWC2010), Shanghai, 2010. 
● Differences to this work (selection) 
– Specifc domain: Genes ↔ domain independence 
– Restricting matching to specifc entities 
– Supporting the user upon matching in new domains 
– Different workfows when matching 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
41 
Appendix 
● Implemented standards 
– XML 
Server ↔ client communication 
– Ontology Alignment Format 
Specifcation of reference-links 
Output of results 
– RDF with serializations RDF/XML, N-Triples, Turtle, OWL, N-Quads 
Input format of datasets 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
42 
Appendix 
● ISONORM 9241/110 (long version) evaluation results 
Result Reference value 
Conformance to the task 5.6 4.87 
Self-descriptiveness 2.7 4.55 
Conformance to expectations 4.6 4.94 
Suitability for learning 5.6 4.52 
Controllability 5.9 5.01 
Fault-tolerance 5.6 4.29 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
43 
Appendix 
● GUI – Entity Selection 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
44 
Appendix 
● GUI – Data Sources 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
45 
Appendix 
● GUI – Reference Links 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
46 
Appendix 
● GUI – Ergebnisse 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
47 
Sources 
[HB11] Heath, Tom und Christian Bizer: Linked data: Evolving the web into a global 
data space. Synthesis lectures on the semantic web: theory and technology, 
1(1):1–136, 2011. 
[Mar03] Martin, Robert Cecil: Agile Software Development: Principles, Patterns, and 
Practices. Prentice Hall PTR, Upper Saddle River, NJ, USA, 2003. 
[CMK09] Chen, Zhaoqi, Dmitri V. Kalashnikov und Sharad Mehrotra: Exploiting context 
analysis for combining multiple entity resolution systems. In: Proceedings of 
the 2009 ACM SIGMOD International Conference on Management of data, 
Seiten 207–218. ACM, 2009. 
[JP97] Prumper, Jochen: Der Benutzungsfragebogen ISONORM 9241/10: 
Ergebnisse Zur Reliabilitat und Validitat. In: Liskowsky, R., B.M. Velichkowsky 
und W. Wunschmann (Herausgeber): Software-Ergonomie ’97 – Usability 
Engineering: Integration von Mensch-Computer-Interaktion und Software- 
Entwicklung, Seiten 253–261, Stuttgart, Marz 1997. Teubner. 
[SB11] Sarodnick, Florian und Henning Brau: Methoden der Usability Evaluation. 
Verlag Hans Huber, 2011. 
10/7/14
Institute of applied Informatics and formal Description 
Methods (AIFB), Department of Informatics 
48 
Sources 
[VBGK09] Volz, Julius, Christian Bizer, Martin Gaedke und Georgi Kobilarov: 
Discovering and Maintaining Links on the Web of Data. In: Bernstein, 
Abraham, David R. Kar- ger, Tom Heath, Lee Feigenbaum, Diana 
Maynard, Enrico Motta und Krishnaprasad Thirunarayan (Herausgeber): 
International Semantic Web Conference, Band 5823 der Reihe Lecture 
Notes in Computer Science, Seiten 650–665. Springer, 2009. 
[LHS+13] Lyko, Klaus, Konrad Höffner, René Speck, Axel- Cyrille Ngonga Ngomo 
und Jens Lehmann: SAIM – One Step Closer to Zero-Confguration Link 
Discovery. In: The Semantic Web: ESWC 2013 Satellite Events, Seiten 
167–172. Springer, 2013. 
[ACB03] Adolph, Steve, Alistair Cockburn und Paul Bramble: Use cases effektiv 
erstellen : [das Fundament für gute Software- Entwicklung, 
Geschaftsprozesse mit uses cases modellieren, die Regeln für uses 
cases sicher beherrschen] / Alistair Cockburn. Übers. aus dem Amerikan. 
von Rudiger Dieterle. mitp, 2003. 
10/7/14

Mais conteúdo relacionado

Semelhante a Enriching SMW based Virtual Research Environments with external data, Jan Novacek, SMWCon Fall 2014

“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” diannepatricia
 
SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17Pieter Pauwels
 
ENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeEUDAT
 
Minor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure VisualizerMinor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure VisualizerRonitShrivastava057
 
Using ADO.NET Entity Framework in Domain Driven Design: A Pattern Approach
Using ADO.NET Entity Framework in Domain Driven Design: A Pattern ApproachUsing ADO.NET Entity Framework in Domain Driven Design: A Pattern Approach
Using ADO.NET Entity Framework in Domain Driven Design: A Pattern ApproachHoan Phuc
 
Adopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projectsAdopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projectsCranfield University
 
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTourismFastForward
 
Data for Science Service Portfolio
Data for Science Service PortfolioData for Science Service Portfolio
Data for Science Service PortfolioEUDAT
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityOpen Cyber University of Korea
 
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...Tao Xie
 
Special Purpose IBM Center of excellence lab
Special Purpose IBM Center of excellence lab Special Purpose IBM Center of excellence lab
Special Purpose IBM Center of excellence lab Ganesan Narayanasamy
 

Semelhante a Enriching SMW based Virtual Research Environments with external data, Jan Novacek, SMWCon Fall 2014 (20)

“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services”
 
SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17
 
ENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science Theme
 
Sub1583
Sub1583Sub1583
Sub1583
 
Minor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure VisualizerMinor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure Visualizer
 
CV2015
CV2015CV2015
CV2015
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
MTech- Viva_Voce
MTech- Viva_VoceMTech- Viva_Voce
MTech- Viva_Voce
 
Using ADO.NET Entity Framework in Domain Driven Design: A Pattern Approach
Using ADO.NET Entity Framework in Domain Driven Design: A Pattern ApproachUsing ADO.NET Entity Framework in Domain Driven Design: A Pattern Approach
Using ADO.NET Entity Framework in Domain Driven Design: A Pattern Approach
 
IBM Think Milano
IBM Think MilanoIBM Think Milano
IBM Think Milano
 
BI Developer, NTU
BI Developer, NTUBI Developer, NTU
BI Developer, NTU
 
IDS 2013 - ROSKO 3
IDS 2013 - ROSKO 3IDS 2013 - ROSKO 3
IDS 2013 - ROSKO 3
 
Be computer-engineering-2012
Be computer-engineering-2012Be computer-engineering-2012
Be computer-engineering-2012
 
Adopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projectsAdopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projects
 
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
 
FDS_dept_ppt.pptx
FDS_dept_ppt.pptxFDS_dept_ppt.pptx
FDS_dept_ppt.pptx
 
Data for Science Service Portfolio
Data for Science Service PortfolioData for Science Service Portfolio
Data for Science Service Portfolio
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics Interoperability
 
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
MSR 2022 Foundational Contribution Award Talk: Software Analytics: Reflection...
 
Special Purpose IBM Center of excellence lab
Special Purpose IBM Center of excellence lab Special Purpose IBM Center of excellence lab
Special Purpose IBM Center of excellence lab
 

Mais de KDZ - Zentrum für Verwaltungsforschung

Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014KDZ - Zentrum für Verwaltungsforschung
 
Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...
Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...
Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...KDZ - Zentrum für Verwaltungsforschung
 
Wikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, Vienna
Wikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, ViennaWikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, Vienna
Wikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, ViennaKDZ - Zentrum für Verwaltungsforschung
 
Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...
Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...
Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...KDZ - Zentrum für Verwaltungsforschung
 
SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...
SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...
SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...KDZ - Zentrum für Verwaltungsforschung
 
The Canton of Zurich will deploy a SMW-based portal to manage historical monu...
The Canton of Zurich will deploy a SMW-based portal to manage historical monu...The Canton of Zurich will deploy a SMW-based portal to manage historical monu...
The Canton of Zurich will deploy a SMW-based portal to manage historical monu...KDZ - Zentrum für Verwaltungsforschung
 
Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...
Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...
Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...KDZ - Zentrum für Verwaltungsforschung
 
Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...
Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...
Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...KDZ - Zentrum für Verwaltungsforschung
 
SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...
SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...
SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...KDZ - Zentrum für Verwaltungsforschung
 
Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...
Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...
Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...KDZ - Zentrum für Verwaltungsforschung
 

Mais de KDZ - Zentrum für Verwaltungsforschung (17)

Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
 
Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...
Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...
Interactive Pollutant Mapping Powered by SMW, Franz Borrmann, Alexander Gesin...
 
Wikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, Vienna
Wikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, ViennaWikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, Vienna
Wikis and knowlege management, Bernhard Krabina, SMWCon Fall 2014, Vienna
 
Wikimedia Community Effort, Claudia Garad, SMWCon Fall 2014
Wikimedia Community Effort, Claudia Garad, SMWCon Fall 2014Wikimedia Community Effort, Claudia Garad, SMWCon Fall 2014
Wikimedia Community Effort, Claudia Garad, SMWCon Fall 2014
 
Meeting Minutes & Group Communication, Daren Welsh, SMWCon Fall 2014
Meeting Minutes & Group Communication, Daren Welsh, SMWCon Fall 2014Meeting Minutes & Group Communication, Daren Welsh, SMWCon Fall 2014
Meeting Minutes & Group Communication, Daren Welsh, SMWCon Fall 2014
 
Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...
Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...
Historical Wiki of Vienna - the largest city wiki, Christoph Sonnlechner, SMW...
 
SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...
SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...
SMW Use Cases at the Provincial Government of Lower Austria, Gerald Streimelw...
 
GrazWiki / Baugeschichtewiki, Manfred Brunner, SMWCon Fall 2014
GrazWiki / Baugeschichtewiki, Manfred Brunner, SMWCon Fall 2014GrazWiki / Baugeschichtewiki, Manfred Brunner, SMWCon Fall 2014
GrazWiki / Baugeschichtewiki, Manfred Brunner, SMWCon Fall 2014
 
Introducing HR Metadata, Tom Kronenburg, SMWCon Fall 2014
Introducing HR Metadata, Tom Kronenburg, SMWCon Fall 2014Introducing HR Metadata, Tom Kronenburg, SMWCon Fall 2014
Introducing HR Metadata, Tom Kronenburg, SMWCon Fall 2014
 
History of the EVA Wiki, Daren Welsh, James Montalvo, SMWCon 2014
History of the EVA Wiki, Daren Welsh, James Montalvo, SMWCon 2014History of the EVA Wiki, Daren Welsh, James Montalvo, SMWCon 2014
History of the EVA Wiki, Daren Welsh, James Montalvo, SMWCon 2014
 
The Canton of Zurich will deploy a SMW-based portal to manage historical monu...
The Canton of Zurich will deploy a SMW-based portal to manage historical monu...The Canton of Zurich will deploy a SMW-based portal to manage historical monu...
The Canton of Zurich will deploy a SMW-based portal to manage historical monu...
 
Visualisualisation of Semantic Relations, Tim Stein, SMWCon Fall 2014
Visualisualisation of Semantic Relations, Tim Stein, SMWCon Fall 2014Visualisualisation of Semantic Relations, Tim Stein, SMWCon Fall 2014
Visualisualisation of Semantic Relations, Tim Stein, SMWCon Fall 2014
 
Relaunch of HaloACL, Wojtek Breiter, SMWCon Fall 2014
Relaunch of HaloACL, Wojtek Breiter, SMWCon Fall 2014Relaunch of HaloACL, Wojtek Breiter, SMWCon Fall 2014
Relaunch of HaloACL, Wojtek Breiter, SMWCon Fall 2014
 
Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...
Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...
Semantic MediaWiki as a part of an Industry 4.0 Landscape, Alexander Gesinn, ...
 
Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...
Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...
Approved Revs v1.0 with Semantic Approvers, James Montalvo, SMWCon Fall 2014,...
 
SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...
SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...
SMW between OpenData, OpenGLAM, Linked Data and the Semantic Web, Bernhard Kr...
 
Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...
Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...
Digital sustainability of open source communities, Matthias Stürmer, SMWCon F...
 

Último

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfproinshot.com
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...software pro Development
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 

Último (20)

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 

Enriching SMW based Virtual Research Environments with external data, Jan Novacek, SMWCon Fall 2014

  • 1. INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB), FACULTY OF ECONOMICS AND BUSINESS ENGINEERING, DEPARTMENT OF INFORMATICS KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu Using Entity Matching to enrich wiki content Semi-automatic enrichment of a semantic virtual research environment with external data. Development of a solution in form of a SMW Plugin.
  • 2. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 2 Acknowledgement This talk is funded by the the Institute of applied Informatics and formal Description Methods (AIFB), Faculty of Economics and Business Engineering, Department of Informatics, Karlsruhe Institute of Technology (KIT) 10/7/14
  • 3. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 3 Overview 1. Introduction 1. Problem defnition 2. Objectives 2. Solution architecture 3. Implementation 4. Evaluation 10/7/14
  • 4. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 4 Introduction ● Background – Science becoming more global ● Collaboration of institutes, across different cities, countries – More interconnections at the same time through global collaboration Wilsdon, James. Knowledge, networks and nations: global scientifc collaboration in the 21st century. The Royal Society, 2011. 10/7/14 optional
  • 5. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 5 Introduction ● Virtual Research Environment (VRE) – Defnition: „A VRE is best viewed as a framework into which tools, services and resources can be plugged.“ M. Fraser. Virtual research environments: overview and activity. Ariadne, 44:31–40, 2005. – Defnition: „Virtual Research Environments are innovative, web-based, community-oriented, comprehensive, fexible, and secure working environments conceived to serve the needs of modern science.“ Candela, Leonardo, Donatella Castelli, and Pasquale Pagano. Virtual Research Environments: An Overview and a Research Agenda. Data Science Journal 12.0 (2013): GRDI75-GRDI81. – Synonyms: Virtual Research Environments (Carusi & Reimer, 2010) Science Gateways (Wilkins-Diehr, 2007) Collaboratories (Wulf, 1993) Digital Libraries (Candela, Castelli, & Pagano, 2011) Inhabited Information Spaces (Snowdon, Churchill, & Frécon, 2004) L. Candela. Virtual Research Environments, GRDI2020, http://www.grdi2020.eu/Repository/FileScaricati/eb0e8fea-c496-45b7-a0c5-831b90fe0045.pdf, Date: 06.04.2014 – Provides the means to work on research questions collaboratively – Overcoming e.g. locational or temporal barriers 10/7/14
  • 6. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 6 Introduction ● Virtual Research Environment (VRE) – https://nanoHUB.org/ „Online simulation and more for nanotechnology“ 10/7/14 optional
  • 7. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 7 Introduction ● Virtual Research Environment (VRE) – http://smw-cora.org/ „Semantic collaborative Corpora Analysis for Humanities and Social Sciences“ Developed by: 10/7/14 Funded by:
  • 8. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 8 Overview 1. Introduction 1. Problem defnition 2. Objectives 2. Solution architecture 3. Implementation 4. Evaluation 10/7/14
  • 9. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 9 Problem defnition ● Example: two Datasets taken from Semantic CorA Date: 17.02.2014 – Different amounts of attributes ● New research question arises: occupation now in focus ● Problem: Occupation attribute set only on 375 of 4014 persons 10/7/14
  • 10. 1) Identifcation of corresponding entities in external data sources ● Example: Deutsche Nationalbibliothek (German National Library) Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 10 Problem defnition 10/7/14
  • 11. 1) Identifcation of corresponding entities in external data sources ● Example: Deutsche Nationalbibliothek (German National Library) Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 11 Problem defnition 10/7/14 Entity matching
  • 12. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 12 Problem defnition 2) Data integration 10/7/14 ● Example: Occupation Integration
  • 13. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 13 Overview 1. Introduction 1. Problem defnition 2. Objectives 2. Solution architecture 3. Implementation 4. Evaluation 10/7/14
  • 14. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 14 Objectives ● Formulation of objectives 10/7/14 Development of a solution to the presented problems for a Virtual Research Environment. Important aspects: 1) Semi-automatic identifcation of corresponding entities 2) Semi-automatic integration of data from identifed entities 3) Usability to support researchers in regard to 1) an 2) ● Note to 3) Usability – Implication: Need to support expressing new matching strategies – Implication: Need to support data integration process
  • 15. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 15 Overview 1. Introduction 1. Problem defnition 2. Objectives 2. Solution architecture 3. Implementation 4. Evaluation 10/7/14
  • 16. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 16 Solution architecture 1. Identifcation of various requirements 2. Defne basic workfow: Entity selection 10/7/14 Set data source Specify reference links Entity matching Review Result review Links correct? Data integration no yes
  • 17. 1. Identifcation of various requirements 2. Defne basic workfow 3. Chose basic architecture alternative in regard to Entity Matching software. Assessment of the degree of software reuse, scalability, maintainability and development effort. Primary architecture options: Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 17 Solution architecture 10/7/14 A1 Stand-alone execution A2 Reuse existing implementations A3 Webservice A4 Own implementation optional
  • 18. 1. Identifcation of various requirements 2. Defne basic workfow 3. Chose basic architecture alternative in regard to Entity Matching software. Assessment of the degree of software reuse, scalability, maintainability and development effort. Primary architecture options: Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 18 Solution architecture 10/7/14 A1 Stand-alone execution ✓ A2 Reuse existing implementations A3 Webservice A4 Own implementation optional
  • 19. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 19 Overview 1. Introduction 1. Problem defnition 2. Objectives 2. Solution architecture 3. Implementation 4. Evaluation 10/7/14
  • 20. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 20 Implementation ● Selection of Entity Matching frameworks – Identifcation of 33 Entity Matching frameworks 10/7/14 Swoosh, SERF, Silk - Linking Framework, LIMES, RelDC Engine, AgreementMakerLight, CIDER-CL, CroMatcher, IAMA, LogMap, LogMapLt, MaasMatch, StringsAuto MapSSS, ODGOMS, RiMOM, ServOMap, SLINT+, SPHeRe (System for Parallel Heterogeneity Resolution), SYNTHESIS, WeSeE-Match, XmapGen, XmapSiG, YAM++, … – Selection of suitable frameworks, based on... ● Availability ● Features ● Integrability ● Development/maintenance status – Candidates: LIMES (completed), Silk (prepared) – Abstraction over Entity Matching frameworks ● Improves software evolution ● Opportunity to combine different Entity Matching approaches, to improve results and lower context dependence [CKM09] optional
  • 21. – Semantic Web Browser (SWB) for review and correction, as well as displaying Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 21 Implementation ● Methods of Data integration 10/7/14 external data in local entities – Parser function to query job information und results
  • 22. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 22 Implementation ● Front-end implementation – GUI structured in sections 10/7/14 ● Job management, entity selection, data sources, reference links, result review and correction – Working in different sections independently is possible
  • 23. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 23 Overview 1. Introduction 1. Problem defnition 2. Objectives 2. Solution architecture 3. Implementation 4. Evaluation 10/7/14
  • 24. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 24 Evaluation ● Formative and summative evaluation ● Theoretical review – Achievement of objectives – Requirements covering ● Assessment of usability – Usability test – Questionnaire ISONORM 9241/110 [JP97] (long version) 10/7/14
  • 25. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 25 Evaluation ● Achievement of objectives Development of a solution for a virtual research environment. Essential aspects: 1) Semi-automatic identifcation of corresponding entities 2) Semi-automatic integration of data from identifed entities 3) Usability to support researchers in regard to 1) und 2) – Generic solution as MediaWiki / SMW extension for all virtual research environments, based on MediaWiki / SMW. – Solution covers aspect 1) completely. – Solution covers aspect 2) partially: Concept takes semi-automatic data integration into account, though implementation in prototype not yet completed. – Solution covers aspect 3) for the most part. Results of a usability evaluation on following slides. 10/7/14 optional
  • 26. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 26 Evaluation ● Usability test – Co-Discovery method in combination with the Coaching Method [SB11]. ● Subjects should work through the main success scenario from use case 1 (identifcation of external entities) together and communicate their thoughts in the process ● Asking questions was allowed, but help on tasks was only given, if the subjects were not able to solve a task at all. ● Problems were discovered by observation and feedback. – 2 Subjects: Researchers who were familiar with Semantic CorA. – “Real” data taken from Semantic CorA. – Overall result: With only little help the subjects were able to complete all steps of the main success scenario of use case 1 and thereby achive the goal of this use case. 10/7/14
  • 27. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 27 Evaluation ● Usability test – Usability problems ● Unintuitive adding of elements ● Missing descriptions or explanations ● No options for working on multiple elements simultaneously ● Confusing layout 10/7/14
  • 28. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 28 Evaluation ● ISONORM 9241/110 (long version) [JP97] – Standardized questionnaire, concerning the series of standards Ergonomie der Mensch-System-Interaktion, Section 110 of DIN EN ISO 9241 (German Institute for Standardization). – 35 Questions which are answered by 7-step gamut ranging from very negative to very positive. – Suitable for summative as well as formative evaluation. – Comparability with other software. – No special training for subjects required. 10/7/14
  • 29. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 29 Evaluation ● Open problems – Functionality of data integration ● Semantic Web Browser integration not yet completed ● Prototype does not allow to create mappings from data to an internal vocabulary of the virtual research environment – Further integration of Entity Matching frameworks ● LIMES Entity Matching framework integrated Silk framework selected but not yet implemented in the prototype – Extension of confguration options ● Many technical details hidden from users ● Evaluation for default values needed – Provenance and access control ● No rights management, access control or provenance information apart from default SMW/MediaWiki functionality 10/7/14
  • 30. Reference value Evaluation result Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 30 Evaluation ● ISONORM 9241/110 (long version) [JP97] – Result 10/7/14 Conformance to expectations Self-descriptiveness Suitablity for learning Controllability Suitablity for the task Fault tolerance Customizability
  • 31. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 31 Evaluation ● Opportunities for improvement / outlook – Investigation on other implementation options (slide 17) A2 Reuse of existing implementations A3 Webservice A4 Own implementation 10/7/14
  • 32. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 32 Evaluation ● Opportunities for improvement / outlook – Automatic selection of candidates for reference links (e.g. by approaches such as Silk [VBGK09] or SAIM [LHS+13]) 10/7/14 Quelle: Silk Link Discovery Framework Wiki https://www.assembla.com/wiki/show/silk/Managing_Reference_Links Quelle: SAIM Projekt Seite http://aksw.org/Projects/SAIM.html
  • 33. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 33 Evaluation ● Opportunities for improvement / outlook – Entity Matching by crowdsourcing Interesting within the context of virtual research environments or wikis as some of these have large user groups and therefore have a great potential for utilizing crowdsourcing. 10/7/14 Game with a purpose (GWAP): Veri-Links, [LNE13]
  • 34. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 34 Thank you for your attention! 10/7/14
  • 35. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 35 Appendix ● Use Case 1, notation by [ACB03] VREU – Virtual Research Environment User Primary Actor: VREU Level: User goal Precondition: The Virtual Research Environment contains at least one entity. Postcondition: The amount of entities in the Virtual Research Environment has not changed. Main success scenario: 1. VREU selects entities which are to be identifed in external data sources. 2. VREU sets the storage location of the external data source (SPARQL Endpoint or RDF File in local fle system). 3. VREU provides some reference-links for the Entity Matching system. 4. VREU reviews links generated by the Entity Matching system. 10/7/14
  • 36. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 36 Appendix ● Use Case 1 (continuation) Extensions: 2a. External data not accessible (unable to read input fle or SPARQL endpoint unavailable). 2a1. Error is shown to the VREU. 2a2. VREU cancels this use case or restarts. Variations: 3’. Reference-links are proposed by the Entity Matching system and only have to be accepted or denied by the VREU. 4’. Links are integrated into the Virtual Research Environment without review. 4”. Links are only reviewed on selected entities. 4”’. No links were found. 10/7/14
  • 37. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 37 Appendix ● Use Case 2 VREU – Virtual Research Environment User Primary Actor: VREU Level: User goal Includes: Use Case 1 Precondition: At least 1 link was generated in Use Case 1. Invariant: The amount of entities does not change. Postcondition: Selected entities were enriched with external data. Main success scenario: 1. VREU specifes a mapping from the vocabulary of the external dataset to the vocabulary of the Virtual Research Environment. 2. VREU selects data types which are to be integrated. 10/7/14
  • 38. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 38 Appendix ● Use Case 2 (continuation) Extensions: 2a. VREU defnes rules, how data should be modifed uppon import (e.g. by automatic translation). Variations: 1a. Mapping on the identity, to adopt attributes without change. 10/7/14
  • 39. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 39 Appendix ● Related Work – Becker, Christian, Bizer, Christian, Erdmann, Michael and Greaves, Mark. Extending SMW+ with a Linked Data Integration Framework. Paper presented at the meeting of the Posters & Demos at the International Semantic Web Conference (ISWC2010), Shanghai, 2010. ● Architecture: (experpt) – Web Data Access Module (LDSpider, SPARQL, RDF dumps) – Vocabulary Mapping Module (R2R Framework) – Identity Resolution Module (LDIF, Silk Server) – Quality Evaluation Module (Trust Policy Evaluation Engine) – Integrated Web Data (Named Graph Data model and provenance information) ● Front-End: – Ontology Browser of SMW+ – Inline queries and interactive query interface 10/7/14
  • 40. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 40 Appendix ● Related Work – Becker, Christian, Bizer, Christian, Erdmann, Michael and Greaves, Mark. Extending SMW+ with a Linked Data Integration Framework. Paper presented at the meeting of the Posters & Demos at the International Semantic Web Conference (ISWC2010), Shanghai, 2010. ● Differences to this work (selection) – Specifc domain: Genes ↔ domain independence – Restricting matching to specifc entities – Supporting the user upon matching in new domains – Different workfows when matching 10/7/14
  • 41. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 41 Appendix ● Implemented standards – XML Server ↔ client communication – Ontology Alignment Format Specifcation of reference-links Output of results – RDF with serializations RDF/XML, N-Triples, Turtle, OWL, N-Quads Input format of datasets 10/7/14
  • 42. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 42 Appendix ● ISONORM 9241/110 (long version) evaluation results Result Reference value Conformance to the task 5.6 4.87 Self-descriptiveness 2.7 4.55 Conformance to expectations 4.6 4.94 Suitability for learning 5.6 4.52 Controllability 5.9 5.01 Fault-tolerance 5.6 4.29 10/7/14
  • 43. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 43 Appendix ● GUI – Entity Selection 10/7/14
  • 44. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 44 Appendix ● GUI – Data Sources 10/7/14
  • 45. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 45 Appendix ● GUI – Reference Links 10/7/14
  • 46. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 46 Appendix ● GUI – Ergebnisse 10/7/14
  • 47. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 47 Sources [HB11] Heath, Tom und Christian Bizer: Linked data: Evolving the web into a global data space. Synthesis lectures on the semantic web: theory and technology, 1(1):1–136, 2011. [Mar03] Martin, Robert Cecil: Agile Software Development: Principles, Patterns, and Practices. Prentice Hall PTR, Upper Saddle River, NJ, USA, 2003. [CMK09] Chen, Zhaoqi, Dmitri V. Kalashnikov und Sharad Mehrotra: Exploiting context analysis for combining multiple entity resolution systems. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, Seiten 207–218. ACM, 2009. [JP97] Prumper, Jochen: Der Benutzungsfragebogen ISONORM 9241/10: Ergebnisse Zur Reliabilitat und Validitat. In: Liskowsky, R., B.M. Velichkowsky und W. Wunschmann (Herausgeber): Software-Ergonomie ’97 – Usability Engineering: Integration von Mensch-Computer-Interaktion und Software- Entwicklung, Seiten 253–261, Stuttgart, Marz 1997. Teubner. [SB11] Sarodnick, Florian und Henning Brau: Methoden der Usability Evaluation. Verlag Hans Huber, 2011. 10/7/14
  • 48. Institute of applied Informatics and formal Description Methods (AIFB), Department of Informatics 48 Sources [VBGK09] Volz, Julius, Christian Bizer, Martin Gaedke und Georgi Kobilarov: Discovering and Maintaining Links on the Web of Data. In: Bernstein, Abraham, David R. Kar- ger, Tom Heath, Lee Feigenbaum, Diana Maynard, Enrico Motta und Krishnaprasad Thirunarayan (Herausgeber): International Semantic Web Conference, Band 5823 der Reihe Lecture Notes in Computer Science, Seiten 650–665. Springer, 2009. [LHS+13] Lyko, Klaus, Konrad Höffner, René Speck, Axel- Cyrille Ngonga Ngomo und Jens Lehmann: SAIM – One Step Closer to Zero-Confguration Link Discovery. In: The Semantic Web: ESWC 2013 Satellite Events, Seiten 167–172. Springer, 2013. [ACB03] Adolph, Steve, Alistair Cockburn und Paul Bramble: Use cases effektiv erstellen : [das Fundament für gute Software- Entwicklung, Geschaftsprozesse mit uses cases modellieren, die Regeln für uses cases sicher beherrschen] / Alistair Cockburn. Übers. aus dem Amerikan. von Rudiger Dieterle. mitp, 2003. 10/7/14