SlideShare uma empresa Scribd logo
1 de 11
Baixar para ler offline
STI – Semantic Technology Institute Innsbruck
STI Community – An approach
for a semantically enhanced
company platform
Carmen Brenner, Andreas Thalhammer,
and Dieter Fensel
STI Innsbruck Technical Report
September 11, 2012
STI Innsbruck
University of Innsbruck
Technologiepark,
Technikerstraße 21
6020 Innsbruck
Austria
http://www.sti-innsbruck.at/
STI Innsbruck TR 2012-08-06
2
This technical report will introduce the background and
the goal of the STI International Community platform, an
open online platform with the goal to collect information
on company and business entities that are active in the
field of semantic technologies. Based on the ideas of
Crowdsourcing the data is created and enriched by the
semantic community itself therefore introducing the
community's knowledge into the dataset. The platform is
accessible at http://community.sti2.at.
STI Innsbruck TR 2012-08-06
3
Introduction ............................................................................................4	
  
The platform structure..............................................................................5	
  
Technologies ............................................................................ 6	
  
Vocabulary of the platform ..................................................... 6	
  
Crowdsourcing the data........................................................... 9	
  
Accessing the data..................................................................10	
  
The status .............................................................................................10	
  
STI Innsbruck TR 2012-08-06
4
Introduction
The interest in and the number of applications enriched by semantic technology
have grown fast in the last years. No longer are terms like Semantic Web, Big
Data or Linked Open Data only associated with research, but semantic
technology is on the cusp of mainstream adoption and represents a huge
potential for advances in all areas of ICT. But it has also become more and
more difficult for interested parties to keep track of all these companies that try
to bring semantic technology into our daily lives with a variety of products.
Information on companies that are active in the field of semantic technologies is
usually diverse and widely distributed, and hard to get by. Additionally, in
many cases, the data on a company or a product is not provided in a machine-
readable format like RDF, but rather in an only human-readable way.
Out of this dilemma the idea was born to create a list of companies that are
active in the field of semantic technology in order to have a central point where
interested parties can easily browse for companies and their products. We
initiated the project in December 2010 collecting information on companies
working in the semantic technology field and the first draft was publicly
announced to the semantic technology community on December 20, 2010 over
the W3C Semantic Web mailing list1
. The feedback received was huge and
mostly very positive, and a great number of companies were proposed to be
added to the list2
.
1
http://lists.w3.org/Archives/Public/semantic-web/
2
The whole conversation on the W3C Semantic Web mailing lists is available at:
http://lists.w3.org/Archives/Public/semantic-web/2010Dec/0190.html
STI Innsbruck TR 2012-08-06
5
Such change and update requests to the list were numerous, which showed that
a simple list that only offers a short description of the companies with a link to
their Web sites as it started out at the beginning was simply not enough. There
was demand to allow the semantics community to easily add new companies or
edit existing profiles, and therefore maintain their public data themselves.
The idea to build an open online platform was therefore initiated in co-
operation with STI Research and Consultancy3
. The focus and vision of STI
Research and Consultancy is to ensure that semantics will become part of
everyday life, and such an open online platform was a huge step in supporting
the community.
The platform structure
This section will shortly describe the structure of the platform including the
technologies used and the vocabulary.
The STI Community platform was designed to be an online platform with the
goals
1. To collect data on companies that are active in the field of semantic
technologies,
2. To support developers and vendors of semantic technology in their wish
to position their businesses and products in the Internet, and
3. To give users who are searching for semantic applications and semantic
technology vendors a place to start their search.
3
http://research.sti2.org
STI Innsbruck TR 2012-08-06
6
The term “open” in this context refers to the concept of Crowdsourcing4
where
the profiles of companies are added and maintained by the community.
Technologies
The STI International Community platform makes by itself heavy use of
semantic technology. All information on companies is stored as data triples in
an underlying triple store. For this purpose we use OWLIM5
, a well-known
semantic repository that supports the load of tens of billions of RDF statements,
uses non-trivial inference and delivers outstanding multi-user query performance.
It is entirely developed in Java and uses Forest6
, a lightweight web interface
that brings together tools for execution of SPARQL queries, visual exploration
of RDF triples, and full text. Forest allows easy connectivity to the underlying
OWLIM triple store via Sesame a de-facto standard framework for processing
RDF data that includes parsing, storing, inferencing and querying of/over such
data.
Vocabulary of the platform
The vocabulary used to describe companies is based on several existing and
well-distributed vocabularies including RDF Schema 7
, Friend of a Friend
(FOAF)8
, Dublin Core9
and GoodRelations10
, but has also a small vocabulary of
its own where the category and the attributes of a company are specified. The
4
http://en.wikipedia.org/wiki/Crowdsourcing
5
http://www.ontotext.com/owlim
6
http://www.ontotext.com/
7
http://www.w3.org/TR/rdf-schema/
8
http://www.foaf-project.org/
9
http://dublincore.org/documents/dcmi-terms/
10
http://www.heppnetz.de/ontologies/goodrelations/v1.html
STI Innsbruck TR 2012-08-06
7
mini-vocabulary of the STI International Community platform specifies the
different categories and the properties a company can have.
Company categories (e.g. Vendor, User or Marketplace) are in this vocabulary
defined as subclasses of RDF Schema’s Organization. We have pre-defined the
following company attributes: companyProperty, description, logo, name and
homepage. companyProperty will used by the platform to describe all user
generated company attributes.
In the following your will see the XML for the vocabulary as described above.
<?xml version="1.0" encoding="utf-8"?>
<rdf:RDF xml:base="http://community.sti2.at/company/"
xmlns:dc="http://purl.org/dc/terms/"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:stc="http://community.sti2.at/spec.rdf#"
xmlns:xhv="http://www.w3.org/1999/xhtml/vocab#"
xmlns:xml="http://www.w3.org/XML/1998/namespace"
xmlns:owl="http://www.w3.org/2002/07/owl">
<owl:Ontology
rdf:about="http://community.sti2.at/spec.rdf">
<dc:title>Semantic Technology Companies vocabulary</dc:title>
<dc:description>Semantic Technology Companies RDF vocabulary, described
using W3C RDF Schema and the Web Ontology Language.</dc:description>
</owl:Ontology>
<rdfs:Class rdf:about="http://community.sti2.at/spec.rdf#Vendor">
<rdfs:isDefinedBy rdf:resource="http://community.sti2.at/spec.rdf"/>
<rdfs:label>Vendor</rdfs:label>
<rdfs:comment>A vendor of Semantic Technologies.</rdfs:comment>
STI Innsbruck TR 2012-08-06
8
<rdfs:subClassOf rdf:resource="http://xmlns.com/foaf/0.1/Organization"/>
</rdfs:Class>
<rdfs:Class rdf:about="http://community.sti2.at/spec.rdf#Marketplace">
<rdfs:isDefinedBy rdf:resource="http://community.sti2.at/spec.rdf"/>
<rdfs:label>Marketplace</rdfs:label>
<rdfs:comment>A Semantic Data marketplace.</rdfs:comment>
<rdfs:subClassOf rdf:resource="http://xmlns.com/foaf/0.1/Organization"/>
</rdfs:Class>
<rdfs:Class rdf:about="http://community.sti2.at/spec.rdf#User">
<rdfs:isDefinedBy rdf:resource="http://community.sti2.at/spec.rdf"/>
<rdfs:label>User</rdfs:label>
<rdfs:comment>A user of Semantic Technologies.</rdfs:comment>
<rdfs:subClassOf rdf:resource="http://xmlns.com/foaf/0.1/Organization"/>
</rdfs:Class>
<rdf:Property rdf:about="http://community.sti2.at/spec.rdf#companyProperty">
<rdfs:label>Super property of Semantic Technology Companies</rdfs:label>
</rdf:Property>
<rdf:Property rdf:about="http://purl.org/dc/terms/description">
<rdfs:subPropertyOf
rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/>
</rdf:Property>
<rdf:Property rdf:about="http://xmlns.com/foaf/0.1/logo">
<rdfs:subPropertyOf
rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/>
</rdf:Property>
<rdf:Property rdf:about="http://xmlns.com/foaf/0.1/name">
<rdfs:subPropertyOf
rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/>
</rdf:Property>
<rdf:Property rdf:about="http://xmlns.com/foaf/0.1/homepage">
STI Innsbruck TR 2012-08-06
9
<rdfs:subPropertyOf
rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/>
</rdf:Property>
Crowdsourcing the data
As stated before, the platform encourages the users to extend this vocabulary
and therefore offers the possibility of adding new categories and new company
attributes. New attributes and categories are modeled as RDF properties having
certain domains, but as the determination of the range has proved to be a non-
trivial task in other applications like DBpedia11
, we decided to use simple
strings for the initial system. An extension on a later point is possible and
planned.
A basic profile of a company represented in RDF/XML can look like this:
<stc:Vendor rdf:about="DA0ED2E7-F1E8-4464-AE72-88F2755F5BFE">
<dc:description>Ontotext provides core semantic technology distinctive
for its performance, scale, and compliance with standards.
Its semantic database OWLIM is proven to be the fastest and most
scalable RDF(S)/OWL engine. Ontotext develops KIM - the most popular semantic
annotation and search platform. WSMO Studio and wsmo4j are at the heart of the
recent developments in Semantic WS and BPM.
Ontotext's technology delivers real-world applications in Life Sciences,
Web Search, BI, Telecommunications, Media Monitoring, Online Recruitment, and
other areas.</dc:description>
<foaf:name>Ontotext</foaf:name>
<foaf:homepage rdf:resource="http://www.ontotext.com/"/>
</stc:Vendor>
11
http://www.dbpedia.org
STI Innsbruck TR 2012-08-06
10
The example describes the company Ontotext. It is categorized into the
category stc:Vendor with the unique ID DA0ED2E7-F1E8-4464-AE72-
88F2755F5BFE. It has a company description (dc:description), a company legal
name (foaf:name) and a link to its Web site (foaf:homepage).
Accessing the data
The complete data set is accessible through a Web interface but also through a
SPARQL endpoint. The Web interface of the community platform allows the
user to browse through the list of company profiles, add new profiles or modify
existing ones. An additional search interface with filtering options for different
company categories simplifies the search for a specific company.
Users with experience in the query language SPARQL can use the provided
SPARQL endpoint for more sophisticated queries, or explore the complete data
set as RDF dumb.
The status
The current dataset of the platform includes the company profiles of around
500 companies from all over the globe that are active in the field of semantic
technology. The data has been collected with the help of the community (e.g.
over the W3C Semantic Web mailing list) and by investigating the data of
contributors of research projects in the area of semantic technology.
STI Innsbruck TR 2012-08-06
11
Future Work
For the future, it might be interesting to allow users of the platform to
introduce existing vocabularies into the data set. So instead of having
companyProperty as an abstract description, users would be able to select
specific attributes for company properties. E.g. the address of a company might
be added as a Location in GoodRelations gr:location12
. This seems to be no
trivial task and thorough implementation.
Addressing recent problems with the technology behind the platform
(specifically Forest), it seems to be reasonable to change the technologies used.
It might be worth looking into the matter of using Drupal 7.x as a basis for a
re-implementation of the STI Community platform.
Drupal 7.x is an open source content management system (CMS) with a big
community that is very active in the semantic field. It offers out-of-the-box user
management, logging, versioning and RDFa support, and can be easily extended
e.g. with a SPARQL endpoint and other annotations. A big advantage would
be that it is more easily maintained than the current platform implementation.
12
http://www.heppnetz.de/ontologies/goodrelations/v1.html#Location

Mais conteúdo relacionado

Semelhante a Sti community – an approach for a semantically enhanced company platform 2012 09 11 0

Godfrey Rust - Rights.com - Linked Content Coalition 062013
Godfrey Rust - Rights.com - Linked Content Coalition 062013Godfrey Rust - Rights.com - Linked Content Coalition 062013
Godfrey Rust - Rights.com - Linked Content Coalition 062013
Europeana Licensing
 
How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...
How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...
How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...
Michael Fanning
 

Semelhante a Sti community – an approach for a semantically enhanced company platform 2012 09 11 0 (20)

Intellectual Property in the Social Networking Space, 2004
Intellectual Property in the Social Networking Space, 2004Intellectual Property in the Social Networking Space, 2004
Intellectual Property in the Social Networking Space, 2004
 
Technology And Market Watch Tool Passas
Technology And Market Watch Tool PassasTechnology And Market Watch Tool Passas
Technology And Market Watch Tool Passas
 
Use of Open Data in Hong Kong (LegCo 2014)
Use of Open Data in Hong Kong (LegCo 2014)Use of Open Data in Hong Kong (LegCo 2014)
Use of Open Data in Hong Kong (LegCo 2014)
 
Godfrey Rust - Rights.com - Linked Content Coalition 062013
Godfrey Rust - Rights.com - Linked Content Coalition 062013Godfrey Rust - Rights.com - Linked Content Coalition 062013
Godfrey Rust - Rights.com - Linked Content Coalition 062013
 
Linked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot AustriaLinked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot Austria
 
Resources (Links) for 2016
Resources (Links) for 2016Resources (Links) for 2016
Resources (Links) for 2016
 
How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...
How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...
How useful are Weblogs, RSS-Newsfeeds Wikis and Podcasting to information spe...
 
Weaving SIOC into the Web of Linked Data
Weaving SIOC into the Web of Linked DataWeaving SIOC into the Web of Linked Data
Weaving SIOC into the Web of Linked Data
 
Thewiki4opentech
Thewiki4opentechThewiki4opentech
Thewiki4opentech
 
Linked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and ExamplesLinked Open Data Principles, Technologies and Examples
Linked Open Data Principles, Technologies and Examples
 
Semic 2012 highlights report
Semic 2012 highlights report Semic 2012 highlights report
Semic 2012 highlights report
 
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
Semantics on services: the story so far (SALAD2015 keynote at ESWC2015)
 
Bibliotheken en cloud computing
Bibliotheken en cloud computingBibliotheken en cloud computing
Bibliotheken en cloud computing
 
Harvesting&Metadata Enrich Project EVA 2009
Harvesting&Metadata Enrich Project   EVA 2009Harvesting&Metadata Enrich Project   EVA 2009
Harvesting&Metadata Enrich Project EVA 2009
 
Intro to Web 3.0 and the Internet of Things
Intro to Web 3.0 and the Internet of ThingsIntro to Web 3.0 and the Internet of Things
Intro to Web 3.0 and the Internet of Things
 
Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8
 
SIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesSIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media Sites
 
Semantic Web 2.0: Creating Social Semantic Information Spaces
Semantic Web 2.0: Creating Social Semantic Information SpacesSemantic Web 2.0: Creating Social Semantic Information Spaces
Semantic Web 2.0: Creating Social Semantic Information Spaces
 
Tech Trans as Learning
Tech Trans as LearningTech Trans as Learning
Tech Trans as Learning
 
ESUR and Web 2.0 / social media.pptx
ESUR and Web 2.0 / social media.pptxESUR and Web 2.0 / social media.pptx
ESUR and Web 2.0 / social media.pptx
 

Mais de STIinnsbruck (20)

Unister
UnisterUnister
Unister
 
Twoo
TwooTwoo
Twoo
 
Twibes
TwibesTwibes
Twibes
 
Tweet deck 2012-01-02
Tweet deck 2012-01-02Tweet deck 2012-01-02
Tweet deck 2012-01-02
 
Tv handbook revised_100120141
Tv handbook revised_100120141Tv handbook revised_100120141
Tv handbook revised_100120141
 
Tv feratel 13032014
Tv feratel 13032014Tv feratel 13032014
Tv feratel 13032014
 
Tv evaluation 12032014
Tv evaluation 12032014Tv evaluation 12032014
Tv evaluation 12032014
 
T vb publication_rules_11032014
T vb publication_rules_11032014T vb publication_rules_11032014
T vb publication_rules_11032014
 
T vb mapping_implementation_25032014
T vb mapping_implementation_25032014T vb mapping_implementation_25032014
T vb mapping_implementation_25032014
 
T vb alignment_022814_0
T vb alignment_022814_0T vb alignment_022814_0
T vb alignment_022814_0
 
Ttr 20130701
Ttr 20130701Ttr 20130701
Ttr 20130701
 
Ttg mapping to_schema.org_
Ttg mapping to_schema.org_Ttg mapping to_schema.org_
Ttg mapping to_schema.org_
 
Ttb 08042014
Ttb 08042014Ttb 08042014
Ttb 08042014
 
Trust you
Trust youTrust you
Trust you
 
Tripwolf
TripwolfTripwolf
Tripwolf
 
Tripbirds
TripbirdsTripbirds
Tripbirds
 
Traveltainment
TraveltainmentTraveltainment
Traveltainment
 
Travelaudience
TravelaudienceTravelaudience
Travelaudience
 
Tourismuszukunft
TourismuszukunftTourismuszukunft
Tourismuszukunft
 
Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013
 

Último

Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
Kayode Fayemi
 

Último (20)

SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
Causes of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCauses of poverty in France presentation.pptx
Causes of poverty in France presentation.pptx
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animals
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 

Sti community – an approach for a semantically enhanced company platform 2012 09 11 0

  • 1. STI – Semantic Technology Institute Innsbruck STI Community – An approach for a semantically enhanced company platform Carmen Brenner, Andreas Thalhammer, and Dieter Fensel STI Innsbruck Technical Report September 11, 2012 STI Innsbruck University of Innsbruck Technologiepark, Technikerstraße 21 6020 Innsbruck Austria http://www.sti-innsbruck.at/
  • 2. STI Innsbruck TR 2012-08-06 2 This technical report will introduce the background and the goal of the STI International Community platform, an open online platform with the goal to collect information on company and business entities that are active in the field of semantic technologies. Based on the ideas of Crowdsourcing the data is created and enriched by the semantic community itself therefore introducing the community's knowledge into the dataset. The platform is accessible at http://community.sti2.at.
  • 3. STI Innsbruck TR 2012-08-06 3 Introduction ............................................................................................4   The platform structure..............................................................................5   Technologies ............................................................................ 6   Vocabulary of the platform ..................................................... 6   Crowdsourcing the data........................................................... 9   Accessing the data..................................................................10   The status .............................................................................................10  
  • 4. STI Innsbruck TR 2012-08-06 4 Introduction The interest in and the number of applications enriched by semantic technology have grown fast in the last years. No longer are terms like Semantic Web, Big Data or Linked Open Data only associated with research, but semantic technology is on the cusp of mainstream adoption and represents a huge potential for advances in all areas of ICT. But it has also become more and more difficult for interested parties to keep track of all these companies that try to bring semantic technology into our daily lives with a variety of products. Information on companies that are active in the field of semantic technologies is usually diverse and widely distributed, and hard to get by. Additionally, in many cases, the data on a company or a product is not provided in a machine- readable format like RDF, but rather in an only human-readable way. Out of this dilemma the idea was born to create a list of companies that are active in the field of semantic technology in order to have a central point where interested parties can easily browse for companies and their products. We initiated the project in December 2010 collecting information on companies working in the semantic technology field and the first draft was publicly announced to the semantic technology community on December 20, 2010 over the W3C Semantic Web mailing list1 . The feedback received was huge and mostly very positive, and a great number of companies were proposed to be added to the list2 . 1 http://lists.w3.org/Archives/Public/semantic-web/ 2 The whole conversation on the W3C Semantic Web mailing lists is available at: http://lists.w3.org/Archives/Public/semantic-web/2010Dec/0190.html
  • 5. STI Innsbruck TR 2012-08-06 5 Such change and update requests to the list were numerous, which showed that a simple list that only offers a short description of the companies with a link to their Web sites as it started out at the beginning was simply not enough. There was demand to allow the semantics community to easily add new companies or edit existing profiles, and therefore maintain their public data themselves. The idea to build an open online platform was therefore initiated in co- operation with STI Research and Consultancy3 . The focus and vision of STI Research and Consultancy is to ensure that semantics will become part of everyday life, and such an open online platform was a huge step in supporting the community. The platform structure This section will shortly describe the structure of the platform including the technologies used and the vocabulary. The STI Community platform was designed to be an online platform with the goals 1. To collect data on companies that are active in the field of semantic technologies, 2. To support developers and vendors of semantic technology in their wish to position their businesses and products in the Internet, and 3. To give users who are searching for semantic applications and semantic technology vendors a place to start their search. 3 http://research.sti2.org
  • 6. STI Innsbruck TR 2012-08-06 6 The term “open” in this context refers to the concept of Crowdsourcing4 where the profiles of companies are added and maintained by the community. Technologies The STI International Community platform makes by itself heavy use of semantic technology. All information on companies is stored as data triples in an underlying triple store. For this purpose we use OWLIM5 , a well-known semantic repository that supports the load of tens of billions of RDF statements, uses non-trivial inference and delivers outstanding multi-user query performance. It is entirely developed in Java and uses Forest6 , a lightweight web interface that brings together tools for execution of SPARQL queries, visual exploration of RDF triples, and full text. Forest allows easy connectivity to the underlying OWLIM triple store via Sesame a de-facto standard framework for processing RDF data that includes parsing, storing, inferencing and querying of/over such data. Vocabulary of the platform The vocabulary used to describe companies is based on several existing and well-distributed vocabularies including RDF Schema 7 , Friend of a Friend (FOAF)8 , Dublin Core9 and GoodRelations10 , but has also a small vocabulary of its own where the category and the attributes of a company are specified. The 4 http://en.wikipedia.org/wiki/Crowdsourcing 5 http://www.ontotext.com/owlim 6 http://www.ontotext.com/ 7 http://www.w3.org/TR/rdf-schema/ 8 http://www.foaf-project.org/ 9 http://dublincore.org/documents/dcmi-terms/ 10 http://www.heppnetz.de/ontologies/goodrelations/v1.html
  • 7. STI Innsbruck TR 2012-08-06 7 mini-vocabulary of the STI International Community platform specifies the different categories and the properties a company can have. Company categories (e.g. Vendor, User or Marketplace) are in this vocabulary defined as subclasses of RDF Schema’s Organization. We have pre-defined the following company attributes: companyProperty, description, logo, name and homepage. companyProperty will used by the platform to describe all user generated company attributes. In the following your will see the XML for the vocabulary as described above. <?xml version="1.0" encoding="utf-8"?> <rdf:RDF xml:base="http://community.sti2.at/company/" xmlns:dc="http://purl.org/dc/terms/" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:stc="http://community.sti2.at/spec.rdf#" xmlns:xhv="http://www.w3.org/1999/xhtml/vocab#" xmlns:xml="http://www.w3.org/XML/1998/namespace" xmlns:owl="http://www.w3.org/2002/07/owl"> <owl:Ontology rdf:about="http://community.sti2.at/spec.rdf"> <dc:title>Semantic Technology Companies vocabulary</dc:title> <dc:description>Semantic Technology Companies RDF vocabulary, described using W3C RDF Schema and the Web Ontology Language.</dc:description> </owl:Ontology> <rdfs:Class rdf:about="http://community.sti2.at/spec.rdf#Vendor"> <rdfs:isDefinedBy rdf:resource="http://community.sti2.at/spec.rdf"/> <rdfs:label>Vendor</rdfs:label> <rdfs:comment>A vendor of Semantic Technologies.</rdfs:comment>
  • 8. STI Innsbruck TR 2012-08-06 8 <rdfs:subClassOf rdf:resource="http://xmlns.com/foaf/0.1/Organization"/> </rdfs:Class> <rdfs:Class rdf:about="http://community.sti2.at/spec.rdf#Marketplace"> <rdfs:isDefinedBy rdf:resource="http://community.sti2.at/spec.rdf"/> <rdfs:label>Marketplace</rdfs:label> <rdfs:comment>A Semantic Data marketplace.</rdfs:comment> <rdfs:subClassOf rdf:resource="http://xmlns.com/foaf/0.1/Organization"/> </rdfs:Class> <rdfs:Class rdf:about="http://community.sti2.at/spec.rdf#User"> <rdfs:isDefinedBy rdf:resource="http://community.sti2.at/spec.rdf"/> <rdfs:label>User</rdfs:label> <rdfs:comment>A user of Semantic Technologies.</rdfs:comment> <rdfs:subClassOf rdf:resource="http://xmlns.com/foaf/0.1/Organization"/> </rdfs:Class> <rdf:Property rdf:about="http://community.sti2.at/spec.rdf#companyProperty"> <rdfs:label>Super property of Semantic Technology Companies</rdfs:label> </rdf:Property> <rdf:Property rdf:about="http://purl.org/dc/terms/description"> <rdfs:subPropertyOf rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/> </rdf:Property> <rdf:Property rdf:about="http://xmlns.com/foaf/0.1/logo"> <rdfs:subPropertyOf rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/> </rdf:Property> <rdf:Property rdf:about="http://xmlns.com/foaf/0.1/name"> <rdfs:subPropertyOf rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/> </rdf:Property> <rdf:Property rdf:about="http://xmlns.com/foaf/0.1/homepage">
  • 9. STI Innsbruck TR 2012-08-06 9 <rdfs:subPropertyOf rdf:resource="http://community.sti2.at/spec.rdf#companyProperty"/> </rdf:Property> Crowdsourcing the data As stated before, the platform encourages the users to extend this vocabulary and therefore offers the possibility of adding new categories and new company attributes. New attributes and categories are modeled as RDF properties having certain domains, but as the determination of the range has proved to be a non- trivial task in other applications like DBpedia11 , we decided to use simple strings for the initial system. An extension on a later point is possible and planned. A basic profile of a company represented in RDF/XML can look like this: <stc:Vendor rdf:about="DA0ED2E7-F1E8-4464-AE72-88F2755F5BFE"> <dc:description>Ontotext provides core semantic technology distinctive for its performance, scale, and compliance with standards. Its semantic database OWLIM is proven to be the fastest and most scalable RDF(S)/OWL engine. Ontotext develops KIM - the most popular semantic annotation and search platform. WSMO Studio and wsmo4j are at the heart of the recent developments in Semantic WS and BPM. Ontotext's technology delivers real-world applications in Life Sciences, Web Search, BI, Telecommunications, Media Monitoring, Online Recruitment, and other areas.</dc:description> <foaf:name>Ontotext</foaf:name> <foaf:homepage rdf:resource="http://www.ontotext.com/"/> </stc:Vendor> 11 http://www.dbpedia.org
  • 10. STI Innsbruck TR 2012-08-06 10 The example describes the company Ontotext. It is categorized into the category stc:Vendor with the unique ID DA0ED2E7-F1E8-4464-AE72- 88F2755F5BFE. It has a company description (dc:description), a company legal name (foaf:name) and a link to its Web site (foaf:homepage). Accessing the data The complete data set is accessible through a Web interface but also through a SPARQL endpoint. The Web interface of the community platform allows the user to browse through the list of company profiles, add new profiles or modify existing ones. An additional search interface with filtering options for different company categories simplifies the search for a specific company. Users with experience in the query language SPARQL can use the provided SPARQL endpoint for more sophisticated queries, or explore the complete data set as RDF dumb. The status The current dataset of the platform includes the company profiles of around 500 companies from all over the globe that are active in the field of semantic technology. The data has been collected with the help of the community (e.g. over the W3C Semantic Web mailing list) and by investigating the data of contributors of research projects in the area of semantic technology.
  • 11. STI Innsbruck TR 2012-08-06 11 Future Work For the future, it might be interesting to allow users of the platform to introduce existing vocabularies into the data set. So instead of having companyProperty as an abstract description, users would be able to select specific attributes for company properties. E.g. the address of a company might be added as a Location in GoodRelations gr:location12 . This seems to be no trivial task and thorough implementation. Addressing recent problems with the technology behind the platform (specifically Forest), it seems to be reasonable to change the technologies used. It might be worth looking into the matter of using Drupal 7.x as a basis for a re-implementation of the STI Community platform. Drupal 7.x is an open source content management system (CMS) with a big community that is very active in the semantic field. It offers out-of-the-box user management, logging, versioning and RDFa support, and can be easily extended e.g. with a SPARQL endpoint and other annotations. A big advantage would be that it is more easily maintained than the current platform implementation. 12 http://www.heppnetz.de/ontologies/goodrelations/v1.html#Location