SlideShare uma empresa Scribd logo
1 de 35
Some thoughts about the gaps across
languages and domains
through the experience on building the
core common vocabularies
Hideaki Takeda
National Institute of Informatics
takeda@nii.ac.jp
Glocal KO Workshop, Thursday August 13, 2015, Copenhagen
Who am I?
Hideaki Takeda, Dr., Eng.
• Professor, National Institute of Informatics
– Research Institute mainly for Computer Science
• Background: Computer Science, in particular, Artificial
Intelligence
• Current interest: Semantic Web, Ontology, Linked Open
Data (LOD), Social Media Analysis
• Social activities
– President, Linked Open Data Initiative (NPO)
– Founder, Dbpedia Japanese Chapter
– Specialist, Information-technology Promotion Agency,
Japan (IPA)
– Chair, Japan Link Center (Registration Agency of
International DOI Foundation)
– Board, ORCID
Core Vocabularies
• Background
– Everything is on infosphere, i.e., web
– Lots of information, lots of data, lots of systems
• Problems
– Misunderstanding/mis-matching/”missing
links“ across different domains
– Gap between human and machines (computers)
Core Vocabularies
• Aim
– Increase interoperability of information/data
– Bridge human and machine understanding
• Target
– Governmental documents/data
• Method
– Define a set of concepts which bridge (human-
readable) terms and (computer-processable) symbols
(URIs)
– Starting from the most common concepts
Core Vocabularies
• Activities worldwide
– USA: NIEM Core
• NIEM (National Information Exchange Model)
– Europe: ISA Core Vocabularies
– UN: United Nations Centre for Trade Facilitation
and Electronic Business (UN/CEFACT)
• Core Components Library (UN/CCL)
– Japan: IMI Core Vocabulary
ISA Core Vocabularies v 1.1
NIEM Architecture
http://niem.github.io/technical/iepd-versions/
NIEM
http://reference.niem.gov/niem/guidance/user-guide/vol1/user-guide-vol1.pdf
http://www.epa.gov/oei/symposium/2010/roy.pdf
IMI Project
• Supported by
– Ministry of Economy, Trade,
and Industry, Japan
• Technical Framework
– Data Model
– Core Vocabulary
– Design Rules
• Support Framework
– Tools
• for data developer
• for schema developer
– Database
• schema / tools / templates/ …
Person Type
Name
Gender
Gender Code
Birth Date
Address
…
Name Type
Type
Name
Family Name
Given Name
…
Address Type
Type
Notation
Zip Code
Prefecture
City
…
String
String
String
Code TypeString
String
String
String
String
String
Code Type
Type
Value
Name Type
Address Type
Codelist Type
String
Thing Type
10
IMI as a template for schema
Registration form for Confere
Name:
Address:
Gender:
Affiliation:
Affiliation
Address:
Attending date: -
M /
Person Type
Name
Gender
Gender Code
Birth Date
Address
…
Name Type
Type
Name
Family Name
Given Name
…
Address Type
Type
Notation
Zip Code
Prefecture
City
…
String
String
String
Code Type
String String
String
String
String
String
Code Type
Type
Value
Name Type
Address Type
Codelist Type
String
Thing Type
IMI Individual Form
Person Type
Name
Gender
Address
Affiliation
Name Type
Name
Address Type
Notation
Zip-code
String
String
String
String
Name
Address
Org.
Person
Date
Event Participation Type
Participant
Date
Design Schema
Remove unnecessary items
Add necessary items
Roles of IMI
• Structured concept dictionary
– Concept dictionary
• Terms as notation of concepts
– The entry is concept, not term
• Class concept and relation concept
• General-specific relation
– Structured dictionary
• Concepts form a network of concepts which in tern represents
meaning of individual concepts
• A class concept consists of relation concepts representing
attributes and general/specific relations
• A relation concept consists of class concepts connected as
domains and ranges and general/specific relations
• Template for schemata
– Add or remove items for the specific needs
Use of IMI
• Define the concept model
• “Serialize” it into specific “physical” forms
• Use suitable a physical form
IMI Concept
Model
RDF XML
Natural
Language Form
For Open Data For data exchange For spread sheets and documents
• Relax definition
• Interoperability
with other open data
schemata
• Strict definition
• Interoperability with DB
schemata
• Relax definition with simple
structure
• Readability by humans
IMI Core vocabulary v2.2
• Published on Feb.3 2015
• 48 core class terms
– person, address, facility, location, date, …
• 206 core property terms
– name of person, birth date, birth country, …
• Multi format
– rdf schema, xml schema
and documents for human
http://imi.ipa.go.jp/ns/core/2/ 14
Class definition (person class)
person 人
説明:人の情報を表現するためのデータ型 Data Type to describe a person
継承(inherit from) : ic:実体型
property Data type cardinality 説明 (ja) Description (en)
ID ID ic:ID型 0..n ID Identification of a Person
Name of person 氏名 ic:氏名型 0..n 氏名 Name of a Person
Gender 性別 xsd:string 0..1 性別の表記 Gender of a Person
Gender code 性別コード ic:コード型 0..1 性別コード Gender of a Person
Birth date 生年月日 ic:日付型 0..1 生年月日 Date of Birth of a Person
Death date 死亡年月日 ic:日付型 0..1 死亡年月日 Date of Death of a Person
Residence address 住所 ic:住所型 0..n 現住所 Present address of a Person
Domicile of origin 本籍 ic:住所型 0..1 本籍 Legal residence address of a Person
Contact information 連絡先 ic:連絡先型 0..n 連絡先 Contact information of a Person
Nationality 国籍 xsd:string 0..n 国籍の表記
A county that assigns rights, duties, and
privileges to a person because of the birth or
naturalization of the person in that country.
Nationality code 国籍コード ic:コード型 0..n
住民基本台帳で利用さ
れている国籍コード
A county that assigns rights, duties, and
privileges to a person because of the birth or
naturalization of the person in that country.
Birth country 出生国 xsd:string 0..1 生まれた国名 A location where a person was born.
Birth country code 出生国コード ic:コード型 0..1 生まれた国のコード A location where a person was born.
Birth place 出生地 ic:住所型 0..1 生まれた場所 A location where a person was born.
16
Class Structure
person 人
name ic:氏名型
Contact ic:連絡先型
: :
氏名
Family name xsd:string
Romanized Family name xsd:string
: :
contact 連絡先
Phone number ic:電話番号型
Address ic:住所型
: :
電話番号
: :
address 住所
Country xsd:string
Prefecture xsd:string
: :
 A class term has a property term as a sub element and the property term can refer a class
term. Again, the class term has a list of property terms. That constructs a layered structure
of terms as the following figure.
phone number
name
Concept of the IMI framework
International interoperability is highly
considered in preparing IMI.
Core
Vocabulary
Shelter
Location
Hospital
Station
Geographical Space
/Facilities
Transportation
Disaster
Prevention
Finance
Domain-specific
Vocabularies
Disaster
Restoration
Cost
Cross Domain
Vocabulary
IMI
Japanese
Local
government
Standard
(APPLIC)
DE fact
Standards
(DC, foaf,
etc)
NIEM
(US)
ISA
(EU)
Schema.org
18
Mapping between concepts in
different core vocabularies
• Difficulty of concept-concept mapping
– Matching of meaning tends to be very abstract
discussion
Concept
reference
Ontology
Real world
Concept
reference
?
Mapping between concepts in
different core vocabularies
• Difficulty of concept-concept mapping
– Matching of meaning tends to be very abstract
discussion
– Matching of references is easier
Concept
reference
Ontology
Real world
Concept
reference
?
Mapping between concepts in
different core vocabularies
• Difficulty of concept-concept mapping
– Syntactical mapping vs. semantic mapping
• Just consider what it refers in the real world, not how it
is represented in systems.
Concept
reference
Ontology
Concept
reference
?
Systems World
Cognitive World
Person
person 人
説明:人の情報を表現するためのデータ型 Data Type to describe a
person
継承(inherit from) : ic:実体型
prop
erty
Data
type
cardi
nalit
y
説明 (ja) Description (en)
ID ID ic:ID型 0..n ID Identification of a Person
Name of
person
氏名
ic:氏名
型
0..n 氏名 Name of a Person
Gender 性別
xsd:strin
g
0..1 性別の表記 Gender of a Person
ender code
性別
コード
ic:コード
型
0..1 性別コード Gender of a Person
Birth date
生年月
日
ic:日付
型
0..1 生年月日 Date of Birth of a Person
Death date
死亡年
月日
ic:日付
型
0..1 死亡年月日 Date of Death of a Person
Residence
address
住所
ic:住所
型
0..n 現住所 Present address of a Person
Domicile of
origin
本籍
ic:住所
型
0..1 本籍
Legal residence address of a
Person
Contact
nformation
連絡先
ic:連絡
先型
0..n 連絡先
Contact information of a
Person
Nationality 国籍
xsd:strin
g
0..n 国籍の表記
A county that assigns rights,
duties, and privileges to a
person because of the birth or
naturalization of the person in
that country.
住民基本台帳
A county that assigns rights,
duties, and privileges to a
?
?
Systems World
Cognitive World
Postal Code
?
?
“101-8430” ^^xsd:string “SW1A 0AA”@en
(postal code in Japan) (postal code in Europe)
Systems World
Cognitive World
Semantic Mapping
• Semantic Mapping
– Mapping on the cognitive layer
– Two ways of judging mapping
• Extensional Mapping
– Check whether ‘things’ are shared
– e.g., person
– Mostly for Class Mapping
• Intensional Mapping
– Check whether ‘values’ are shared
– e.g., postal-code
– Mostly for Property Mapping
• Syntactical Mapping
– Mapping on the systems layer
Types of matching: SKOS
• Exact Match
• Close Match
• Broad/Narrow Match
• Related Match
Close match
• Close match: nearly matched but not exactly
matched.
• Extensional mapping
– Coverage of ‘things’ are overlapped so much
• Coverage of ‘Country’ is slightly different
– ‘things’ are close
• Reference of ‘Person’ is slightly different (person vs. legal
Person)
• Intensional mapping
– Coverage of ‘values’ are overlapped so much
Broad match/narrow match
• Broad/narrow match
– One subsumes the other
• Extensional mapping
– Coverage of ‘things’ are subsumed, i.e., the subset
is exact match
• Intensional mapping
– Coverage of ‘values’ are subsumed, i.e., the subset
is exact match
More different matching
• Complicated match
– An element of a system matches a combination of
two or more elements.
– “Pathway” match
• A single property matches the combination of two or
more properties
– “Conditional” match
• An element matches the other element if some
condition is hold
IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者
LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所 It is exact match if the value of ic:住所.種別 should be "登記住所".
Results
Core Vocabulary Identifier Link Mapping relation Data model Identifier
Address Link Has exact match IMI ic:住所型
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:町名
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:丁目
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地補足
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地
AddressAddressArea Link Has narrow match IMI ic:住所型.ic:号
AddressAddressID Link Has exact match IMI ic:住所型.ic:ID
AddressAdminUnitL1 Link Has exact match IMI ic:住所型.ic:国
AddressAdminUnitL2 Link Has narrow match IMI ic:住所型.ic:都道府県
AddressFullAddress Link Has exact match IMI ic:住所型.ic:表記
AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:ビル番号
AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:部屋番号
AddressLocatorName Link Has narrow match IMI ic:住所型.ic:ビル名
AddressPOBox Link Has related match IMI ic:住所型.ic:方書
AddressPostCode Link Has exact match IMI ic:住所型.ic:郵便番号
AddressPostName Link Has narrow match IMI ic:住所型.ic:市区町村
AddressPostName Link Has narrow match IMI ic:住所型.ic:区
AddressThoroughfare Link Has no match IMI
Agent Link Has exact match IMI ic:実体型
Results
Identifier Link Has exact match IMI ic:ID型
IdentifierIdentifier Link Has exact match IMI ic:ID型.ic:識別値
IdentifierIssueDate Link Has no match IMI
IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者
IdentifierIssuingAuthorityURI Link Has exact match IMI ic:ID型.ic:ID体系.ic:URI
IdentifierType Link Has no match IMI
JurisdictionIdentifier Link Has related match IMI ic:国籍コード
JurisdictionName Link Has related match IMI ic:国籍
LegalEntity Link Has exact match IMI ic:法人型
LegalEntityAddress Link Has broad match IMI ic:法人型.ic:住所
LegalEntityAlternativeName Link Has no match IMI
LegalEntityCompanyActivity Link Has close match IMI ic:法人型.ic:事業種目
LegalEntityCompanyStatus Link Has related match IMI ic:法人型.ic:活動状況
LegalEntityCompanyType Link Has exact match IMI ic:法人型.ic:組織種別
LegalEntityIdentifier Link Has exact match IMI ic:法人型.ic:ID
LegalEntityLegalIdentifier Link Has no match IMI
LegalEntityLegalName Link Has broad match IMI ic:法人型.ic:名称.表記
LegalEntityLocation Link Has related match IMI ic:法人型.ic:地物.説明
LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所
Location Link Has exact match IMI ic:場所型
LocationAddress Link Has exact match IMI ic:場所型.ic:住所
LocationGeographicIdentifier Link Has broad match IMI ic:場所型.ic:地理識別子
LocationGeographicName Link Has exact match IMI ic:場所型.ic:名称.ic:表記
LocationGeometry Link Has exact match IMI ic:場所型.ic:地理座標
Results
Person Link Has exact match IMI ic:人型
PersonAddress Link Has exact match IMI ic:人型.ic:住所
PersonAlternativeName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名
PersonBirthName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名
PersonCitizenship Link Has no match IMI
PersonCountryOfBirth Link Has exact match IMI ic:人型.ic:出生国
PersonCountryOfDeath Link Has no match IMI
PersonDateOfBirth Link Has exact match IMI ic:人型.ic:生年月日
PersonDateOfDeath Link Has exact match IMI ic:人型.ic:死亡年月日
PersonFamilyName Link Has exact match IMI ic:人型.ic:氏名.ic:姓
PersonFullName Link Has exact match IMI ic:人型.ic:氏名.ic:姓名
PersonGender Link Has exact match IMI ic:人型.ic:性別コード
PersonGivenName Link Has exact match IMI ic:人型.ic:氏名.ic:名
PersonIdentifier Link Has broad match IMI ic:人型.ic:ID
PersonPatronymicName Link Has no match IMI ic:人型.ic:氏名.ic:姓名
PersonPlaceOfBirth Link Has narrow match IMI ic:人型.ic:出生地
Bridging core and domain vocabularies
(working in progress)
• Aim: Core vocabulary would be extended to
domain vocabularies
– Agriculture
– Finance
– Traffic
– …
• Task:
– Can concepts be shared between core and domains?
really?
Agricultural Activity Ontology (AAO)
Agricultural activity
crop production activity
activity for propagation
activity in the vegetative growth stage
activity in the reproductive growth stage
activity for environment control
activity for soil control
activity for climate control
activity for water control
activity for biotic control
activity for chemical control
post production activity
activity for harvesting
activity for processing
activity for extending shelf-life
activity for wrapping
indirect activity
activity for preparing materials
activity for cleaning
activity for transport
activity for monitoring
activity for maintaining farm equipment
administrative activity
activity for business administration
http://cavoc.org/aao/
An example: “activity” (and “event”)
• S: (n) activity (any specific behavior) "they avoided all recreational activity"
– direct hyponym / full hyponym
– direct hypernym / inherited hypernym / sister term
• S: (n) act, deed, human action, human activity (something that people do or cause to happen)
– S: (n) event (something that happens at a given place and time)
– [WordNet]
• Each activity is a Happening which involves volition and participants. It has
temporal dimension. It is distinguished from Events by the fact that the activity
does not trigger change of state and does not have a conceptual end point.
– [PROTON Extent module (a lightweight upper-level ontology)]
• Activity: This class represents the abstract content of an event, which may be
repeated many times, once or never. For example a training course, or a play.
– [The Event Programme Vocabulary (prog)]
• E5 Event
– Subclass of: E4 Period
– Superclass of: E7 Activity, E63 Beginning of Existence, E64 End of Existence
• E7 Activity
– Subclass of: E5 Event
– Superclass of: E8 Acquisition, E9 Move, E10 Transfer of Custody, E11 Modification,
E13 Attribute Assignment, E65 Creation …
– [CIDOC Conceptual Reference Model]
Summary
• Sharing concepts is a very long way
• No ground truth
– Step-by-step understanding of the world
– Careful consensus making
• More flexible framework is needed
– Simple mapping is not so happy

Mais conteúdo relacionado

Destaque

Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2Amrita Prasad
 
Marketing na Internet
Marketing na InternetMarketing na Internet
Marketing na Internetrenatofrigo
 
Mitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de parejaMitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de parejaAdmingac
 
Roteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de comprasRoteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de comprasAntonio Marcos Montai Messias
 
Problemas de aprendizaje
Problemas de aprendizajeProblemas de aprendizaje
Problemas de aprendizajeLISS
 
Seo proposal for tensator group
Seo proposal for tensator groupSeo proposal for tensator group
Seo proposal for tensator groupParixit Dwivedi
 
Aplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y TiemposAplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y TiemposZadecon
 
Conferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda onlineConferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda onlineDario Schilman
 

Destaque (14)

corripio
corripio corripio
corripio
 
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
Puzzle ITC Talk @Docker CH meetup CI CD_with_Openshift_0.2
 
Sigmund freud obras completas - lopez ballesteros
Sigmund freud   obras completas - lopez ballesterosSigmund freud   obras completas - lopez ballesteros
Sigmund freud obras completas - lopez ballesteros
 
Augmenter la satisfaction de l'utilisateur
Augmenter la satisfaction de l'utilisateurAugmenter la satisfaction de l'utilisateur
Augmenter la satisfaction de l'utilisateur
 
Manual dqp
Manual dqpManual dqp
Manual dqp
 
Estrategia nal. obesidad 1
Estrategia nal. obesidad 1Estrategia nal. obesidad 1
Estrategia nal. obesidad 1
 
Marketing na Internet
Marketing na InternetMarketing na Internet
Marketing na Internet
 
Mitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de parejaMitos y errores en las relaciones de pareja
Mitos y errores en las relaciones de pareja
 
Roteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de comprasRoteiro de estudo de caso simulação do processo de compras
Roteiro de estudo de caso simulação do processo de compras
 
Problemas de aprendizaje
Problemas de aprendizajeProblemas de aprendizaje
Problemas de aprendizaje
 
Seo proposal for tensator group
Seo proposal for tensator groupSeo proposal for tensator group
Seo proposal for tensator group
 
Aplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y TiemposAplicación de los estudios de Métodos y Tiempos
Aplicación de los estudios de Métodos y Tiempos
 
Conferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda onlineConferencia: Como montar tu primera tienda online
Conferencia: Como montar tu primera tienda online
 
Mai2010 einladung doktorandenkolloquium
Mai2010 einladung doktorandenkolloquiumMai2010 einladung doktorandenkolloquium
Mai2010 einladung doktorandenkolloquium
 

Semelhante a Bridging gaps across languages and domains through core vocabularies

141021 ipa global vocab
141021 ipa global vocab141021 ipa global vocab
141021 ipa global vocabKenji Hiramoto
 
Semtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony LeeSemtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony LeeSaltlux Inc.
 
SemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony LeeSemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony LeeSaltlux Inc.
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.dri_ireland
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)Marina Santini
 
Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...tmra
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application ProfilesDiane Hillmann
 
PPT slides
PPT slidesPPT slides
PPT slidesbutest
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Leon Derczynski
 
Text mining introduction-1
Text mining   introduction-1Text mining   introduction-1
Text mining introduction-1Sumit Sony
 
What Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsWhat Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsSeth Grimes
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsAndre Freitas
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert Systemguest4513a7
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert SystemMediabistro
 
Leveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge BasesLeveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge BasesNandana Mihindukulasooriya
 
Topic models, vector semantics and applications
Topic models, vector semantics and applicationsTopic models, vector semantics and applications
Topic models, vector semantics and applicationsVasileios Lampos
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Bhaskar Mitra
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceLeon Derczynski
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Bianca Pereira
 

Semelhante a Bridging gaps across languages and domains through core vocabularies (20)

141021 ipa global vocab
141021 ipa global vocab141021 ipa global vocab
141021 ipa global vocab
 
Semtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony LeeSemtech 2011, Saltlux, Tony Lee
Semtech 2011, Saltlux, Tony Lee
 
SemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony LeeSemTech 2011, Saltlux, Tony Lee
SemTech 2011, Saltlux, Tony Lee
 
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
 
Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...Construction of Authority Information for Personal Names Focused on the Forme...
Construction of Authority Information for Personal Names Focused on the Forme...
 
Introduction to Application Profiles
Introduction to Application ProfilesIntroduction to Application Profiles
Introduction to Application Profiles
 
PPT slides
PPT slidesPPT slides
PPT slides
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
 
Text mining introduction-1
Text mining   introduction-1Text mining   introduction-1
Text mining introduction-1
 
What Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsWhat Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content Analytics
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert System
 
Web 3 Expert System
Web 3 Expert SystemWeb 3 Expert System
Web 3 Expert System
 
Leveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge BasesLeveraging Semantic Parsing for Relation Linking over Knowledge Bases
Leveraging Semantic Parsing for Relation Linking over Knowledge Bases
 
Topic models, vector semantics and applications
Topic models, vector semantics and applicationsTopic models, vector semantics and applications
Topic models, vector semantics and applications
 
Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
 
Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)Reading Group 2013 (DERI NUIG)
Reading Group 2013 (DERI NUIG)
 

Mais de National Institute of Informatics (NII)

趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)National Institute of Informatics (NII)
 
趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜National Institute of Informatics (NII)
 
セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築National Institute of Informatics (NII)
 
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ National Institute of Informatics (NII)
 
Presenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked DataPresenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked DataNational Institute of Informatics (NII)
 
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜National Institute of Informatics (NII)
 
研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向について研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向についてNational Institute of Informatics (NII)
 

Mais de National Institute of Informatics (NII) (20)

趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
趙簡単LOD入門 〜デジタル庁をデジタル化する〜 (改訂版)
 
趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜趙簡単LOD入門 〜デジタル庁をデジタル化する〜
趙簡単LOD入門 〜デジタル庁をデジタル化する〜
 
"分人"型社会とAI
"分人"型社会とAI"分人"型社会とAI
"分人"型社会とAI
 
セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築セマンティックWeb技術を用いた農業分野の標準語彙の構築
セマンティックWeb技術を用いた農業分野の標準語彙の構築
 
研究オープンデータにおける大学と研究者の役割
研究オープンデータにおける大学と研究者の役割研究オープンデータにおける大学と研究者の役割
研究オープンデータにおける大学と研究者の役割
 
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
NII研究100連発 ウェブと人工知能の融合 -人間の創造性を刺激するコンピュータ
 
Presenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked DataPresenting and Preserving the Change in Taxonomic Knowledge for Linked Data
Presenting and Preserving the Change in Taxonomic Knowledge for Linked Data
 
Crop vocabulary (CVO): Core vocabulary of crop names
Crop vocabulary (CVO): Core vocabulary of crop namesCrop vocabulary (CVO): Core vocabulary of crop names
Crop vocabulary (CVO): Core vocabulary of crop names
 
ORCIDとオープンサイエンス
ORCIDとオープンサイエンスORCIDとオープンサイエンス
ORCIDとオープンサイエンス
 
How to build ontologies - a case study of Agriculture Activity Ontology
How to build ontologies - a case study of Agriculture Activity OntologyHow to build ontologies - a case study of Agriculture Activity Ontology
How to build ontologies - a case study of Agriculture Activity Ontology
 
LODとオープンデータ (DBpediaとIMIの周辺を中心に)
LODとオープンデータ(DBpediaとIMIの周辺を中心に)LODとオープンデータ(DBpediaとIMIの周辺を中心に)
LODとオープンデータ (DBpediaとIMIの周辺を中心に)
 
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
共通語彙の構築の基本的な考え方と方法 〜研究データのために語彙・スキーマを作るには〜
 
Working with Global Infrastructure at a National Level
Working with Global Infrastructure at a National LevelWorking with Global Infrastructure at a National Level
Working with Global Infrastructure at a National Level
 
Activities of JaLC as a national service
Activities of JaLC as a national serviceActivities of JaLC as a national service
Activities of JaLC as a national service
 
Development and Application of Agriculture Ontologies
Development and Application of Agriculture Ontologies Development and Application of Agriculture Ontologies
Development and Application of Agriculture Ontologies
 
Design Process of Agriculture Ontologies
Design Process of Agriculture OntologiesDesign Process of Agriculture Ontologies
Design Process of Agriculture Ontologies
 
AIの未来 ~技術と社会の関係のダイナミクス~
AIの未来~技術と社会の関係のダイナミクス~AIの未来~技術と社会の関係のダイナミクス~
AIの未来 ~技術と社会の関係のダイナミクス~
 
Towards Knowledge-Enabled Society
Towards Knowledge-Enabled SocietyTowards Knowledge-Enabled Society
Towards Knowledge-Enabled Society
 
研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向について研究データ利活用に関する国内活動及び国際動向について
研究データ利活用に関する国内活動及び国際動向について
 
オープンサイエンスとオープンデータ
オープンサイエンスとオープンデータオープンサイエンスとオープンデータ
オープンサイエンスとオープンデータ
 

Último

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Último (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Bridging gaps across languages and domains through core vocabularies

  • 1. Some thoughts about the gaps across languages and domains through the experience on building the core common vocabularies Hideaki Takeda National Institute of Informatics takeda@nii.ac.jp Glocal KO Workshop, Thursday August 13, 2015, Copenhagen
  • 2. Who am I? Hideaki Takeda, Dr., Eng. • Professor, National Institute of Informatics – Research Institute mainly for Computer Science • Background: Computer Science, in particular, Artificial Intelligence • Current interest: Semantic Web, Ontology, Linked Open Data (LOD), Social Media Analysis • Social activities – President, Linked Open Data Initiative (NPO) – Founder, Dbpedia Japanese Chapter – Specialist, Information-technology Promotion Agency, Japan (IPA) – Chair, Japan Link Center (Registration Agency of International DOI Foundation) – Board, ORCID
  • 3. Core Vocabularies • Background – Everything is on infosphere, i.e., web – Lots of information, lots of data, lots of systems • Problems – Misunderstanding/mis-matching/”missing links“ across different domains – Gap between human and machines (computers)
  • 4. Core Vocabularies • Aim – Increase interoperability of information/data – Bridge human and machine understanding • Target – Governmental documents/data • Method – Define a set of concepts which bridge (human- readable) terms and (computer-processable) symbols (URIs) – Starting from the most common concepts
  • 5. Core Vocabularies • Activities worldwide – USA: NIEM Core • NIEM (National Information Exchange Model) – Europe: ISA Core Vocabularies – UN: United Nations Centre for Trade Facilitation and Electronic Business (UN/CEFACT) • Core Components Library (UN/CCL) – Japan: IMI Core Vocabulary
  • 6.
  • 10. IMI Project • Supported by – Ministry of Economy, Trade, and Industry, Japan • Technical Framework – Data Model – Core Vocabulary – Design Rules • Support Framework – Tools • for data developer • for schema developer – Database • schema / tools / templates/ … Person Type Name Gender Gender Code Birth Date Address … Name Type Type Name Family Name Given Name … Address Type Type Notation Zip Code Prefecture City … String String String Code TypeString String String String String String Code Type Type Value Name Type Address Type Codelist Type String Thing Type 10
  • 11. IMI as a template for schema Registration form for Confere Name: Address: Gender: Affiliation: Affiliation Address: Attending date: - M / Person Type Name Gender Gender Code Birth Date Address … Name Type Type Name Family Name Given Name … Address Type Type Notation Zip Code Prefecture City … String String String Code Type String String String String String String Code Type Type Value Name Type Address Type Codelist Type String Thing Type IMI Individual Form Person Type Name Gender Address Affiliation Name Type Name Address Type Notation Zip-code String String String String Name Address Org. Person Date Event Participation Type Participant Date Design Schema Remove unnecessary items Add necessary items
  • 12. Roles of IMI • Structured concept dictionary – Concept dictionary • Terms as notation of concepts – The entry is concept, not term • Class concept and relation concept • General-specific relation – Structured dictionary • Concepts form a network of concepts which in tern represents meaning of individual concepts • A class concept consists of relation concepts representing attributes and general/specific relations • A relation concept consists of class concepts connected as domains and ranges and general/specific relations • Template for schemata – Add or remove items for the specific needs
  • 13. Use of IMI • Define the concept model • “Serialize” it into specific “physical” forms • Use suitable a physical form IMI Concept Model RDF XML Natural Language Form For Open Data For data exchange For spread sheets and documents • Relax definition • Interoperability with other open data schemata • Strict definition • Interoperability with DB schemata • Relax definition with simple structure • Readability by humans
  • 14. IMI Core vocabulary v2.2 • Published on Feb.3 2015 • 48 core class terms – person, address, facility, location, date, … • 206 core property terms – name of person, birth date, birth country, … • Multi format – rdf schema, xml schema and documents for human http://imi.ipa.go.jp/ns/core/2/ 14
  • 15.
  • 16. Class definition (person class) person 人 説明:人の情報を表現するためのデータ型 Data Type to describe a person 継承(inherit from) : ic:実体型 property Data type cardinality 説明 (ja) Description (en) ID ID ic:ID型 0..n ID Identification of a Person Name of person 氏名 ic:氏名型 0..n 氏名 Name of a Person Gender 性別 xsd:string 0..1 性別の表記 Gender of a Person Gender code 性別コード ic:コード型 0..1 性別コード Gender of a Person Birth date 生年月日 ic:日付型 0..1 生年月日 Date of Birth of a Person Death date 死亡年月日 ic:日付型 0..1 死亡年月日 Date of Death of a Person Residence address 住所 ic:住所型 0..n 現住所 Present address of a Person Domicile of origin 本籍 ic:住所型 0..1 本籍 Legal residence address of a Person Contact information 連絡先 ic:連絡先型 0..n 連絡先 Contact information of a Person Nationality 国籍 xsd:string 0..n 国籍の表記 A county that assigns rights, duties, and privileges to a person because of the birth or naturalization of the person in that country. Nationality code 国籍コード ic:コード型 0..n 住民基本台帳で利用さ れている国籍コード A county that assigns rights, duties, and privileges to a person because of the birth or naturalization of the person in that country. Birth country 出生国 xsd:string 0..1 生まれた国名 A location where a person was born. Birth country code 出生国コード ic:コード型 0..1 生まれた国のコード A location where a person was born. Birth place 出生地 ic:住所型 0..1 生まれた場所 A location where a person was born. 16
  • 17. Class Structure person 人 name ic:氏名型 Contact ic:連絡先型 : : 氏名 Family name xsd:string Romanized Family name xsd:string : : contact 連絡先 Phone number ic:電話番号型 Address ic:住所型 : : 電話番号 : : address 住所 Country xsd:string Prefecture xsd:string : :  A class term has a property term as a sub element and the property term can refer a class term. Again, the class term has a list of property terms. That constructs a layered structure of terms as the following figure. phone number name
  • 18. Concept of the IMI framework International interoperability is highly considered in preparing IMI. Core Vocabulary Shelter Location Hospital Station Geographical Space /Facilities Transportation Disaster Prevention Finance Domain-specific Vocabularies Disaster Restoration Cost Cross Domain Vocabulary IMI Japanese Local government Standard (APPLIC) DE fact Standards (DC, foaf, etc) NIEM (US) ISA (EU) Schema.org 18
  • 19. Mapping between concepts in different core vocabularies • Difficulty of concept-concept mapping – Matching of meaning tends to be very abstract discussion Concept reference Ontology Real world Concept reference ?
  • 20. Mapping between concepts in different core vocabularies • Difficulty of concept-concept mapping – Matching of meaning tends to be very abstract discussion – Matching of references is easier Concept reference Ontology Real world Concept reference ?
  • 21. Mapping between concepts in different core vocabularies • Difficulty of concept-concept mapping – Syntactical mapping vs. semantic mapping • Just consider what it refers in the real world, not how it is represented in systems. Concept reference Ontology Concept reference ? Systems World Cognitive World
  • 22. Person person 人 説明:人の情報を表現するためのデータ型 Data Type to describe a person 継承(inherit from) : ic:実体型 prop erty Data type cardi nalit y 説明 (ja) Description (en) ID ID ic:ID型 0..n ID Identification of a Person Name of person 氏名 ic:氏名 型 0..n 氏名 Name of a Person Gender 性別 xsd:strin g 0..1 性別の表記 Gender of a Person ender code 性別 コード ic:コード 型 0..1 性別コード Gender of a Person Birth date 生年月 日 ic:日付 型 0..1 生年月日 Date of Birth of a Person Death date 死亡年 月日 ic:日付 型 0..1 死亡年月日 Date of Death of a Person Residence address 住所 ic:住所 型 0..n 現住所 Present address of a Person Domicile of origin 本籍 ic:住所 型 0..1 本籍 Legal residence address of a Person Contact nformation 連絡先 ic:連絡 先型 0..n 連絡先 Contact information of a Person Nationality 国籍 xsd:strin g 0..n 国籍の表記 A county that assigns rights, duties, and privileges to a person because of the birth or naturalization of the person in that country. 住民基本台帳 A county that assigns rights, duties, and privileges to a ? ? Systems World Cognitive World
  • 23. Postal Code ? ? “101-8430” ^^xsd:string “SW1A 0AA”@en (postal code in Japan) (postal code in Europe) Systems World Cognitive World
  • 24. Semantic Mapping • Semantic Mapping – Mapping on the cognitive layer – Two ways of judging mapping • Extensional Mapping – Check whether ‘things’ are shared – e.g., person – Mostly for Class Mapping • Intensional Mapping – Check whether ‘values’ are shared – e.g., postal-code – Mostly for Property Mapping • Syntactical Mapping – Mapping on the systems layer
  • 25. Types of matching: SKOS • Exact Match • Close Match • Broad/Narrow Match • Related Match
  • 26. Close match • Close match: nearly matched but not exactly matched. • Extensional mapping – Coverage of ‘things’ are overlapped so much • Coverage of ‘Country’ is slightly different – ‘things’ are close • Reference of ‘Person’ is slightly different (person vs. legal Person) • Intensional mapping – Coverage of ‘values’ are overlapped so much
  • 27. Broad match/narrow match • Broad/narrow match – One subsumes the other • Extensional mapping – Coverage of ‘things’ are subsumed, i.e., the subset is exact match • Intensional mapping – Coverage of ‘values’ are subsumed, i.e., the subset is exact match
  • 28. More different matching • Complicated match – An element of a system matches a combination of two or more elements. – “Pathway” match • A single property matches the combination of two or more properties – “Conditional” match • An element matches the other element if some condition is hold IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者 LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所 It is exact match if the value of ic:住所.種別 should be "登記住所".
  • 29. Results Core Vocabulary Identifier Link Mapping relation Data model Identifier Address Link Has exact match IMI ic:住所型 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:町名 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:丁目 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地補足 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:番地 AddressAddressArea Link Has narrow match IMI ic:住所型.ic:号 AddressAddressID Link Has exact match IMI ic:住所型.ic:ID AddressAdminUnitL1 Link Has exact match IMI ic:住所型.ic:国 AddressAdminUnitL2 Link Has narrow match IMI ic:住所型.ic:都道府県 AddressFullAddress Link Has exact match IMI ic:住所型.ic:表記 AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:ビル番号 AddressLocatorDesignator Link Has narrow match IMI ic:住所型.ic:部屋番号 AddressLocatorName Link Has narrow match IMI ic:住所型.ic:ビル名 AddressPOBox Link Has related match IMI ic:住所型.ic:方書 AddressPostCode Link Has exact match IMI ic:住所型.ic:郵便番号 AddressPostName Link Has narrow match IMI ic:住所型.ic:市区町村 AddressPostName Link Has narrow match IMI ic:住所型.ic:区 AddressThoroughfare Link Has no match IMI Agent Link Has exact match IMI ic:実体型
  • 30. Results Identifier Link Has exact match IMI ic:ID型 IdentifierIdentifier Link Has exact match IMI ic:ID型.ic:識別値 IdentifierIssueDate Link Has no match IMI IdentifierIssuingAuthority Link Has related match IMI ic:ID型.ic:ID体系.ic:発行者 IdentifierIssuingAuthorityURI Link Has exact match IMI ic:ID型.ic:ID体系.ic:URI IdentifierType Link Has no match IMI JurisdictionIdentifier Link Has related match IMI ic:国籍コード JurisdictionName Link Has related match IMI ic:国籍 LegalEntity Link Has exact match IMI ic:法人型 LegalEntityAddress Link Has broad match IMI ic:法人型.ic:住所 LegalEntityAlternativeName Link Has no match IMI LegalEntityCompanyActivity Link Has close match IMI ic:法人型.ic:事業種目 LegalEntityCompanyStatus Link Has related match IMI ic:法人型.ic:活動状況 LegalEntityCompanyType Link Has exact match IMI ic:法人型.ic:組織種別 LegalEntityIdentifier Link Has exact match IMI ic:法人型.ic:ID LegalEntityLegalIdentifier Link Has no match IMI LegalEntityLegalName Link Has broad match IMI ic:法人型.ic:名称.表記 LegalEntityLocation Link Has related match IMI ic:法人型.ic:地物.説明 LegalEntityRegisteredAddress Link Has broad match IMI ic:法人型.ic:住所 Location Link Has exact match IMI ic:場所型 LocationAddress Link Has exact match IMI ic:場所型.ic:住所 LocationGeographicIdentifier Link Has broad match IMI ic:場所型.ic:地理識別子 LocationGeographicName Link Has exact match IMI ic:場所型.ic:名称.ic:表記 LocationGeometry Link Has exact match IMI ic:場所型.ic:地理座標
  • 31. Results Person Link Has exact match IMI ic:人型 PersonAddress Link Has exact match IMI ic:人型.ic:住所 PersonAlternativeName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名 PersonBirthName Link Has broad match IMI ic:人型.ic:氏名.ic:姓名 PersonCitizenship Link Has no match IMI PersonCountryOfBirth Link Has exact match IMI ic:人型.ic:出生国 PersonCountryOfDeath Link Has no match IMI PersonDateOfBirth Link Has exact match IMI ic:人型.ic:生年月日 PersonDateOfDeath Link Has exact match IMI ic:人型.ic:死亡年月日 PersonFamilyName Link Has exact match IMI ic:人型.ic:氏名.ic:姓 PersonFullName Link Has exact match IMI ic:人型.ic:氏名.ic:姓名 PersonGender Link Has exact match IMI ic:人型.ic:性別コード PersonGivenName Link Has exact match IMI ic:人型.ic:氏名.ic:名 PersonIdentifier Link Has broad match IMI ic:人型.ic:ID PersonPatronymicName Link Has no match IMI ic:人型.ic:氏名.ic:姓名 PersonPlaceOfBirth Link Has narrow match IMI ic:人型.ic:出生地
  • 32. Bridging core and domain vocabularies (working in progress) • Aim: Core vocabulary would be extended to domain vocabularies – Agriculture – Finance – Traffic – … • Task: – Can concepts be shared between core and domains? really?
  • 33. Agricultural Activity Ontology (AAO) Agricultural activity crop production activity activity for propagation activity in the vegetative growth stage activity in the reproductive growth stage activity for environment control activity for soil control activity for climate control activity for water control activity for biotic control activity for chemical control post production activity activity for harvesting activity for processing activity for extending shelf-life activity for wrapping indirect activity activity for preparing materials activity for cleaning activity for transport activity for monitoring activity for maintaining farm equipment administrative activity activity for business administration http://cavoc.org/aao/
  • 34. An example: “activity” (and “event”) • S: (n) activity (any specific behavior) "they avoided all recreational activity" – direct hyponym / full hyponym – direct hypernym / inherited hypernym / sister term • S: (n) act, deed, human action, human activity (something that people do or cause to happen) – S: (n) event (something that happens at a given place and time) – [WordNet] • Each activity is a Happening which involves volition and participants. It has temporal dimension. It is distinguished from Events by the fact that the activity does not trigger change of state and does not have a conceptual end point. – [PROTON Extent module (a lightweight upper-level ontology)] • Activity: This class represents the abstract content of an event, which may be repeated many times, once or never. For example a training course, or a play. – [The Event Programme Vocabulary (prog)] • E5 Event – Subclass of: E4 Period – Superclass of: E7 Activity, E63 Beginning of Existence, E64 End of Existence • E7 Activity – Subclass of: E5 Event – Superclass of: E8 Acquisition, E9 Move, E10 Transfer of Custody, E11 Modification, E13 Attribute Assignment, E65 Creation … – [CIDOC Conceptual Reference Model]
  • 35. Summary • Sharing concepts is a very long way • No ground truth – Step-by-step understanding of the world – Careful consensus making • More flexible framework is needed – Simple mapping is not so happy