1. When Culture Encounters Internet Conference, December 14th-15th, 2010, Taipei
Connecting Museums with Linked Data
以鏈結資料連結博物館
Hideaki Takeda 武田英明
takeda@nii.ac.jp
National Institute of Informatics
国立情報学研究所
With LODAC project team
I. Ohmukai, F. Kato, T. Kamura, T. Takahashi, H. Ueda
Hideaki Takeda / National Institute of Informatics
2. Outline
l Information Cycle
l Linked Data and Museum Data
l LODAC Museum
Hideaki Takeda / National Institute of Informatics
3. Information Cycle
l Information can be created only based on existing information
n No information can be created out of nothing
n Collect – Use & Create
l Value of information is how much it is used
n No value for information without use
n Use & Create – Publish
l Accumulation of information is the wealth of society
n Distribution of information is the health of society
n Publish – Share -- Collect
&
Create
Hideaki Takeda / National Institute of Informatics
4. Information Cycle
l Before Gutenberg
n Media
u Hand-writing books
u Oral communication
n Information Cycle is
u Slow
u Small amount
u Few People
l After Gutenberg, the age of Mass media arrived …
&
Create
Hideaki Takeda / National Institute of Informatics
5. Two social layers on information cycle
with Mass Media
Writer, Artist, Scholar
Mass media
Government
&
Create
Hideaki Takeda / National Institute of Informatics
6. Two social layers on information cycle
with Mass media
Writer, Artist, Scholar
Mass media
Government
Collect
Ordinary
People
Use
&
Create
Hideaki Takeda / National Institute of Informatics
7. Two social layers on information cycle
with Mass Media
Writer, Artist, Scholar
Mass media
Government
Ordinary
People
&
Create
Hideaki Takeda / National Institute of Informatics
8. Information Cycle with Web
Web Internet
Open Door to Information Cycle for Ordinary People
Search Engine Web Server
Web Browser & HTML Editor
Create
Hideaki Takeda / National Institute of Informatics
9. Information Cycle
Web
l Web accelerate Information Cycle in
n Speed
n Quantity
n People
&
Create
Hideaki Takeda / National Institute of Informatics
10. Information Cycle with Web
Web Internet
Search Engine Web Server
Web Browser & HTML Editor
Create
Hideaki Takeda / National Institute of Informatics
11. Metadata is the platform of Information Cycle
Metadata
&
&
Hideaki Takeda / National Institute of Informatics
12. Linked Data will be the platform of Information C
ycle on the content layer
Metadata
Linked Data
&
&
Hideaki Takeda / National Institute of Informatics
14. Linked Data – Four Rules
l Linked Data is “Web of Data”
n (Traditional) Web is “Web of Documents”
l What is Linked Data?
n RDF triples
n Can refer others
n Can be referred by others,
l Four Rules for Linked Data
n Use URIs as names for things
n Use HTTP URIs so that people can look up those names.
n When someone looks up a URI, provide useful information, using
the standards (RDF*, SPARQL)
n Include links to other URIs. so that they can discover more things
Linked Data, TBL, http://www.w3.org/DesignIssues/LinkedData.html
Hideaki Takeda / National Institute of Informatics
15. Importance of data in public sector as Linked Data
l In principle, it should be shared
l It is the basic knowledge of our society
l Data in public sector
n Library
n Museum
n Archive
n Government
Hideaki Takeda / National Institute of Informatics
16. Challenges for Linked Data in Japan
l Lack of culture of sharing
l Immature community for linked data
l Lack of central data Set
l Difficulty of multi-lingual data
Anyway let’s start!
Hideaki Takeda / National Institute of Informatics
17. LODAC Project
l Open Social Semantic Web Platform for Academic Resources
n Providing platforms for Linked Data
n Practicing data accumulation and publishing
l Interested Areas
n Museum information
n Geographical information, especially geographical names
n Local information
n …
Hideaki Takeda / National Institute of Informatics
18. Museum data as LOD
l The state-of-the-art of museum information in Japan
n Distributed
u Self maintained
u Isolated
n Opaque
u Self designed
u Messy
l Aggregating and associating museum information
n LODAC-Museum (tentative)
Hideaki Takeda / National Institute of Informatics
19. Over 1.4 billion collections
Over 1,000 organizations
Hideaki Takeda / National Institute of Informatics
20. http://lod.ac/ (open on December 11)
Hideaki Takeda / National Institute of Informatics
21. LODAC Museum – Main work
l Gathering of data
n Thesaurus, museum collections, etc
l Standardization of data
n Representing data from different sources in a unique form
l Integration of data
n Identifying data
n Associating the same data
l Publishing and share of data
Hideaki Takeda / National Institute of Informatics
22. Data sources
l Thesaurus and authority sources l Other sources
n 日本美術シソーラス DB 絵画編 n DBPedia Japan
(Thesaurus of Japanese Art) n GIS data
n 国指定文化財データベース
(DB for National Designated Cultural Property)
n 文化遺産オンライン
(Cultural Heritage Online)
l Museum Collection (14 museums)
n 国立美術館所蔵作品総合目録検
n 栃木県立美術館
索システム ( 国立国際美術館,
n 秋田県立近代美術館
京都国立近代美術館,東京国立
近代美術館 ) (4 Nat’l Museums) n 岩手県立美術館
n 国立西洋美術館 (Nat’l M. Western Art) n 徳島県立近代美術館
n 京都国立博物館 (Kyoto Nat’l Museum) n 山梨県立美術館
n 奈良国立博物館 (Nara Nat’l Museum) n 東京都現代美術館
n Hideaki of Art) 香川県立東山魁夷せとうち
福島県立美術館 (Fukushima Pref. M.Takedan National Institute of Informatics
/
23. Metadata design
l Basic Structure
n Work – Creator – Museum
l Interoperability is more considered than correctness in the domain
n DC> DCTerm> FOAF> iCal >SKOS>NDLSH> RDA> CIDOC
CRM
PREFIX
lodac:Work Keep it flat as long as possible URI
n Property( 一部項目省略 )
資料分類 lodac:genre crm http://purl.org/NET/cidoc-crm/core#
文化財 lodac:culturalAssets
制作者 dc:creator / dc11:creator dc http://purl.org/dc/terms/
国籍 crm:P7_took_place_at dc11 http://purl.org/dc/elements/1.1/
作品名 dc:title / skos:prefLabel
作品名読み dc:title @ja-hrkt / skos:altLabel foaf http://xmlns.com/foaf/0.1/
作品名英語 dc:title @en / skos:altLabel skos http://www.w3.org/2004/02/skos/core#
銘文 crm:P62I_is_depicted_by Metadata
印章 crm:P65_shows_visual_item rdfs http://www.w3.org/2000/01/rdf-
員数 crm:P57_has_number_of_parts schema# elements
コレクション dc:isPartOf ical http://www.w3.org/2002/12/cal/ical# Work: 46
制作年 dc:created Person: 23
推定始年 lodac:estimatedStartYear rda2 http://RDVocab.info/ElementsGr2
材質 dc:medium / crm:P45_consists_of Org. 13
lodac http://lod.ac/ns/lodac#
Bib. 12
Hideaki Takeda / National Institute of Informatics
24. Integration Policy
l How to integrate data from different sources
n sharing of responsibility
u Each source is responsible for its data
l Identifying IDs for data and managing data with the IDs
u LODAC is only responsible for integration
l Assigning original IDs and associating other IDs to them
Data from Source A Integrated data Data from Source B
crm:P55_has_current_location dc:creator
Work
dc:references dc:references
crm:P55_has_current_location Museum crm:P55_has_current_location
dc:references dc:references dc:creator
dc:creator
Creator
dc:references dc:references
Hideaki Takeda / National Institute of Informatics
25. Integration of Person Data
l Matching of Creators
n Base: List of Artists from Thesaurus of Japanese Art
n Target: Creators of collection in museums + Dbpedia
n Method: String match of names
n Results: Links from artist nodes to work nodes are added
LODAC data Links
Link to Work
DBpedia
Basic Information
for Creators
Hideaki Takeda / National Institute of Informatics
32. 東京近代美術館 National Museum of Modern Art, Tokyo
Hideaki Takeda / National Institute of Informatics
33. 国指定文化財データベース DB for National Designated Cultural
Property
Hideaki Takeda / National Institute of Informatics
34. Tokushima Pref. Museum Thesaurus for Japanese Art
DB for National Designated
Cultural Property
National Museum of Fukui Pref. Museum
Modern Art, Tokyo
Hideaki Takeda / National Institute of Informatics
35. Data size and Integration Results
Source Type No. Type for Sources No. Results
Integration
国立美術館 ( 西美を除く 3 館 ) Work 25180
Museum Thesaurus for J. art 648 77
国立西洋美術館 Work 4373
Cultural Heritage Online 915
京都国立博物館 Work 5819
Designated Thesaurus for J. art (work) 3800 74
奈良国立博物館 Work 431
Cultural
福島県立美術館 Work 20 Property Designated Cultural Property 10115
DB
栃木県立美術館 Work 32
work Thesaurus for J. art (work) 1332 15020
秋田県立近代美術館 Work 22
Museum collections (work) 61861
岩手県立美術館 Work 1558
徳島県立近代美術館 Work 18482
Person Thesaurus for J. art (artist) 1332 615
Museum collections (work) 61861
山梨県立美術館 Work 262
東京都現代美術館 Work 5416
香川県立東山魁夷せとうち美術館 Work 266
Museum collections
Thesaurus for J. art Work 3800
Thesaurus for J. art Person 1332
Thesaurus for J. art Group 289
Thesaurus for J. art Museum 648
Cultural Heritage Online Museum 915
Designated Cultural Property DB Work 10115
合計 103096
Hideaki Takeda / National Institute of Informatics
36. What can LOD give Museum Data?
Connectivity!!
l Open Connectivity makes new values for museum data
n Connect to data in other areas
n Connect to UGC (User Generated Contents)
Hideaki Takeda / National Institute of Informatics
37. Local Information with Museum data
l Museum LOD + Local LOD / Sightseeing LOD / Geo LOD
l e.g.,
n Tour visiting museums with a focus
n Joint event with local festivals
n Tour for food related historical events
n …
Hideaki Takeda / National Institute of Informatics
38. User Generated Contents for Museum Information
l Contributions by non-experts
l e.g.,
n Personal comments for Buddha statues
1. Statue of Sarasvati 2. Ryohoji Temple
n Records of visiting museums 弁財天像
n Media-mix events
3. Theme Song for Ryohoji 4. Event
Hideaki Takeda / National Institute of Informatics
39. Publish museum data as LOD
l Let’s make museum data open and shareable
l Change “cultural heritage” to “cultural resources”
l (art/culture) * information = Promotion of the Nation
l Beyond collaboration of Museum Library Archives(MLA)
n MLA3(Museum Library Archives, Arts and Academia)
l More users, more various types of usage
Hideaki Takeda / National Institute of Informatics
40. Make arts and culture more dynamic and more energet
ic
Pop
Culture
Hideaki Takeda / National Institute of Informatics