SlideShare uma empresa Scribd logo
1 de 46
Digital Medieval Data Curation
CLIR Postdoctoral Fellowship Seminar
Bryn Mawr, 2013
Benjamin Albritton, Stanford University Libraries
blalbrit@stanford.edu
@bla222
Current State: A World of Silos
Roman de la Rose Parker on the Web e-codices And so on…
Data Interoperability
• Break down silos
• Separate data from applications
• Share data models and
programming interfaces
• Enable interactions at the tool and
repository level
Designing Modular Repositories and
Tools
Image Data (Canonical)
Image
Viewer
Discovery
Annotation
Non-image data (Canonical)
Transcription
Image Viewer
Image
Analysis
Discovery Tool X?
Repository
Repository
User
Interface
3rd-Party
Tools
Image Data (Canonical)
Image
Viewer
Discovery
Annotation
Non-image data (Canonical)
Transcription
Image Viewer
Image
Analysis
Discovery Tool X?
Repository
Repository
User
Interface
3rd-Party
Tools
Designing Modular Repositories and
Tools
Image Data (Canonical)
Image
Viewer
Discovery
Annotation
Non-image data (Canonical)
Transcription
Image Viewer
Image
Analysis
Discovery Tool X?
Designing Modular Repositories and
Tools
Iterative Interactions
Multiple Data Sources
• Existing structured data (catalogs)
• User-added
– Comments
– Transcriptions
– Etc.
• Digital images
• Machine processing
Motivating Questions
What does this mean for medieval data?
• How do we rethink medieval object data in a
shared, distributed, global space?
• How do we enable collaboration and encourage
engagement?
• How do we deal with tools that are producing
new data on digital surrogates that are
implicitly about a real world object?
Transcribing from Digital Surrogates
La Terre de Secille
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR Fold A Open
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR Fold A Open Fold A and B Open
Naïve Approach: Attach Transcription to Image
One problem example: Multiple Representations
CCC 26 f. iiiR Fold A Open Fold A and B Open f. iiiV
The Shared Canvas
• Represents a real world thing we
want to “talk” about
• Has a unique name
• http://dms-data.stanford.edu/Parker/CCC026/canvas-12
Data Model: SharedCanvas
http://www.shared-canvas.org
Data is “about” a real thing
Canvas Paradigm
• A Canvas is an empty space in which to build up a display
• Makes explicit that the image is a surrogate
Open Annotation Model
• Annotation (a document)
• Body (the ‘comment’ of the annotation)
• Target (the resource the Body is ‘about’)
Model: Annotations to Paint Canvas
• The Canvas represents the empty page
• Annotation links Image with Canvas
Model: Annotations to Paint Canvas
• Annotation links Text with Canvas
Model: Annotations to Paint Canvas
Model: Missing Pages
Medieval Data Use-Cases: A Sampler
• Structured data from existing sources
• Transcription and glyphs
• Structured data from new sources
Structured Data from Existing Sources
A Catalog of the Manuscripts of
Salisbury Cathedral Library
Drives Discovery
Transcription:
T-PEN (Saint Louis University) http://t-pen.org
• Transcription tool
• Provides image parsing
– Columns
BNF fr. 9221 – column parsing
T-PEN (Saint Louis University)
http://t-pen.org
• Transcription tool
• Provides image parsing
– Columns
– Lines
BNF fr. 9221 – line parsing
T-PEN (Saint Louis University)
http://t-pen.org
BNF fr. 9221 – transcription view
Drives Full-Text Search
http://t-pen.org/TPEN
… and other interfaces
http://stanford.edu/~blalbrit/v-machine-2/samples/DamedequiRF5.xml
T-PEN’s PaleoTool
BNF fr. 1586 – glyph parsing
Results for “matching” glyphs
Glyphs with multiple letters
Comparing results across manuscripts
BNF fr. 1586 CCCC 324
User-created Structured Data
Beinecke MS 310, f. 1r
• Each row = 1 day (January 1, here)
• Lists the feast of the Circumcision
• Optionally provides additional information
Distributed Resources /
Distributed Environments
Data capture in T-PEN
http:t-pen.org – Saint Louis University
Front-end: Exhibit
http://guillaumedemachaut.com/kalendar/sharedkalendar.html
Simple (really simple) Exhibit based on kalendar transcriptions
(Exhibit: http://www.simile-widgets.org/exhibit/)
For each record:
Enabling rapid comparison
Two mss. include the entry “Thimotheus apostel”
Distributed Resources /
Distributed Environments
SharedCanvas Demo Implementation
http://www.shared-canvas.org/impl/demodh
SharedCanvas Demo Implementation
http://www.shared-canvas.org/impl/demodh
SharedCanvas Demo Implementation
http://www.shared-canvas.org/impl/demodh
A Sea of Manuscript Data
• Thousands of manuscripts currently available
interoperably, with more coming rapidly
• Discovery data is a mixed bag
• Tools provide data back into the system that
can be re-used
• New data drives new discovery, new
interfaces, and new visualization challenges
• Management and manipulation of that “wild”
data is a serious challenge

Mais conteúdo relacionado

Mais procurados

Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
The European Library
 

Mais procurados (20)

Your research as open science
Your research as open scienceYour research as open science
Your research as open science
 
Deriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF DataDeriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF Data
 
AINL 2016: Kozerenko
AINL 2016: Kozerenko AINL 2016: Kozerenko
AINL 2016: Kozerenko
 
The Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela KroegerThe Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela Kroeger
 
POSTDATA: Towards publishing European Poetry as Linked Open Data
POSTDATA: Towards publishing European Poetry as Linked Open DataPOSTDATA: Towards publishing European Poetry as Linked Open Data
POSTDATA: Towards publishing European Poetry as Linked Open Data
 
AINL 2016: Kuznetsova
AINL 2016: KuznetsovaAINL 2016: Kuznetsova
AINL 2016: Kuznetsova
 
co:op-READ-Convention Marburg - Günter Mühlberger
co:op-READ-Convention Marburg - Günter Mühlbergerco:op-READ-Convention Marburg - Günter Mühlberger
co:op-READ-Convention Marburg - Günter Mühlberger
 
Linked open data: standardization, interoperability and multilingual challeng...
Linked open data: standardization, interoperability and multilingual challeng...Linked open data: standardization, interoperability and multilingual challeng...
Linked open data: standardization, interoperability and multilingual challeng...
 
co:op-READ-Convention Marburg - Basilis Gatos
co:op-READ-Convention Marburg - Basilis Gatosco:op-READ-Convention Marburg - Basilis Gatos
co:op-READ-Convention Marburg - Basilis Gatos
 
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IU
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IUOne Discovery Layer, Eight Front Doors: Implementing Blacklight @ IU
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IU
 
Introduction to persistency and Berkeley DB
Introduction to persistency and Berkeley DBIntroduction to persistency and Berkeley DB
Introduction to persistency and Berkeley DB
 
Semantic Web in the Digital Humanities
Semantic Web in the Digital HumanitiesSemantic Web in the Digital Humanities
Semantic Web in the Digital Humanities
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsRDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL Platforms
 
IIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership MeetingIIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership Meeting
 
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)
 
co:op-READ-Convention Marburg - Sebastian Colutto
co:op-READ-Convention Marburg - Sebastian Coluttoco:op-READ-Convention Marburg - Sebastian Colutto
co:op-READ-Convention Marburg - Sebastian Colutto
 
Session 03 acquiring data
Session 03 acquiring dataSession 03 acquiring data
Session 03 acquiring data
 
A non-technical introduction to text mining for information specialists
A non-technical introduction to text mining for information specialists A non-technical introduction to text mining for information specialists
A non-technical introduction to text mining for information specialists
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
 

Destaque (8)

Normativa del Sistema de Contabilidad General de la Nación
Normativa del Sistema de Contabilidad General de la NaciónNormativa del Sistema de Contabilidad General de la Nación
Normativa del Sistema de Contabilidad General de la Nación
 
Dust Collector
Dust CollectorDust Collector
Dust Collector
 
Dust collector
Dust collectorDust collector
Dust collector
 
Guía Uso de Plataforma
Guía Uso de Plataforma Guía Uso de Plataforma
Guía Uso de Plataforma
 
Cooperative education in tourism industry bonnie group
Cooperative education in tourism industry  bonnie groupCooperative education in tourism industry  bonnie group
Cooperative education in tourism industry bonnie group
 
Maa
MaaMaa
Maa
 
Confianza legítima 2
Confianza legítima 2Confianza legítima 2
Confianza legítima 2
 
Dto 854 02-dic-2004
Dto 854 02-dic-2004Dto 854 02-dic-2004
Dto 854 02-dic-2004
 

Semelhante a Digital Medieval Data Curation

Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
madhuvardhan
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
madhuvardhan
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
Jon Stroop
 

Semelhante a Digital Medieval Data Curation (20)

Florence2
Florence2Florence2
Florence2
 
Facsimiles of Text and Music from Distributed Resources
Facsimiles of Text and Music from Distributed ResourcesFacsimiles of Text and Music from Distributed Resources
Facsimiles of Text and Music from Distributed Resources
 
A Comparative Kalendar - DH2013 Presentation
A Comparative Kalendar - DH2013 PresentationA Comparative Kalendar - DH2013 Presentation
A Comparative Kalendar - DH2013 Presentation
 
Shared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conferenceShared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conference
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Overview of Lincoln Paper Design
Overview of Lincoln Paper DesignOverview of Lincoln Paper Design
Overview of Lincoln Paper Design
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
From ontology to wiki
From ontology to wikiFrom ontology to wiki
From ontology to wiki
 
Interpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open ContextInterpretation, Context, and Metadata: Examples from Open Context
Interpretation, Context, and Metadata: Examples from Open Context
 
From Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science TalesFrom Workflows to Transparent Research Objects and Reproducible Science Tales
From Workflows to Transparent Research Objects and Reproducible Science Tales
 
Doing DH in Theological Libraries
Doing DH in Theological LibrariesDoing DH in Theological Libraries
Doing DH in Theological Libraries
 
DL-architecture.ppt
DL-architecture.pptDL-architecture.ppt
DL-architecture.ppt
 
The Data-Intensive Visual Analytics (DIVA) project
The Data-Intensive Visual Analytics (DIVA) projectThe Data-Intensive Visual Analytics (DIVA) project
The Data-Intensive Visual Analytics (DIVA) project
 
Digital libraries
Digital librariesDigital libraries
Digital libraries
 
"Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications""Data Provenance: Principles and Why it matters for BioMedical Applications"
"Data Provenance: Principles and Why it matters for BioMedical Applications"
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 

Digital Medieval Data Curation

Notas do Editor

  1. Allows filtering by date, item, and manuscript, as well as search across the items