SlideShare uma empresa Scribd logo
1 de 39
Managing Crowd sourced Cultural
Heritage Datasets
National Library of Wales
Glen Robson – Head of Systems
twitter: @glenrobson
Plan
• Background to the National Library of Wales
• Crowd Sourcing projects
– Cymru – 1900 – Wales
– Cynefin
– Shipping Records
– WW1 Book of Remembrance
• Providing storage and access
Content
Content
Content
Content
Welsh Newspapers
http://www.cymru1900wales.org/
Cynefin
cynefin.archiveswales.org.uk
Data
• Fields:
– Owner
– Tennant
– Use – arable/forest etc.
– Size (acre, rood, perches)
– Tithe Value (pounds, shilling, pence)
– Geo-coordinates
• Storing in Fedora
– ALTO
– Open Annotations
• JSON-LD
• RDF/XML
– Indexing in SOLR
– Website in the summer
Shipping Registers
• 544 merchant vessels registered at the port of
Aberystwyth
• 1856-1914
• Crew lists – name, position, birth date, reason
for leaving, location
• Transcribed by volunteers
• https://www.llgc.org.uk/blog/?p=5716
Data Preservation
• Where do we store this data?
– Catalogue – MARC
– Fedora 3 Repository
• Excel files / RDF
• Data being enhanced
– Currently:
• Triple store (sesame) – preservation?
• https://github.com/LlGC-NLW/shippingrecords
– Fedora 4?
Enhancements
• Linking out
– Places:- Birth and Ship arrival
• Volunteer using OpenRefine to group places
• Will try and match with GeoNames
– Ships :-
• Added to wikidata by NLW Wikipedian in Residence:
– https://tools.wmflabs.org/reasonator/?&q=23927955
– https://tools.wmflabs.org/reasonator/?&q=24027483
– Adding images, size, weight, creation, destruction, link to
newspapers
– Dutch Shipping to Newspaper linking:
http://bit.ly/1Talish/
Research Potential
• By publishing these datasets as Linked Open Data it allows research
that wasn’t possible when these items were physical or even when
they were standalone digital objects.
• Physical:
– Travel to Aberystwyth - x hours/days
– Transcribe data in the reading room – x months/years
– Process back home
• Standalone Digital Object
– Transcribe data at home – x months/years
– Process at home
• Linked Open Data Annotations
– Process at home results in minutes
• Have to take transcriptions with trust
Mirador
http://projectmirador.org/
Simple Annotation Server
• https://github.com/glenrobson/SimpleAnnotationServer
• Stores IIIF Annotations as Linked Open Data
Annotation (Transcription)
http://walesatwar.org
Newspapers
Future Projects
Future Projects
Future Projects
Providing Access
• Volunteers want to see results
• Cynefin – funded project
• Shipping records – independent website
• Cymru1900Wales – dataset (CSV + Linked Data)
• Mirador and IIIF options:
– IIIF Search API
– IIIF Ranges – table of contents
– Datasets for download
Universal Viewer
Dataset Intersection
• Example of dataset intersection
• John Williams
• Born 1891
Can we do this at scale?
Cynefin
Maps
1838 to 1947
Newspapers
1804 to 1919
Cymru 1914
1914 to 1918
General
Digitisation
Shipping Records
1856 to 1914
Crime and
Punishment
Database
1730 to 1830
Welsh Bibliography
0 to 1970
Summary
• Different methods of crowd sourcing:
– Excel
– Outsourcing – Cynefin and wales1900
– IIIF – Mirador & Simple Annotation Server
– WikiData
• Ideally crowd sourcing platform directly connected to access solution
(there will be corrections)
• Transcribing to linked data gives:
– Connection to external data sources (geonames, wikipedia)
– Connection to other resources (newspapers)
– Allows researchers to query the data
• IIIF gives:
– Easy to setup transcription platform
– Work with other peoples content

Mais conteúdo relacionado

Mais procurados

Eaa2021 476 izeta cattaneo idacordig and suquia
 Eaa2021 476 izeta cattaneo idacordig and suquia Eaa2021 476 izeta cattaneo idacordig and suquia
Eaa2021 476 izeta cattaneo idacordig and suquia
ariadnenetwork
 
Eaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seaddaEaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seadda
ariadnenetwork
 

Mais procurados (20)

Dynamics and partnerships with local associations involved in LoCloud: a case...
Dynamics and partnerships with local associations involved in LoCloud: a case...Dynamics and partnerships with local associations involved in LoCloud: a case...
Dynamics and partnerships with local associations involved in LoCloud: a case...
 
DRI Community Forum: Collection Focus - Transport Infrastructure Ireland
DRI Community Forum: Collection Focus - Transport Infrastructure IrelandDRI Community Forum: Collection Focus - Transport Infrastructure Ireland
DRI Community Forum: Collection Focus - Transport Infrastructure Ireland
 
From Open Acces to Open Collections to Open Minds
From Open Acces to Open Collections to Open MindsFrom Open Acces to Open Collections to Open Minds
From Open Acces to Open Collections to Open Minds
 
Europeana Generic Services Projects Meeting, 29-30 October 2018, The Hague, B...
Europeana Generic Services Projects Meeting, 29-30 October 2018, The Hague, B...Europeana Generic Services Projects Meeting, 29-30 October 2018, The Hague, B...
Europeana Generic Services Projects Meeting, 29-30 October 2018, The Hague, B...
 
2018 03-03 culture hack-bucharest-marco streefkerk
2018 03-03 culture hack-bucharest-marco streefkerk2018 03-03 culture hack-bucharest-marco streefkerk
2018 03-03 culture hack-bucharest-marco streefkerk
 
Open Cultural Heritage Data @ the Rijksmuseum
Open Cultural Heritage Data @ the RijksmuseumOpen Cultural Heritage Data @ the Rijksmuseum
Open Cultural Heritage Data @ the Rijksmuseum
 
LoCloud Overview
LoCloud OverviewLoCloud Overview
LoCloud Overview
 
LoCloud geolocation enrichment tools: On the Map
LoCloud geolocation enrichment tools: On the MapLoCloud geolocation enrichment tools: On the Map
LoCloud geolocation enrichment tools: On the Map
 
Digital Initiatives and Digital Scholarship at the British Library
Digital Initiatives and Digital Scholarship at the British LibraryDigital Initiatives and Digital Scholarship at the British Library
Digital Initiatives and Digital Scholarship at the British Library
 
Europeana Newspapers Aggregator Forum 2018 Berlin
Europeana Newspapers Aggregator Forum 2018 BerlinEuropeana Newspapers Aggregator Forum 2018 Berlin
Europeana Newspapers Aggregator Forum 2018 Berlin
 
2017 IIIF Conference - The Vatican - SACHA
2017 IIIF Conference - The Vatican - SACHA2017 IIIF Conference - The Vatican - SACHA
2017 IIIF Conference - The Vatican - SACHA
 
Eaa2021 476 izeta cattaneo idacordig and suquia
 Eaa2021 476 izeta cattaneo idacordig and suquia Eaa2021 476 izeta cattaneo idacordig and suquia
Eaa2021 476 izeta cattaneo idacordig and suquia
 
Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?
 
#FAIRGLAM : Towards FAIR digital cultural heritage collections in the Rijksmu...
#FAIRGLAM : Towards FAIR digital cultural heritage collections in the Rijksmu...#FAIRGLAM : Towards FAIR digital cultural heritage collections in the Rijksmu...
#FAIRGLAM : Towards FAIR digital cultural heritage collections in the Rijksmu...
 
Eaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seaddaEaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seadda
 
21st Century Geospatial #HistEnv Data Management
21st Century Geospatial #HistEnv Data Management21st Century Geospatial #HistEnv Data Management
21st Century Geospatial #HistEnv Data Management
 
Heeren pan-seadda-leiden-17mrt2020
Heeren pan-seadda-leiden-17mrt2020Heeren pan-seadda-leiden-17mrt2020
Heeren pan-seadda-leiden-17mrt2020
 
Locating a National Collection
Locating a National CollectionLocating a National Collection
Locating a National Collection
 
Infrastructure - A necessary platform for user empowerment
Infrastructure - A necessary platform for user empowermentInfrastructure - A necessary platform for user empowerment
Infrastructure - A necessary platform for user empowerment
 
Geosemantic Tools for Archaeological Research
Geosemantic Tools for Archaeological ResearchGeosemantic Tools for Archaeological Research
Geosemantic Tools for Archaeological Research
 

Destaque

Destaque (10)

Baton slides from Open Repositories 2016
Baton slides from Open Repositories 2016Baton slides from Open Repositories 2016
Baton slides from Open Repositories 2016
 
Goethals Harvard Library's Digital Preservation Repository
Goethals Harvard Library's Digital Preservation RepositoryGoethals Harvard Library's Digital Preservation Repository
Goethals Harvard Library's Digital Preservation Repository
 
VanDyck Long-Term Preservation of Digital Scholarly Literature
VanDyck Long-Term Preservation of Digital Scholarly LiteratureVanDyck Long-Term Preservation of Digital Scholarly Literature
VanDyck Long-Term Preservation of Digital Scholarly Literature
 
Ferrante Durable Access to Digital Primary Sources
Ferrante Durable Access to Digital Primary SourcesFerrante Durable Access to Digital Primary Sources
Ferrante Durable Access to Digital Primary Sources
 
Wittenberg Portico: Lessons From a Community Supported Archive
Wittenberg Portico: Lessons From a Community Supported ArchiveWittenberg Portico: Lessons From a Community Supported Archive
Wittenberg Portico: Lessons From a Community Supported Archive
 
Herdrich -The Digital Library of the Middle East (DLME)
Herdrich -The Digital Library of the Middle East (DLME)Herdrich -The Digital Library of the Middle East (DLME)
Herdrich -The Digital Library of the Middle East (DLME)
 
Kettler Information Digitization in the Humanities
Kettler Information Digitization in the HumanitiesKettler Information Digitization in the Humanities
Kettler Information Digitization in the Humanities
 
Wheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation RelayWheeler & Benedict -- Enabling the Preservation Relay
Wheeler & Benedict -- Enabling the Preservation Relay
 
Madsen Digital Preservation Policy & Strategy
Madsen Digital Preservation Policy & StrategyMadsen Digital Preservation Policy & Strategy
Madsen Digital Preservation Policy & Strategy
 
Waraksa Digital Library of the Middle East
Waraksa Digital Library of the Middle EastWaraksa Digital Library of the Middle East
Waraksa Digital Library of the Middle East
 

Semelhante a OR2016 - Managing Crowd sourced Cultural Heritage Datasets

Cynefin: A Sense of Place
Cynefin: A Sense of PlaceCynefin: A Sense of Place
Cynefin: A Sense of Place
Glen Robson
 
Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...
Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...
Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...
RCAHMW
 

Semelhante a OR2016 - Managing Crowd sourced Cultural Heritage Datasets (20)

Discovery, Reuse, Research and Crowdsourcing: IIIF experiences from the NLW
Discovery, Reuse, Research and Crowdsourcing: IIIF experiences from the NLWDiscovery, Reuse, Research and Crowdsourcing: IIIF experiences from the NLW
Discovery, Reuse, Research and Crowdsourcing: IIIF experiences from the NLW
 
IIIF and NLW delivered at the 'Access to the World's Images' meeting in Ghent...
IIIF and NLW delivered at the 'Access to the World's Images' meeting in Ghent...IIIF and NLW delivered at the 'Access to the World's Images' meeting in Ghent...
IIIF and NLW delivered at the 'Access to the World's Images' meeting in Ghent...
 
IIIF and the National Library of Wales
IIIF and the National Library of WalesIIIF and the National Library of Wales
IIIF and the National Library of Wales
 
Black country collections online
Black country collections onlineBlack country collections online
Black country collections online
 
Effective information sharing - workshop session CILIP Cymru Wales Annual Con...
Effective information sharing - workshop session CILIP Cymru Wales Annual Con...Effective information sharing - workshop session CILIP Cymru Wales Annual Con...
Effective information sharing - workshop session CILIP Cymru Wales Annual Con...
 
NLW Linked Open Data Sets
NLW Linked Open Data SetsNLW Linked Open Data Sets
NLW Linked Open Data Sets
 
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
 
Pratt/KCL Summer School 2017
Pratt/KCL Summer School 2017Pratt/KCL Summer School 2017
Pratt/KCL Summer School 2017
 
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...
 
NAA shake your family tree talk 2012
NAA shake your family tree talk 2012NAA shake your family tree talk 2012
NAA shake your family tree talk 2012
 
Archaeology Data Service (ADS)
Archaeology Data Service (ADS)Archaeology Data Service (ADS)
Archaeology Data Service (ADS)
 
Cynefin: A Sense of Place
Cynefin: A Sense of PlaceCynefin: A Sense of Place
Cynefin: A Sense of Place
 
Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...
Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...
Inventory 1964-2014: Crowdsourcing the National Monuments Record: Jamie Davie...
 
Linking Spaces with Places: Examples from the PastPlace Project
Linking Spaces with Places: Examples from thePastPlace ProjectLinking Spaces with Places: Examples from thePastPlace Project
Linking Spaces with Places: Examples from the PastPlace Project
 
Cymru1900 and the List of Historic Place Names in Wales
Cymru1900 and the List of Historic Place Names in WalesCymru1900 and the List of Historic Place Names in Wales
Cymru1900 and the List of Historic Place Names in Wales
 
Extracting and Sharing Data from Historical Maps
Extracting and Sharing Data from Historical MapsExtracting and Sharing Data from Historical Maps
Extracting and Sharing Data from Historical Maps
 
Geography resources at the State Library of Victoria
Geography resources at the State Library of VictoriaGeography resources at the State Library of Victoria
Geography resources at the State Library of Victoria
 
GTAV presentation 2010
GTAV presentation 2010GTAV presentation 2010
GTAV presentation 2010
 
Julian D. Richards - Open Data in European Archaeology
Julian D. Richards -  Open Data in European ArchaeologyJulian D. Richards -  Open Data in European Archaeology
Julian D. Richards - Open Data in European Archaeology
 
Glenn Cumiskey - UKAD 2016 forum
Glenn Cumiskey - UKAD 2016 forumGlenn Cumiskey - UKAD 2016 forum
Glenn Cumiskey - UKAD 2016 forum
 

Mais de Glen Robson

Mais de Glen Robson (11)

IIIF for Aggregators
IIIF for AggregatorsIIIF for Aggregators
IIIF for Aggregators
 
IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019
 
South Africa IIIF Presentation API
South Africa IIIF Presentation APISouth Africa IIIF Presentation API
South Africa IIIF Presentation API
 
IIIF Image API - glen
IIIF Image API - glenIIIF Image API - glen
IIIF Image API - glen
 
Sweden IIIF Event - IIIF Community
Sweden IIIF Event - IIIF CommunitySweden IIIF Event - IIIF Community
Sweden IIIF Event - IIIF Community
 
SWIB 2018 - Visualising form data - NLW Transcription Projects
SWIB 2018 - Visualising form data - NLW Transcription ProjectsSWIB 2018 - Visualising form data - NLW Transcription Projects
SWIB 2018 - Visualising form data - NLW Transcription Projects
 
NISO REST Training IIIF
NISO REST Training IIIF NISO REST Training IIIF
NISO REST Training IIIF
 
Introduction to Annotation, Content Search, and IIIF Authentication from the ...
Introduction to Annotation, Content Search, and IIIF Authentication from the ...Introduction to Annotation, Content Search, and IIIF Authentication from the ...
Introduction to Annotation, Content Search, and IIIF Authentication from the ...
 
Intro to IIIF and IIIF @NLW
Intro to IIIF and IIIF @NLWIntro to IIIF and IIIF @NLW
Intro to IIIF and IIIF @NLW
 
Introduction to IIIF
Introduction to IIIFIntroduction to IIIF
Introduction to IIIF
 
Europeana Tech - IIIF in Action
Europeana Tech - IIIF in ActionEuropeana Tech - IIIF in Action
Europeana Tech - IIIF in Action
 

Último

Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Último (20)

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 

OR2016 - Managing Crowd sourced Cultural Heritage Datasets

  • 1. Managing Crowd sourced Cultural Heritage Datasets National Library of Wales Glen Robson – Head of Systems twitter: @glenrobson
  • 2. Plan • Background to the National Library of Wales • Crowd Sourcing projects – Cymru – 1900 – Wales – Cynefin – Shipping Records – WW1 Book of Remembrance • Providing storage and access
  • 3.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15. Data • Fields: – Owner – Tennant – Use – arable/forest etc. – Size (acre, rood, perches) – Tithe Value (pounds, shilling, pence) – Geo-coordinates • Storing in Fedora – ALTO – Open Annotations • JSON-LD • RDF/XML – Indexing in SOLR – Website in the summer
  • 16. Shipping Registers • 544 merchant vessels registered at the port of Aberystwyth • 1856-1914 • Crew lists – name, position, birth date, reason for leaving, location • Transcribed by volunteers • https://www.llgc.org.uk/blog/?p=5716
  • 17.
  • 18. Data Preservation • Where do we store this data? – Catalogue – MARC – Fedora 3 Repository • Excel files / RDF • Data being enhanced – Currently: • Triple store (sesame) – preservation? • https://github.com/LlGC-NLW/shippingrecords – Fedora 4?
  • 19. Enhancements • Linking out – Places:- Birth and Ship arrival • Volunteer using OpenRefine to group places • Will try and match with GeoNames – Ships :- • Added to wikidata by NLW Wikipedian in Residence: – https://tools.wmflabs.org/reasonator/?&q=23927955 – https://tools.wmflabs.org/reasonator/?&q=24027483 – Adding images, size, weight, creation, destruction, link to newspapers – Dutch Shipping to Newspaper linking: http://bit.ly/1Talish/
  • 20.
  • 21. Research Potential • By publishing these datasets as Linked Open Data it allows research that wasn’t possible when these items were physical or even when they were standalone digital objects. • Physical: – Travel to Aberystwyth - x hours/days – Transcribe data in the reading room – x months/years – Process back home • Standalone Digital Object – Transcribe data at home – x months/years – Process at home • Linked Open Data Annotations – Process at home results in minutes • Have to take transcriptions with trust
  • 23.
  • 24. Simple Annotation Server • https://github.com/glenrobson/SimpleAnnotationServer • Stores IIIF Annotations as Linked Open Data
  • 27.
  • 32. Providing Access • Volunteers want to see results • Cynefin – funded project • Shipping records – independent website • Cymru1900Wales – dataset (CSV + Linked Data) • Mirador and IIIF options: – IIIF Search API – IIIF Ranges – table of contents – Datasets for download
  • 34. Dataset Intersection • Example of dataset intersection • John Williams • Born 1891
  • 35.
  • 36.
  • 37.
  • 38. Can we do this at scale? Cynefin Maps 1838 to 1947 Newspapers 1804 to 1919 Cymru 1914 1914 to 1918 General Digitisation Shipping Records 1856 to 1914 Crime and Punishment Database 1730 to 1830 Welsh Bibliography 0 to 1970
  • 39. Summary • Different methods of crowd sourcing: – Excel – Outsourcing – Cynefin and wales1900 – IIIF – Mirador & Simple Annotation Server – WikiData • Ideally crowd sourcing platform directly connected to access solution (there will be corrections) • Transcribing to linked data gives: – Connection to external data sources (geonames, wikipedia) – Connection to other resources (newspapers) – Allows researchers to query the data • IIIF gives: – Easy to setup transcription platform – Work with other peoples content

Notas do Editor

  1. Due to our remote location we’ve focused on digitising as much as possible so people don’t have to come to Aberystwyth. Once digitised they are made freely aviliable online.
  2. Lots of different content
  3. One of our largest collections is our collection of digitised newspapers which consist of 15million articles and 1.1 million digitised images from 1804 to 1919
  4. First crowd source project involved working with Zooniverse to transcribe the place names on the Ordnance Survey’s six-inch to a mile maps c. 1900 Working with partners to expand to whole of UK
  5. The next project we worked on was a crowd sourcing project on Tithe Maps. Georeference Tithe maps from the 1800s Transcribe Apportionments The platform was developed by Klokan technologies Project coming to an end soon so get busy!
  6. More a volunteering project. We have a volunteer coordinator who organises projects and one of them was on transcribing shipping records.
  7. No digital images Take transcriptions on trust.
  8. File_221-1_vtls004662587 3 9 year olds Name: Richard E. James Age: 9 Born: Aberystwyth 1879-07-27 Joined: At sea Position: Cabin boy Stayed with them for 1 year and left to Antwerp Discharged N  
  9. One issue is that you have to take the transcriptions on trust. The items haven’t been digitised so you can’t check the quality of the transcription. So going forward we have been doing it slightly differently using Mirador
  10. May have already heard of Mirador but it is a tool developed by Stanford and Harford Universities and works with IIIF images.
  11. One thing it provides is an annotation tool for transcribing content.
  12. We’ve developed a annotation server that can be plugged into Mirador to store the transcriptions as linked open data in either a jena or seasme database.
  13. The first project to use it was for a project transcribing the Welsh WW1 book of rememberence containing a list of all Welsh soldiures who gave their lives in WW1. This was in collaboration with the Welsh center for international affairs. Contains information on Name, Rank, home town and Regiment or Ship serve red.
  14. And because the transcriptions are linked data it is possible to link them to other projects like the NLW wales at war project. This is another crowdsourcing effort working with school children to help them learn about the impacts of WW1. It asks them to add full biographies of soldiers including schools attended, birth dates etc.
  15. Because Mirador is built to use IIIF images it is possible to load different types of content for example this is facebook like tagging system for an image.
  16. And its also possible to import existing OCR and allow users to correct OCR.
  17. Going forward we are looking at running transcription projects with external partners including Aberystywth University on transcribing early student records.
  18. A project to transcribe latin manuscripts. One of these manuscripts is held by the British Library but it will be transcribed using Mirador hosted at the NLW.
  19. And finally a archival collection of WW1 tribunal records.
  20. Volunteers not unreasonably want to see results.
  21. 1843
  22. Tell stories through linked data