SlideShare uma empresa Scribd logo
1 de 45
Distributed “Forms of Attention”:
eMOP and the CobreTool
Anton duPlessis, Laura Mandell, James Creel, and
Alexy Maslov
Texas A&M University
DH2014
July 10, 2014
Introduction
Distributed Reading
Causer, T., J. Tonra, and V. Wallace. “Transcription Maximized; Expense
Minimized?: Crowdsourcing and editing The Collected Works of Jeremy Bentham.”
Literary and Linguistic Computing 27.2 (2012), pp. 119-137.
Causer, T., and V. Wallace. “Building a Volunteer Community: Results and Findings
from Transcribe Bentham.” Digital Humanities Quarterly 6.1 (2012).
http://www.digitalhumanities.org/dhq/vol/6/2/000125/000125.html.
Gibbs, Frederick W. “New Textual Traditions from Community Transcription.” Digital
Medievalist 7 (2011). http://www.digitalmedievalist.org/journal/7/gibbs/
Holley, Rose. “How Good Can It Get: Analysing and Improving OCR Accuracy in
Large Scale Historic Newspaper Digitisation Programs.” D-Lib Magazine 1.3/4
(2009).
---. “Many Hands Make Light Work.” March 2009. National Library of Australia. ISBN
978‐0‐642‐27694‐0
Guillory, John. “Close Reading: Prologue and Epilogue,”
ADE Bulletin 149 (2010): 8-14.
Hayles, N. Katherine. “Hyper and Deep Attention: The
Generational Divide in Cognitive Models,” Profession 2007:
187-199.
Commentary
vs.
Contribution
Bruno Latour
{
• Developed for Los Primeros Libros Project
• an international collaboration to digitize and provide access to 16th Century
New World imprints (1539 – 1600)
• http://primeroslibros.org
• http://libros.library.tamu.edu
• Create opportunities for academic investigation and instruction
• Interface leverages scrolling filmstrip view of tiled thumbnails
• Magnification and comparison tools facilitate detailed examination
• View and compare multiple exemplars of the same work that would be
impossible with the physical books
• Compare state, emission, edition, etc. of an exemplar
• Examine variations in print, missing / obstructed text, missing / misnumbered /misbound /
damaged pages, fire marks, marginalia and other copy specific attributes
• Synchronous examination of multiple books permits parallel comparison
• Reading Tools
– Book View
– Reading View
– Detailed View
– Repository View
– Comparison View
• Quick Comparison View
• Annotations
– Structural
• table of contents
– Non-structural
• copy specific features
– Transcription
• capability to view and correct the
OCR output of a text
• Editing Tools
– Basic
– Canonical
• abstract construct that permits
alignment of different exemplars
of the same work by leveraging
the structural metadata
– Frankenbook
• application of the canonical
construct using images drawn
from any exemplar(s) to replace
the placeholders to create custom
editions via a drag and drop
method
• A systematic workflow for getting EEBO and
ECCO content and metadata into Cobre
• Accept existing OCR text as transcriptions in
XML import
• Editors for human transcription/revision of
pages
• Addition of transcriptions to XML export
• DSpace does not support bitstream (i.e. file)
level metadata of the detail required for
annotation and transcription.
• We include an additional bitstream that
contains metadata about the page-image
bitsreams – the Bitstream Metadata Bitstream
• The BMB is an XML file with “chunks” that
describe one or more pages.
A file attached to the
item
Example snippet of its
contents
Invoked with
a click
Administrative
users can
indicate
whether a
transcription
is vetted as
acceptable
Click this
And export
these
Results of Usability Studies
Confusions
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete
Dh2014 e mopcobre-complete

Mais conteúdo relacionado

Mais procurados

Opinion mining for social media and news items in Romanian
Opinion mining for social media and news items in RomanianOpinion mining for social media and news items in Romanian
Opinion mining for social media and news items in Romanian
Traian Rebedea
 

Mais procurados (14)

eMOP-PennSt-lunch
eMOP-PennSt-luncheMOP-PennSt-lunch
eMOP-PennSt-lunch
 
From Early Modern Printing to Post-Modern Indie Publishing: Using eMOP on AFP
From Early Modern Printing to Post-Modern Indie Publishing: Using eMOP on AFPFrom Early Modern Printing to Post-Modern Indie Publishing: Using eMOP on AFP
From Early Modern Printing to Post-Modern Indie Publishing: Using eMOP on AFP
 
SCONUL Summer Conference 2019 - Svein Arne Brygfjeld
SCONUL Summer Conference 2019 -  Svein Arne BrygfjeldSCONUL Summer Conference 2019 -  Svein Arne Brygfjeld
SCONUL Summer Conference 2019 - Svein Arne Brygfjeld
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
 
A focused crawler for romanian words discovery
A focused crawler for romanian words discoveryA focused crawler for romanian words discovery
A focused crawler for romanian words discovery
 
Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...
 
Detecting and Describing Historical Periods in a Large Corpora
Detecting and Describing Historical Periods in a Large CorporaDetecting and Describing Historical Periods in a Large Corpora
Detecting and Describing Historical Periods in a Large Corpora
 
Searching for the Best Machine Translation Combination
Searching for the Best Machine Translation CombinationSearching for the Best Machine Translation Combination
Searching for the Best Machine Translation Combination
 
Opinion mining for social media and news items in Romanian
Opinion mining for social media and news items in RomanianOpinion mining for social media and news items in Romanian
Opinion mining for social media and news items in Romanian
 
resume16
resume16resume16
resume16
 
co:op-READ-Convention Marburg - Günter Mühlberger
co:op-READ-Convention Marburg - Günter Mühlbergerco:op-READ-Convention Marburg - Günter Mühlberger
co:op-READ-Convention Marburg - Günter Mühlberger
 
Integration stories with OpenClinica and OpenXData
Integration stories with OpenClinica and OpenXDataIntegration stories with OpenClinica and OpenXData
Integration stories with OpenClinica and OpenXData
 
DMDS Winter 2015 Workshop 1 slides
DMDS Winter 2015 Workshop 1 slidesDMDS Winter 2015 Workshop 1 slides
DMDS Winter 2015 Workshop 1 slides
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social Sciences
 

Semelhante a Dh2014 e mopcobre-complete

UVA MDST 3703 Marking-Up a Text 2012-09-13
UVA MDST 3703 Marking-Up a Text 2012-09-13UVA MDST 3703 Marking-Up a Text 2012-09-13
UVA MDST 3703 Marking-Up a Text 2012-09-13
Rafael Alvarado
 
UVA MDST 3073 Texts and Models-2012-09-11
UVA MDST 3073 Texts and Models-2012-09-11UVA MDST 3073 Texts and Models-2012-09-11
UVA MDST 3073 Texts and Models-2012-09-11
Rafael Alvarado
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18
Rafael Alvarado
 
Mdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-modelsMdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-models
Rafael Alvarado
 
Mdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-dataMdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-data
Rafael Alvarado
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
madhuvardhan
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
madhuvardhan
 

Semelhante a Dh2014 e mopcobre-complete (20)

Zoss High-Level Text Analysis and Techniques
Zoss High-Level Text Analysis and TechniquesZoss High-Level Text Analysis and Techniques
Zoss High-Level Text Analysis and Techniques
 
Digital Humanities: An Introduction
Digital Humanities: An IntroductionDigital Humanities: An Introduction
Digital Humanities: An Introduction
 
Books and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down RowsBooks and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down Rows
 
Transkribus | Günter Mühlberger
Transkribus | Günter MühlbergerTranskribus | Günter Mühlberger
Transkribus | Günter Mühlberger
 
Visualizing Textual Data
Visualizing Textual DataVisualizing Textual Data
Visualizing Textual Data
 
Working digitally with Historical Documents
Working digitally with Historical DocumentsWorking digitally with Historical Documents
Working digitally with Historical Documents
 
2013 RBMS Premodern manuscript application profile presentation
2013 RBMS Premodern manuscript application profile presentation2013 RBMS Premodern manuscript application profile presentation
2013 RBMS Premodern manuscript application profile presentation
 
UVA MDST 3703 Marking-Up a Text 2012-09-13
UVA MDST 3703 Marking-Up a Text 2012-09-13UVA MDST 3703 Marking-Up a Text 2012-09-13
UVA MDST 3703 Marking-Up a Text 2012-09-13
 
Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...
Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...
Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...
 
UVA MDST 3073 Texts and Models-2012-09-11
UVA MDST 3073 Texts and Models-2012-09-11UVA MDST 3073 Texts and Models-2012-09-11
UVA MDST 3073 Texts and Models-2012-09-11
 
Miao
MiaoMiao
Miao
 
The future of reference management systems
The future of reference management systemsThe future of reference management systems
The future of reference management systems
 
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18
 
Mdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-modelsMdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-models
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
New Directions in Information Organization: A Linked Data Model with BIBFRAME
New Directions in Information Organization: A Linked Data Model with BIBFRAMENew Directions in Information Organization: A Linked Data Model with BIBFRAME
New Directions in Information Organization: A Linked Data Model with BIBFRAME
 
Mdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-dataMdst3705 2013-02-19-text-into-data
Mdst3705 2013-02-19-text-into-data
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 
Class 5-introto dl
Class 5-introto dlClass 5-introto dl
Class 5-introto dl
 

Último

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Último (20)

Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

Dh2014 e mopcobre-complete

  • 1. Distributed “Forms of Attention”: eMOP and the CobreTool Anton duPlessis, Laura Mandell, James Creel, and Alexy Maslov Texas A&M University DH2014 July 10, 2014
  • 3.
  • 4.
  • 5. Causer, T., J. Tonra, and V. Wallace. “Transcription Maximized; Expense Minimized?: Crowdsourcing and editing The Collected Works of Jeremy Bentham.” Literary and Linguistic Computing 27.2 (2012), pp. 119-137. Causer, T., and V. Wallace. “Building a Volunteer Community: Results and Findings from Transcribe Bentham.” Digital Humanities Quarterly 6.1 (2012). http://www.digitalhumanities.org/dhq/vol/6/2/000125/000125.html. Gibbs, Frederick W. “New Textual Traditions from Community Transcription.” Digital Medievalist 7 (2011). http://www.digitalmedievalist.org/journal/7/gibbs/ Holley, Rose. “How Good Can It Get: Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs.” D-Lib Magazine 1.3/4 (2009). ---. “Many Hands Make Light Work.” March 2009. National Library of Australia. ISBN 978‐0‐642‐27694‐0
  • 6. Guillory, John. “Close Reading: Prologue and Epilogue,” ADE Bulletin 149 (2010): 8-14. Hayles, N. Katherine. “Hyper and Deep Attention: The Generational Divide in Cognitive Models,” Profession 2007: 187-199. Commentary vs. Contribution Bruno Latour {
  • 7. • Developed for Los Primeros Libros Project • an international collaboration to digitize and provide access to 16th Century New World imprints (1539 – 1600) • http://primeroslibros.org • http://libros.library.tamu.edu • Create opportunities for academic investigation and instruction • Interface leverages scrolling filmstrip view of tiled thumbnails • Magnification and comparison tools facilitate detailed examination • View and compare multiple exemplars of the same work that would be impossible with the physical books • Compare state, emission, edition, etc. of an exemplar • Examine variations in print, missing / obstructed text, missing / misnumbered /misbound / damaged pages, fire marks, marginalia and other copy specific attributes • Synchronous examination of multiple books permits parallel comparison
  • 8. • Reading Tools – Book View – Reading View – Detailed View – Repository View – Comparison View • Quick Comparison View • Annotations – Structural • table of contents – Non-structural • copy specific features – Transcription • capability to view and correct the OCR output of a text • Editing Tools – Basic – Canonical • abstract construct that permits alignment of different exemplars of the same work by leveraging the structural metadata – Frankenbook • application of the canonical construct using images drawn from any exemplar(s) to replace the placeholders to create custom editions via a drag and drop method
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. • A systematic workflow for getting EEBO and ECCO content and metadata into Cobre • Accept existing OCR text as transcriptions in XML import • Editors for human transcription/revision of pages • Addition of transcriptions to XML export
  • 18.
  • 19. • DSpace does not support bitstream (i.e. file) level metadata of the detail required for annotation and transcription. • We include an additional bitstream that contains metadata about the page-image bitsreams – the Bitstream Metadata Bitstream • The BMB is an XML file with “chunks” that describe one or more pages.
  • 20. A file attached to the item Example snippet of its contents
  • 21.
  • 23.
  • 24.
  • 27. Results of Usability Studies Confusions

Notas do Editor

  1. Cobre = Comparative Book Reader
  2. Utilizes open source technologies to display JPF / JP2 images via a filmstrip metaphor User-driven and iterative design, implementation, and testing approach
  3. * Copies of the same title in separate windows… Cobre prevents this!