SlideShare uma empresa Scribd logo
1 de 31
Baixar para ler offline
Crowdsourced
Manuscript Transcription
         Ben Brumfield
     Roots and Routes 2012
Not just crowdsourcing...
● Collaborative work
● Off-site solo work
● Private work
Not just manuscripts...
●   Maps
●   Textiles
●   Music
●   Flawed OCR
Not just transcription...
● Indexing
● Editing
● Identification

Counting seals on Arctic ice caps.
What it isn't
We'll concentrate on web-based tools for
extracting text from images, not addressing:
● Oral History
● Video
● Audio Transcription
● Image Manipulation
● Transcription/Facsimile Display

Tools exist for these tasks, nevertheless.
Break
What materials are you working with outside of
modern, printed books and websites?
Origins (Approaches)
Two Approaches and one Dead End
● Indexing
● Editing
● Tagging
Indexing
●   Structured Data
●   Extracts from Text vs. Representing Text
●   Databases for Search and Analysis
●   Granular Quality Control
●   Gamification
Editing
●   Books, Diaries, Letters, Articles
●   Representing Text
●   Traditional Editorial Workflow
●   Digital or Print Editions
Tagging
● Too small
● Too imprecise
Origins (Traditions)
●   OCR Correction
●   Documentary Editing
●   Genealogy
●   Natural Science
●   Astronomy

Split this into 5 slides
Online Tools
● Recent (none older than 2005)
● Influenced by origin
● Still pretty raw
● Most require tech expertise for set-up and
  customization
● All require making trade-offs
Lab Session 1: Breadth
NYPL What's on the Menu
  Indexing

Wikisource
  Editing
Selection Factors
●   Source Material
●   Transcript Purpose
●   Organizational/Project Management Fit
●   Financial and Technical Resources
Source Material
Evaluating your source material:
● Is it of interest to anyone else?
● Is it under copyright?
● Does it need restricted access?
● Is it composed of documents or records?
● Is it non-textual?
● How complex is the layout? How important
  is that layout?
Purpose
How will you be using the transcribed data?
● Traditional print editions
● Searchable online editions
● Do you want to use the system to analyze
  the text?
● How do you want to analyze the text?
● Is public engagement a goal?
● Should the transcripts be open?
Organizational/Project Management Fit

● How important is traditional editorial
  workflow?
● Will you rely on volunteers? How will you
  motivate them?
● What is the duration of the project?
● Is there a "final version"?
● Is TEI a mandate?
Financial and Technical Resources
Do you have or need:
● System administrators to install non-hosted
  software?
● Money to pay hosting costs?
● Programming skills to customize a tool?
● Money to pay programmers for
  customization?
● Support for on-going costs to keep the site
  running, however small?
Lab Session 2: Markup Options
FromThePage

TranscribeBentham
Technical Questions to Answer
● Where are the images now?
● How do images get into the system?
● How do transcripts get out of the system?
● How mature is the underlying technology?
● How configurable is the technology?
● How does the system work with the public
  face of your project?
● Where does the metadata live?
● Who will maintain this? How long?
● How many sites are using this system?
Wikisource
Pro:
● Mediawiki plus its add-on modules (e.g.
  print-on-demand, export).
● Wikimedia community.
● Incredibly mature.
Con:
● Wikimedia policy.
● Public editing.
● Limited mark-up.
Bentham Transcription Desk
Pro:
● MediaWiki is very mature.
● TEI Toolbar (can also be used on other
  systems)
● Deployed outside original project.

Con:
● Development efforts halted.
Scripto
Pro:
● Team at CHNM has a great track record.
● Your CMS is your public face.
● MediaWiki is very mature.
● Deployed and under active development.

Con:
● Your CMS handles all metadata.
● Mark-up is extremely limited.
FromThePage
Pro:
● Designed for intensive editing and indexing.
● Semantic mark-up and analysis.
● Hosting available.

Con:
● Single developer (me).
● No TEI mark-up.
Islandora TEI Editor
Caveat: I don't know much about this tool or
this team.

● Based on Drupal and Fedora
● Supports TEI via friendly interface
● Many Drupal-based projects considering it.
T-PEN
Caveat: I don't know much about this tool.

●   Designed for medieval manuscripts.
●   Supports TEI natively.
●   Line-by-line interface.
●   Hosted version available.
Scribe
Pro:
● Excellent for complex layout or non-
  documentary transcription.
● Zooniverse team is large, well-funded,
  experienced.
● Configurable.
Con:
● No automated tool for loading images or
  viewing transcript database (yet!)
● No concept of image-as-a-text.
Pybossa
Caveat: I don't know much about this tool or
this team.

● Open Knowledge Foundation's
  crowdsourcing task management tool.
● Designed for tabular data.
● Google Spreadsheet data entry.
● Extremely young.
TextLab
Caveat: I don't know much about this tool or
this team.

● Melville Electronic Library.
● Direct addition of TEI tags to image.
Lab Session 3: Configuration
Scribe
  Old Weather,
  What's the Score,
  Development deployments
Find me
                Ben Brumfield
           benwbrum@gmail.com
 http://manuscripttranscription.blogspot.com/
                @benwbrum

Mais conteúdo relacionado

Semelhante a Roots and Routes: Crowdsourced Manuscript Transcription Workshop

Scalable, good, cheap
Scalable, good, cheapScalable, good, cheap
Scalable, good, cheapMarc Cluet
 
Computer Programming Overview
Computer Programming OverviewComputer Programming Overview
Computer Programming Overviewagorolabs
 
What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...Stefano Fago
 
Services, tools & practices for a software house
Services, tools & practices for a software houseServices, tools & practices for a software house
Services, tools & practices for a software houseParis Apostolopoulos
 
The Professional Programmer
The Professional ProgrammerThe Professional Programmer
The Professional ProgrammerDave Cross
 
Python in Industry
Python in IndustryPython in Industry
Python in IndustryDharmit Shah
 
Drupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsDrupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsKris Buytaert
 
HOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptxHOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptxssuser62b2da
 
Agile Development: Key to smart software development
Agile Development: Key to smart software developmentAgile Development: Key to smart software development
Agile Development: Key to smart software developmentJerlyn Manohar
 
Building Better FLOSS Community Relationships @ FB
Building Better  FLOSS Community Relationships @ FBBuilding Better  FLOSS Community Relationships @ FB
Building Better FLOSS Community Relationships @ FBDavide Cavalca
 
Picking the right architecture and sticking to it
Picking the right architecture and sticking to itPicking the right architecture and sticking to it
Picking the right architecture and sticking to itPetter Holmström
 
Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...Bambang Purnomosidi D. P.
 
We Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps CodeWe Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps CodeDocker, Inc.
 
Path dependent-development (PyCon India)
Path dependent-development (PyCon India)Path dependent-development (PyCon India)
Path dependent-development (PyCon India)ncoghlan_dev
 
Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)ncoghlan_dev
 
Dynatech presentation for TSI Career Day
Dynatech presentation for TSI Career DayDynatech presentation for TSI Career Day
Dynatech presentation for TSI Career DayArtur Babyuk
 

Semelhante a Roots and Routes: Crowdsourced Manuscript Transcription Workshop (20)

Scalable, good, cheap
Scalable, good, cheapScalable, good, cheap
Scalable, good, cheap
 
Computer Programming Overview
Computer Programming OverviewComputer Programming Overview
Computer Programming Overview
 
What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...What drives Innovation? Innovations And Technological Solutions for the Distr...
What drives Innovation? Innovations And Technological Solutions for the Distr...
 
Services, tools & practices for a software house
Services, tools & practices for a software houseServices, tools & practices for a software house
Services, tools & practices for a software house
 
Cloud accounting software uk
Cloud accounting software ukCloud accounting software uk
Cloud accounting software uk
 
The Professional Programmer
The Professional ProgrammerThe Professional Programmer
The Professional Programmer
 
IT Career Planning v2
IT Career Planning v2IT Career Planning v2
IT Career Planning v2
 
Python in Industry
Python in IndustryPython in Industry
Python in Industry
 
Drupal and Devops , the Survey Results
Drupal and Devops , the Survey ResultsDrupal and Devops , the Survey Results
Drupal and Devops , the Survey Results
 
HOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptxHOW TO START (ANYTHING ABOUT CODE).pptx
HOW TO START (ANYTHING ABOUT CODE).pptx
 
Agile Development: Key to smart software development
Agile Development: Key to smart software developmentAgile Development: Key to smart software development
Agile Development: Key to smart software development
 
Building Better FLOSS Community Relationships @ FB
Building Better  FLOSS Community Relationships @ FBBuilding Better  FLOSS Community Relationships @ FB
Building Better FLOSS Community Relationships @ FB
 
Picking the right architecture and sticking to it
Picking the right architecture and sticking to itPicking the right architecture and sticking to it
Picking the right architecture and sticking to it
 
Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...Programming Languages and Development Tools: State of the Art and (Hopefully)...
Programming Languages and Development Tools: State of the Art and (Hopefully)...
 
Ploneide
PloneidePloneide
Ploneide
 
Learning to code in 2020
Learning to code in 2020Learning to code in 2020
Learning to code in 2020
 
We Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps CodeWe Need to Talk: How Communication Helps Code
We Need to Talk: How Communication Helps Code
 
Path dependent-development (PyCon India)
Path dependent-development (PyCon India)Path dependent-development (PyCon India)
Path dependent-development (PyCon India)
 
Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)Path Dependent Development (PyCon AU)
Path Dependent Development (PyCon AU)
 
Dynatech presentation for TSI Career Day
Dynatech presentation for TSI Career DayDynatech presentation for TSI Career Day
Dynatech presentation for TSI Career Day
 

Último

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 

Último (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 

Roots and Routes: Crowdsourced Manuscript Transcription Workshop

  • 1. Crowdsourced Manuscript Transcription Ben Brumfield Roots and Routes 2012
  • 2. Not just crowdsourcing... ● Collaborative work ● Off-site solo work ● Private work
  • 3. Not just manuscripts... ● Maps ● Textiles ● Music ● Flawed OCR
  • 4. Not just transcription... ● Indexing ● Editing ● Identification Counting seals on Arctic ice caps.
  • 5. What it isn't We'll concentrate on web-based tools for extracting text from images, not addressing: ● Oral History ● Video ● Audio Transcription ● Image Manipulation ● Transcription/Facsimile Display Tools exist for these tasks, nevertheless.
  • 6. Break What materials are you working with outside of modern, printed books and websites?
  • 7. Origins (Approaches) Two Approaches and one Dead End ● Indexing ● Editing ● Tagging
  • 8. Indexing ● Structured Data ● Extracts from Text vs. Representing Text ● Databases for Search and Analysis ● Granular Quality Control ● Gamification
  • 9. Editing ● Books, Diaries, Letters, Articles ● Representing Text ● Traditional Editorial Workflow ● Digital or Print Editions
  • 10. Tagging ● Too small ● Too imprecise
  • 11. Origins (Traditions) ● OCR Correction ● Documentary Editing ● Genealogy ● Natural Science ● Astronomy Split this into 5 slides
  • 12. Online Tools ● Recent (none older than 2005) ● Influenced by origin ● Still pretty raw ● Most require tech expertise for set-up and customization ● All require making trade-offs
  • 13. Lab Session 1: Breadth NYPL What's on the Menu Indexing Wikisource Editing
  • 14. Selection Factors ● Source Material ● Transcript Purpose ● Organizational/Project Management Fit ● Financial and Technical Resources
  • 15. Source Material Evaluating your source material: ● Is it of interest to anyone else? ● Is it under copyright? ● Does it need restricted access? ● Is it composed of documents or records? ● Is it non-textual? ● How complex is the layout? How important is that layout?
  • 16. Purpose How will you be using the transcribed data? ● Traditional print editions ● Searchable online editions ● Do you want to use the system to analyze the text? ● How do you want to analyze the text? ● Is public engagement a goal? ● Should the transcripts be open?
  • 17. Organizational/Project Management Fit ● How important is traditional editorial workflow? ● Will you rely on volunteers? How will you motivate them? ● What is the duration of the project? ● Is there a "final version"? ● Is TEI a mandate?
  • 18. Financial and Technical Resources Do you have or need: ● System administrators to install non-hosted software? ● Money to pay hosting costs? ● Programming skills to customize a tool? ● Money to pay programmers for customization? ● Support for on-going costs to keep the site running, however small?
  • 19. Lab Session 2: Markup Options FromThePage TranscribeBentham
  • 20. Technical Questions to Answer ● Where are the images now? ● How do images get into the system? ● How do transcripts get out of the system? ● How mature is the underlying technology? ● How configurable is the technology? ● How does the system work with the public face of your project? ● Where does the metadata live? ● Who will maintain this? How long? ● How many sites are using this system?
  • 21. Wikisource Pro: ● Mediawiki plus its add-on modules (e.g. print-on-demand, export). ● Wikimedia community. ● Incredibly mature. Con: ● Wikimedia policy. ● Public editing. ● Limited mark-up.
  • 22. Bentham Transcription Desk Pro: ● MediaWiki is very mature. ● TEI Toolbar (can also be used on other systems) ● Deployed outside original project. Con: ● Development efforts halted.
  • 23. Scripto Pro: ● Team at CHNM has a great track record. ● Your CMS is your public face. ● MediaWiki is very mature. ● Deployed and under active development. Con: ● Your CMS handles all metadata. ● Mark-up is extremely limited.
  • 24. FromThePage Pro: ● Designed for intensive editing and indexing. ● Semantic mark-up and analysis. ● Hosting available. Con: ● Single developer (me). ● No TEI mark-up.
  • 25. Islandora TEI Editor Caveat: I don't know much about this tool or this team. ● Based on Drupal and Fedora ● Supports TEI via friendly interface ● Many Drupal-based projects considering it.
  • 26. T-PEN Caveat: I don't know much about this tool. ● Designed for medieval manuscripts. ● Supports TEI natively. ● Line-by-line interface. ● Hosted version available.
  • 27. Scribe Pro: ● Excellent for complex layout or non- documentary transcription. ● Zooniverse team is large, well-funded, experienced. ● Configurable. Con: ● No automated tool for loading images or viewing transcript database (yet!) ● No concept of image-as-a-text.
  • 28. Pybossa Caveat: I don't know much about this tool or this team. ● Open Knowledge Foundation's crowdsourcing task management tool. ● Designed for tabular data. ● Google Spreadsheet data entry. ● Extremely young.
  • 29. TextLab Caveat: I don't know much about this tool or this team. ● Melville Electronic Library. ● Direct addition of TEI tags to image.
  • 30. Lab Session 3: Configuration Scribe Old Weather, What's the Score, Development deployments
  • 31. Find me Ben Brumfield benwbrum@gmail.com http://manuscripttranscription.blogspot.com/ @benwbrum