Introduction to Text Encoding and the TEI

Introduction to Text Encoding and the Text Encoding Initiative (TEI) Richard Wisneski Head, Bibliographic/Metadata Services Kelvin Smith Library Case Western Reserve University 2009-2010

[object Object],[object Object],[object Object],First, Some Ground Rules

[object Object],[object Object],[object Object],Sources to Consult

PART 1: Overview of Text Encoding

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],What Is Text Encoding?

Quick Example <lg> <head>After <del>an</del><add>the <del>unsolv’d</del></add> argument</head> <l><del>The</del><add><del>Coming in,</del> A group of</add> little children, and their <lb/>ways and chatter, flow in <del>upon me</del></l> <l>Like <add>welcome</add> rippling water o'er my <lb>heated <add>nerves and</add> flesh.</l> </lg>

[object Object],[object Object],[object Object],What Text Encoding Is NOT

[object Object],[object Object],[object Object],Why Do Text Encoding?

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Text Encoding Allows Users To…

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Who Does Text Encoding? Where Is It Found?

[object Object],[object Object],[object Object],[object Object],What Is the Text Encoding Initiative (TEI)?

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Text Encoding and XML

XML Documents Must Be: ,[object Object],[object Object],[object Object],[object Object]

XML Vocabulary ,[object Object],[object Object],[object Object],Element Attribute Value Content </titleStmt> Nested <titleStmt> is PARENT ELEMENT. <title> is the CHILD ELEMENT for <titleStmt>

[object Object],[object Object],[object Object],Quick Example

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Validity

Schema Examples ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

PART 3: Levels of TEI Encoding

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Five Levels

Level 1 Encoding: Fully Automated Conversion and Encoding ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Level 1 Encoding: Characteristics <div1> or <div> There should be only one child of <body>: a single <div> (or <div1>) <ab> There should be only one child of the <div> (or <div1>): a single <ab> wrapping all text OCR text. If the text is ever “upgraded” to a Level 3 or higher, the <ab> element will be replaced by structural elements like <p> and <table>. <pb> Required in Level 1. Page images can be linked to the text by specifying a jpeg or other image file as the value of the facs= attribute. Page numbers can be supplied with the n= attribute to record the number that is on the page. The Task Force sees the use of METS here as having a tremendous advantage. METS/TEI page turning documentation will be included in the near future.

Level 2 Encoding: Minimal Encoding ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Level 2 Encoding: Characteristics All elements specified in Level 1 plus the following: <front>, <back> Optional <div1> or <div> If no type= attribute is specified, a type= value of "section" should be presumed. <head> Required if present. <ab> At least one container element is required. <fw> Running heads; can be automatically generated

[object Object],[object Object],P5 Level 2 Encoding Template

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],P5 Level 2 Encoding Example

Level 3 Encoding: Simple Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Level 3 Encoding: Characteristics All elements specified in Levels 1 and 2 plus the following : <front>, <back> Required if present <div> Required if present; type attribute is recommended <floatingText> Recommended if present. <p> Required for paragraph breaks in prose. <lg> and <l> Required for identifying groups of lines and lines, respectively <list> and <item> May be used in this level to indicate ordered and unordered list structures <table>, <row>, and <cell> May be used to indicate table structures. <figure> Required to indicate figures other than page images <hi> Required to indicate changes in typeface; rend attribute is optional <note> All notes must be encoded. It is also recommended that notes that extend beyond one page be combined into one <note> element. Marginal notes, without reference, should occur at the beginning of the paragraph to which they refer, with the value of the place attribute as "margin"

Level 3 Encoding: General Recommendations ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Level 3 Encoding: Prose Example <TEI xmlns="http://www.tei-c.org/ns/1.0" xml:id="VAA2383"> <teiHeader> [stuff] </teiHeader> <text> <front> <div type="frontispiece">[figure]</div1> <titlePage>[text]</titlePage> <div type="dedication">[text]</div1> <div type="contents">[text]</div1> </front> <body> <div type="book"> <head>[book title]</head> <div type="chapter“> <pb n=“5” xml:id=“freear-p03” />[text] </div2> <div type="chapter"> <pb n=“12” xml:id=“freear-p12” />[text] </div2> <div type="chapter">[text]</div2> </div> </body> <back> <div type="appendix">[text]</div1> <div type="index">[text]</div1> </back> </text></TEI> Table of Contents:  <div type="contents"> <head>CONTENTS</head> <list type="simple"> <item>I. A Boy and His Dog <hi rend="right">3</hi> <ptr target="#freear-p03"/> </item> <item>II. Romance <hi rend="right">12</hi> <ptr target="#freear-p12"/> </item> </div>

Level 3 Encoding: Verse Example <TEI xmlns="http://www.tei-c.org/ns/1.0" xml:id="VAA2383"> <teiHeader> [stuff] </teiHeader> <text> <front> <titlePage>[text]</titlePage> <div type="dedication">[text]</div1> <div type="contents">[text]</div1> </front> <body> <div type="book"> <head>[book title]</head> <div type="part"> <head>[section title]</head> <div type="poem"> <head>THE DAYS GONE BY.</head> <lg> <l n="1">O the days gone by! O the days gone by!</l> <l n="2">The apples in the orchard, and the pathway through the rye;</l> <l n="3">The chirrup of the robin, and the whistle of the quail</l> <l n="4">As he piped across the meadows sweet as any nightingale;</l> </lg> <lg>[lines of poetry]</lg> <lg>[lines of poetry]</lg> </div> </div> </div> </body> </text> </TEI>

Level 4 Encoding: Basic Content Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Level 4 Encoding: Characteristics All elements specified in Levels 1, 2 and 3 plus the following : Et cetera; see TEI BPG Guidelines <titlePage> and child elements Required if present <group> Required to encode a collection of independent texts that are regarded as a single group for processing or other purposes <emph>, <foreign>, <gloss>, <term>, or <title> Recommended to identify typographically distinct text <epigraph>, <quote>, <said>, <mentioned>, or <soCalled> Recommended to represent speech, thought, quotation, etc. <sic>, <corr>, or <choice> Recommended to encode errors or typos. <add>, <del>, <gap>, and <unclear> Recommended to encode material that is omitted, added, marked for deletion, or is illegible, invisible, or inaudible <opener>, <dateline>, <salute> <closer>, <signed>, <postscript> Required to indicate specific parts of letters <sp>, <speaker>, and <stage> Required to encode different dramatic structures. <sp> and <speaker> Required to encode oral histories interviews

[object Object],Example of Level 4 Encoding

Level 5 Encoding: Scholarly Encoding Projects ,[object Object]

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Example: Variant Readings in Level 5 Apparatus; critical apparatus Lemma, or base text

General Recommendations ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

PART 4: Short Practice in Text Encoding

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TEXT ETC . Chapter 1. The Kingdom of God. 1 Chapter 2. Lincoln-Hearted Men 9 Chapter 3. Taming the Wilderness 19

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Chapter Heading and Paragraph

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],P5 Level 2 Encoding

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],P5 Level 3 Encoding

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],P5 Level 3 Continued

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],P5 Level 4 Encoding

[object Object],[object Object],[object Object],[object Object],TEI Header

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Basic Components of TEI Header

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TEI Header (continued)

Example: MARC to TEI Header ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Session 2: Text Encoding and the Text Encoding Initiative (TEI) Richard Wisneski Head, Bibliographic/Metadata Services Kelvin Smith Library Case Western Reserve University 2009-2010

PART 6: Some Common Practices in Text Encoding

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Using oXygen

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Common Practices (continued)

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Inserting Images <figDesc> is not required in Level 3, but we are using it to capture either an image caption or to describe the image if a caption is not present

[object Object],Footnote Encoded (and Marginalia) If this note were in the MARGIN of the page, it would be encoded, for example: <note type=“auth” place=“margin-left”> text, text,text </note> Type= and rend= attributes are optional

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Endnotes

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Encode with Pointer and Link

[object Object],[object Object],[object Object],[object Object],[object Object],Encoding Contextual Information

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Common Tags for Contextual Information

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Types of Contextual Information

[object Object],[object Object],[object Object],[object Object],[object Object],Personography

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Personography Encoding TEI header Participation description listPerson person

[object Object],[object Object],Placeography (Gazetteer)

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Placeography Encoding back div place

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Interpretative Keywords and Themes

[object Object],[object Object],[object Object],Future Trends in TEI

[object Object],[object Object],[object Object],Other Encoding Possibilities

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],References

Introduction to Text Encoding and the TEI

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (6)

Semelhante a Introduction to Text Encoding and the TEI

Semelhante a Introduction to Text Encoding and the TEI (20)

Último

Último (20)

Introduction to Text Encoding and the TEI

Notas do Editor