Consider the Source

Consider the Source
Textual criticism and digital techniques

Wednesday, March 18, 2009

How do we know what
happened?


People aren’t machines


• the scriptorium is cold


• the food is bad


• the food is bad
• the tea ladies are unfriendly


• the food is bad
• the tea ladies are unfriendly
• they just don’t want to be there anymore


• Modern historical approaches are a recent
thing


thing

• Not unheard of before, but not standard


thing

• Not unheard of before, but not standard
• This profoundly affects the way in which
histories have reached us


600 years later


600 years later

• a bunch of error-ridden copies


600 years later

• or only a few error-ridden copies


600 years later

• or only a few error-ridden copies
• or only a name check in a book about
something else


Textual criticism


Apparatus example
St. Stephenʼs Church in Nijmegen
Nobilis itaque comes Otto imperio et dominio Novimagensi sibi, ut praefer-
tur, impignoratis et commissis proinde praeesse cupiens, anno liiii superius 1254
descripto, mense Iunio, una cum iudice, scabinis ceterisque civibus civitatis
Novimagensis, pro ipsius et inhabitantium in ea necessitate, commodo et utili-
5 tate, ut ecclesia eius parochialis extra civitatem sita destrueretur et infra muros
transferretur ac de novo construeretur, a reverendo patre domino Conrado de
Hofsteden, archiepiscopo Coloniensi, licentiam, et a venerabilibus dominis de-
cano et capitulo sanctorum Apostolorum Coloniensi, ipsius ecclesiae ab antiquo
veris et pacificis patronis, consensum, citra tamen praeiudicium, damnum aut
10 gravamen iurium et bonorum eorundem, impetravit.
Et exinde liberum locum eiusdem civitatis qui dicitur Hundisbrug, de prae-
libati Wilhelmi Romanorum regis, ipsius fundi domini, consensu, ad aedifican-
dum et consecrandum ecclesiam et coemeterium, eisdem decano et capitulo de
expresso eiusdem civitatis assensu libera contradiderunt voluntate, obligantes
15 se ipsi comes et civitas dictis decano et capitulo, quod in recompensationem
illius areae infra castrum et portam, quae fuit dos ecclesiae, in qua plebanus
habitare solebat—quae tunc per novum fossatum civitatis est destructa—aliam
aream competentem et ecclesiae novae, ut praefertur, aedificandae satis conti-
guam, ipsi plebano darent et assignarent. Et desuper apud dictam ecclesiam
20 sanctorum Apostolorum est littera sigillis ipsorum Ottonis comitis et civitatis
Novimagensis sigillata.
3 p. 227 R 4 p. 97 N 6 p. 129 D 12 f. 72v M 13 p. 228 R 20 p. 130 D
2 proinde ] primum D 5 ecclesia eius ] ecclesia D: eius eius H extra civitatem om. H
infra ] intra D 6 transferretur ] transferreretur NH 7 Hofsteden ] Hostede D: Hosteden
H Coloniensi ] Colononiensi H dominis ] viris H 8 Coloniensi ] Coloniae H 10 iurium ]
virium D 11 liberum ] librum H qui ] quae D Hundisbrug ] Hundisburch D: Hunsdisbrug
R 12 regis ] imperatoris D 13 et consecrandum om. H eisdem ] eiusdem D 15 comes ]
comites D dictis om. H 17 tunc ] nunc H 18 ut. . . aedificandae om. H 18–19 contiguam ]
contiguum M 19 apud om. H 20 est ] et H littera ] litteram H 21 Novimagensis ]
Novimagii D sigillata ] sigillis communita H


Apparatus example


Who needs this and
why?


Who needs this and
why?

• Historians look for one thing


Who needs this and
why?

• Linguists look for other things


Who needs this and
why?

• Linguists look for other things
• Others will be interested too


The Chronicle of
Matthew of Edessa


How to make an
edition


Surviving manuscripts

• Oldest full manuscript is Venice 887



• Next oldest is Vienna 574



• Next oldest is Vienna 574
• 24 of 42 (4 of 6 fragments) copied before
1700


Extant manuscripts of
the Chronicle
Manuscripts Fragments

28

21

14

7

0
pre 16th 16th 17th 18th 19th


Two manuscript groups
• •
Group 1: like Venice 887 Group 2: like Vienna 574

• •
Text generally Text truncated near
complete (to 1162) the year 1096/7

• •
Transmitted with the Transmitted with
Life of St. Nerses speciﬁc long sequence
(Mesrop the Priest) of texts


Matenadaran 1896


Matenadaran 1896

• Copied in 1689


Matenadaran 1896

• Copied in 1689
• Uniquely preserves two passages of text


Matenadaran 1896

• Copied in 1689
• These lacunae known to other copyists


Matenadaran 1896

• Copied in 1689
• These lacunae known to other copyists
• But lots of manuscripts are older. Hm.


Making the edition


Making the edition

• Transcription


Making the edition

• Transcription
• Collation text analysis


Making the edition

• Transcription
• Editing the text


Making the edition

• Transcription
• Editing the text
• Publication


Digital techniques


Transcription


Transcription

• The most time-consuming part


Transcription

• Ideal solution would be optical character
recognition (OCR)


Transcription

• Ideal solution would be optical character
recognition (OCR)
• No OCR for manuscripts, yet


Start with a manuscript

Into plain text
դ. Թուխտ սիրոյ և միաբանութեան, շարագրեցեալ կղէմէս աստուածաբան վարդապետէ։
Առաջաբանութիւն։

Նա զի արդ գրիչս իմ անյարմարս կարօղ լինիցի երբէկ պատմագրիլ, ըստ պատշաճի
զմեծամեծ յիշելիսն։ Որք ի վաղ ժամանակի անտի պատահեցան յեկեղեցին հայոց. և զի
արդ մեք անհընագէտս յանձնառնցուք ճառել զանցեալ ծածկագոյն խորհուրդս այլասեռ
ազգի, մինչև ցայժմ ո՜չ կարաց Ֆրանկ պատմիչ ոք զբուռն հարել ի սոյնպիսի օտար
պատմագրութիւնս։ Բայց սակայն յուսացեալ ի յօգնութիւն սրբոյ աստուածածնին
յօժարապէտս ախորժեսցուք համարձակիլ և ի յայս անհոռն ծովս մտանել...


XML solution: TEI
!-- ... --
div n=”4”
headhi rend=”red”Թուխտ սիրոյ և միաբանութեexան/ex, շարագրեցlb/
եալ կղէմէս աexստուա/exծաբան վարդապետէ։lb/
Առաջաբանութիւն։ /hi
/head
phi rend=”ornament”Ն/hihi rend=”red”ա զի արդ գրիչս իմ անյարմարս/hi lb/
կարօղ լինիցի երբէկ պատմագրիլ, expanըստ/expan lb/
պատշաճի զմեծամեծ յիշելիսն։ Որք lb/
ի վաղ ժամանակի անտի պատահեցան lb/
յեկեղեցին հայոց. և զի արդ մեք անհընագէտս lb/
յանձնառնցուք ճառել զանցեալ ծածկագոյն lb/
խորհուրդս այլասեռ ազգի, մինչև ցայժմ ո՜չ lb/
կարաց Ֆրանկ պատմիչ ոք զբուռն հարել lb/
ի սոյնպիսի օտար պատմագրութիexւն/exս։ Բայց սաlb/
կայն յուսացեալ ի յօգնութիexւն/ex սրբոյ աexստուա/exծածնին lb/
յօժարապէտս ախորժեսցուք համարձակիլ և ի lb/
յայս անհոռն ծովս մտանել։ ... lb/
/p
/div
!-- ... --


Better TEI XML text
!-- ... --
div n=”4”
head rend=”red”wԹուխտ/w wսիրոյ/w wև/w wմիաբանուexթեան/ex,/w wշարագրեցlb/եալ/w wկղէմէս/w
wexաստուած/exաբան/w wվարդապետէ։/wlb/
wԱռաջաբանութիւն։
/head

pwhi rend=”ornament”Ն/hihi rend=”red”ա/hi/w hi rend=”red”wզի/w wարդ/w wգրիչս/w wիմ/w
wանյարմարս/w/hi lb/
wկարօղ/w wլինիցի/w wերբէկ/w wպատմագրիլc type=”punct”,/c/w wexpanըստ/expan/w lb/
wպատշաճի/w wզմեծամեծ/w wյիշելիսնc type=”punct”։/c/w wՈրք/w lb/
wի/w wվաղ/w wժամանակի/w wանտի/w wպատահեցան/w lb/
wյեկեղեցին/w wհայոցc type=”punct”./c/w wև/w wզի/w wարդ/w wմեք/w wանհընագէտս/w lb/
wյանձնառնցուք/w wճառել/w wզանցեալ/w wծածկագոյն/w lb/
wխորհուրդս/w wայլասեռ/w wազգիc type=”punct”,/c/w wմինչև/w wցայժմ ոc type=”punct”՜/cչ/w lb/
wկարաց/w wՖրանկ/w wպատմիչ/w wոք/w wզբուռն/w wհարել/w lb/
wի/w wսոյնպիսի/w wօտար/w wպատմագրուexթիւնս/exc type=”punct”։/c/w wԲայց/w wսաlb/կայն/w
wյուսացեալ/w wի յօգնուexթիւն/ex/w wսրբոյ/w wexաստուած/exածնին/w lb/
wյօժարապէտս/w wախորժեսցուք/w wհամարձակիլ/w wև/w wի/w lb/
wյայս/w wանհոռն/w wծովս/w wմտանելc type=”punct”։/c/w ... lb/
/p
/div
!-- ... --


Perl to the rescue #1

• XML is a terrible thing to edit



• I want a transcription markup that I can
convert to TEI XML later



• I want a transcription markup that I can
convert to TEI XML later
• Not a solution you’ll like, but I’ll show it to
you anyway


seg type=quot;wordquot;հsubstdelայ/deladd
place=quot;overwritequot;ex resp= quot;#tlaquot;ո/exռex resp=
quot;#tlaquot;ո/exմ/addոց./subst


հ±-այ-+(overwrite)ոռոմ+ոց


TEI markup
[172]
զօ՛րացն և զօրավարացն և ազգն հոռոմոց իւրոց քաջութեան զան
դարձ փաղչելն արարին պարծանք նմանեացն վատ՛ հովուաց,
ո՛ր յորժամ զգայլն տեսանէ փաղչի, սակայն հոռոմք յան
ջանս ջանացին, ո՛ր լուր զպարիսպ ամրութեան տանս հայոց
քակեա՛լ կործանեցին, և զպարսիկք ի վերայ արձակեցին սրով, և
զամենայն յաղթու՛թիւնն իւրոց համարեցան, և ինքեանք անպատկառելի
երեսօք, կուրտ՛ զօրավարք, և ներքինի զօրօք զհայ՛ք պահել
ջանա+յ+ին, մինչև պարսիկք յան±-(blot)տ-+տ+±էր տեսին զ^ամենայն^ արևելք.
և յայնժամ մեծաւ՛ զօրութեամբ զօրացնն այ՛լազգիքն, որ ի
մէկ տարո՛յ հասան մինչև ի դու՛ռն կոստանդնուպօլիս, և
առին զամենայն աշխարհն ±-հայոց-+(overwrite)հոռոմոց+±, զքաղաքս
ծովեզերաց և զկղզիս նոցա,
և արա՛րին զազգն յունաց որպէս զբա՛նդարգեալս ի ներս
ի կոստանդնուպօլիս. և յորժամ առա՛ւ հայք ի յունաց, ար
գելաւ՛ ամենայն չարութիւնն հոռոմո՛ց, յազգէն հայոց, և զկնի այսօր
իկ հնարեցան այ՛լ կերպիւ պատերազմ յարուցանեալ ^ընդ^ ազգն
հա՛յոց, նստան ի քննութիւն հաւ՛ատոյ, և այսու ատեա՛լ
անարգեցին զհանդէս պատերազմի և զօրմարտի, և զկռիւս և
զաղմու՛կս յեկեղեցի աստուծոյ կարգեալ հաստատեցին. ի պարսից
պատերազմէն յօժարութեամբ փախչին, և զամենայն ճշմարիտ
հաւատացեալքս
քրիստոսի ի հաւատոյն ջանան խափանել և խաղխտել, վասն զի յորժամ
այր քաջ զօրաւ՛որ գտանէին, զաչսն խաւարեցուցանէին, և կամ
ի ծով ձգեա՛լ խեղդամահ սատակէին. և այ՛ն էր փո՛յթ յօժարութեան


TEI markup

հ±-այ-+(overwrite)ոռոմ+ոց,

seg type=quot;wordquot;
substհdelայ/del
add place=quot;overwritequot;
ex resp=quot;#tlaquot;ո/exռ
ex resp=quot;#tlaquot;ո/exմ/add
/substոց,/seg


TEI markup
հոռոմոց իւրոց քաջութեան

seg type=”word”հ
ex resp=quot;#tlaquot;ո/exռ
ex resp=quot;#tlaquot;ո/exմոց/seg
seg type=”word”իւր
ex resp=quot;#tlaquot;ո/exց/seg
seg type=”word”ք
ex resp=quot;#tlaquot;ա/exջ
ex resp=quot;#tlaquot;ո/exւ
ex resp=quot;#tlaquot;թ/exե
ex resp=quot;#tlaquot;ան/ex/seg


TEI text: what now?
!-- ... --
div n=”4”
head rend=”red”wԹուխտ/w wսիրոյ/w wև/w wմիաբանուexթեան/ex,/w wշարագրեցlb/եալ/
w wկղէմէս/w wexաստուած/exաբան/w wվարդապետէ։/wlb/
wԱռաջաբանութիւն։
/head

pwhi rend=”ornament”Ն/hihi rend=”red”ա/hi/w hi rend=”red”wզի/w wարդ/w wգրիչս/
w wիմ/w wանյարմարս/w/hi lb/
wկարօղ/w wլինիցի/w wերբէկ/w wպատմագրիլc type=”punct”,/c/w wexpanըստ/expan/w
lb/
wպատշաճի/w wզմեծամեծ/w wյիշելիսնc type=”punct”։/c/w wՈրք/w lb/
wի/w wվաղ/w wժամանակի/w wանտի/w wպատահեցան/w lb/
wյեկեղեցին/w wհայոցc type=”punct”./c/w wև/w wզի/w wարդ/w wմեք/w
wանհընագէտս/w lb/
wյանձնառնցուք/w wճառել/w wզանցեալ/w wծածկագոյն/w lb/
wխորհուրդս/w wայլասեռ/w wազգիc type=”punct”,/c/w wմինչև/w wցայժմ ոc
type=”punct”՜/cչ/w lb/
wկարաց/w wՖրանկ/w wպատմիչ/w wոք/w wզբուռն/w wհարել/w lb/
wի/w wսոյնպիսի/w wօտար/w wպատմագրուexթիւնս/exc type=”punct”։/c/w wԲայց/w
wսաlb/կայն/w wյուսացեալ/w wի յօգնուexթիւն/ex/w wսրբոյ/w wexաստուած/exածնին/w
lb/
wյօժարապէտս/w wախորժեսցուք/w wհամարձակիլ/w wև/w wի/w lb/
wյայս/w wանհոռն/w wծովս/w wմտանելc type=”punct”։/c/w ... lb/
/p
/div
!-- ... --


Collation
quot;The collation of manuscripts requires the
infuriating accuracy of a pedant and the
obsessive stamina of an idiot. It is therefore an
ideal task for a computer.quot;
—Peter Robinson, “Collation and Textual Criticism”, LLC vol. 4 no. 2, 1989


Collation


Collation

• need to align words with each other


Collation

• ...across many manuscripts


Collation

• ...across many manuscripts
• ...even when the words aren’t exactly the
same
(e.g. “յաշխարհին” vs. “աշխարհն”)


յայսմ այս յայսմ այս այս
ամենայն ամենայն ամի ամենայն ամենայն
եղելոցն, եղելոց եղելոց եղելոց եղելոցս
նստուցանեն նստուցանեն նստուցանեն նստուցանեն նստուցանեն
զաթոռ զաթոռ յաթոռ զաթոռ զաթոռ
հայրապետութեան հայրապետութեան հայրապետութեան հայրապետութեանն հայրապետութեան
ի ի ի
թաւբլուր թաւաբլուրն։ թաւբլուր
եւ եւ եւ
կացեալ կացեալ կացեալ
անդ անդ անդ
զամս զամս զամս
գ գ, գ
եւ եւ եւ
ընդ ընդ ընդ
ամենայն ամենայն ամենայն
զ զ վեց
ամ ամ, ամ
կալեալ կալեալ կալեալ
զաթոռ զաթոռ զաթոռ
հայրապետութեանն հայրապետութեան հայրապետութեանն
տէր տէր տէր տէր զտէր
խաչիկ։ խաչիկ։ խաչիկն։ խաչիկ։ խաչիկ։


!-- ... --
p
wapp
rdg wit=”#A #C”յայսմ/rdg
rdg wit=”#B #D #E”այս/rdg
/app/w
wapp
rdg wit=”#A #B #D #E”ամենայն/rdg
rdg wit=”#C”ամի/rdg
/app/w
wapp
rdg wit=”#A”եղելոցն/rdg
rdg wit=”#B #C #D”եղելոց/rdg
rdg wit=”#E”եղելոցս/rdg
/app/w
wնստուցանեն/w
w type=”preﬁx”app
rdg wit=”#A #B #D #E”զ/rdg
rdg wit=”#C”յ/rdg
/app/w
wաթոռ/w
wհայրապետութեան/w
wapp
rdg wit=”#A #C #D”ի/rdg
/app/w
!-- ... --
/p
!-- ... --


!-- ... --
p
wapp
rdg wit=”#A #C”յայսմ/rdg
lem wit=”#B #D #E”այս/rdg
/app/w
wapp
lem wit=”#A #B #D #E”ամենայն/rdg
rdg wit=”#C”ամի/rdg
/app/w
wapp
lem wit=”#A”եղելոցն/rdg
/app/w
lem wit=”#C”յ/rdg
/app/w
wաթոռ/w
wapp
lem wit=”#A #C #D”ի/rdg
/app/w
!-- ... --
/p
!-- ... --


Our text apparatus

այս ամենայն եղելոցն նստուցանեն զաթոռ
1

ի թաւբլուր,

1 այս] յայսմ AC 1 ամենայն] ամի C 1 եղելոցն] եղելոց BDE եղելոցս C
1 զաթոռ] յաթոռ C 2 ի թաւբլուր] om. BE
...


!-- ... --
p
wapp
lem wit=”#A #C”յայսմ/rdg
rdg wit=”#B #D #E”այս/rdg
/app/w
wapp
rdg wit=”#A #B #D #E”ամենայն/rdg
lem wit=”#C”ամի/rdg
/app/w
wapp
lem wit=”#A”եղելոցն/rdg
/app/w
lem wit=”#C”յ/rdg
/app/w
wաթոռ/w
wapp
lem wit=”#A #C #D”ի/rdg
/app/w
!-- ... --
/p
!-- ... --


New text apparatus

յայսմ ամի եղելոցն նստուցանեն զաթոռ
1

ի թաւբլուր,

1 յայսմ] այս BDE 1 ամի] ամենայն ABDE 1 եղելոցն] եղելոց BDE եղելոցս C
1 զաթոռ] յաթոռ C 2 ի թաւբլուր] om. BE
...


Manuscript stemmas:
the family tree


Stemma construction

• Better stemma through analysis of collation results


Stemma construction


• Borrows statistical models from evolutionary biology


Stemma construction



• “Maximum parsimony” based upon DNA of specimens


Stemma construction




• Manuscripts are specimens


Stemma construction




• Manuscripts are specimens

• Biologists have DNA sequences; we have words.


A B A B B
A A B A A
A B B B C
A A A A A
A A B A A
A A A B A
A O A A O
A O B A O
A O A A O
A O A A O
A O A A O
A O A A O
A O A A O
A O A A O
A O A A O
A O A A O
A O A B O
A O A A O
A O A A O
A O A A O
A O B A O
A A A A B
A A B A A


Non-fragmentary manuscripts omitted:
!
Paris 191, 200
Jerusalem 3651
Matenadaran 2855, 2899, 3380,
gaps appear
6605, 8159, 8232, 8894
Rome 25
Vienna 243, 246
quot;
ch
% ap
te

text truncated
rd
F (1617)
ivi
sio
ns
ap
B (1623) pe
ar
X (1669)

$
A (1689) #
Matenadaran
3520 (17th c.)
O (ca. 1702)

Matenadaran
W (1601)
2644(1844)
V (1590-1600)
J (1617)
D (1647) (Jerusalem
1869 edition*)
H (17th c.) Z (17th c.)

Y (17th c.) K (1699)

L (1660)
I (1664)
Matenadaran
3071 (1651-61)
Bzommar 644
(1775-1805)

Venice 986
(1830-35)
*Based on Jerusalem mss. 1051, 1107


Publication


Online publication
• XML can also be turned into HTML for online
publication

• This gives:


Online publication
publication

• This gives:

• searchable text


Online publication
publication

• This gives:

• searchable text

• easy updates


Online publication
publication

• This gives:

• searchable text

• easy updates

• conﬁgurable set of variants


Online publication
publication

• This gives:

• searchable text

• easy updates

• conﬁgurable set of variants

• links to manuscript images where available


Questions?


Consider the Source

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (20)

Consider the Source

Notas do Editor