SlideShare uma empresa Scribd logo
1 de 82
Baixar para ler offline
Visualizing the
Transcribe Bentham Corpus
Frédérique Mélanie, Estelle Tieberghien, Pablo Ruiz Fabo,
Thierry Poibeau
LATTICE Lab: ENS – CNRS – U Paris 3, PSL – USPC
Tim Causer, Melissa Terras
UCL Bentham Project, UCL Digital Humanities
UCLDH Seminar, December 2016
Outline
• UCL Bentham Project & Transcribe Bentham
• How navigate this corpus? Visualizations
– Lexical extraction
– Co-occurrence networks
• Static view and Temporal evolution
• Evaluation and Challenges
• Other corpus explorations via visualization
• Distant Reading Module, WordTree
• Other lexical analyses 2
Jeremy Bentham (1748-1832)
•Jurist, philosopher, and legal and
social reformer
•Leading theorist in Anglo-American
philosophy of law
•Influenced the development of
welfarism
•Advocated utilitarianism
•Animal rights,
•Work on the “panopticon”
•Not founder of UCL, but...
•60,000 folios in UCL Sp. Collections
•40,000 untranscribed
•Auto-icon
The Bentham Project
• http://www.ucl.ac.uk/Bentham-Project/
• Since 1959
• “aims to produce a new scholarly
edition of the works and
correspondence of Jeremy Bentham”
• twenty six volumes of the new
Collected Works have been published
• 50 years to transcribe 20,000 folios
• Previous AHRC grant catalogued the
manuscripts
– http://www.benthampapers.ucl.ac.uk/
Facts and Figures (as of 1st July 2016)
• 16,205 manuscripts transcribed/partially-transcribed
• 15,351 (94%) checked and approved
• 83,955 visits
• 34,359 unique views
• Average session time: 14 minutes 13 seconds
• 140 countries
• 514 people have transcribed something
• Most of the work done by the 26 Super Transcribers
• Average of 54 transcripts edited since the start of the project
• Average of 56 per week during the last twelve months
• Greatest number of transcripts in any one week: 300 (w/c 14 June
• 2014)
Transcribe Bentham progress, 8 September 2010 to 20 March 2015
0
2000
4000
6000
8000
10000
12000
8
Sep
2010
5
Nov
2011
30
Dec
2010
25
Feb
2011
15
Apr
2011
17
Jun
2011
12
Aug
2011
7
Oct
2011
2
Dec
2011
27
Jan
2012
23
Mar
2012
18
May
2012
13
Jul
2012
7
Sep
2012
2
Nov
2012
28
Dec
2012
22
Feb
2013
26
Apr
2013
21
Jun
2013
16
Aug
2013
11
Oct
2013
6
Dec
2013
31
Jan
2014
28
Mar
2014
23
May
2014
18
Jul
2014
12
Sep
2014
7
Nov
2014
9
Jan
2015
6
Mar
2015
Manuscripts worked on Completed transcripts
NYT article
BL manuscripts made
available
With thanks to:
•Prof Philip Schofield (UCL Bentham Project, Principal
Investigator)
•Dr Tim Causer (Bentham Project)
•Dr Kris Grint (Bentham Project)
•Richard Davis (University of London Computer Centre
•José Martin (ULCC)
•Martin Moyle (UCL Library Services)
•Lesley Pitman (UCL Library Services)
•Tony Slade (UCL Creative Media)
•Miguel Faleiro Rodrigues, Alejandro Salinas Lopez, and
Raheel Nabi (UCL Creative Media)
•Dr Arnold Hunt (British Library)
•Anna-Maria Sichani (Bentham Project)
•Dr Justin Tonra (National University of Ireland Galway)
and Dr Valerie Wallace (Victoria University Wellington),
bother formerly of the Bentham Project
•All the partners in Transcriptorium
http://transcriptorium.eu/consortium/
•And Transcribe Bentham’s volunteers!
•Project previously funded by the AHRC and the Andrew W.
Mellon Foundation
Outline
• UCL Bentham Project & Transcribe Bentham
• How navigate this corpus? Visualizations
– Lexical extraction
– Co-occurrence networks
• Static view and Temporal evolution
• Evaluation and Challenges
• Other corpus explorations via visualization
• Distant Reading Module, WordTree
• Other lexical analyses 13
Relevant access to a large corpus
14
Relevant access to a large corpus
• A search index?
• Topic models?
• Corpus cartography?
Challenges for this corpus
• Not an all-English corpus
• Difficulties posed by an historical variety
• Technical language
• Revision history, additions and deletions
15
Stats for analyzed corpus sample
• Total TEI files: 29,900
• In English: 29,400
• That we dated: 16,700
• We only visualized English transcripts that
we could date (with a simple heuristic)1
• Work is based on ca. 55% of the all the
TEI files in our sample
16
1We were not using the corpus’ date metadata for this exercise
Corpus Cartography
• Lexical extraction (of relevant sequences)
• Clustering based on similarity measures
• Visual representation (map of the corpus)
based on layout algorithms
17
Cartography tool: CorText
• CorText Manager covers all cartography
steps:
– Lexical extraction
– Clustering
– Visualization
• Each step can be used independently,
thanks to standard import/export formats
18
ToolscombinedwithCorText
CARTOGRAPHY STEP TOOLS and RESOURCES
Lexical Extraction
DBpedia Spotlight
YaTeA
Human domain-expert
Clustering CorText Analysis
Visualization Gephi + Sigma JS plugin
- Static
CorText MapExplorer
Inkscape
- Dynamic CorText Heatmaps,
Tubes, Distant Reading
19
Outline
• UCL Bentham Project & Transcribe Bentham
• How navigate this corpus? Visualizations
– Lexical extraction
– Co-occurrence networks
• Static view and Temporal evolution
• Evaluation and Challenges
• Other corpus explorations via visualization
• Distant Reading Module, WordTree
• Other lexical analyses 20
Lexical Extraction
• CorText native option
– Noun-Phrase chunks (based on TreeTagger)
• Our options:
– Entity Linking / Wikification to DBpedia
– Keyphrase extraction tools like YaTeA
• In all cases: manual selection of pre-ranked
candidate terms by a domain-expert
21
Entity Linking / Wikification
• Given a database with encyclopedic
knowledge (e.g. Wikipedia)
- Finds references (mentions) to DB terms in text
- Dealing with variability in the mentions for a term
22
Entity Linking / Wikification
• Given a database with encyclopedic
knowledge (e.g. Wikipedia)
- Finds references (mentions) to DB terms in text
- Dealing with variability in the mentions for a term
23
Database
Entity Linking / Wikification
• Given a database with encyclopedic
knowledge (e.g. Wikipedia)
- Finds references (mentions) to DB terms in text
- Dealing with variability in the mentions for a term
24
Database
Entity Linking / Wikification
• Given a database with encyclopedic
knowledge (e.g. Wikipedia)
- Finds references (mentions) to DB terms in text
- Dealing with variability in the mentions for a term
25
DatabaseCorpus
- judicatory
- judicial
- judicature
- Judicatory
- Judicial
Entity Linking / Wikification
• Tool: DBpedia Spotlight
• Compares the context of sequences of
words in a text against DBpedia articles:
– Term definition’s text
– Links
– DBpedia structure (redirections etc.)
• Assigns a DBpedia term to the sequence if
a good match is found
26
Entity Linking / Wikification
Example terms and their variants
27
Term Variants
Judiciary judicature, judicatory,
judicial
Jury jury, juries
Monarch king, monarch
Quantity amount, quantity
Saint Peter Simon Peter, Cephas
Entity Linking / Wikification
28
• Applying a current knowledge-base
(DBpedia) to 18th-19th century texts
• Is this a valid method?
Keyphrase extraction
• YaTeA (Aubin and Hamon, 2006)
• Extracts noun-phrases of configurable
structure and length
29
Outline
• UCL Bentham Project & Transcribe Bentham
• How navigate this corpus? Visualizations
– Lexical extraction
– Co-occurrence networks
• Static view and Temporal evolution
• Evaluation and Challenges
• Other corpus explorations via visualization
• Distant Reading Module, WordTree
• Other lexical analyses 30
Clustering
• CorText offers several similarity metrics
– we chose the default method (distributional)
for homogeneous networks (Weeds & Weir 2005)
31
Visualization
• Static (one map for all dated transcripts)
• Dynamic: temporal slices on the corpus
– Heatmaps
– “River” or Sankey networks (“Tubes layout”)
32
http://apps.lattice.cnrs.fr/bentham
Static visualization
33
CorText network visualized with Gephi
Static visualization
34
CorText network visualized with Gephi
Static visualization
35
Example term: Bill
36
Example term: happiness
37
CorText network made interactive thanks to Gephi’s Sigma JS Exporter
38
Example term: happiness
39
Example term: happiness
Example term: suffering
40
Example term: suffering
41
42
Example term:
death
43
Example term:
death
Examples: nodes linking clusters
44
Examples: nodes linking clusters
45
Heatmaps: Saliency per subcorpus
46
Heatmaps: 1800-1809 subcorpus
47
Heatmaps: 1810-1819 subcorpus
48
Dynamic visualization
49
Dynamic visualization
50
1795 1800 1805 1810
Dynamic visualization
51
1795 1800 1805 1810
Dynamic visualization
52
1795 1800 1805 1810
Dynamic visualization
53
1795 1800 1805 1810
Outline
• UCL Bentham Project & Transcribe Bentham
• How navigate this corpus? Visualizations
– Lexical extraction
– Co-occurrence networks
• Static view and Temporal evolution
• Evaluation and Challenges
• Other corpus explorations via visualization
• Distant Reading Module, WordTree
• Other lexical analyses 54
Evaluation
• Static maps: terms in the clusters
correspond closely to issues dealt with by
Bentham for the thematic areas of each
cluster
• Heatmaps: The evolution depicted
corresponds to the evolution of topics in
Bentham’s work
• DBpedia vs. keyphrase extraction: The
keyphrases provide more relevant
evidence for specialized scholars, a
general encyclopedia can help other users
55
Challenges
Deleted material Additions
56
Challenges
Thematic Variety
• Animal Welfare
• Arts
• Capital punishment
• Civil Code
• Constitutional Code
• Convict transportation
• Correspondence
• Crime & Punishment
• Education
• Law
• Legislation
• Moral Philosophy
• New South Wales
• Panopticon
• Penal Code
• Political Economy
• Preventive Police
• Religion
• Science
• Sexual Morality
• Torture
Formal Variety
• Text sheets
• Copies / Fair copies
• Marginal summary sheets
• Correspondence
• Collectanea
• Rudiments
• Spencers
57
From http://www.transcribe-bentham.da.ulcc.ac.uk/td/Manuscripts and
http://www.benthampapers.ucl.ac.uk/help.aspx?subject=category
Outline
• UCL Bentham Project & Transcribe Bentham
• How navigate this corpus? Visualizations
– Lexical extraction
– Co-occurrence networks
• Static view and Temporal evolution
• Evaluation and Challenges
• Other corpus explorations via visualization
• Distant Reading Module, WordTree
• Other lexical analyses 58
Distant Reading Module
• Follow evolution of selected lexical
sequences
59
Evolution of a lexical item
60
Temporal evolution
Temporal evolution profiles:
- Here: Rising, but present at all dates
- Other examples: falling, regular spikes etc.
Contexts: WordTree
61
Contexts: WordTree
62
Contexts: WordTree
63
Context evolution: Bump Charts
64
• Example: evil
65
Neighbours evolutionBumpCharts
66
Neighbours evolutionBumpCharts
• Example: relations among neighbours of
evil
Relations in the context: Egonetworks
67
Evolution of neighbours’ relations
68
Egonetworks(Period2)
Evolution of neighbours’ relations
69
Egonetworks(Period3)
Evolution of neighbours’ relations
70
Egonetworks(Period4)
Outline
• UCL Bentham Project & Transcribe Bentham
• How navigate this corpus? Visualizations
– Lexical extraction
– Co-occurrence networks
• Static view and Temporal evolution
• Evaluation and Challenges
• Other corpus explorations via visualization
• Distant Reading Module, WordTree
• Other lexical analyses 71
Other Lexical Analyses
• TXM “textometry” tool
– Automatic part-of-
speech tagging
– Partition texts according
to metadata
– Query corpus using
linguistic criteria
– Statistical analyses
(overrepresentation,
underrepresentation)
72
[ http://textometrie.ens-lyon.fr/?lang=en ]
Lexical Analysis with TXM
73
Lexical Analysis with TXM
• Partition the corpus according to Category,
Year, Decade, Main headings, or other
available metadata
74
Lexical Analysis with TXM
Number of words per Category
75
Lexical Analyses with TXM
• Over- (or under-) representation of given
words per decade (after partitioning per decade)
76
TXM linguistic queries
• Evil followed by a noun, per text-category
77
TXM linguistic queries
• Sentences containing an adjective + evil
78
Summary
• Accessing a large unedited corpus
– Cartography methods
• Lexical extraction
• Maps
– Static picture of the corpus
– Temporal evolution
– Other visualizations (Distant, WordTree)
• Domain-expert feedback
• Challenges
• Other lexical analyses
79
http://apps.lattice.cnrs.fr/bentham
Bibliography
Aubin, S., and Hamon, T. (2006) Improving Term
Extraction with Terminological Resources. In
Advances in Natural Language Processing: 5th
International Conference on NLP, FinTAL 2006, pp.
380-387. LNAI 4139. Springer.
Auer, Sören, et al. (2007). DBpedia: A nucleus for a
web of open data. The Semantic Web. Springer.
Causer, Tim, and Terras, Melissa (2014a). Many
hands make light work. Many hands together
make merry work: Transcribe Bentham and
crowdsourcing manuscript collections, in
Crowdsourcing Our Cultural Heritage, ed. M. Ridge,
Ashgate
Causer, Tim, and Terras, Melissa (2014b).
Crowdsourcing Bentham: Beyond the Traditional
Boundaries of Academic History, International
Journal of Humanities and Arts Computing, 8
Chavalarias, David, and Jean-Philippe Cointet. (2013).
Phylomemetic Patterns in Science Evolution—The
Rise and Fall of Scientific Fields. PLoS ONE 8 (2)
Cortext Manager Documentation (2016).
https://docs.cortext.net/.
Mendes, Pablo N., Max Jakob, Andrés García-Silva,
and Christian Bizer. (2011). DBpedia Spotlight:
Shedding Light on the Web of Documents. In
Proceedings of the 7th International Conference on
Semantic Systems, 1–8. ACM.
Mélanie, F., Tieberghien, E., Ruiz, P., Poibeau, T.,
Causer, T. Terras, M. (2016). Mapping the Bentham
Corpus. In Digital Humanities Conference (DH
2016). Kraków, Poland.
Poibeau, T. and Ruiz, P. (2015). Generating Navigable
Semantic Maps from Social Sciences Corpora. In
Digital Humanities Conference (DH 2015). Sydney,
Australia.
Rule, Alix, Jean-Philippe Cointet, and Peter S.
Bearman. (2015). Lexical Shifts, Substantive
Changes, and Continuity in State of the Union
Discourse, 1790–2014. Proceedings of the National
Academy of Sciences 112 (35)
Venturini, T., N. Baya Laffite, J.-P. Cointet, I. Gray, V.
Zabban, and K. De Pryck. (2014). Three Maps and
Three Misunderstandings: A Digital Mapping of
Climate Diplomacy. Big Data & Society 1
Weeds J, Weir D (2005). Co-occurrence retrieval: A
flexible framework for lexical distributional similarity.
In Computational Linguistics 31(4), 439–475.
Wattenberg, M. and Viégas, F.B., 2008. The word tree,
an interactive visual concordance. In IEEE
transactions on visualization and computer graphics,
14(6), pp.1221-1228.
80
81
82
& return you all due thanks
pablo.ruiz.fabo@ens.fr http://www.lattice.cnrs.fr/Pablo-Ruiz-Fabo,541
http://apps.lattice.cnrs.fr/

Mais conteúdo relacionado

Mais procurados

Knowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseKnowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseAndrea Nuzzolese
 
Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...
Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...
Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...eveline wandl-vogt
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesDave Lewis
 
An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...Juan Antonio Vizcaino
 
The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)Juan Antonio Vizcaino
 
Information Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataInformation Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataWim Peters
 
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...IMPACT Centre of Competence
 
OAPEN Göttingen workshop may 9 2012
OAPEN Göttingen workshop may 9 2012OAPEN Göttingen workshop may 9 2012
OAPEN Göttingen workshop may 9 2012Ronald Snijder
 

Mais procurados (11)

Clark - Metadata is the Message
Clark - Metadata is the MessageClark - Metadata is the Message
Clark - Metadata is the Message
 
Knowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseKnowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuse
 
Data wrangling week1
Data wrangling week1Data wrangling week1
Data wrangling week1
 
Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...
Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...
Lexicography and Lexicology from a Pan-European Perspective: COST ENeL Workin...
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New Technologies
 
Oke
OkeOke
Oke
 
An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...An overview of the PRIDE ecosystem of resources and computational tools for m...
An overview of the PRIDE ecosystem of resources and computational tools for m...
 
The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)The Proteomics Standards Initiative (PSI)
The Proteomics Standards Initiative (PSI)
 
Information Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament dataInformation Extraction from EuroParliament and UK Parliament data
Information Extraction from EuroParliament and UK Parliament data
 
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
 
OAPEN Göttingen workshop may 9 2012
OAPEN Göttingen workshop may 9 2012OAPEN Göttingen workshop may 9 2012
OAPEN Göttingen workshop may 9 2012
 

Destaque

งานอาเมเมเมเ
งานอาเมเมเมเงานอาเมเมเมเ
งานอาเมเมเมเmaykai
 
Ykone Insights #5 Silicon Switzerland
Ykone Insights #5 Silicon Switzerland Ykone Insights #5 Silicon Switzerland
Ykone Insights #5 Silicon Switzerland Ykone Agency
 
TRABAJO DE LA WEBQUEST
TRABAJO DE LA WEBQUESTTRABAJO DE LA WEBQUEST
TRABAJO DE LA WEBQUESTLudi Nieto
 
MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...
MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...
MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...Minnesota Interactive Marketing Association
 
Codes and Conventions
Codes and Conventions Codes and Conventions
Codes and Conventions tomjmcleod
 
Social media presentation
Social media presentationSocial media presentation
Social media presentationamy348
 
Examen bimestral
Examen bimestralExamen bimestral
Examen bimestralMonica-MC
 
MU 3313: Music History
MU 3313: Music HistoryMU 3313: Music History
MU 3313: Music HistorySusan Whitmer
 
Pitch new final
Pitch new    finalPitch new    final
Pitch new finalkeeley1234
 
Презентация видео-диагностики по методу Ануашвили (Дети)
Презентация  видео-диагностики по методу Ануашвили (Дети)Презентация  видео-диагностики по методу Ануашвили (Дети)
Презентация видео-диагностики по методу Ануашвили (Дети)bonitarium
 
Eng1023 library instruction_sp2016
Eng1023 library instruction_sp2016Eng1023 library instruction_sp2016
Eng1023 library instruction_sp2016Susan Whitmer
 
Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...
Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...
Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...UCLDH
 
Ekaluokan mikroskooppilöydökset blogiin
Ekaluokan mikroskooppilöydökset blogiinEkaluokan mikroskooppilöydökset blogiin
Ekaluokan mikroskooppilöydökset blogiinAnu Liljeström
 
Procesal constitucional
Procesal constitucionalProcesal constitucional
Procesal constitucionalNeft Tel
 

Destaque (20)

งานอาเมเมเมเ
งานอาเมเมเมเงานอาเมเมเมเ
งานอาเมเมเมเ
 
Ykone Insights #5 Silicon Switzerland
Ykone Insights #5 Silicon Switzerland Ykone Insights #5 Silicon Switzerland
Ykone Insights #5 Silicon Switzerland
 
TRABAJO DE LA WEBQUEST
TRABAJO DE LA WEBQUESTTRABAJO DE LA WEBQUEST
TRABAJO DE LA WEBQUEST
 
MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...
MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...
MIMA Monthly January 2015 - "Content Strategy 2015: Marketing, Mobile, and th...
 
Codes and Conventions
Codes and Conventions Codes and Conventions
Codes and Conventions
 
Blog
BlogBlog
Blog
 
Social media presentation
Social media presentationSocial media presentation
Social media presentation
 
Examen bimestral
Examen bimestralExamen bimestral
Examen bimestral
 
My vacation
My vacationMy vacation
My vacation
 
MU 3313: Music History
MU 3313: Music HistoryMU 3313: Music History
MU 3313: Music History
 
Immigration lawyer jacksonville
Immigration lawyer jacksonvilleImmigration lawyer jacksonville
Immigration lawyer jacksonville
 
Pitch new final
Pitch new    finalPitch new    final
Pitch new final
 
Презентация видео-диагностики по методу Ануашвили (Дети)
Презентация  видео-диагностики по методу Ануашвили (Дети)Презентация  видео-диагностики по методу Ануашвили (Дети)
Презентация видео-диагностики по методу Ануашвили (Дети)
 
Eng1023 library instruction_sp2016
Eng1023 library instruction_sp2016Eng1023 library instruction_sp2016
Eng1023 library instruction_sp2016
 
Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...
Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...
Dr. Kathryn E. Piquette, Cologne Center for eHumanities, Universität zu Köln:...
 
Lawyers in jacksonville
Lawyers in jacksonvilleLawyers in jacksonville
Lawyers in jacksonville
 
Sandbach Santa Route Map
Sandbach Santa Route MapSandbach Santa Route Map
Sandbach Santa Route Map
 
Ekaluokan mikroskooppilöydökset blogiin
Ekaluokan mikroskooppilöydökset blogiinEkaluokan mikroskooppilöydökset blogiin
Ekaluokan mikroskooppilöydökset blogiin
 
Procesal constitucional
Procesal constitucionalProcesal constitucional
Procesal constitucional
 
Balayage with Balay Lama
Balayage with Balay LamaBalayage with Balay Lama
Balayage with Balay Lama
 

Semelhante a Visualizing the Transcribe Bentham Corpus

ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsPeter Haase
 
Science Mapping and Research Positioning
Science Mapping and Research PositioningScience Mapping and Research Positioning
Science Mapping and Research PositioningNees Jan van Eck
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge GraphsPeter Haase
 
Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval GESIS
 
Presentation DFG Bonn 16 september 2015
Presentation DFG Bonn 16 september 2015Presentation DFG Bonn 16 september 2015
Presentation DFG Bonn 16 september 2015Isabel C. B. Brouwer
 
Redesigning our Combine Harvester
Redesigning our Combine HarvesterRedesigning our Combine Harvester
Redesigning our Combine HarvesterTry PurpleSearch
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsGESIS
 
Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...ÚISK FF UK
 
Advanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editorsAdvanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editorsNees Jan van Eck
 
Trends in Scholarly Publishing
Trends in Scholarly PublishingTrends in Scholarly Publishing
Trends in Scholarly PublishingETH-Bibliothek
 
Multiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataMultiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataNees Jan van Eck
 
Patterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondPatterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondJonathan Bowen
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialNees Jan van Eck
 
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...Nees Jan van Eck
 
[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...
[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...
[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...Digital Classicist Seminar Berlin
 
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Nees Jan van Eck
 
Making Sense of the Confusing World of Research Information Management
Making Sense of the Confusing World of Research Information ManagementMaking Sense of the Confusing World of Research Information Management
Making Sense of the Confusing World of Research Information ManagementOCLC
 

Semelhante a Visualizing the Transcribe Bentham Corpus (20)

ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
Science Mapping and Research Positioning
Science Mapping and Research PositioningScience Mapping and Research Positioning
Science Mapping and Research Positioning
 
The Virtual Research Environment and Libraries
The Virtual Research Environment and LibrariesThe Virtual Research Environment and Libraries
The Virtual Research Environment and Libraries
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 
Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval
 
Presentation DFG Bonn 16 september 2015
Presentation DFG Bonn 16 september 2015Presentation DFG Bonn 16 september 2015
Presentation DFG Bonn 16 september 2015
 
Redesigning our Combine Harvester
Redesigning our Combine HarvesterRedesigning our Combine Harvester
Redesigning our Combine Harvester
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
 
Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...
 
Walsh "Text Data Mining with HTRC"
Walsh "Text Data Mining with HTRC"Walsh "Text Data Mining with HTRC"
Walsh "Text Data Mining with HTRC"
 
Advanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editorsAdvanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editors
 
Trends in Scholarly Publishing
Trends in Scholarly PublishingTrends in Scholarly Publishing
Trends in Scholarly Publishing
 
Multiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataMultiple perspectives on bibliometric data
Multiple perspectives on bibliometric data
 
Patterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondPatterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyond
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer Tutorial
 
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
 
[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...
[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...
[DCSB] Dr Gabriel Bodard (KCL) “A View on Digital Classics Collaboration: fro...
 
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
 
Bryant Confusing World of RIM
Bryant Confusing World of RIM Bryant Confusing World of RIM
Bryant Confusing World of RIM
 
Making Sense of the Confusing World of Research Information Management
Making Sense of the Confusing World of Research Information ManagementMaking Sense of the Confusing World of Research Information Management
Making Sense of the Confusing World of Research Information Management
 

Mais de UCLDH

Neil Tarrant Defining Nature’s Limits 9 March 2022.pptx
Neil Tarrant Defining Nature’s Limits 9 March 2022.pptxNeil Tarrant Defining Nature’s Limits 9 March 2022.pptx
Neil Tarrant Defining Nature’s Limits 9 March 2022.pptxUCLDH
 
Archiving the Medici: History and Future (1370s-2020s)
Archiving the Medici: History and Future (1370s-2020s)Archiving the Medici: History and Future (1370s-2020s)
Archiving the Medici: History and Future (1370s-2020s)UCLDH
 
The Pleasures and Sorrows of digitising primary source collections: The Case ...
The Pleasures and Sorrows of digitising primary source collections: The Case ...The Pleasures and Sorrows of digitising primary source collections: The Case ...
The Pleasures and Sorrows of digitising primary source collections: The Case ...UCLDH
 
CVT Connect: Co-producing a digital platform for people with learning disabil...
CVT Connect: Co-producing a digital platform for people with learning disabil...CVT Connect: Co-producing a digital platform for people with learning disabil...
CVT Connect: Co-producing a digital platform for people with learning disabil...UCLDH
 
The opportunity of accessibility: increasing impact and improving the user ex...
The opportunity of accessibility: increasing impact and improving the user ex...The opportunity of accessibility: increasing impact and improving the user ex...
The opportunity of accessibility: increasing impact and improving the user ex...UCLDH
 
National Trust 'For Everyone' strategy
National Trust 'For Everyone' strategyNational Trust 'For Everyone' strategy
National Trust 'For Everyone' strategyUCLDH
 
Digital Lives of People with Learning Disabilities
Digital Lives of People with Learning DisabilitiesDigital Lives of People with Learning Disabilities
Digital Lives of People with Learning DisabilitiesUCLDH
 
Digital Content and Disability - The Librarian Perspective
Digital Content and Disability - The Librarian PerspectiveDigital Content and Disability - The Librarian Perspective
Digital Content and Disability - The Librarian PerspectiveUCLDH
 
SensusAccess: Alternate Media Made Easy
SensusAccess: Alternate Media Made EasySensusAccess: Alternate Media Made Easy
SensusAccess: Alternate Media Made EasyUCLDH
 
Accessible Publishing
Accessible PublishingAccessible Publishing
Accessible PublishingUCLDH
 
What might a spoken corpus tell us about language
What might a spoken corpus tell us about languageWhat might a spoken corpus tell us about language
What might a spoken corpus tell us about languageUCLDH
 
“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...
“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...
“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...UCLDH
 
Oceanic Exchanges presentation
Oceanic Exchanges presentationOceanic Exchanges presentation
Oceanic Exchanges presentationUCLDH
 
Digital Face project presentation
Digital Face project presentationDigital Face project presentation
Digital Face project presentationUCLDH
 
CrossCult presentation
CrossCult presentationCrossCult presentation
CrossCult presentationUCLDH
 
Computational History and the Transformation of Public Discourse in Finland, ...
Computational History and the Transformation of Public Discourse in Finland, ...Computational History and the Transformation of Public Discourse in Finland, ...
Computational History and the Transformation of Public Discourse in Finland, ...UCLDH
 
Where does the born- and reborn-digital material take the Digital Humanities?
Where does the born- and reborn-digital material take the Digital Humanities?Where does the born- and reborn-digital material take the Digital Humanities?
Where does the born- and reborn-digital material take the Digital Humanities?UCLDH
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformUCLDH
 
Managing library collections with friends, favours and a spoonful of sugar
Managing library collections with friends, favours and a spoonful of sugarManaging library collections with friends, favours and a spoonful of sugar
Managing library collections with friends, favours and a spoonful of sugarUCLDH
 
L taylor ucl_caribbean_digital_dreams_2017
L taylor ucl_caribbean_digital_dreams_2017L taylor ucl_caribbean_digital_dreams_2017
L taylor ucl_caribbean_digital_dreams_2017UCLDH
 

Mais de UCLDH (20)

Neil Tarrant Defining Nature’s Limits 9 March 2022.pptx
Neil Tarrant Defining Nature’s Limits 9 March 2022.pptxNeil Tarrant Defining Nature’s Limits 9 March 2022.pptx
Neil Tarrant Defining Nature’s Limits 9 March 2022.pptx
 
Archiving the Medici: History and Future (1370s-2020s)
Archiving the Medici: History and Future (1370s-2020s)Archiving the Medici: History and Future (1370s-2020s)
Archiving the Medici: History and Future (1370s-2020s)
 
The Pleasures and Sorrows of digitising primary source collections: The Case ...
The Pleasures and Sorrows of digitising primary source collections: The Case ...The Pleasures and Sorrows of digitising primary source collections: The Case ...
The Pleasures and Sorrows of digitising primary source collections: The Case ...
 
CVT Connect: Co-producing a digital platform for people with learning disabil...
CVT Connect: Co-producing a digital platform for people with learning disabil...CVT Connect: Co-producing a digital platform for people with learning disabil...
CVT Connect: Co-producing a digital platform for people with learning disabil...
 
The opportunity of accessibility: increasing impact and improving the user ex...
The opportunity of accessibility: increasing impact and improving the user ex...The opportunity of accessibility: increasing impact and improving the user ex...
The opportunity of accessibility: increasing impact and improving the user ex...
 
National Trust 'For Everyone' strategy
National Trust 'For Everyone' strategyNational Trust 'For Everyone' strategy
National Trust 'For Everyone' strategy
 
Digital Lives of People with Learning Disabilities
Digital Lives of People with Learning DisabilitiesDigital Lives of People with Learning Disabilities
Digital Lives of People with Learning Disabilities
 
Digital Content and Disability - The Librarian Perspective
Digital Content and Disability - The Librarian PerspectiveDigital Content and Disability - The Librarian Perspective
Digital Content and Disability - The Librarian Perspective
 
SensusAccess: Alternate Media Made Easy
SensusAccess: Alternate Media Made EasySensusAccess: Alternate Media Made Easy
SensusAccess: Alternate Media Made Easy
 
Accessible Publishing
Accessible PublishingAccessible Publishing
Accessible Publishing
 
What might a spoken corpus tell us about language
What might a spoken corpus tell us about languageWhat might a spoken corpus tell us about language
What might a spoken corpus tell us about language
 
“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...
“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...
“It is Time for the Slaves to Speak”: Transatlantic Abolitionism and African ...
 
Oceanic Exchanges presentation
Oceanic Exchanges presentationOceanic Exchanges presentation
Oceanic Exchanges presentation
 
Digital Face project presentation
Digital Face project presentationDigital Face project presentation
Digital Face project presentation
 
CrossCult presentation
CrossCult presentationCrossCult presentation
CrossCult presentation
 
Computational History and the Transformation of Public Discourse in Finland, ...
Computational History and the Transformation of Public Discourse in Finland, ...Computational History and the Transformation of Public Discourse in Finland, ...
Computational History and the Transformation of Public Discourse in Finland, ...
 
Where does the born- and reborn-digital material take the Digital Humanities?
Where does the born- and reborn-digital material take the Digital Humanities?Where does the born- and reborn-digital material take the Digital Humanities?
Where does the born- and reborn-digital material take the Digital Humanities?
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse Platform
 
Managing library collections with friends, favours and a spoonful of sugar
Managing library collections with friends, favours and a spoonful of sugarManaging library collections with friends, favours and a spoonful of sugar
Managing library collections with friends, favours and a spoonful of sugar
 
L taylor ucl_caribbean_digital_dreams_2017
L taylor ucl_caribbean_digital_dreams_2017L taylor ucl_caribbean_digital_dreams_2017
L taylor ucl_caribbean_digital_dreams_2017
 

Último

Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 

Último (20)

Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 

Visualizing the Transcribe Bentham Corpus

  • 1. Visualizing the Transcribe Bentham Corpus Frédérique Mélanie, Estelle Tieberghien, Pablo Ruiz Fabo, Thierry Poibeau LATTICE Lab: ENS – CNRS – U Paris 3, PSL – USPC Tim Causer, Melissa Terras UCL Bentham Project, UCL Digital Humanities UCLDH Seminar, December 2016
  • 2. Outline • UCL Bentham Project & Transcribe Bentham • How navigate this corpus? Visualizations – Lexical extraction – Co-occurrence networks • Static view and Temporal evolution • Evaluation and Challenges • Other corpus explorations via visualization • Distant Reading Module, WordTree • Other lexical analyses 2
  • 3. Jeremy Bentham (1748-1832) •Jurist, philosopher, and legal and social reformer •Leading theorist in Anglo-American philosophy of law •Influenced the development of welfarism •Advocated utilitarianism •Animal rights, •Work on the “panopticon” •Not founder of UCL, but... •60,000 folios in UCL Sp. Collections •40,000 untranscribed •Auto-icon
  • 4. The Bentham Project • http://www.ucl.ac.uk/Bentham-Project/ • Since 1959 • “aims to produce a new scholarly edition of the works and correspondence of Jeremy Bentham” • twenty six volumes of the new Collected Works have been published • 50 years to transcribe 20,000 folios • Previous AHRC grant catalogued the manuscripts – http://www.benthampapers.ucl.ac.uk/
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. Facts and Figures (as of 1st July 2016) • 16,205 manuscripts transcribed/partially-transcribed • 15,351 (94%) checked and approved • 83,955 visits • 34,359 unique views • Average session time: 14 minutes 13 seconds • 140 countries • 514 people have transcribed something • Most of the work done by the 26 Super Transcribers • Average of 54 transcripts edited since the start of the project • Average of 56 per week during the last twelve months • Greatest number of transcripts in any one week: 300 (w/c 14 June • 2014)
  • 11. Transcribe Bentham progress, 8 September 2010 to 20 March 2015 0 2000 4000 6000 8000 10000 12000 8 Sep 2010 5 Nov 2011 30 Dec 2010 25 Feb 2011 15 Apr 2011 17 Jun 2011 12 Aug 2011 7 Oct 2011 2 Dec 2011 27 Jan 2012 23 Mar 2012 18 May 2012 13 Jul 2012 7 Sep 2012 2 Nov 2012 28 Dec 2012 22 Feb 2013 26 Apr 2013 21 Jun 2013 16 Aug 2013 11 Oct 2013 6 Dec 2013 31 Jan 2014 28 Mar 2014 23 May 2014 18 Jul 2014 12 Sep 2014 7 Nov 2014 9 Jan 2015 6 Mar 2015 Manuscripts worked on Completed transcripts NYT article BL manuscripts made available
  • 12. With thanks to: •Prof Philip Schofield (UCL Bentham Project, Principal Investigator) •Dr Tim Causer (Bentham Project) •Dr Kris Grint (Bentham Project) •Richard Davis (University of London Computer Centre •José Martin (ULCC) •Martin Moyle (UCL Library Services) •Lesley Pitman (UCL Library Services) •Tony Slade (UCL Creative Media) •Miguel Faleiro Rodrigues, Alejandro Salinas Lopez, and Raheel Nabi (UCL Creative Media) •Dr Arnold Hunt (British Library) •Anna-Maria Sichani (Bentham Project) •Dr Justin Tonra (National University of Ireland Galway) and Dr Valerie Wallace (Victoria University Wellington), bother formerly of the Bentham Project •All the partners in Transcriptorium http://transcriptorium.eu/consortium/ •And Transcribe Bentham’s volunteers! •Project previously funded by the AHRC and the Andrew W. Mellon Foundation
  • 13. Outline • UCL Bentham Project & Transcribe Bentham • How navigate this corpus? Visualizations – Lexical extraction – Co-occurrence networks • Static view and Temporal evolution • Evaluation and Challenges • Other corpus explorations via visualization • Distant Reading Module, WordTree • Other lexical analyses 13
  • 14. Relevant access to a large corpus 14
  • 15. Relevant access to a large corpus • A search index? • Topic models? • Corpus cartography? Challenges for this corpus • Not an all-English corpus • Difficulties posed by an historical variety • Technical language • Revision history, additions and deletions 15
  • 16. Stats for analyzed corpus sample • Total TEI files: 29,900 • In English: 29,400 • That we dated: 16,700 • We only visualized English transcripts that we could date (with a simple heuristic)1 • Work is based on ca. 55% of the all the TEI files in our sample 16 1We were not using the corpus’ date metadata for this exercise
  • 17. Corpus Cartography • Lexical extraction (of relevant sequences) • Clustering based on similarity measures • Visual representation (map of the corpus) based on layout algorithms 17
  • 18. Cartography tool: CorText • CorText Manager covers all cartography steps: – Lexical extraction – Clustering – Visualization • Each step can be used independently, thanks to standard import/export formats 18
  • 19. ToolscombinedwithCorText CARTOGRAPHY STEP TOOLS and RESOURCES Lexical Extraction DBpedia Spotlight YaTeA Human domain-expert Clustering CorText Analysis Visualization Gephi + Sigma JS plugin - Static CorText MapExplorer Inkscape - Dynamic CorText Heatmaps, Tubes, Distant Reading 19
  • 20. Outline • UCL Bentham Project & Transcribe Bentham • How navigate this corpus? Visualizations – Lexical extraction – Co-occurrence networks • Static view and Temporal evolution • Evaluation and Challenges • Other corpus explorations via visualization • Distant Reading Module, WordTree • Other lexical analyses 20
  • 21. Lexical Extraction • CorText native option – Noun-Phrase chunks (based on TreeTagger) • Our options: – Entity Linking / Wikification to DBpedia – Keyphrase extraction tools like YaTeA • In all cases: manual selection of pre-ranked candidate terms by a domain-expert 21
  • 22. Entity Linking / Wikification • Given a database with encyclopedic knowledge (e.g. Wikipedia) - Finds references (mentions) to DB terms in text - Dealing with variability in the mentions for a term 22
  • 23. Entity Linking / Wikification • Given a database with encyclopedic knowledge (e.g. Wikipedia) - Finds references (mentions) to DB terms in text - Dealing with variability in the mentions for a term 23 Database
  • 24. Entity Linking / Wikification • Given a database with encyclopedic knowledge (e.g. Wikipedia) - Finds references (mentions) to DB terms in text - Dealing with variability in the mentions for a term 24 Database
  • 25. Entity Linking / Wikification • Given a database with encyclopedic knowledge (e.g. Wikipedia) - Finds references (mentions) to DB terms in text - Dealing with variability in the mentions for a term 25 DatabaseCorpus - judicatory - judicial - judicature - Judicatory - Judicial
  • 26. Entity Linking / Wikification • Tool: DBpedia Spotlight • Compares the context of sequences of words in a text against DBpedia articles: – Term definition’s text – Links – DBpedia structure (redirections etc.) • Assigns a DBpedia term to the sequence if a good match is found 26
  • 27. Entity Linking / Wikification Example terms and their variants 27 Term Variants Judiciary judicature, judicatory, judicial Jury jury, juries Monarch king, monarch Quantity amount, quantity Saint Peter Simon Peter, Cephas
  • 28. Entity Linking / Wikification 28 • Applying a current knowledge-base (DBpedia) to 18th-19th century texts • Is this a valid method?
  • 29. Keyphrase extraction • YaTeA (Aubin and Hamon, 2006) • Extracts noun-phrases of configurable structure and length 29
  • 30. Outline • UCL Bentham Project & Transcribe Bentham • How navigate this corpus? Visualizations – Lexical extraction – Co-occurrence networks • Static view and Temporal evolution • Evaluation and Challenges • Other corpus explorations via visualization • Distant Reading Module, WordTree • Other lexical analyses 30
  • 31. Clustering • CorText offers several similarity metrics – we chose the default method (distributional) for homogeneous networks (Weeds & Weir 2005) 31
  • 32. Visualization • Static (one map for all dated transcripts) • Dynamic: temporal slices on the corpus – Heatmaps – “River” or Sankey networks (“Tubes layout”) 32 http://apps.lattice.cnrs.fr/bentham
  • 37. Example term: happiness 37 CorText network made interactive thanks to Gephi’s Sigma JS Exporter
  • 44. Examples: nodes linking clusters 44
  • 45. Examples: nodes linking clusters 45
  • 46. Heatmaps: Saliency per subcorpus 46
  • 54. Outline • UCL Bentham Project & Transcribe Bentham • How navigate this corpus? Visualizations – Lexical extraction – Co-occurrence networks • Static view and Temporal evolution • Evaluation and Challenges • Other corpus explorations via visualization • Distant Reading Module, WordTree • Other lexical analyses 54
  • 55. Evaluation • Static maps: terms in the clusters correspond closely to issues dealt with by Bentham for the thematic areas of each cluster • Heatmaps: The evolution depicted corresponds to the evolution of topics in Bentham’s work • DBpedia vs. keyphrase extraction: The keyphrases provide more relevant evidence for specialized scholars, a general encyclopedia can help other users 55
  • 57. Challenges Thematic Variety • Animal Welfare • Arts • Capital punishment • Civil Code • Constitutional Code • Convict transportation • Correspondence • Crime & Punishment • Education • Law • Legislation • Moral Philosophy • New South Wales • Panopticon • Penal Code • Political Economy • Preventive Police • Religion • Science • Sexual Morality • Torture Formal Variety • Text sheets • Copies / Fair copies • Marginal summary sheets • Correspondence • Collectanea • Rudiments • Spencers 57 From http://www.transcribe-bentham.da.ulcc.ac.uk/td/Manuscripts and http://www.benthampapers.ucl.ac.uk/help.aspx?subject=category
  • 58. Outline • UCL Bentham Project & Transcribe Bentham • How navigate this corpus? Visualizations – Lexical extraction – Co-occurrence networks • Static view and Temporal evolution • Evaluation and Challenges • Other corpus explorations via visualization • Distant Reading Module, WordTree • Other lexical analyses 58
  • 59. Distant Reading Module • Follow evolution of selected lexical sequences 59
  • 60. Evolution of a lexical item 60 Temporal evolution Temporal evolution profiles: - Here: Rising, but present at all dates - Other examples: falling, regular spikes etc.
  • 64. Context evolution: Bump Charts 64 • Example: evil
  • 67. • Example: relations among neighbours of evil Relations in the context: Egonetworks 67
  • 68. Evolution of neighbours’ relations 68 Egonetworks(Period2)
  • 69. Evolution of neighbours’ relations 69 Egonetworks(Period3)
  • 70. Evolution of neighbours’ relations 70 Egonetworks(Period4)
  • 71. Outline • UCL Bentham Project & Transcribe Bentham • How navigate this corpus? Visualizations – Lexical extraction – Co-occurrence networks • Static view and Temporal evolution • Evaluation and Challenges • Other corpus explorations via visualization • Distant Reading Module, WordTree • Other lexical analyses 71
  • 72. Other Lexical Analyses • TXM “textometry” tool – Automatic part-of- speech tagging – Partition texts according to metadata – Query corpus using linguistic criteria – Statistical analyses (overrepresentation, underrepresentation) 72 [ http://textometrie.ens-lyon.fr/?lang=en ]
  • 74. Lexical Analysis with TXM • Partition the corpus according to Category, Year, Decade, Main headings, or other available metadata 74
  • 75. Lexical Analysis with TXM Number of words per Category 75
  • 76. Lexical Analyses with TXM • Over- (or under-) representation of given words per decade (after partitioning per decade) 76
  • 77. TXM linguistic queries • Evil followed by a noun, per text-category 77
  • 78. TXM linguistic queries • Sentences containing an adjective + evil 78
  • 79. Summary • Accessing a large unedited corpus – Cartography methods • Lexical extraction • Maps – Static picture of the corpus – Temporal evolution – Other visualizations (Distant, WordTree) • Domain-expert feedback • Challenges • Other lexical analyses 79 http://apps.lattice.cnrs.fr/bentham
  • 80. Bibliography Aubin, S., and Hamon, T. (2006) Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing: 5th International Conference on NLP, FinTAL 2006, pp. 380-387. LNAI 4139. Springer. Auer, Sören, et al. (2007). DBpedia: A nucleus for a web of open data. The Semantic Web. Springer. Causer, Tim, and Terras, Melissa (2014a). Many hands make light work. Many hands together make merry work: Transcribe Bentham and crowdsourcing manuscript collections, in Crowdsourcing Our Cultural Heritage, ed. M. Ridge, Ashgate Causer, Tim, and Terras, Melissa (2014b). Crowdsourcing Bentham: Beyond the Traditional Boundaries of Academic History, International Journal of Humanities and Arts Computing, 8 Chavalarias, David, and Jean-Philippe Cointet. (2013). Phylomemetic Patterns in Science Evolution—The Rise and Fall of Scientific Fields. PLoS ONE 8 (2) Cortext Manager Documentation (2016). https://docs.cortext.net/. Mendes, Pablo N., Max Jakob, Andrés García-Silva, and Christian Bizer. (2011). DBpedia Spotlight: Shedding Light on the Web of Documents. In Proceedings of the 7th International Conference on Semantic Systems, 1–8. ACM. Mélanie, F., Tieberghien, E., Ruiz, P., Poibeau, T., Causer, T. Terras, M. (2016). Mapping the Bentham Corpus. In Digital Humanities Conference (DH 2016). Kraków, Poland. Poibeau, T. and Ruiz, P. (2015). Generating Navigable Semantic Maps from Social Sciences Corpora. In Digital Humanities Conference (DH 2015). Sydney, Australia. Rule, Alix, Jean-Philippe Cointet, and Peter S. Bearman. (2015). Lexical Shifts, Substantive Changes, and Continuity in State of the Union Discourse, 1790–2014. Proceedings of the National Academy of Sciences 112 (35) Venturini, T., N. Baya Laffite, J.-P. Cointet, I. Gray, V. Zabban, and K. De Pryck. (2014). Three Maps and Three Misunderstandings: A Digital Mapping of Climate Diplomacy. Big Data & Society 1 Weeds J, Weir D (2005). Co-occurrence retrieval: A flexible framework for lexical distributional similarity. In Computational Linguistics 31(4), 439–475. Wattenberg, M. and Viégas, F.B., 2008. The word tree, an interactive visual concordance. In IEEE transactions on visualization and computer graphics, 14(6), pp.1221-1228. 80
  • 81. 81
  • 82. 82 & return you all due thanks pablo.ruiz.fabo@ens.fr http://www.lattice.cnrs.fr/Pablo-Ruiz-Fabo,541 http://apps.lattice.cnrs.fr/