SlideShare uma empresa Scribd logo
1 de 70
Using corpora to
enhance language
learning
Michael Barlow
Overview
wordlists
collocation lists
online concordancers
text analysis software
concordancers
ParaConc and Collocate
web-based exercises
data-driven learning materials
Wordlists – general and
specialised
Wordlists have been around since before the
invention of computers. General wordlists are
used for curriculum development, textbook
writing etc.
Also possible to produce a word list for a
reading (or a possibly textbook)
Wordlists – general
Use existing wordlists such as West's General
Service List and recent updates. Coxhead's
Academic Wordlist. Kilgarriff's Wordlists
based on the BNC.
Kilgarriff Page
Academic Word List
Academic Word List
Academic Word List
• receptive list (based on morphological
derivations)
• the list excludes words found in non-academic
texts (even if they occur in academic texts)
• do we need subject or genre-specific
wordlists? (Hyland)
Specialised Word List
• Create a wordlist from a corpus (using
concordancer or other utilities)
• May need to create your own corpus –
BootCaT ?? Silvia Bernadini
BootCaT
Vocab Profile
• Tom Cobb's Vocab Profile
• http://www.lextutor.ca/vp/eng/
Collocation lists
• More difficult to find – use Collocation
Dictionary??
• Biber's work on lexical bundles
• Use concordancer or utility to create ngram
lists or locate collocations
• Collocate – shown below
Concordancers
• Online concordancer
Concordancers
Concordancers –
americancorpus.org
Concordancers
• Using a concordancer in the classroom
• Corpus as a reference tool – query the corpus
– can you say “the government are”
– what is the difference between “for
instance” and “for example”
– Tim Johns – Data-driven Learning
• (...caused economic
development...)
Concordancers – text
reconstruction exercises
Data-driven learning
(deductive)
Data-driven learning
(inductive)
Concordance data
• DDL – highlighting/noticing/discovery learning
• Highlight unexpected (for the learner)
distinctions, uses etc.
• Sequence data to build up knowledge
Parallel concordance
data
• Parallel concordance works on translation
corpus
• Students need to have same L1
Concordance data
issues
• KWIC format
• Google effect
• Data overload
• Reauthenticating data
– Sabine Braun – includes discourse
perspective (Why did the speaker use
that form?)
Parallel Corpora – DDL
(CHUJO, Kiyomi)
Parallel Corpora – DDL
(Chujo, Kiyomi)
Collocate
Software to extract collocations/terms
Word search + Span (2 words, 3 words etc.)
n-gram (bigram, trigram) list
Full extract -- collocations in a corpus
Search for analysis
(Span = 2)
analysis - frequency
analysis - t-score
analysis - MI
Trigram search
Trigram -- by freq
Trigram -- alphabetical
Trigram -- by MI
Using batch mode –
Corpuslab.com
Familiar exercise authoring
Currently offline
Aims
avoid duplication of tasks -- identifying
common collocations in Business English
Provide corpus/analysis resources
Bring corpus resources together with
familiar exercise authoring
Student View
Student View
Student View
Student View
Exercise types
Matching
Fill-the-gap
Multiple Choice
Reorder
Categorise
Exercise types
Matching*
Fill-the-gap
Multiple Choice
Reorder
Categorise*
Teacher view
Teacher view
Teacher view
Teacher view -
Resources
Resources
Teacher-generated resources
uploaded frequency lists
worksheets
Tracking
Teachers can track their exercises
“Class teachers” track students in their class
Tracking
Report for exercise Cat1
Tracking of student
School view
Register as a school
Create class names
Assign teachers to classes
Track students in classes
School view
School view
Resources
Site resources
corpora and simple concordancer
text analysis utilities
Text analysis utilities
Create frequency lists
Text analysis in terms of frequency bands
Collocational analysis of texts
Corpora
Teacher/Author resource
Sample corpus -- CSPAE
Add other corpora such as MICASE
Create various options for searching that
make use of corpus annotation
Simple searching
Aims
Create a language learning site
Encourage and facilitate use of corpus data
Matching exercise (up to 5 columns)
Provide access to word lists etc
Provide text analysis tools
Aims
Use traditional exercise types that teachers
are familiar with
Give examples of creative uses of these
standard exercises
Thank you

Mais conteúdo relacionado

Semelhante a Enhancing Language Learning Using Corpora

Problem-based Learning & Resource-based Learning two complementary approac...
Problem-based Learning & Resource-based Learning  two complementary approac...Problem-based Learning & Resource-based Learning  two complementary approac...
Problem-based Learning & Resource-based Learning two complementary approac...
Wilco te Winkel
 
Chapter 6, curriculum development in language teaching. j.c. richards
Chapter 6, curriculum development in language teaching.  j.c. richardsChapter 6, curriculum development in language teaching.  j.c. richards
Chapter 6, curriculum development in language teaching. j.c. richards
Savaedi
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Abdullah al Mamun
 

Semelhante a Enhancing Language Learning Using Corpora (20)

2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring
 
Using do-it-yourself corpora in EAP-A tailore-made resource
Using do-it-yourself corpora in EAP-A tailore-made resourceUsing do-it-yourself corpora in EAP-A tailore-made resource
Using do-it-yourself corpora in EAP-A tailore-made resource
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learning
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
Academic Phrasebank Navigable PDF
Academic Phrasebank Navigable PDFAcademic Phrasebank Navigable PDF
Academic Phrasebank Navigable PDF
 
Data Driven Learning Lite Presentation
Data Driven Learning Lite PresentationData Driven Learning Lite Presentation
Data Driven Learning Lite Presentation
 
Edad 695 research methodology
Edad 695 research methodologyEdad 695 research methodology
Edad 695 research methodology
 
Problem-based Learning & Resource-based Learning two complementary approac...
Problem-based Learning & Resource-based Learning  two complementary approac...Problem-based Learning & Resource-based Learning  two complementary approac...
Problem-based Learning & Resource-based Learning two complementary approac...
 
Academic-Phrasebank.pdf
Academic-Phrasebank.pdfAcademic-Phrasebank.pdf
Academic-Phrasebank.pdf
 
Tips for teaching writing
Tips for teaching writingTips for teaching writing
Tips for teaching writing
 
Alannah fitzgerald The TOETOE project planning for impact
Alannah fitzgerald The TOETOE project planning for impactAlannah fitzgerald The TOETOE project planning for impact
Alannah fitzgerald The TOETOE project planning for impact
 
Effective research strategies
Effective research strategiesEffective research strategies
Effective research strategies
 
EBMgt Course Module 6: Searching for Scientific Evidence
EBMgt Course Module 6: Searching for Scientific EvidenceEBMgt Course Module 6: Searching for Scientific Evidence
EBMgt Course Module 6: Searching for Scientific Evidence
 
Using and learning phrases
Using and learning phrasesUsing and learning phrases
Using and learning phrases
 
semantic web & natural language
semantic web & natural languagesemantic web & natural language
semantic web & natural language
 
Chapter 6, curriculum development in language teaching. j.c. richards
Chapter 6, curriculum development in language teaching.  j.c. richardsChapter 6, curriculum development in language teaching.  j.c. richards
Chapter 6, curriculum development in language teaching. j.c. richards
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
Natural Language Processing using Java
Natural Language Processing using JavaNatural Language Processing using Java
Natural Language Processing using Java
 
Hacks for academic writing
Hacks for academic writingHacks for academic writing
Hacks for academic writing
 

Último

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 

Enhancing Language Learning Using Corpora