SlideShare uma empresa Scribd logo
1 de 8
Corpus Linguistics
What is Corpus linguistics?
Corpus linguistics is the study of language as
  expressed in samples (corpora) or "real world"
  text. This method represents a digestive
  approach to deriving a set of abstract rules by
  which a natural language is governed or else
  relates to another language. Originally done
  by hand, corpora are now largely derived by
  an automated process.
One of the main contributions of corpus
 linguistics is in the area of exploring patterns
 of language use. Corpus linguistics provides an
 extremely powerful tool for the analysis of
 natural language an use varies in different
 situations.
As a result of these advances there are typically
  four features that are seen as characteristic of
  corpus bases analyses of language:
o It’s empirical, analyzing the actual patterns of use
  in natural texts.
o It utilizes large and principled collection of natural
  texts, known as a ‘corpus’ the basis for analysis
o It makes extensive use of computers for analysis,
  using both automatic and interactive techniques
o It depends on both quantitative and qualitative
  analytical techniques
Corpus Design and Compilation
A corpus is a large and principled collection of
  texts stored in electronic format. There is no
  minimum size for a text collection to be
  considered a corpus. This is a significant
  development as it enables researchers all over
  the world to access the same sets of data
  which not only encourages a higher degree of
  accountability in data analysis, nut also
  permits collaborative word an follow up
  studies by different researcher.
Types of Corpora
There are as many types f corpora as there are
  research topics in linguistics. General corpora,
  such as the Brown Corpus, the LOB, or the BNC,
  aim to represent language I its broadest sense
  and to serve as a widely available resource for
  baseline or comparative studies of general
  linguistic features.
A general corpus is designed to be balanced and
  include language samples from a wide range of
  registers or genres, including both fiction and
  nonfiction in al their diversity.
Corpus Compilation
When creating a corpus, data collection involves
  obtaining or creating electronic versions of the
  target texts, and storing and organizing them.
  Written corpora are far less labor intensive to
  collect than spoken corpora.
The data collection phase of building a spoken
  copus is lengthy and expensive. The first step
  is to decide on a transcription system.
Word Counts and Basic Corpus Tools
There are many levels of information that can be
  gathered from a corpus. These levels range
  from simple word lists can reveal both
  linguistic associating patterns.
The tools that are used for these analyses range
  from basic concordance packages to complex
  interactive computer programs.

Mais conteúdo relacionado

Mais procurados

How to Use Corpora in Language Teaching
How to Use Corpora in Language TeachingHow to Use Corpora in Language Teaching
How to Use Corpora in Language TeachingCALPER
 
Corpus linguistics, ch6
Corpus linguistics, ch6Corpus linguistics, ch6
Corpus linguistics, ch6VivaAs
 
Types of corpus linguistics Parallel ,aligned...
 Types of corpus linguistics Parallel ,aligned... Types of corpus linguistics Parallel ,aligned...
Types of corpus linguistics Parallel ,aligned...RajpootBhatti5
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsAdnanBaloch15
 
Corpus linguistics in language learning
Corpus linguistics in language learningCorpus linguistics in language learning
Corpus linguistics in language learningnfuadah123
 
Mentalist and Behaviorist Theory of SLA
Mentalist and Behaviorist Theory of SLAMentalist and Behaviorist Theory of SLA
Mentalist and Behaviorist Theory of SLAWenlie Jean
 
Corpus study design
Corpus study designCorpus study design
Corpus study designbikashtaly
 
Applied Linguistics & Language Teaching
               Applied Linguistics & Language Teaching               Applied Linguistics & Language Teaching
Applied Linguistics & Language TeachingFarhad Mohammad
 
Corpus Tools for Language Teaching
Corpus Tools for Language TeachingCorpus Tools for Language Teaching
Corpus Tools for Language TeachingCALPER
 
Applied linguistic: Contrastive Analysis
Applied linguistic: Contrastive AnalysisApplied linguistic: Contrastive Analysis
Applied linguistic: Contrastive AnalysisIntan Meldy
 
Chapter iv language contact
Chapter iv language contactChapter iv language contact
Chapter iv language contactFaisal Pak
 

Mais procurados (20)

Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
How to Use Corpora in Language Teaching
How to Use Corpora in Language TeachingHow to Use Corpora in Language Teaching
How to Use Corpora in Language Teaching
 
Corpus linguistics, ch6
Corpus linguistics, ch6Corpus linguistics, ch6
Corpus linguistics, ch6
 
Types of corpus linguistics Parallel ,aligned...
 Types of corpus linguistics Parallel ,aligned... Types of corpus linguistics Parallel ,aligned...
Types of corpus linguistics Parallel ,aligned...
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Corpus linguistics in language learning
Corpus linguistics in language learningCorpus linguistics in language learning
Corpus linguistics in language learning
 
Mentalist and Behaviorist Theory of SLA
Mentalist and Behaviorist Theory of SLAMentalist and Behaviorist Theory of SLA
Mentalist and Behaviorist Theory of SLA
 
Corpus study design
Corpus study designCorpus study design
Corpus study design
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
Applied Linguistics & Language Teaching
               Applied Linguistics & Language Teaching               Applied Linguistics & Language Teaching
Applied Linguistics & Language Teaching
 
Applied linguistics
Applied linguisticsApplied linguistics
Applied linguistics
 
Second language acquisition
Second language acquisitionSecond language acquisition
Second language acquisition
 
Language recovery
Language recoveryLanguage recovery
Language recovery
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Corpus Tools for Language Teaching
Corpus Tools for Language TeachingCorpus Tools for Language Teaching
Corpus Tools for Language Teaching
 
Second Language Acquisition 631
Second Language Acquisition 631Second Language Acquisition 631
Second Language Acquisition 631
 
Applied linguistic: Contrastive Analysis
Applied linguistic: Contrastive AnalysisApplied linguistic: Contrastive Analysis
Applied linguistic: Contrastive Analysis
 
Sociolinguistics
SociolinguisticsSociolinguistics
Sociolinguistics
 
Chapter iv language contact
Chapter iv language contactChapter iv language contact
Chapter iv language contact
 

Semelhante a Corpus linguistics

Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguisticsRaul Vargas
 
Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Umm-e-Rooman Yaqoob
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...ijnlc
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...kevig
 
Corpus-Based Studies of Legal Language for Translation Purposes:
Corpus-Based Studies of Legal Language for Translation Purposes:Corpus-Based Studies of Legal Language for Translation Purposes:
Corpus-Based Studies of Legal Language for Translation Purposes:Lucja Biel
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysisRubyaShaheen
 
The Corpus In The Classroom
The Corpus In The ClassroomThe Corpus In The Classroom
The Corpus In The ClassroomColin Graham
 
lexicography
lexicographylexicography
lexicographyayfa
 
Corpus Linguistics II.pptx
Corpus Linguistics II.pptxCorpus Linguistics II.pptx
Corpus Linguistics II.pptxRachidMouzouni1
 
11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)ThennarasuSakkan
 
Corpus approaches to discourse analysis
Corpus approaches to discourse analysisCorpus approaches to discourse analysis
Corpus approaches to discourse analysisAseel K. Mahmood
 
Syracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxSyracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxdeanmtaylor1545
 
Corpus based translation Studies
Corpus based translation StudiesCorpus based translation Studies
Corpus based translation StudiesHabib Ali
 

Semelhante a Corpus linguistics (20)

Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
Corpus-Based Studies of Legal Language for Translation Purposes:
Corpus-Based Studies of Legal Language for Translation Purposes:Corpus-Based Studies of Legal Language for Translation Purposes:
Corpus-Based Studies of Legal Language for Translation Purposes:
 
Treebank annotation
Treebank annotationTreebank annotation
Treebank annotation
 
corpus linguistics.pptx
corpus linguistics.pptxcorpus linguistics.pptx
corpus linguistics.pptx
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Computer assisted text and corpus analysis
Computer assisted text and corpus analysisComputer assisted text and corpus analysis
Computer assisted text and corpus analysis
 
LSDI.pptx
LSDI.pptxLSDI.pptx
LSDI.pptx
 
The Corpus In The Classroom
The Corpus In The ClassroomThe Corpus In The Classroom
The Corpus In The Classroom
 
lexicography
lexicographylexicography
lexicography
 
Corpus Linguistics II.pptx
Corpus Linguistics II.pptxCorpus Linguistics II.pptx
Corpus Linguistics II.pptx
 
11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)11 terms in Corpus Linguistics1 (2)
11 terms in Corpus Linguistics1 (2)
 
Corpus approaches to discourse analysis
Corpus approaches to discourse analysisCorpus approaches to discourse analysis
Corpus approaches to discourse analysis
 
Syracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxSyracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docx
 
lexicographic evidence
lexicographic evidencelexicographic evidence
lexicographic evidence
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
Corpus based translation Studies
Corpus based translation StudiesCorpus based translation Studies
Corpus based translation Studies
 

Mais de Alicia Ruiz

Focus on the language learner
Focus on the language learnerFocus on the language learner
Focus on the language learnerAlicia Ruiz
 
Sociolinguistics
SociolinguisticsSociolinguistics
SociolinguisticsAlicia Ruiz
 
Psycholinguistics
PsycholinguisticsPsycholinguistics
PsycholinguisticsAlicia Ruiz
 
Second language acquisition
Second language acquisitionSecond language acquisition
Second language acquisitionAlicia Ruiz
 
Discourse analysis
Discourse analysisDiscourse analysis
Discourse analysisAlicia Ruiz
 
An overview of applied linguistics
An overview of applied linguisticsAn overview of applied linguistics
An overview of applied linguisticsAlicia Ruiz
 

Mais de Alicia Ruiz (10)

Everyday tasks
Everyday tasksEveryday tasks
Everyday tasks
 
Focus on the language learner
Focus on the language learnerFocus on the language learner
Focus on the language learner
 
Sociolinguistics
SociolinguisticsSociolinguistics
Sociolinguistics
 
Psycholinguistics
PsycholinguisticsPsycholinguistics
Psycholinguistics
 
Second language acquisition
Second language acquisitionSecond language acquisition
Second language acquisition
 
Pragmatics
PragmaticsPragmatics
Pragmatics
 
Discourse analysis
Discourse analysisDiscourse analysis
Discourse analysis
 
Vocabulary
VocabularyVocabulary
Vocabulary
 
Grammar
GrammarGrammar
Grammar
 
An overview of applied linguistics
An overview of applied linguisticsAn overview of applied linguistics
An overview of applied linguistics
 

Último

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 

Último (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 

Corpus linguistics

  • 2. What is Corpus linguistics? Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally done by hand, corpora are now largely derived by an automated process.
  • 3. One of the main contributions of corpus linguistics is in the area of exploring patterns of language use. Corpus linguistics provides an extremely powerful tool for the analysis of natural language an use varies in different situations.
  • 4. As a result of these advances there are typically four features that are seen as characteristic of corpus bases analyses of language: o It’s empirical, analyzing the actual patterns of use in natural texts. o It utilizes large and principled collection of natural texts, known as a ‘corpus’ the basis for analysis o It makes extensive use of computers for analysis, using both automatic and interactive techniques o It depends on both quantitative and qualitative analytical techniques
  • 5. Corpus Design and Compilation A corpus is a large and principled collection of texts stored in electronic format. There is no minimum size for a text collection to be considered a corpus. This is a significant development as it enables researchers all over the world to access the same sets of data which not only encourages a higher degree of accountability in data analysis, nut also permits collaborative word an follow up studies by different researcher.
  • 6. Types of Corpora There are as many types f corpora as there are research topics in linguistics. General corpora, such as the Brown Corpus, the LOB, or the BNC, aim to represent language I its broadest sense and to serve as a widely available resource for baseline or comparative studies of general linguistic features. A general corpus is designed to be balanced and include language samples from a wide range of registers or genres, including both fiction and nonfiction in al their diversity.
  • 7. Corpus Compilation When creating a corpus, data collection involves obtaining or creating electronic versions of the target texts, and storing and organizing them. Written corpora are far less labor intensive to collect than spoken corpora. The data collection phase of building a spoken copus is lengthy and expensive. The first step is to decide on a transcription system.
  • 8. Word Counts and Basic Corpus Tools There are many levels of information that can be gathered from a corpus. These levels range from simple word lists can reveal both linguistic associating patterns. The tools that are used for these analyses range from basic concordance packages to complex interactive computer programs.