SlideShare uma empresa Scribd logo
1 de 8
Baixar para ler offline
Module
          13
Natural Language
      Processing
        Version 2 CSE IIT, Kharagpur
Lesson
    41
   Parsing
Version 2 CSE IIT, Kharagpur
13.3 Natural Language Generation
The steps in natural language generation are as follows.


               Meaning representation

Utterance Planning

               Meaning representations for sentences

Sentence Planning and Lexical Choice

               Syntactic structures of sentences with lexical choices

Sentence Generation

               Morphologically analyzed words

Morphological Generation

               Words


13.4 Steps in Language Understanding and Generation
13.4.1 Morphological Analysis
   •   Analyzing words into their linguistic components (morphemes).
   •   Morphemes are the smallest meaningful units of language.
             cars                    car+PLU
             giving                  give+PROG
             geliyordum              gel+PROG+PAST+1SG          - I was coming

   •   Ambiguity: More than one alternatives
            flies                  flyVERB+PROG
                                   flyNOUN+PLU
            adam                   adam+ACC                - the man (accusative)
                                   adam+P1SG               - my man
                                   ada+P1SG+ACC             - my island (accusative)




                                                           Version 2 CSE IIT, Kharagpur
13.4.2 Parts-of-Speech (POS) Tagging
   •   Each word has a part-of-speech tag to describe its category.

   •   Part-of-speech tag of a word is one of major word groups
       (or its subgroups).
           – open classes -- noun, verb, adjective, adverb
           – closed classes -- prepositions, determiners, conjuctions, pronouns,
                particples

   •   POS Taggers try to find POS tags for the words.

   • duck is a verb or noun? (morphological analyzer cannot make decision).

   •   A POS tagger may make that decision by looking the surrounding words.
          – Duck! (verb)
          – Duck is delicious for dinner. (noun)

13.4.3 Lexical Processing
   •   The purpose of lexical processing is to determine meanings of individual words.

   •   Basic methods is to lookup in a database of meanings – lexicon

   •   We should also identify non-words such as punctuation marks.

   •   Word-level ambiguity -- words may have several meanings, and the correct one
       cannot be chosen based solely on the word itself.
          – bank in English

   •   Solution -- resolve the ambiguity on the spot by POS tagging (if possible) or pass-
       on the ambiguity to the other levels.

13.4.4 Syntactic Processing
   •   Parsing -- converting a flat input sentence into a hierarchical structure that
       corresponds to the units of meaning in the sentence.

   •   There are different parsing formalisms and algorithms.

   •   Most formalisms have two main components:
         – grammar -- a declarative representation describing the syntactic structure
              of sentences in the language.
         – parser -- an algorithm that analyzes the input and outputs its structural
              representation (its parse) consistent with the grammar specification.



                                                            Version 2 CSE IIT, Kharagpur
•   CFGs are in the center of many of the parsing mechanisms. But they are
       complemented by some additional features that make the formalism more suitable
       to handle natural languages.

13.4.5 Semantic Analysis
   •   Assigning meanings to the structures created by syntactic analysis.

   •   Mapping words and structures to particular domain objects in way consistent with
       our knowledge of the world.

   •   Semantic can play an import role in selecting among competing syntactic analyses
       and discarding illogical analyses.
          – I robbed the bank -- bank is a river bank or a financial institution

   •   We have to decide the formalisms which will be used in the meaning
       representation.


13.5 Knowledge Representation for NLP
   •   Which knowledge representation will be used depends on the application --
       Machine Translation, Database Query System.

   •   Requires the choice of representational framework, as well as the specific
       meaning vocabulary (what are concepts and relationship between these concepts
       -- ontology)

   •   Must be computationally effective.

   •   Common representational formalisms:
         – first order predicate logic
         – conceptual dependency graphs
         – semantic networks
         – Frame-based representations

13.6 Discourse
   •   Discourses are collection of coherent sentences (not arbitrary set of sentences)

   •   Discourses have also hierarchical structures (similar to sentences)

   •   anaphora resolution -- to resolve referring expression
          – Mary bought a book for Kelly. She didn’t like it.
                • She refers to Mary or Kelly. -- possibly Kelly
                • It refers to what -- book.
          – Mary had to lie for Kelly. She didn’t like it.

                                                           Version 2 CSE IIT, Kharagpur
•   Discourse structure may depend on application.
         – Monologue
         – Dialogue
         – Human-Computer Interaction

13.7 Applications of Natural Language Processing
  •   Machine Translation – Translation between two natural languages.
        – See the Babel Fish translations system on Alta Vista.

  •   Information Retrieval – Web search (uni-lingual or multi-lingual).

  •   Query Answering/Dialogue – Natural language interface with a database system,
      or a dialogue system.

  •   Report Generation – Generation of reports such as weather reports.

  •   Some Small Applications –
         – Grammar Checking, Spell Checking, Spell Corrector

13.8 Machine Translation
  •   Machine Translation refers to converting a text in language A into the
      corresponding text in language B (or speech).

  •   Different Machine Translation architectures are:
         – interlingua based systems
         – transfer based systems

  •   Challenges are to acquire the required knowledge resources such as mapping rules
      and bi-lingual dictionary? By hand or acquire them automatically from corpora.

  •   Example Based Machine Translation acquires the required knowledge (some of it
      or all of it) from corpora.




                                                         Version 2 CSE IIT, Kharagpur
Questions
1. Consider the following short story:

John went to the diner to eat lunch. He ordered a hamburger. But John wasn't very
hungry so he didn't _nish it. John told the waiter that he wanted a doggy bag. John gave
the waiter a tip. John then went to the hardware store and home.

Each inference below is based on a plausible interpretation of the story. For each
inference, briefly explain whether that inference was primarily based on syntactic,
semantic, pragmatic, discourse, or world knowledge. (Do not answer world knowledge
unless none of the other categories are appropriate.)

(a) John is the person who ordered a hamburger.

(b) John wasn't just stating a fact that he desired a doggy bag, but was requesting that the
waiter bring him a doggy bag.

(c) John went to the hardware store and then went to his house. (As opposed to going to
a hardware store and a hardware home.)

(d) John gave the waiter some money as a gratuity. (As opposed to giving him a
suggestion or hint.)

(e) John was wearing clothes.


2. Identify the thematic role associated with each noun phrase in the sentence below:

Mary went from Utah to Colorado with John by bicycle.


Solutions
1.a. Discourse knowledge. The inference comes from coreference resolution between
John” and “He” in the first and second sentences.

1.b. Pragmatics. Most people would assume that John was making a request of the waiter
and not merely stating a fact, which is a pragmatic issue because it reects the purpose of
John's statement.

1.c. Syntactic knowledge. This inference reflects one syntactic parse: ((hardware store)
and (home)), as opposed to an alternative parse: (hardware (store and home)).

1.d Semantic knowledge. Most people would assume that “tip” means gratuity, as
opposed to other meanings of the word “tip”, such as suggestion or hint.

                                                             Version 2 CSE IIT, Kharagpur
1.e. World Knowledge. There is nothing stated in the story that mentions clothes, but in
our culture people virtually always wear clothes when they leave their house. So we
make this assumption.

2. The roles are

agent = Mary
source (from-loc) = Utah
destination (to-loc) = Colorado
co-agent = John
instrument = bicycle




                                                           Version 2 CSE IIT, Kharagpur

Mais conteúdo relacionado

Mais procurados

Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AISaurav Shrestha
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: ParsingRushdi Shams
 
Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...
Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...
Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...Edureka!
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingRishikese MR
 
Natural language processing
Natural language processing Natural language processing
Natural language processing Md.Sumon Sarder
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingMariana Soffer
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingTed Xiao
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Mustafa Jarrar
 
Natural Language Processing glossary for Coders
Natural Language Processing glossary for CodersNatural Language Processing glossary for Coders
Natural Language Processing glossary for CodersAravind Mohanoor
 
Natural Language Processing - Unit 1
Natural Language Processing - Unit 1Natural Language Processing - Unit 1
Natural Language Processing - Unit 1Mithun B N
 
Natural language processing
Natural language processingNatural language processing
Natural language processingKarenVacca
 

Mais procurados (20)

Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
NLP_KASHK:Text Normalization
NLP_KASHK:Text NormalizationNLP_KASHK:Text Normalization
NLP_KASHK:Text Normalization
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
NLP
NLPNLP
NLP
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...
Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...
Stemming And Lemmatization Tutorial | Natural Language Processing (NLP) With ...
 
Nlp
NlpNlp
Nlp
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processing Natural language processing
Natural language processing
 
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
Formal languages
Formal languagesFormal languages
Formal languages
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing
 
Natural Language Processing glossary for Coders
Natural Language Processing glossary for CodersNatural Language Processing glossary for Coders
Natural Language Processing glossary for Coders
 
Natural Language Processing - Unit 1
Natural Language Processing - Unit 1Natural Language Processing - Unit 1
Natural Language Processing - Unit 1
 
NLP
NLPNLP
NLP
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 

Semelhante a Parsing and Applications of NLP

Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptOlusolaTop
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AISATHYANARAYANAKB
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxSHIBDASDUTTA
 
Presentation generative-transformational grammar
Presentation generative-transformational grammar Presentation generative-transformational grammar
Presentation generative-transformational grammar Nailun Naja
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
CNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningCNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningKv Sagar
 
Sanskrit in Natural Language Processing
Sanskrit in Natural Language ProcessingSanskrit in Natural Language Processing
Sanskrit in Natural Language ProcessingHitesh Joshi
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Abdullah al Mamun
 
Jarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingJarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingMustafa Jarrar
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 
Wide Coverage Semantic Representations from a CCG Parser
Wide Coverage Semantic Representations from a CCG ParserWide Coverage Semantic Representations from a CCG Parser
Wide Coverage Semantic Representations from a CCG ParserMark Chang
 

Semelhante a Parsing and Applications of NLP (20)

Lesson 41.pdf
Lesson 41.pdfLesson 41.pdf
Lesson 41.pdf
 
AI Lesson 40
AI Lesson 40AI Lesson 40
AI Lesson 40
 
intro.ppt
intro.pptintro.ppt
intro.ppt
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
 
NLP_KASHK: Introduction
NLP_KASHK: Introduction NLP_KASHK: Introduction
NLP_KASHK: Introduction
 
Presentation generative-transformational grammar
Presentation generative-transformational grammar Presentation generative-transformational grammar
Presentation generative-transformational grammar
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
CNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningCNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learning
 
Sanskrit in Natural Language Processing
Sanskrit in Natural Language ProcessingSanskrit in Natural Language Processing
Sanskrit in Natural Language Processing
 
L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
 
Nlp (1)
Nlp (1)Nlp (1)
Nlp (1)
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Jarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingJarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language Processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Wide Coverage Semantic Representations from a CCG Parser
Wide Coverage Semantic Representations from a CCG ParserWide Coverage Semantic Representations from a CCG Parser
Wide Coverage Semantic Representations from a CCG Parser
 

Mais de Avijit Kumar (20)

Lesson 18
Lesson 18Lesson 18
Lesson 18
 
Lesson 19
Lesson 19Lesson 19
Lesson 19
 
Lesson 20
Lesson 20Lesson 20
Lesson 20
 
Lesson 21
Lesson 21Lesson 21
Lesson 21
 
Lesson 23
Lesson 23Lesson 23
Lesson 23
 
Lesson 25
Lesson 25Lesson 25
Lesson 25
 
Lesson 24
Lesson 24Lesson 24
Lesson 24
 
Lesson 22
Lesson 22Lesson 22
Lesson 22
 
Lesson 26
Lesson 26Lesson 26
Lesson 26
 
Lesson 27
Lesson 27Lesson 27
Lesson 27
 
Lesson 28
Lesson 28Lesson 28
Lesson 28
 
Lesson 29
Lesson 29Lesson 29
Lesson 29
 
Lesson 30
Lesson 30Lesson 30
Lesson 30
 
Lesson 31
Lesson 31Lesson 31
Lesson 31
 
Lesson 32
Lesson 32Lesson 32
Lesson 32
 
Lesson 33
Lesson 33Lesson 33
Lesson 33
 
Lesson 36
Lesson 36Lesson 36
Lesson 36
 
Lesson 35
Lesson 35Lesson 35
Lesson 35
 
Lesson 37
Lesson 37Lesson 37
Lesson 37
 
Lesson 39
Lesson 39Lesson 39
Lesson 39
 

Último

Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Último (20)

Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Parsing and Applications of NLP

  • 1. Module 13 Natural Language Processing Version 2 CSE IIT, Kharagpur
  • 2. Lesson 41 Parsing Version 2 CSE IIT, Kharagpur
  • 3. 13.3 Natural Language Generation The steps in natural language generation are as follows. Meaning representation Utterance Planning Meaning representations for sentences Sentence Planning and Lexical Choice Syntactic structures of sentences with lexical choices Sentence Generation Morphologically analyzed words Morphological Generation Words 13.4 Steps in Language Understanding and Generation 13.4.1 Morphological Analysis • Analyzing words into their linguistic components (morphemes). • Morphemes are the smallest meaningful units of language. cars car+PLU giving give+PROG geliyordum gel+PROG+PAST+1SG - I was coming • Ambiguity: More than one alternatives flies flyVERB+PROG flyNOUN+PLU adam adam+ACC - the man (accusative) adam+P1SG - my man ada+P1SG+ACC - my island (accusative) Version 2 CSE IIT, Kharagpur
  • 4. 13.4.2 Parts-of-Speech (POS) Tagging • Each word has a part-of-speech tag to describe its category. • Part-of-speech tag of a word is one of major word groups (or its subgroups). – open classes -- noun, verb, adjective, adverb – closed classes -- prepositions, determiners, conjuctions, pronouns, particples • POS Taggers try to find POS tags for the words. • duck is a verb or noun? (morphological analyzer cannot make decision). • A POS tagger may make that decision by looking the surrounding words. – Duck! (verb) – Duck is delicious for dinner. (noun) 13.4.3 Lexical Processing • The purpose of lexical processing is to determine meanings of individual words. • Basic methods is to lookup in a database of meanings – lexicon • We should also identify non-words such as punctuation marks. • Word-level ambiguity -- words may have several meanings, and the correct one cannot be chosen based solely on the word itself. – bank in English • Solution -- resolve the ambiguity on the spot by POS tagging (if possible) or pass- on the ambiguity to the other levels. 13.4.4 Syntactic Processing • Parsing -- converting a flat input sentence into a hierarchical structure that corresponds to the units of meaning in the sentence. • There are different parsing formalisms and algorithms. • Most formalisms have two main components: – grammar -- a declarative representation describing the syntactic structure of sentences in the language. – parser -- an algorithm that analyzes the input and outputs its structural representation (its parse) consistent with the grammar specification. Version 2 CSE IIT, Kharagpur
  • 5. CFGs are in the center of many of the parsing mechanisms. But they are complemented by some additional features that make the formalism more suitable to handle natural languages. 13.4.5 Semantic Analysis • Assigning meanings to the structures created by syntactic analysis. • Mapping words and structures to particular domain objects in way consistent with our knowledge of the world. • Semantic can play an import role in selecting among competing syntactic analyses and discarding illogical analyses. – I robbed the bank -- bank is a river bank or a financial institution • We have to decide the formalisms which will be used in the meaning representation. 13.5 Knowledge Representation for NLP • Which knowledge representation will be used depends on the application -- Machine Translation, Database Query System. • Requires the choice of representational framework, as well as the specific meaning vocabulary (what are concepts and relationship between these concepts -- ontology) • Must be computationally effective. • Common representational formalisms: – first order predicate logic – conceptual dependency graphs – semantic networks – Frame-based representations 13.6 Discourse • Discourses are collection of coherent sentences (not arbitrary set of sentences) • Discourses have also hierarchical structures (similar to sentences) • anaphora resolution -- to resolve referring expression – Mary bought a book for Kelly. She didn’t like it. • She refers to Mary or Kelly. -- possibly Kelly • It refers to what -- book. – Mary had to lie for Kelly. She didn’t like it. Version 2 CSE IIT, Kharagpur
  • 6. Discourse structure may depend on application. – Monologue – Dialogue – Human-Computer Interaction 13.7 Applications of Natural Language Processing • Machine Translation – Translation between two natural languages. – See the Babel Fish translations system on Alta Vista. • Information Retrieval – Web search (uni-lingual or multi-lingual). • Query Answering/Dialogue – Natural language interface with a database system, or a dialogue system. • Report Generation – Generation of reports such as weather reports. • Some Small Applications – – Grammar Checking, Spell Checking, Spell Corrector 13.8 Machine Translation • Machine Translation refers to converting a text in language A into the corresponding text in language B (or speech). • Different Machine Translation architectures are: – interlingua based systems – transfer based systems • Challenges are to acquire the required knowledge resources such as mapping rules and bi-lingual dictionary? By hand or acquire them automatically from corpora. • Example Based Machine Translation acquires the required knowledge (some of it or all of it) from corpora. Version 2 CSE IIT, Kharagpur
  • 7. Questions 1. Consider the following short story: John went to the diner to eat lunch. He ordered a hamburger. But John wasn't very hungry so he didn't _nish it. John told the waiter that he wanted a doggy bag. John gave the waiter a tip. John then went to the hardware store and home. Each inference below is based on a plausible interpretation of the story. For each inference, briefly explain whether that inference was primarily based on syntactic, semantic, pragmatic, discourse, or world knowledge. (Do not answer world knowledge unless none of the other categories are appropriate.) (a) John is the person who ordered a hamburger. (b) John wasn't just stating a fact that he desired a doggy bag, but was requesting that the waiter bring him a doggy bag. (c) John went to the hardware store and then went to his house. (As opposed to going to a hardware store and a hardware home.) (d) John gave the waiter some money as a gratuity. (As opposed to giving him a suggestion or hint.) (e) John was wearing clothes. 2. Identify the thematic role associated with each noun phrase in the sentence below: Mary went from Utah to Colorado with John by bicycle. Solutions 1.a. Discourse knowledge. The inference comes from coreference resolution between John” and “He” in the first and second sentences. 1.b. Pragmatics. Most people would assume that John was making a request of the waiter and not merely stating a fact, which is a pragmatic issue because it reects the purpose of John's statement. 1.c. Syntactic knowledge. This inference reflects one syntactic parse: ((hardware store) and (home)), as opposed to an alternative parse: (hardware (store and home)). 1.d Semantic knowledge. Most people would assume that “tip” means gratuity, as opposed to other meanings of the word “tip”, such as suggestion or hint. Version 2 CSE IIT, Kharagpur
  • 8. 1.e. World Knowledge. There is nothing stated in the story that mentions clothes, but in our culture people virtually always wear clothes when they leave their house. So we make this assumption. 2. The roles are agent = Mary source (from-loc) = Utah destination (to-loc) = Colorado co-agent = John instrument = bicycle Version 2 CSE IIT, Kharagpur