SlideShare uma empresa Scribd logo
1 de 18
Baixar para ler offline
Language
                           Modeling in
                         Turner&Charniak
                              (2007)

                            Kilian Evang


                         Language Models
                         N-gram LMs

 Language Modeling in    Charniak’s LM

                         Determiner

Turner&Charniak (2007)   Selection
                         Method
                         Results
                         Reasons for Success

                         References
      Kilian Evang


       2009-11-30
Language
Recap: Language Models                                           Modeling in
                                                               Turner&Charniak
                                                                    (2007)

                                                                  Kilian Evang


                                                               Language Models
                                                               N-gram LMs
                                                               Charniak’s LM
  ◮   LMs assign probabilities to sentences                    Determiner
                                                               Selection
  ◮   a sentence is a complex event                            Method
                                                               Results
  ◮   LMs break it up into a sequence of “atomic” events       Reasons for Success

                                                               References
  ◮   each “atomic” event conditioned on certain previous
      events
  ◮   conditional probabilities approximated by counting and
      smoothing
Language
N-gram LMs                                             Modeling in
                                                     Turner&Charniak
                                                          (2007)

                                                        Kilian Evang


                                                     Language Models
                       n-gram LMs    Charniak’s LM   N-gram LMs
                                                     Charniak’s LM
 sequence represents   sentence                      Determiner
                                                     Selection
 p(sent) =             p(seq)                        Method

 events are            words,                        Results
                                                     Reasons for Success

                       end symbols                   References




 conditioned on        the n − 1
                       previous
                       events
Language
A Sentence – a Sequence of Events                    Modeling in
                                                   Turner&Charniak
                                                        (2007)

                                                      Kilian Evang


                                                   Language Models
                                                   N-gram LMs

Sentence                                           Charniak’s LM

                                                   Determiner
                                                   Selection
                                                   Method
                                                   Results
                                                   Reasons for Success

   put       the      ball        in   the   box   References




Event sequence
put, the, ball, in, the, box, ∆
Language
A Sentence – a Sequence of Events                    Modeling in
                                                   Turner&Charniak
                                                        (2007)

                                                      Kilian Evang


                                                   Language Models
                                                   N-gram LMs

Sentence                                           Charniak’s LM

                                                   Determiner
                                                   Selection
                                                   Method
                                                   Results
                                                   Reasons for Success

   put      the       ball      in     the   box   References




Conditional probability
p(wi = the|wi −2 = ball, wi −1 = in)
Language
N-gram LMs vs. Charniak’s Parsing LM                       Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang


                       n-gram LMs    Charniak’s LM       Language Models
                                                         N-gram LMs
 sequence represents   sentence      parse tree          Charniak’s LM

 p(sent) =             p(seq)           p(seq)           Determiner
                                                         Selection
                                     seq                 Method
 events are            words,        pre-terminals,      Results
                                                         Reasons for Success
                       end symbols   terminals,          References
                                     constituents,
                                     end symbols
 conditioned on        the n − 1     certain previous
                       previous      events, depending
                       events        on type
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang
Parse tree
         vp                                              Language Models
                                                         N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                 pp                Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np          References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box
Language
A Parse Tree – a Sequence of Events                                  Modeling in
                                                                   Turner&Charniak
                                                                        (2007)

                                                                      Kilian Evang
Parse tree
           vp                                                      Language Models
                                                                   N-gram LMs
                                                                   Charniak’s LM

                                                                   Determiner
                      np                      pp                   Selection
                                                                   Method
                                                                   Results
                                                                   Reasons for Success
                                                    np             References



  verb          det        noun   prep        det        noun

   put          the        ball     in        the        box


Event sequence
verb, put, M, ∆, M, np, pp, ∆, noun, ball, M, det, ∆, M, ∆, the,
prep, in, M, ∆, M, np, ∆, noun, box, M, det, ∆, M, ∆, the
Language
Digression: Non-head Constituents                           Modeling in
                                                          Turner&Charniak
                                                               (2007)

                                                             Kilian Evang


                                                          Language Models
Tree fragment                                             N-gram LMs
                                                          Charniak’s LM

                                  l                       Determiner
                                                          Selection
                                                          Method
                                                          Results
            Lm     ...    L1      t      R1    ...   Rn   Reasons for Success

                                                          References


                                  h



Event sequence fragment
M, L1 , . . ., Lm , ∆, M, R1 , . . ., Rn , ∆
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang
Parse tree
                                                         Language Models
         vp                                              N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                 pp                Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np          References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box


Conditional probability for a head pre-terminal
p(t = noun|l = np, m = vp, u = verb, i = put)
Language
A Parse Tree – a Sequence of Events                         Modeling in
                                                          Turner&Charniak
                                                               (2007)

                                                             Kilian Evang
Parse tree
                                                          Language Models
         vp                                               N-gram LMs
                                                          Charniak’s LM

                                                          Determiner
                    np                  pp                Selection
                                                          Method
                                                          Results
                                                          Reasons for Success
                                              np          References



  verb        det        noun   prep    det        noun

   put        the        ball    in     the        box


Conditional probability for a head terminal
p(h = ball|t = noun, l = np, m = vp, u = verb, i = put)
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)


Parse tree                                                  Kilian Evang


         vp                                              Language Models
                                                         N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                  pp               Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np
                                                         References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box


Conditional probability for a non-head constituent
p(Li = det|Li −1 = M, h = ball, t = noun, l = np, m =
vp, u = verb)
Language
Overview: Conditioning                                                        Modeling in
                                                                            Turner&Charniak
                                                                                 (2007)

                                                                               Kilian Evang
 event type                    conditioned on
                                                                            Language Models
 head pre-terminal t           constituent label l,
                                                                            N-gram LMs
                               mother constituent label m,                  Charniak’s LM

                               mother constituent head pre-terminal u       Determiner
                                                                            Selection
                               mother consitutent head terminal i
                                                                            Method
 head terminal h               head pre-terminal t,                         Results
                                                                            Reasons for Success
                               constituent label l,
                                                                            References
                               mother constituent label m,
                               mother constituent head pre-terminal u
                               mother consitutent head terminal i
 non-head                      (part of) L1...i −1 (L1...m , R1...i −1 ),
 constituent label Li (Ri ),   head terminal h,
 end symbol ∆                  head pre-terminal t,
                               constituent label l,
                               mother constituent label m,
                               mother constituent head pre-terminal u
Language
Determiner Selection                                               Modeling in
                                                                 Turner&Charniak
                                                                      (2007)

                                                                    Kilian Evang


                                                                 Language Models
                                                                 N-gram LMs
                                                                 Charniak’s LM

                                                                 Determiner
  ◮   for each NP,                                               Selection
                                                                 Method
        ◮   for each possible determiner (the, a/an, null),      Results
                                                                 Reasons for Success
              ◮   determine probability of NP
                                                                 References
        ◮   choose determiner resulting in highest probability
  ◮   note: sufficient to determine probabilities for events
      that differ
Language
Determiner Selection – Example                                         Modeling in
                                                                     Turner&Charniak
                                                                          (2007)

                                                                        Kilian Evang
  ◮   “put [NP the ball] in the box”
                                                                     Language Models
        ◮   p(L1 = det|m = vp, u = verb, l = np, t = noun, h =       N-gram LMs
                                                                     Charniak’s LM
            ball) × p(L2 = ∆|m = vp, u = verb, l = np, t =
                                                                     Determiner
            noun, h = ball, L1 = det) × p(det → the|m = vp, u =      Selection
            verb, l = np, t = noun, h = ball, L1 = det)              Method
                                                                     Results
                                                                     Reasons for Success
  ◮   “put [NP a/an ball] in the box”
                                                                     References
        ◮   p(L1 = det|m = vp, u = verb, l = np, t = noun, h =
            ball) × p(L2 = ∆|m = vp, u = verb, l = np, t =
            noun, h = ball, L1 = det) × (p(det → a|m = vp, u =
            verb, l = np, t = noun, h = ball, L1 = det) + p(det →
            an|m = vp, u = verb, l = np, t = noun, h = ball, L1 =
            det))
  ◮   “put [NP ball] in the box”
        ◮   p(L1 = ∆|m = vp, u = verb, l = np, t = noun, h = ball)
Language
Results     Modeling in
          Turner&Charniak
               (2007)

             Kilian Evang


          Language Models
          N-gram LMs
          Charniak’s LM

          Determiner
          Selection
          Method
          Results
          Reasons for Success

          References
Language
Reasons for Success                                                      Modeling in
                                                                       Turner&Charniak
                                                                            (2007)

                                                                          Kilian Evang


                                                                       Language Models
                                                                       N-gram LMs
                                                                       Charniak’s LM

                                                                       Determiner
  ◮   syntactic structure allows for long-distance conditioning,       Selection

      e.g.                                                             Method
                                                                       Results
                                                                       Reasons for Success
            ◮   he [VP gave [NP the sultan of Brunei] [NP a cactus]]
                                                                       References
  ◮   constituent head enforces selectional preferences,
      reflected in head-first strategy
  ◮   ...
Language
References                                                       Modeling in
                                                               Turner&Charniak
                                                                    (2007)

                                                                  Kilian Evang


                                                               Language Models
    Eugene Charniak (2000)                                     N-gram LMs

    A Maximum-Entropy-Inspired Parser                          Charniak’s LM

                                                               Determiner
    Proceedings of the First Meeting of the North American     Selection

    Chapter of the Association for Computational Linguistics   Method
                                                               Results
                                                               Reasons for Success

    Eugene Charniak (2001)                                     References

    Immediate-Head Parsing for Language Models
    Proceedings of the 39th Annual Meeting of the
    Association for Computational Linguistics
    Jenine Turner & Eugene Charniak (2007)
    Language Modeling for Determiner Selection
    Proceedings of NAACL HLT 2007, Companion Volume

Mais conteúdo relacionado

Destaque

Smoky Mountain Secrets
Smoky Mountain SecretsSmoky Mountain Secrets
Smoky Mountain SecretsJeanMarlowe
 
2011 05 09_배달통
2011 05 09_배달통2011 05 09_배달통
2011 05 09_배달통Sanghoon Kim
 
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftJoshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftMorningMaestro
 
I Npd Mfei 5 10
I Npd Mfei 5 10I Npd Mfei 5 10
I Npd Mfei 5 10kbmcgourty
 
Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01imm
 
Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vesdo 1
 
앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람Sanghoon Kim
 
Documento compraventa terreno campeche
Documento compraventa terreno campecheDocumento compraventa terreno campeche
Documento compraventa terreno campecheErmilo De la Cruz
 
Cross Media for non-profits
Cross Media for non-profitsCross Media for non-profits
Cross Media for non-profitsRCAcrossmedia
 
Doha 2006 sukan asia
Doha 2006 sukan asiaDoha 2006 sukan asia
Doha 2006 sukan asiaOlimpikini
 
Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Bangladesh Food Security Investment Forum
 
elmuellocalmon
elmuellocalmonelmuellocalmon
elmuellocalmoncristian
 
Final project
Final project Final project
Final project imm
 
Funcionaris versus emprenedors
Funcionaris versus emprenedorsFuncionaris versus emprenedors
Funcionaris versus emprenedorsaleixclosmari
 
La nit del terror. Euclides
La nit del terror. Euclides La nit del terror. Euclides
La nit del terror. Euclides joan
 
Peru presentation
Peru presentationPeru presentation
Peru presentationanchefu
 
Presentatie Ssis Part I
Presentatie Ssis Part IPresentatie Ssis Part I
Presentatie Ssis Part Isecuserve1
 

Destaque (20)

Smoky Mountain Secrets
Smoky Mountain SecretsSmoky Mountain Secrets
Smoky Mountain Secrets
 
2011 05 09_배달통
2011 05 09_배달통2011 05 09_배달통
2011 05 09_배달통
 
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftJoshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
 
I Npd Mfei 5 10
I Npd Mfei 5 10I Npd Mfei 5 10
I Npd Mfei 5 10
 
Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01
 
Dip.IFRS ACCA(UK)
Dip.IFRS ACCA(UK)Dip.IFRS ACCA(UK)
Dip.IFRS ACCA(UK)
 
Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)
 
앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람
 
Documento compraventa terreno campeche
Documento compraventa terreno campecheDocumento compraventa terreno campeche
Documento compraventa terreno campeche
 
Cross Media for non-profits
Cross Media for non-profitsCross Media for non-profits
Cross Media for non-profits
 
Doha 2006 sukan asia
Doha 2006 sukan asiaDoha 2006 sukan asia
Doha 2006 sukan asia
 
앱리뷰 Dayta
앱리뷰 Dayta앱리뷰 Dayta
앱리뷰 Dayta
 
Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...
 
elmuellocalmon
elmuellocalmonelmuellocalmon
elmuellocalmon
 
Final project
Final project Final project
Final project
 
2011 05 30_curing
2011 05 30_curing2011 05 30_curing
2011 05 30_curing
 
Funcionaris versus emprenedors
Funcionaris versus emprenedorsFuncionaris versus emprenedors
Funcionaris versus emprenedors
 
La nit del terror. Euclides
La nit del terror. Euclides La nit del terror. Euclides
La nit del terror. Euclides
 
Peru presentation
Peru presentationPeru presentation
Peru presentation
 
Presentatie Ssis Part I
Presentatie Ssis Part IPresentatie Ssis Part I
Presentatie Ssis Part I
 

Último

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Último (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

Language Modeling in Turner&Charniak (2007)

  • 1. Language Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Language Modeling in Charniak’s LM Determiner Turner&Charniak (2007) Selection Method Results Reasons for Success References Kilian Evang 2009-11-30
  • 2. Language Recap: Language Models Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM ◮ LMs assign probabilities to sentences Determiner Selection ◮ a sentence is a complex event Method Results ◮ LMs break it up into a sequence of “atomic” events Reasons for Success References ◮ each “atomic” event conditioned on certain previous events ◮ conditional probabilities approximated by counting and smoothing
  • 3. Language N-gram LMs Modeling in Turner&Charniak (2007) Kilian Evang Language Models n-gram LMs Charniak’s LM N-gram LMs Charniak’s LM sequence represents sentence Determiner Selection p(sent) = p(seq) Method events are words, Results Reasons for Success end symbols References conditioned on the n − 1 previous events
  • 4. Language A Sentence – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Sentence Charniak’s LM Determiner Selection Method Results Reasons for Success put the ball in the box References Event sequence put, the, ball, in, the, box, ∆
  • 5. Language A Sentence – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Sentence Charniak’s LM Determiner Selection Method Results Reasons for Success put the ball in the box References Conditional probability p(wi = the|wi −2 = ball, wi −1 = in)
  • 6. Language N-gram LMs vs. Charniak’s Parsing LM Modeling in Turner&Charniak (2007) Kilian Evang n-gram LMs Charniak’s LM Language Models N-gram LMs sequence represents sentence parse tree Charniak’s LM p(sent) = p(seq) p(seq) Determiner Selection seq Method events are words, pre-terminals, Results Reasons for Success end symbols terminals, References constituents, end symbols conditioned on the n − 1 certain previous previous events, depending events on type
  • 7. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box
  • 8. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Event sequence verb, put, M, ∆, M, np, pp, ∆, noun, ball, M, det, ∆, M, ∆, the, prep, in, M, ∆, M, np, ∆, noun, box, M, det, ∆, M, ∆, the
  • 9. Language Digression: Non-head Constituents Modeling in Turner&Charniak (2007) Kilian Evang Language Models Tree fragment N-gram LMs Charniak’s LM l Determiner Selection Method Results Lm ... L1 t R1 ... Rn Reasons for Success References h Event sequence fragment M, L1 , . . ., Lm , ∆, M, R1 , . . ., Rn , ∆
  • 10. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree Language Models vp N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a head pre-terminal p(t = noun|l = np, m = vp, u = verb, i = put)
  • 11. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree Language Models vp N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a head terminal p(h = ball|t = noun, l = np, m = vp, u = verb, i = put)
  • 12. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Parse tree Kilian Evang vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a non-head constituent p(Li = det|Li −1 = M, h = ball, t = noun, l = np, m = vp, u = verb)
  • 13. Language Overview: Conditioning Modeling in Turner&Charniak (2007) Kilian Evang event type conditioned on Language Models head pre-terminal t constituent label l, N-gram LMs mother constituent label m, Charniak’s LM mother constituent head pre-terminal u Determiner Selection mother consitutent head terminal i Method head terminal h head pre-terminal t, Results Reasons for Success constituent label l, References mother constituent label m, mother constituent head pre-terminal u mother consitutent head terminal i non-head (part of) L1...i −1 (L1...m , R1...i −1 ), constituent label Li (Ri ), head terminal h, end symbol ∆ head pre-terminal t, constituent label l, mother constituent label m, mother constituent head pre-terminal u
  • 14. Language Determiner Selection Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner ◮ for each NP, Selection Method ◮ for each possible determiner (the, a/an, null), Results Reasons for Success ◮ determine probability of NP References ◮ choose determiner resulting in highest probability ◮ note: sufficient to determine probabilities for events that differ
  • 15. Language Determiner Selection – Example Modeling in Turner&Charniak (2007) Kilian Evang ◮ “put [NP the ball] in the box” Language Models ◮ p(L1 = det|m = vp, u = verb, l = np, t = noun, h = N-gram LMs Charniak’s LM ball) × p(L2 = ∆|m = vp, u = verb, l = np, t = Determiner noun, h = ball, L1 = det) × p(det → the|m = vp, u = Selection verb, l = np, t = noun, h = ball, L1 = det) Method Results Reasons for Success ◮ “put [NP a/an ball] in the box” References ◮ p(L1 = det|m = vp, u = verb, l = np, t = noun, h = ball) × p(L2 = ∆|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det) × (p(det → a|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det) + p(det → an|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det)) ◮ “put [NP ball] in the box” ◮ p(L1 = ∆|m = vp, u = verb, l = np, t = noun, h = ball)
  • 16. Language Results Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner Selection Method Results Reasons for Success References
  • 17. Language Reasons for Success Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner ◮ syntactic structure allows for long-distance conditioning, Selection e.g. Method Results Reasons for Success ◮ he [VP gave [NP the sultan of Brunei] [NP a cactus]] References ◮ constituent head enforces selectional preferences, reflected in head-first strategy ◮ ...
  • 18. Language References Modeling in Turner&Charniak (2007) Kilian Evang Language Models Eugene Charniak (2000) N-gram LMs A Maximum-Entropy-Inspired Parser Charniak’s LM Determiner Proceedings of the First Meeting of the North American Selection Chapter of the Association for Computational Linguistics Method Results Reasons for Success Eugene Charniak (2001) References Immediate-Head Parsing for Language Models Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics Jenine Turner & Eugene Charniak (2007) Language Modeling for Determiner Selection Proceedings of NAACL HLT 2007, Companion Volume