Issues W2011 Final

An Interpretation-Driven Model of Syntax Richard Caneba canebr@rpi.edu RPI Cognitive Science Department Human-Level Intelligence Laboratory 5/2/2011

Introduction We start with a goal: Develop a system that can understand natural language. 5/2/2011

Introduction We start with a goal: Develop a system that can understand natural language. (Roughly) three sub-goals: Syntactic Parsing Semantic Representation Pragmatics/Discourse 5/2/2011

Introduction We start with a goal: Develop a system that can understand natural language. (Roughly) three sub-goals: Syntactic Parsing Semantic Representation Pragmatics/Discourse Stage 1: Syntax Why is syntax important for natural language understanding? 5/2/2011

Introduction: Syntax “The dog bit the man.” 5/2/2011

Introduction: Syntax “The man bit the dog.” 5/2/2011

Introduction: Syntax “I hit the man with my car.” 5/2/2011

Introduction: Syntax 5/2/2011 “[I] hit [the man] [with my car].”

Introduction: Syntax “[I] hit [the man] [with my car].” 5/2/2011 Option 1: “with my car” modifies “hit””I hit the man while driving my car.”

Introduction: Syntax “[I] hit [the man] [with my car].” 5/2/2011 Option 2: “with my car” modifies “the man””I hit the man who had my car.”

Introduction: Syntax Syntactic interpretation yields very distinct semantic interpretations. 5/2/2011

Introduction: Syntax Syntactic interpretation yields very distinct semantic interpretations. Helps identify the role words play in an utterance. 5/2/2011

Introduction: Syntax Thus, syntax plays a fundamental role in natural language understanding. 5/2/2011

Syntax Current grammar formalisms have number of shortcomings 5/2/2011

Syntax Current grammar formalisms have number of shortcomings From perspective of trying to develop a system to understand natural language 5/2/2011

Syntax Current grammar formalisms have number of shortcomings From perspective of trying to develop a system to understand natural language Focus on generative grammar (i.e. Chomskyan, HPSG [11], [13]) 5/2/2011

Syntax: Theory & Implementation Two classes of shortcomings: Theoretical: shortcoming in the way a theory of language represents the users mental knowledge of that language. Implementation: shortcoming in the way a theory implies or represents language processing in terms of computability and/or cognitive realism. 5/2/2011

Syntax: Theory & Implementation We will show that these two classes are so closely related, does not make sense to make a strong distinction 5/2/2011

Theoretical Shortcomings Shortcoming in the way a theory of language represents the users mental knowledge of that language 5/2/2011

Theoretical Shortcomings Interpretability 5/2/2011

Theoretical Shortcomings i.e. “Fido bit dog.” 5/2/2011

Theoretical Shortcomings i.e. “Fido bit dog.” cf. “Fido bit the dog.” 5/2/2011

Why Syntax? Why syntax at all, if we can still interpret ungrammaticality? 5/2/2011

Why Syntax? “Fido bit dog.” cf. “Dog bit Fido.” 5/2/2011 Fido Fido

Theoretical Shortcomings Phrasal Nesting 5/2/2011

Theoretical Shortcomings Phrasal Nesting example: “the man in boston with the hat is here.” 5/2/2011

5/2/2011 [S] [NP] [VP] [NP] is [PP] [PP] here man [NP] [DP] in the [PP] boston with [NP] hat [DP] “the man in boston with the hat is here.” the

5/2/2011 [S] [VP] [NP] is [PP] man [DP] [PP] [PP] here with [NP] [NP] in the hat [DP] boston the “the man in boston with the hat is here.”

Implementation Shortcomings Shortcoming in the way a theory implies or represents language processing in terms of computability and/or cognitive realism. 5/2/2011

Implementation Shortcomings Perceptual Ordering 5/2/2011

Implementation Shortcomings john 5/2/2011

Implementation Shortcomings john saw 5/2/2011

Implementation Shortcomings john saw mary 5/2/2011

Implementation Shortcomings [VP] [NP] 5/2/2011 john saw mary

Implementation Shortcomings [S] [VP] [NP] [NP] 5/2/2011 john saw mary

Implementation Shortcomings Excessive Structure 5/2/2011

Implementation Shortcomings 5/2/2011 the tall strong angry man

Implementation Shortcomings 5/2/2011 [NP] the tall strong angry man

Implementation Shortcomings 5/2/2011 [NP] [NP] the tall strong angry man

Implementation Shortcomings 5/2/2011 [NP] [NP] [NP] the tall strong angry man

Implementation Shortcomings 5/2/2011 [NP] [NP] [NP] [NP] the tall strong angry man

Implementation Shortcomings 5/2/2011 [NP] [NP] [NP] [NP] [PP] the tall strong angry man in boston

Syntax: Statistical Approach Most currently successful parsing algorithms rely heavily on statistics. However, inferences that require notion of semantics difficult 5/2/2011

Syntax: Statistical Approach i.e. “The couple walked in the park. He held her hand.” 5/2/2011

Syntax: Statistical Approach i.e. “The couple walked in the park. He held her hand.” Statistics does not give anaphoric bind. 5/2/2011

Chomskyan Approach From [6], consider “Joe has put those raw potatoes in the pot.” 5/2/2011

Chomskyan Approach A simpler analysis is possible: 5/2/2011

Chomskyan Approach Illustrates: way a theory is represented has very real computational concerns. 5/2/2011

Other Notable Work Other Cognitive Architecture based Parsers [8] R. Lewis et.al. developed a parser based on "immediate reasoning" in Soar [9] R. Lewis et.al. developed an activation based parser model in ACT-R [1][2][3] J.T. Ball et.al. developed a parser based on Double R Grammar Model, for "synthetic teammate" development in ACT-R 5/2/2011

Other Notable Work However, each of these theories suffers from the shortcomings we’ve already seen. 5/2/2011

Other Notable Work However, each of these theories suffers from the shortcomings we’ve already seen. Both parsers designed by Lewis rely on a CFG formalism Bell’s ACT-R parser does not deeply integrated with reasoning 5/2/2011

Other Notable Work However, each of these theories suffers from the shortcomings we’ve already seen. Both parsers designed by Lewis rely on a CFG formalism Bell’s ACT-R parser does not deeply integrated with reasoning These approaches are not well integrated with reasoning overall 5/2/2011

Motivating Principles To address these shortcomings from an interpretative perspective, four principles are motivated: 5/2/2011

Motivating Principles To address these shortcomings from an interpretative perspective, four principles are motivated:a) The existence of satellite structuresb) Feature structure unificationc) Feature structure aggregationd) Incrementality 5/2/2011

Satellite Structures When we hear words, infer existence of other structures related to that word 5/2/2011

Satellite Structures When we hear words, infer existence of other structures related to that word i.e. “bit” 5/2/2011 bit bit

Satellite Structures When we hear words, infer existence of other structures related to that word i.e. “bit” 5/2/2011 [NP] [NP] Obj Subj bit bit

Feature Structure Unification With the existence of satellite structures, we can unify observed structures together 5/2/2011

Feature Structure Unification With the existence of satellite structures, we can unify observed structures together 5/2/2011 [NP] [NP] Obj Subj john bit bit fido

Feature Structure Aggregation Sequential unification multiple feature structures yields “aggregating” feature structure 5/2/2011

Feature Structure Aggregation i.e. “the big dog” 5/2/2011

Feature Structure Aggregation i.e. “the big dog” 5/2/2011 big the dog

Feature Structure Aggregation i.e. “the big dog” 5/2/2011 [NP] [NP] [NP] big the dog

Feature Structure Aggregation i.e. “the big dog” 5/2/2011 [NP] [NP] [NP] Unify big the dog

Feature Structure Aggregation i.e. “the big dog” 5/2/2011 [NP] [NP] [NP] Unify Unify big the dog

Feature Structure Aggregation i.e. “the big dog” 5/2/2011 Unify [NP] [NP] [NP] big the dog

Incrementality Human sentence comprehension is left-to-right, and incremental Incrementally generate an interpretation 5/2/2011

Using these principles, attempt to address the aforementioned shortcomingsin a model 5/2/2011 Architectural Implementation

Architectural Implementation Parser prototype implemented in the Polyscheme Cognitive Architecture. 5/2/2011

Architectural Implementation Parser prototype implemented in the Polyscheme Cognitive Architecture. Use the model to process and interpret natural language input. 5/2/2011

Architectural Implementation Parsing driven by pair-wise pattern matching 5/2/2011

Architectural Implementation 5/2/2011 Parsing driven by pair-wise pattern matching [XP] * [NP] [XP] * [NP] [DetP] [NP] [DetP] [D] [N] [D] [N] *Underspecified Feature Structure

Architectural Implementation 5/2/2011 [XP] [XP] [NP] [NP] [DetP] [NP] [DetP] [1] man the man the DetP [1]

Architectural Implementation 5/2/2011 IsA(?the, WordUtteranceEvent) IsA(?the, Determiner) PartOf(?d, ?dp) IsA(?dp, PhraseUtteranceEvent) IsA(?dp, Determiner) PartOf(?dp, ?np1) IsA(?np1, PhraseUtteranceEvent) IsA(?np1, Noun) Meets(?the, ?man) IsA(?man, WordUtteranceEvent) IsA(?man, CommonNoun) Specifier(?man, ?detSpr) IsA(?detSpr, Determiner) PartOf(?man, ?np2) IsA(?np2, PhraseUtteranceEvent) IsA(?np2, Noun)  Same(?dp, ?detSpr) Same(?np1, ?np2) [XP] [XP] [NP] [NP] [DetP] [NP] [DetP] [1] man the man the DetP [1]

5/2/2011 [DetP][N]  NP[DetP, N] [AdjP][N]  NP[AdjP, N] [XP] [XP] [XP] [XP] [XP] [NP] [NP] [NP] [DetP] [NP] [AdjP] [NP] [DetP] [AdjP] [D] [N] [D] [N] [Adj] [N] [Adj] [N] [DetP][AdjP]  NP[DetP, AdjP] [XP] [XP] [NP] [NP] [DetP] [AdjP] [AdjP] [DetP] [D] [Adj] [D] [Adj]

5/2/2011 [P][NP]  PP[P, NP] [V][NP]  VP[V, NP] [XP] [XP] [XP] [XP] [XP] [VP] [XP] [PP] [PP] [NP] [NP] [NP] [VP] [NP] [P] [X] [P] [X] [X] [X] [V] [V] [NP][V]  VP[NP, V] [XP] [XP] [XP] [VP] [NP] [VP] [NP] [N] [V] [N] [V]

"the tall man" 5/2/2011 Architectural Implementation

Architectural Implementation "the tall man" 5/2/2011 [NP] [DetP] the

Architectural Implementation "the tall man" 5/2/2011 [NP] [NP] [AdjP] [DetP] tall the

Architectural Implementation "the tall man" 5/2/2011 [NP] [AdjP] [DetP] tall the

Architectural Implementation 5/2/2011 "the tall man" [XP] [NP] [AdjP] [DetP] [NP] tall the man DetP

Architectural Implementation 5/2/2011 [XP] "the tall man" [NP] [AdjP] [DetP] [1] tall the man DetP [1]

Comparative Results Compare to previous incarnation of syntactic parser in Polyscheme: Loyal to CFG formalism Relies very heavily on search 5/2/2011

Comparative Results New model has significant benefits: Orders of magnitude faster (tens of minutes vs. seconds) Wider coverage of sentences 5/2/2011

Conclusions Introduced a syntactic theory based on the principles of Satellite Structure Positing Feature Structure Unification Feature Structure Aggregation Incrementality 5/2/2011

Conclusions Working implementation in Cognitive Architecture that is: Structurally Efficient Computationally fast Cognitively plausible 5/2/2011

Contribution We have presented a new grammatical formalism Implemented in a cognitive architecture Integrated with reasoning capabilities Computationally efficient, cognitively plausible Will lead towards a system that can understand natural language. 5/2/2011

Future Directions Many linguistic phenomena not mentioned here (not necessarily syntactic) Developing a new lexical representation theory to support interpretive grammar Integration with notions of pragmatics/discourse Integrate theory into working applications 5/2/2011

Special thanks to (in no particular order): Dr. Nick Cassimatis Perrin Bignoli JR Scally Soledad Vedovato John Borland Hiroyuki Uchida 5/2/2011

References 5/2/2011 Ball, J. T. (2004). A Cognitively Plausible Model of Language Comprehension. Proceedings of the 13th Conference on Behavior Representation in Modeling and Simulation. Ball, J., Rodgers, S., & Gluck, K. (2001). Integrating ACT-R and Cyc in a large-scale model of language comprehension for use in intelligent agents. Artificial Intelligence. Ball, J., Heiberg, A., & Silber, R. (2007). Toward a large-scale model of language comprehension in ACT-R 6. In R. L. Lewis, T. A. Polk, & J. E. Laird (Eds.), Proceedings of the 8th International Conference on Cognitive Modeling (pp. 163-168). Ball, Jerry T; Heiberg, Andrea; Silber, R. (2005). Toward a Large-Scale Model of Language Comprehension in ACT-R 6 Construction-Driven Language Processing. Language, 1. Ball, J. T. (2004). A Cognitively Plausible Model of Language Comprehension. Proceedings of the 13th Conference on Behavior Representation in Modeling and Simulation. Culicover, P. W., & Jackendoff, R. (2006). The simpler syntax hypothesis. Trends in cognitive sciences, 10(9), 413-8. doi: 10.1016/j.tics.2006.07.007. Lewis, R. L. (1993). An architecturally-based theory of human sentence comprehension. Proceedings of the fifteenth annual conference of the Cognitive Science Society June 18 to 21 1993 Institute of Cognitive Science University of Colorado Boulder (p. 108). Lawrence Erlbaum. Lewis, R. L., Newell, A., & Polk, T. A. (1989). Toward a Soar theory of taking instructions for immediate reasoning. Proceedings of the Eleventh Annual Conference of the Cognitive Science Society (pp. 514-521). Erlbaum. Lewis, R. L., & Vasishth, S. (2005). An Activation-Based Model of Sentence Processing as Skilled Memory Retrieval. Cognitive Science, 29(3), 375-419. Psychology Press. doi: 10.1207/s15516709cog0000_25. Nivre, J. (2005). Dependency grammar and dependency parsing. MSI report, 5133(1959), 1-32. Citeseer. Pollard, C., & Sag, I. (1994). Head Driven Phrase Structure Grammar. Studies in Contemporary Linguistics. University of Chicago Press. Pullman, Stepen G. (1991). Basic Parsing Techniques: an introductory survey. Sag, I. A., Wasow, T., & Bender, E. (2003). Syntactic Theory: A Formal Introduction (second edition). (I. A. Sag, Thomas Wasow, & Emily Bender, Eds.). CSLI Publications.

Issues W2011 Final

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (19)

Semelhante a Issues W2011 Final

Semelhante a Issues W2011 Final (20)

Issues W2011 Final

Notas do Editor