SlideShare uma empresa Scribd logo
1 de 37
Baixar para ler offline
+




    A Bridge not too far

    Valeria de Paiva
    Cuil, Inc.
+
              A Bridge?




Translation
+
    Why do we want this bridge?

        Want our computer to understand us

        Want to to ask questions
             large-scale intelligent information access

        Want to search for content on the web
             exploiting the large amounts of textual
              data on the web

        Want automatic, high quality translation
         into other languages

        Etc…
+
    A bridge between language and logic
    Wish List:

        translation compositional and principled,

        meaning preserving, at least truth value preserving…

        a reasonable fragment of all language

        generic texts

        “logical forms” obtained are useful for reasoning.

    Questions:

        which kind of logic on the target?

        how do we know when we’re done?

        how do we measure quality of results?
+
    Dead simple?
        Isn't this what every
         undergraduate logic text deals
         with in its first chapter?...

        It was Russell's belief that by
         using the new logic of his day,
         philosophers would be able to
         exhibit the underlying “logical
         form" of natural language
         statements. SEP

        Long before that, it was Leibniz’s
         idea Calculemus

        …problems with ‘the present
         king of France’ and his hair.
         “On Denoting”, 1905
+
      Not so simple?
     A natural reaction:

     “Natural language, states Carnap, is
      misleading because it is not sharply
      defined and because it is not
      systematic enough. The syntactic
      rules governing natural language
      sentences do not enable us to
      distinguish between assertions and
      pseudo-assertions.”

     Gamut on R. Carnap’s “The
      elimination of metaphysics through
      logical analysis of language”, 1932.

     Carnap suggested: Instead do
      artificial languages.

     And logicians did: philosophical and
      mathematical logicians studied
      artificial languages
+
    Is it too hard?
        “The skepticism in logical circles
         about the possibility of describing
         natural language with logical
         methods is similar to that in logical
         positivism.”

        Tarski thought that applying the
         semantics he developed for logical
         languages to natural languages
         would be problematic[…] any such
         semantics assumes a precisely
         formulated syntax,[..] out of the
         question that such could be found
         for natural languages.

        “The conviction that there is no
         exact logic of natural language,
         and that language therefore does
         not lend itself to an analysis in
         terms of precise logical notions
         and rules, is common to just about
         all analytic philosophers, even
         those that were most interested in
         language”           Gamuth, 1990
+
    Not for every one...
        Davidson “Recent work by Chomsky and
         others is doing much to bring the
         complexities of natural language within the
         scope of serious semantic theory.” 1967

        Montague “I reject the contention that an
         important theoretical difference exists
         between formal and natural languages” -
         English as a formal language, 1970

        Both a translation from English to a formal
         system and a model theory for the system

        Since then a body of work on Formal
         Semantics of natural language
         concentrating on model theory

         fragments, pen and paper calculations,
         interesting phenomena. ..But also lots of
         computer systems, eg GUS
+



HANG ON A MINUTE…
30-odd years of solid work on NL semantics doesn’t solve my BRIDGE
problem…

Need to deal with huge amounts of text

Need to be robust

Need to move easily between genres, subject domains
+
    The problem:
    between a rock and a hard place
         Knowledge-based                    Shallow, open-domain
          representations of                  representations
          meaning                            Broad-coverage
         Deep/logical representations       Fails “gracefully”
          allow high precision and           Lower precision and recall
          recall, but
                                             Hard to compare systems
         Typically on restricted
          domains                            Too sensitive to form of
                                              sentences
         Hard for users to read/
          interpret output of the
          system
         Very hard for system to
          build up knowledge
+
    Which hard place?

        Whether you’re fond of symbolic         PASCAL (an EU-funded network
                                                  of excellence), decided to
         methods or of machine learning,          organize a series of challenges
         the following are contradictory:         benchmarking the task of
                                                  Recognizing Textural Entailment
        No civilians were killed in the          (RTE) in 2004. This has been
                                                  continued by TAC (Text analysis
         Najaf suicide bombing.                   conference) supported by NIST.

         Two civilians died in the Najaf         RTE Challenges promoted
          suicide bombing.                        textual entailment recognition as
                                                  a generic task capturing
                                                  semantic inference needs across
        Detecting whether one piece of           applications such as QA, Info
         text entails another or                  Retrieval, Info Extraction and
                                                  doc summarization,
         contradicts it is a necessary
         condition for text understanding.       RTE-5 last year.
+
    What makes language hard?
     Not only semanticists seeking
       unicorns or non-existing kings…

     “Negotiations prevented a strike”

         Claims that no strike came into
          being, but true and false assertions
          can be made about the non-
          existent strike.

         Bad idea:

            x y Strike(x) Negotiations (y)
          prevented(x,y)

         Could do existence predicates, but
          entailments don’t work.

         Instead do concepts see
          “Preventing Existence”, Crouch et
          al, FOIS2001
+
    What is Hard?
        Many problems, besides

            (non-)existence

        intensionality

        ambiguity

        reference resolution,

        conditionals,

        presuppositions, etc…

        See “Entailment, Intensionality
         and Text Understanding",
         Condoravdi et al, 2003
+
    Pervasive Intensionality

        Intensionality is widespread in        Anecdotal evidence at least 453
         natural language
                                                 sentences out of 1586 on 100 tips
        Raises detection issues that            photocopier repair will have a
         cannot be brushed aside:                prop complement
          H: The US sought the release of       Clearly it would be desirable to
          hostages.
                                                 cope with intensionality.
          T: Hostages were released.
                                                Proposal underspecified
          HT?                                   CONTEXTS
          People know that propositional        Feasible?...
          attitudes (knowing, believing,
          seeking, etc..) introduce
          intensionality. Can we avoid
          them?
+
    Scalability of Symbolic Methods
        From “Scalability of redundancy
         detection in Focused Document
         collections”, Crouch et al, 2002
                                                      (More problematic is the series
     “Symbolic parsing scales well, both in             of canonicalizations of
      coverage and in dealing with the                  meaning necessary to take
      proliferation of ambiguity that arises            language-based
      out of broad coverage.” This is the XLE           representations to tractable
      system, based on LFG grammar.                     conceptual representations.}

        Scalability/feasibility proven by doing
         it. It’s relative to what others are doing
         and how fast. Claim above based on
         Riezler et al “Parsing the Wall Street
         Journal using a lexical functional
         grammar and discriminative estimation
         techniques”, 2002. Parsing
         performance comparable with Collins
+
    Scalability of Symbolic Methods?
        Measuring different parsers is not easy.

        But has been done for quite a while.

        Translating between outputs by
         different parsers is hard, lots of scope
         for messing the results.

        But there are gold standards, e.g. Penn
         TreeBank

        Semantic representations are harder to
         compare. No gold standard, no vast
         collection of annotated data. Not even a
         consensus on what annotations should
         look like

        Still the case in 2010…
+
    Is it too hard?

        By late 2005 I was ready to throw in the towel.
        We had a beautiful layered architecture
        Processing sentences was fast and high-quality

        But big bottleneck when mapping to KR.
        Then TWO big changes:

        Instead of the ideal golden standard, concentrate on how
         representations relate to each other, entailment and
         contradiction detection ECD

        Instead of KR concepts from Cyc, fluid, wordy concepts from
         WordNet/VerbNet
+

    We can do
       it!


                How much of it?
                How do we improve?
                Architectures for Text Understanding
CycL
      FST
                           UL

    •  Tok
                           XL         Se    Tran   AK
Pre •  Mor                      LFG
                           E          m     sfer    R
       ph,..
                                                        ECD
                                      WN/
                                      VN


+
    Kaplan, Bobrow et al


    PARC Pipeline
+              PARC Layered Architecture
                XLE/LFG P
                         arsing KR M
                                                             Inference
    Text                             apping                   Engine
                       F-structure       Transfer
                                       semantics
                                                      AKR
     Sources
                                                                   Assertions



    Question                                                       Query




                                     Unified Lexicon
               LFG                   Term rewriting         ECD
               XLE                   KR mapping rules       Textual Inference
               MaxEnt models         Factives,Deverbals     logics



Basic idea: canonicalization of meanings
+ Key Process: Canonicalization of
  representations
      Sentences are parsed into f(unctional)-structures
       using XLE
      F-structures are (somewhat) semantic
       representations
      Transform f-structures into (flat and contexted)
       transfer semantic structures
       (inspired by Glue and need to ‘pack’ semantics )
      Transform transfer sem-structures into (flat and
       contexted) AKR structures

      what do these layers of representation buy you?
+Language/KR misalignments:

   Language
    Generalizations   come from the structure of the language
    Representations   compositionally derived from sentence
    structure

   Knowledge    representation
    Generalizations come from the structure of the world
    Representations to support reasoning
    Maintain multiple interpretations


   Layered   bridge helps with the different constraints
+ Semantics and AKR?
  Conceptual         structure
   concepts and roles as in a Description Logic
   concepts as WordNet synsets
   deverbal normalization
          example: destruction of the building -> destroy the building

  Temporal         structure (extracted from linguistic analysis)
     relate time periods, events, states (a hook)

  Contextual        structure
   for negation, propositional attitudes, modals, …
   context-lifting rules



  Packed        ambiguity encoding
+                                           PRED fire<Ed, boy>
                                            TENSE past
 Abstract KR: “Ed fired the boy.”           SUBJ [ PRED Ed ]

(Cyc version)                               OBJ
                                                    PRED boy
                                                    DEF +



    (subconcept Ed3 Person)
    (subconcept boy2 MaleChild)
    (subconcept fire_ev1 DischargeWithPrejudice)          Conceptual
    (role fire_ev1 performedBy Ed3)
    (role fire_ev1 objectActedOn boy2)

    (context t)
    (instantiable Ed3 t)
    (instantiable boy2 t)                                 Contextual
    (instantiable fire_ev1 t)

    (temporalRel startsAfterEndingOf Now
     fire_ev1)                                             Temporal


     Textual Inference Logic, 2005
+ Abstract KR: “Ed fired the boy.”
  (WN/ VN version)
     cf(1, context(t)),
    cf(1, instantiable(‘Ed0',t)),
    cf(1, instantiable('boy3',t)),
    cf(1, instantiable('fire1',t)),
    cf(1, role('Agent','fire1',‘Ed0')),
    cf(1, role('Theme','fire1','boy3')),
    cf(1,subconcept(‘Ed0',[[7626,4576]])),
    cf(1,subconcept('boy3',[[10131706],[9725282],[10464570],
     [9500236] ),),
    cf(A1, subconcept('fire1',[[1124984],[1123061],[1123474]])),
    cf(A2, subconcept('fire1',[[2379472]])),
    cf(1, temporalRel(startsAfterEndingOf,'Now','fire1'))



  Textual Inference Logic: Take Two, 2007
+
    Are we there yet?

        NO! Challenges abound.

        Make translation better (“Deverbal Nouns in Knowledge
         Representation”, “Context Inducing nouns”, 2008)

        Named Entities, measures phrases, gazetteers

        Deal with anaphora and discourse (DRS?)

        Improve sense clustering

        Treatment of copula
+                                    At least 8 systems doing similar work:
                                               Boing BLUE
                                               BOXER
                                               Ontosem
                                               Open Knowledge, Schubert
                                               Trips, Allan
                                               Getaruns
                                               TextCap
A generic                                      LXGram
Architecture?
Semantics in Text Processing, 2008

Eds. Johan Bos, Rodolfo Demonte
+

                                                            Semantics
                                     •  LFG                             •  AKR
                                     •  CCG                             •  Episodic Log
A Generic                            •  HPSG
                                                     •  Transfer
                                                     •  MRS             •  Triples
                                     •  …                               •  …
Architecture
                                                     •  DRS
                                                     •  …                    Knowledge
                                           Grammar
 All require a host of pre & post-                                          Representation
processing: text segmenters, POS
taggers, Lexica, Named Entity
Recognizers, Gazetteers, Temporal
Modules, Coreference Resolution,
WSD, etc
Theorem Prover


          • Taggers             • Parser            • DRT strux
PreProc                   CCG               Boxer
          • NER, …              • Grammar           • De Sandt




                                                                    Nutcracker




+
    C&C Tools
    Clark, Curran & Bos
RTE Stanford
          •  NE
          •  POS       Stanford
preproc      tagger               GR output   ???
                       parser
          •  Chinese
             seg



                                                    Nat Log




+
    Stanford NLP
    Manning et al
•  NE
preprocessing
                •  POS                parsing
                                                •  Stanford
                   tagging,
                   etc




+
    GATE, EDITS
    Cunningham et al, Nagnani et al
+                            The future seems easier if it’s Open
                              Source (see Ann Copestake’s page)

                             And collaborative (that too!)

                             Translation and comparison of results is
                              necessary

                             Many more lexical resources need to
                              be created and shared
How do we got
                              Machine learning of semantics/kr is
about it?                 
                              required
Totally unbaked ideas…
                             Logics, building up from ECD, using
                              probabilistic component need to be in
                              place

                             Looking on the bright side… LOTS of
                              FUN WORK!
+                   Borrowing from Ann Copestake Slacker
                     Semantics:

                    High-throughput parser with semantic
                     output

                    Effective statistical technique for
                     syntactic parse ranking

                    No underlying knowledge base for
                     disambigution
Elaborating a
little…             Underspecification is good

                    Support inter-sentential anaphora, text
                     structure

                    Robust inference and semantic pattern
                     matching

                    (pay attention to ease of use, notations
                     that help..)
+
    Is it too hard?
    “Our language shows a tiresome
     bias in its treatment of time.
     Relations of date are exalted
     grammatically as relations of
     position, weight and color are
     not. This bias is of itself an
     inelegance, or breach of
     theoretical simplicity. Moreover,
     the form that it takes – that of
     requiring every verb form to
     show a tense – is peculiarly
     productive of needless
     complications, since it demands
     lip service to be paid to time
     even when time is farthest from
     our thoughts.”

    Quine 1960
+

    Thanks!
+
    References
    ``Preventing Existence", FOIS 2001.
    ``Making Ontologies Work for Resolving Redundancies Across
    documents", Communications of the ACM, 2002.
    ``Scalability of redundancy detection in focused document
    collections", ScaNaLU, Heidelberg, 2002
    ``Entailment, Intensionality and Text Understanding", Workshop on
    Text Meaning, Edmonton, Canada, May 2003.
    ``Natural Deduction and Context as (Constructive) Modality",
    CONTEXT 2003
    ``A Basic Logic for Textual inference", AAAI Workshop on Inference for
    Textual Question Answering, 2005.
     ``Textual Inference Logic: Take Two", CONTEXT 2007.
     ``Precision-focused Textual Inference", Workshop on Textual
    Entailment and Paraphrasing, 2007
    “Deverbal Nouns in Knowledge Representation”, 2008
    “Context Inducing Nouns”, 2008
+
    Until
    The winter of our discontent
        The Winter of semantics

        As told by Hall, Juravsky and
         Manning in “Studying the
         History of Ideas Using Topic
         Models”, 2008

        Jokes apart, computational
         semantics seemed a thriving
         subject (to me) until around
         1995, when it fell out of favor.

        Completely

        During the same time huge
         progress on statistics based
         methods for language

Mais conteúdo relacionado

Mais procurados

Constructive Description Logics 2006
Constructive Description Logics 2006Constructive Description Logics 2006
Constructive Description Logics 2006Valeria de Paiva
 
KR Workshop 1 - Ontologies
KR Workshop 1 - OntologiesKR Workshop 1 - Ontologies
KR Workshop 1 - OntologiesMichele Pasin
 
FUZZY LOGIC IN NARROW SENSE WITH HEDGES
FUZZY LOGIC IN NARROW SENSE WITH HEDGESFUZZY LOGIC IN NARROW SENSE WITH HEDGES
FUZZY LOGIC IN NARROW SENSE WITH HEDGESijcsit
 
Word sense disambiguation a survey
Word sense disambiguation a surveyWord sense disambiguation a survey
Word sense disambiguation a surveyunyil96
 
Word Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information AccessWord Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information AccessPierpaolo Basile
 
A Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammarsA Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammarsFederico Gobbo
 
Lecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented ApplicationsLecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented ApplicationsMarina Santini
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) inventionjournals
 
AMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYAMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYijnlc
 
Metaphors of Code
Metaphors of CodeMetaphors of Code
Metaphors of CodeTomi Dufva
 
Temporal Semantic Techniques for Text Analysis and Applications
Temporal Semantic Techniques for Text Analysis and ApplicationsTemporal Semantic Techniques for Text Analysis and Applications
Temporal Semantic Techniques for Text Analysis and ApplicationsFedelucio Narducci
 
Lecture 2: Computational Semantics
Lecture 2: Computational SemanticsLecture 2: Computational Semantics
Lecture 2: Computational SemanticsMarina Santini
 
Learning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionaryLearning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionaryRoelof Pieters
 

Mais procurados (13)

Constructive Description Logics 2006
Constructive Description Logics 2006Constructive Description Logics 2006
Constructive Description Logics 2006
 
KR Workshop 1 - Ontologies
KR Workshop 1 - OntologiesKR Workshop 1 - Ontologies
KR Workshop 1 - Ontologies
 
FUZZY LOGIC IN NARROW SENSE WITH HEDGES
FUZZY LOGIC IN NARROW SENSE WITH HEDGESFUZZY LOGIC IN NARROW SENSE WITH HEDGES
FUZZY LOGIC IN NARROW SENSE WITH HEDGES
 
Word sense disambiguation a survey
Word sense disambiguation a surveyWord sense disambiguation a survey
Word sense disambiguation a survey
 
Word Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information AccessWord Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information Access
 
A Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammarsA Constructive Mathematics approach for NL formal grammars
A Constructive Mathematics approach for NL formal grammars
 
Lecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented ApplicationsLecture 2: From Semantics To Semantic-Oriented Applications
Lecture 2: From Semantics To Semantic-Oriented Applications
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
AMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITYAMBIGUITY-AWARE DOCUMENT SIMILARITY
AMBIGUITY-AWARE DOCUMENT SIMILARITY
 
Metaphors of Code
Metaphors of CodeMetaphors of Code
Metaphors of Code
 
Temporal Semantic Techniques for Text Analysis and Applications
Temporal Semantic Techniques for Text Analysis and ApplicationsTemporal Semantic Techniques for Text Analysis and Applications
Temporal Semantic Techniques for Text Analysis and Applications
 
Lecture 2: Computational Semantics
Lecture 2: Computational SemanticsLecture 2: Computational Semantics
Lecture 2: Computational Semantics
 
Learning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionaryLearning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionary
 

Semelhante a A Bridge Not too Far

A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITSA NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITSijasuc
 
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITSA NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITSijwscjournal
 
Lean Logic for Lean Times: Entailment and Contradiction Revisited
Lean Logic for Lean Times: Entailment and Contradiction RevisitedLean Logic for Lean Times: Entailment and Contradiction Revisited
Lean Logic for Lean Times: Entailment and Contradiction RevisitedValeria de Paiva
 
Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Cornelius Puschmann
 
NLP_guest_lecture.pdf
NLP_guest_lecture.pdfNLP_guest_lecture.pdf
NLP_guest_lecture.pdfSoha82
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsAndre Freitas
 
Lean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural LogicLean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural LogicValeria de Paiva
 
AI Beyond Deep Learning
AI Beyond Deep LearningAI Beyond Deep Learning
AI Beyond Deep LearningAndre Freitas
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008Jason Morris
 
Semiotics and conceptual modeling gv 2015
Semiotics and conceptual modeling   gv 2015Semiotics and conceptual modeling   gv 2015
Semiotics and conceptual modeling gv 2015Guido Vetere
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018Andre Freitas
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptOlusolaTop
 
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...Maryam Farooq
 
Bosch1991a bermuda
Bosch1991a bermudaBosch1991a bermuda
Bosch1991a bermudagorin2008
 
PHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matter
PHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matterPHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matter
PHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matterPeter Jones
 
Objectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsObjectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsBeth Johnson
 

Semelhante a A Bridge Not too Far (20)

A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITSA NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
 
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITSA NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
A NATURAL LOGIC FOR ARTIFICIAL INTELLIGENCE, AND ITS RISKS AND BENEFITS
 
Lean Logic for Lean Times: Entailment and Contradiction Revisited
Lean Logic for Lean Times: Entailment and Contradiction RevisitedLean Logic for Lean Times: Entailment and Contradiction Revisited
Lean Logic for Lean Times: Entailment and Contradiction Revisited
 
Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)Corpora, Blogs and Linguistic Variation (Paderborn)
Corpora, Blogs and Linguistic Variation (Paderborn)
 
NLP_guest_lecture.pdf
NLP_guest_lecture.pdfNLP_guest_lecture.pdf
NLP_guest_lecture.pdf
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 
Lean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural LogicLean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural Logic
 
AI Beyond Deep Learning
AI Beyond Deep LearningAI Beyond Deep Learning
AI Beyond Deep Learning
 
Bird05 nltk-intro
Bird05 nltk-introBird05 nltk-intro
Bird05 nltk-intro
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008
 
FinalReport
FinalReportFinalReport
FinalReport
 
Semiotics and conceptual modeling gv 2015
Semiotics and conceptual modeling   gv 2015Semiotics and conceptual modeling   gv 2015
Semiotics and conceptual modeling gv 2015
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...
 
PowerMagpie
PowerMagpiePowerMagpie
PowerMagpie
 
Bosch1991a bermuda
Bosch1991a bermudaBosch1991a bermuda
Bosch1991a bermuda
 
The basics of ontologies
The basics of ontologiesThe basics of ontologies
The basics of ontologies
 
PHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matter
PHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matterPHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matter
PHPnw (England) User Group - Concepts, Spaces and Thresholds and why they matter
 
Objectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsObjectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative Connotations
 

Mais de Valeria de Paiva

Dialectica Categorical Constructions
Dialectica Categorical ConstructionsDialectica Categorical Constructions
Dialectica Categorical ConstructionsValeria de Paiva
 
Logic & Representation 2021
Logic & Representation 2021Logic & Representation 2021
Logic & Representation 2021Valeria de Paiva
 
Constructive Modal and Linear Logics
Constructive Modal and Linear LogicsConstructive Modal and Linear Logics
Constructive Modal and Linear LogicsValeria de Paiva
 
Dialectica Categories Revisited
Dialectica Categories RevisitedDialectica Categories Revisited
Dialectica Categories RevisitedValeria de Paiva
 
Networked Mathematics: NLP tools for Better Science
Networked Mathematics: NLP tools for Better ScienceNetworked Mathematics: NLP tools for Better Science
Networked Mathematics: NLP tools for Better ScienceValeria de Paiva
 
Going Without: a modality and its role
Going Without: a modality and its roleGoing Without: a modality and its role
Going Without: a modality and its roleValeria de Paiva
 
Problemas de Kolmogorov-Veloso
Problemas de Kolmogorov-VelosoProblemas de Kolmogorov-Veloso
Problemas de Kolmogorov-VelosoValeria de Paiva
 
Natural Language Inference: for Humans and Machines
Natural Language Inference: for Humans and MachinesNatural Language Inference: for Humans and Machines
Natural Language Inference: for Humans and MachinesValeria de Paiva
 
The importance of Being Erneast: Open datasets in Portuguese
The importance of Being Erneast: Open datasets in PortugueseThe importance of Being Erneast: Open datasets in Portuguese
The importance of Being Erneast: Open datasets in PortugueseValeria de Paiva
 
Negation in the Ecumenical System
Negation in the Ecumenical SystemNegation in the Ecumenical System
Negation in the Ecumenical SystemValeria de Paiva
 
Constructive Modal and Linear Logics
Constructive Modal and Linear LogicsConstructive Modal and Linear Logics
Constructive Modal and Linear LogicsValeria de Paiva
 
Semantics and Reasoning for NLP, AI and ACT
Semantics and Reasoning for NLP, AI and ACTSemantics and Reasoning for NLP, AI and ACT
Semantics and Reasoning for NLP, AI and ACTValeria de Paiva
 
Categorical Explicit Substitutions
Categorical Explicit SubstitutionsCategorical Explicit Substitutions
Categorical Explicit SubstitutionsValeria de Paiva
 
Logic and Probabilistic Methods for Dialog
Logic and Probabilistic Methods for DialogLogic and Probabilistic Methods for Dialog
Logic and Probabilistic Methods for DialogValeria de Paiva
 
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)Intuitive Semantics for Full Intuitionistic Linear Logic (2014)
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)Valeria de Paiva
 

Mais de Valeria de Paiva (20)

Dialectica Comonoids
Dialectica ComonoidsDialectica Comonoids
Dialectica Comonoids
 
Dialectica Categorical Constructions
Dialectica Categorical ConstructionsDialectica Categorical Constructions
Dialectica Categorical Constructions
 
Logic & Representation 2021
Logic & Representation 2021Logic & Representation 2021
Logic & Representation 2021
 
Constructive Modal and Linear Logics
Constructive Modal and Linear LogicsConstructive Modal and Linear Logics
Constructive Modal and Linear Logics
 
Dialectica Categories Revisited
Dialectica Categories RevisitedDialectica Categories Revisited
Dialectica Categories Revisited
 
PLN para Tod@s
PLN para Tod@sPLN para Tod@s
PLN para Tod@s
 
Networked Mathematics: NLP tools for Better Science
Networked Mathematics: NLP tools for Better ScienceNetworked Mathematics: NLP tools for Better Science
Networked Mathematics: NLP tools for Better Science
 
Going Without: a modality and its role
Going Without: a modality and its roleGoing Without: a modality and its role
Going Without: a modality and its role
 
Problemas de Kolmogorov-Veloso
Problemas de Kolmogorov-VelosoProblemas de Kolmogorov-Veloso
Problemas de Kolmogorov-Veloso
 
Natural Language Inference: for Humans and Machines
Natural Language Inference: for Humans and MachinesNatural Language Inference: for Humans and Machines
Natural Language Inference: for Humans and Machines
 
Dialectica Petri Nets
Dialectica Petri NetsDialectica Petri Nets
Dialectica Petri Nets
 
The importance of Being Erneast: Open datasets in Portuguese
The importance of Being Erneast: Open datasets in PortugueseThe importance of Being Erneast: Open datasets in Portuguese
The importance of Being Erneast: Open datasets in Portuguese
 
Negation in the Ecumenical System
Negation in the Ecumenical SystemNegation in the Ecumenical System
Negation in the Ecumenical System
 
Constructive Modal and Linear Logics
Constructive Modal and Linear LogicsConstructive Modal and Linear Logics
Constructive Modal and Linear Logics
 
Semantics and Reasoning for NLP, AI and ACT
Semantics and Reasoning for NLP, AI and ACTSemantics and Reasoning for NLP, AI and ACT
Semantics and Reasoning for NLP, AI and ACT
 
NLCS 2013 opening slides
NLCS 2013 opening slidesNLCS 2013 opening slides
NLCS 2013 opening slides
 
Dialectica Comonads
Dialectica ComonadsDialectica Comonads
Dialectica Comonads
 
Categorical Explicit Substitutions
Categorical Explicit SubstitutionsCategorical Explicit Substitutions
Categorical Explicit Substitutions
 
Logic and Probabilistic Methods for Dialog
Logic and Probabilistic Methods for DialogLogic and Probabilistic Methods for Dialog
Logic and Probabilistic Methods for Dialog
 
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)Intuitive Semantics for Full Intuitionistic Linear Logic (2014)
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Último (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

A Bridge Not too Far

  • 1. + A Bridge not too far Valeria de Paiva Cuil, Inc.
  • 2. + A Bridge? Translation
  • 3. + Why do we want this bridge?   Want our computer to understand us   Want to to ask questions   large-scale intelligent information access   Want to search for content on the web   exploiting the large amounts of textual data on the web   Want automatic, high quality translation into other languages   Etc…
  • 4. + A bridge between language and logic Wish List:   translation compositional and principled,   meaning preserving, at least truth value preserving…   a reasonable fragment of all language   generic texts   “logical forms” obtained are useful for reasoning. Questions:   which kind of logic on the target?   how do we know when we’re done?   how do we measure quality of results?
  • 5. + Dead simple?   Isn't this what every undergraduate logic text deals with in its first chapter?...   It was Russell's belief that by using the new logic of his day, philosophers would be able to exhibit the underlying “logical form" of natural language statements. SEP   Long before that, it was Leibniz’s idea Calculemus   …problems with ‘the present king of France’ and his hair. “On Denoting”, 1905
  • 6. + Not so simple?   A natural reaction:   “Natural language, states Carnap, is misleading because it is not sharply defined and because it is not systematic enough. The syntactic rules governing natural language sentences do not enable us to distinguish between assertions and pseudo-assertions.” Gamut on R. Carnap’s “The elimination of metaphysics through logical analysis of language”, 1932.   Carnap suggested: Instead do artificial languages.   And logicians did: philosophical and mathematical logicians studied artificial languages
  • 7. + Is it too hard?   “The skepticism in logical circles about the possibility of describing natural language with logical methods is similar to that in logical positivism.”   Tarski thought that applying the semantics he developed for logical languages to natural languages would be problematic[…] any such semantics assumes a precisely formulated syntax,[..] out of the question that such could be found for natural languages.   “The conviction that there is no exact logic of natural language, and that language therefore does not lend itself to an analysis in terms of precise logical notions and rules, is common to just about all analytic philosophers, even those that were most interested in language” Gamuth, 1990
  • 8. + Not for every one...   Davidson “Recent work by Chomsky and others is doing much to bring the complexities of natural language within the scope of serious semantic theory.” 1967   Montague “I reject the contention that an important theoretical difference exists between formal and natural languages” - English as a formal language, 1970   Both a translation from English to a formal system and a model theory for the system   Since then a body of work on Formal Semantics of natural language concentrating on model theory   fragments, pen and paper calculations, interesting phenomena. ..But also lots of computer systems, eg GUS
  • 9. + HANG ON A MINUTE… 30-odd years of solid work on NL semantics doesn’t solve my BRIDGE problem… Need to deal with huge amounts of text Need to be robust Need to move easily between genres, subject domains
  • 10. + The problem: between a rock and a hard place   Knowledge-based   Shallow, open-domain representations of representations meaning   Broad-coverage   Deep/logical representations   Fails “gracefully” allow high precision and   Lower precision and recall recall, but   Hard to compare systems   Typically on restricted domains   Too sensitive to form of sentences   Hard for users to read/ interpret output of the system   Very hard for system to build up knowledge
  • 11. + Which hard place?   Whether you’re fond of symbolic   PASCAL (an EU-funded network of excellence), decided to methods or of machine learning, organize a series of challenges the following are contradictory: benchmarking the task of Recognizing Textural Entailment   No civilians were killed in the (RTE) in 2004. This has been continued by TAC (Text analysis Najaf suicide bombing. conference) supported by NIST. Two civilians died in the Najaf   RTE Challenges promoted suicide bombing. textual entailment recognition as a generic task capturing semantic inference needs across   Detecting whether one piece of applications such as QA, Info text entails another or Retrieval, Info Extraction and doc summarization, contradicts it is a necessary condition for text understanding.   RTE-5 last year.
  • 12. + What makes language hard? Not only semanticists seeking unicorns or non-existing kings… “Negotiations prevented a strike”   Claims that no strike came into being, but true and false assertions can be made about the non- existent strike.   Bad idea: x y Strike(x) Negotiations (y) prevented(x,y)   Could do existence predicates, but entailments don’t work.   Instead do concepts see “Preventing Existence”, Crouch et al, FOIS2001
  • 13. + What is Hard?   Many problems, besides (non-)existence   intensionality   ambiguity   reference resolution,   conditionals,   presuppositions, etc…   See “Entailment, Intensionality and Text Understanding", Condoravdi et al, 2003
  • 14. + Pervasive Intensionality   Intensionality is widespread in   Anecdotal evidence at least 453 natural language sentences out of 1586 on 100 tips   Raises detection issues that photocopier repair will have a cannot be brushed aside: prop complement H: The US sought the release of   Clearly it would be desirable to hostages. cope with intensionality. T: Hostages were released.   Proposal underspecified HT? CONTEXTS People know that propositional   Feasible?... attitudes (knowing, believing, seeking, etc..) introduce intensionality. Can we avoid them?
  • 15. + Scalability of Symbolic Methods   From “Scalability of redundancy detection in Focused Document collections”, Crouch et al, 2002 (More problematic is the series “Symbolic parsing scales well, both in of canonicalizations of coverage and in dealing with the meaning necessary to take proliferation of ambiguity that arises language-based out of broad coverage.” This is the XLE representations to tractable system, based on LFG grammar. conceptual representations.}   Scalability/feasibility proven by doing it. It’s relative to what others are doing and how fast. Claim above based on Riezler et al “Parsing the Wall Street Journal using a lexical functional grammar and discriminative estimation techniques”, 2002. Parsing performance comparable with Collins
  • 16. + Scalability of Symbolic Methods?   Measuring different parsers is not easy.   But has been done for quite a while.   Translating between outputs by different parsers is hard, lots of scope for messing the results.   But there are gold standards, e.g. Penn TreeBank   Semantic representations are harder to compare. No gold standard, no vast collection of annotated data. Not even a consensus on what annotations should look like   Still the case in 2010…
  • 17. + Is it too hard?   By late 2005 I was ready to throw in the towel.   We had a beautiful layered architecture   Processing sentences was fast and high-quality   But big bottleneck when mapping to KR.   Then TWO big changes:   Instead of the ideal golden standard, concentrate on how representations relate to each other, entailment and contradiction detection ECD   Instead of KR concepts from Cyc, fluid, wordy concepts from WordNet/VerbNet
  • 18. + We can do it! How much of it? How do we improve? Architectures for Text Understanding
  • 19. CycL FST UL •  Tok XL Se Tran AK Pre •  Mor LFG E m sfer R ph,.. ECD WN/ VN + Kaplan, Bobrow et al PARC Pipeline
  • 20. + PARC Layered Architecture XLE/LFG P arsing KR M Inference Text apping Engine F-structure Transfer semantics AKR Sources Assertions Question Query Unified Lexicon LFG Term rewriting ECD XLE KR mapping rules Textual Inference MaxEnt models Factives,Deverbals logics Basic idea: canonicalization of meanings
  • 21. + Key Process: Canonicalization of representations   Sentences are parsed into f(unctional)-structures using XLE   F-structures are (somewhat) semantic representations   Transform f-structures into (flat and contexted) transfer semantic structures (inspired by Glue and need to ‘pack’ semantics )   Transform transfer sem-structures into (flat and contexted) AKR structures   what do these layers of representation buy you?
  • 22. +Language/KR misalignments:   Language   Generalizations come from the structure of the language   Representations compositionally derived from sentence structure   Knowledge representation   Generalizations come from the structure of the world   Representations to support reasoning   Maintain multiple interpretations   Layered bridge helps with the different constraints
  • 23. + Semantics and AKR?   Conceptual structure   concepts and roles as in a Description Logic   concepts as WordNet synsets   deverbal normalization   example: destruction of the building -> destroy the building   Temporal structure (extracted from linguistic analysis)   relate time periods, events, states (a hook)   Contextual structure   for negation, propositional attitudes, modals, …   context-lifting rules   Packed ambiguity encoding
  • 24. + PRED fire<Ed, boy> TENSE past Abstract KR: “Ed fired the boy.” SUBJ [ PRED Ed ] (Cyc version) OBJ PRED boy DEF +   (subconcept Ed3 Person)   (subconcept boy2 MaleChild)   (subconcept fire_ev1 DischargeWithPrejudice) Conceptual   (role fire_ev1 performedBy Ed3)   (role fire_ev1 objectActedOn boy2)   (context t)   (instantiable Ed3 t)   (instantiable boy2 t) Contextual   (instantiable fire_ev1 t)   (temporalRel startsAfterEndingOf Now fire_ev1) Temporal Textual Inference Logic, 2005
  • 25. + Abstract KR: “Ed fired the boy.” (WN/ VN version)   cf(1, context(t)),   cf(1, instantiable(‘Ed0',t)),   cf(1, instantiable('boy3',t)),   cf(1, instantiable('fire1',t)),   cf(1, role('Agent','fire1',‘Ed0')),   cf(1, role('Theme','fire1','boy3')),   cf(1,subconcept(‘Ed0',[[7626,4576]])),   cf(1,subconcept('boy3',[[10131706],[9725282],[10464570], [9500236] ),),   cf(A1, subconcept('fire1',[[1124984],[1123061],[1123474]])),   cf(A2, subconcept('fire1',[[2379472]])),   cf(1, temporalRel(startsAfterEndingOf,'Now','fire1')) Textual Inference Logic: Take Two, 2007
  • 26. + Are we there yet?   NO! Challenges abound.   Make translation better (“Deverbal Nouns in Knowledge Representation”, “Context Inducing nouns”, 2008)   Named Entities, measures phrases, gazetteers   Deal with anaphora and discourse (DRS?)   Improve sense clustering   Treatment of copula
  • 27. + At least 8 systems doing similar work:   Boing BLUE   BOXER   Ontosem   Open Knowledge, Schubert   Trips, Allan   Getaruns   TextCap A generic   LXGram Architecture? Semantics in Text Processing, 2008 Eds. Johan Bos, Rodolfo Demonte
  • 28. + Semantics •  LFG •  AKR •  CCG •  Episodic Log A Generic •  HPSG •  Transfer •  MRS •  Triples •  … •  … Architecture •  DRS •  … Knowledge Grammar All require a host of pre & post- Representation processing: text segmenters, POS taggers, Lexica, Named Entity Recognizers, Gazetteers, Temporal Modules, Coreference Resolution, WSD, etc
  • 29. Theorem Prover • Taggers • Parser • DRT strux PreProc CCG Boxer • NER, … • Grammar • De Sandt Nutcracker + C&C Tools Clark, Curran & Bos
  • 30. RTE Stanford •  NE •  POS Stanford preproc tagger GR output ??? parser •  Chinese seg Nat Log + Stanford NLP Manning et al
  • 31. •  NE preprocessing •  POS parsing •  Stanford tagging, etc + GATE, EDITS Cunningham et al, Nagnani et al
  • 32. +   The future seems easier if it’s Open Source (see Ann Copestake’s page)   And collaborative (that too!)   Translation and comparison of results is necessary   Many more lexical resources need to be created and shared How do we got Machine learning of semantics/kr is about it?   required Totally unbaked ideas…   Logics, building up from ECD, using probabilistic component need to be in place   Looking on the bright side… LOTS of FUN WORK!
  • 33. +   Borrowing from Ann Copestake Slacker Semantics:   High-throughput parser with semantic output   Effective statistical technique for syntactic parse ranking   No underlying knowledge base for disambigution Elaborating a little…   Underspecification is good   Support inter-sentential anaphora, text structure   Robust inference and semantic pattern matching   (pay attention to ease of use, notations that help..)
  • 34. + Is it too hard? “Our language shows a tiresome bias in its treatment of time. Relations of date are exalted grammatically as relations of position, weight and color are not. This bias is of itself an inelegance, or breach of theoretical simplicity. Moreover, the form that it takes – that of requiring every verb form to show a tense – is peculiarly productive of needless complications, since it demands lip service to be paid to time even when time is farthest from our thoughts.” Quine 1960
  • 35. + Thanks!
  • 36. + References ``Preventing Existence", FOIS 2001. ``Making Ontologies Work for Resolving Redundancies Across documents", Communications of the ACM, 2002. ``Scalability of redundancy detection in focused document collections", ScaNaLU, Heidelberg, 2002 ``Entailment, Intensionality and Text Understanding", Workshop on Text Meaning, Edmonton, Canada, May 2003. ``Natural Deduction and Context as (Constructive) Modality", CONTEXT 2003 ``A Basic Logic for Textual inference", AAAI Workshop on Inference for Textual Question Answering, 2005. ``Textual Inference Logic: Take Two", CONTEXT 2007. ``Precision-focused Textual Inference", Workshop on Textual Entailment and Paraphrasing, 2007 “Deverbal Nouns in Knowledge Representation”, 2008 “Context Inducing Nouns”, 2008
  • 37. + Until The winter of our discontent   The Winter of semantics   As told by Hall, Juravsky and Manning in “Studying the History of Ideas Using Topic Models”, 2008   Jokes apart, computational semantics seemed a thriving subject (to me) until around 1995, when it fell out of favor.   Completely   During the same time huge progress on statistics based methods for language