SlideShare uma empresa Scribd logo
1 de 19
Reference Scope Identification
in Citing Sentences
         Authors:
                 Amjad Abu-Jbara, Dragomir Radev
                           (University of Michigan)
            Conference:
                                      NAACL 2012
            Expositor:
                                  Akihiro Kameda
              (Aizawa Lab. The University of Tokyo)
Abstract
●   Problem:
    ●   Multiple citation in one sentence
    ●   There are many POS taggers developed using
        different techniques for many major languages such
        as transformation-based error-driven learning (Brill,
        1995), decision trees (Black et al., 1992), Markov
        model (Cutting et al., 1992), maximum entropy
        methods (Ratnaparkhi, 1996) etc for English.
●   Approach:Prepossessing
         and 2+1+2*3+1=10 methods
Preprocessing & Methods
Reference Preprocessing
    (tagging, grouping, non-syntactical element removal)
●   These constraints can be lexicalized (REF.1; REF.2),
    unlexicalized (REF.3; TREF.4) or automatically learned
    (REF.5; REF.6).

●   These constraints can be lexicalized (GREF.1), unlexicalized
    (GTREF.2) or automatically learned (GREF.3).

●   (GTREF.1) apply fuzzy techniques for integrating source
    syntax into hierarchical phrase-based systems (REF.2).
Approach 1(SVM,LR)
●   Word classification
    ●   with SVM, a logistic regression classifier
●   Feature: Distance, Position(Before/After), in Segment(,.;
    and, but, for, nor, or, so, yet), POS tag, Dependency
    Distance, Dependency Relations, Common Ancestor Node,
    Syntactic Distance
●   Problem Example:
    ●   There are many POS taggers developed using different
        techniques for many major languages such as transformation-
        based error-driven learning (Brill, 1995), decision trees (Black et
        al., 1992), Markov model (Cutting et al., 1992), maximum entropy
        methods (Ratnaparkhi, 1996) etc for English.
Approach 2(CRF)
●   Sequence Labeling with CRF
    ●   feature is same as Approach 1
Approach 3-S1-* (CRF/segment)
●   segmentation (1)
    ●   punctuation marks
    ●   coordination conjunctions
        –   and, but, for, nor, or, so, yet
    ●   a set of special expressions
        –   "for example", "for instance", "including", "includes",
            "such as", "like", etc.
●   [Rerankers have been successfully applied to numerous
    NLP tasks such as] [parse selection (GTREF)], [parse
    reranking (GREF)], [question-answering (REF)].
Approach 3-S2-* (CRF/segment)
●   segmentation (2)
    ●   chunking tool
        –   noun groups
        –   verb groups
        –   preposition groups
        –   adjective groups
        –   adverb groups
        –   other parts form segment by themselves
●   [To] [score] [the output] [of] [the coreference models], [we]
    [employ] [the commonly-used MUC scoring program (REF)]
    [and] [the recently-developed CEAF scoring program (TREF)].
Approach 3-*-R1,2,3
                 (CRF/segment)
●   R1: majority label of the words it contains
●   R2: inside if any word is inside
●   R3: outside if any word is outside
    ●   [I O O O O] [I I I] [O O]
AR2011




the link grammar parser
(Sleator and Temperley,1991)
Experiment
Data
●   ACL Anthology Network Corpus
●   3300 sentences, citations in each ≧ 2


             Annotation agreement
●   500 of 3300,
    ●   Preprocessing is perfect
    ●   Kappa coefficient of scope is
              P ( A)−P ( E )
           K=                =2P ( A)−1=0.61
              1−P ( E )
Tools
●   Edinburgh Language Technology Text
    Tokenization Toolkit (LT-TTT)
    ●   text tokenization, part-of-speech tagging, chunking,
        and noun phrase head identification.
●   Stanford parser
    ●   syntactic and dependency parsing
●   LibSVM with linear kernel
●   Weka
    ●   logistic regression classification
Tools
●   Machine Learning for Language Toolkit
    (MALLET)
    ●   CRF

                    Validation
●   10-fold cross validation
Experiment (Preprocessing)
    These constraints can be lexicalized (REF.1; REF.2), ll
                                                   r ec a
●

    unlexicalized (REF.3; TREF.4) or and 93  .1%learned
    (REF.5; REF.6). 3% preci
                               s ion automatically
           ng: 9 8 .
    Taggi
●   These constraints can be lexicalized (GREF.1), unlexicalized
    (GTREF.2) or Perfect!
                 automatically learned (GREF.3).
    Grouping:
    (GTREF.1) apply fuzzy techniques for integrating source
                                      a l:
●


 syntax into hierarchicalence
                              removsystems (REF.2).
Non-syn    tactic refer phrase-based ecall
                            9 0. 1% r
                cision and
9 0.08% pre
Experiment (Main)
               ● CRF
               ● Chunking

               ● Majority
Feature Analysis
●   Feature: Distance, Position(Before/After), Same
    segment(,.; and, but, for, nor, or, so, yet), POS
    tag, Dependency Distance, Dependency
    Relations, Common Ancestor Node, Syntactic
    Distance
Summary
●   Identified reference scope in a sentence which
    has multiple citation
● CRF
● Chunking

● Majority
Reference Scope Identification in Citing Sentences

Mais conteúdo relacionado

Mais procurados

A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...iyo
 
Labelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuPrologLabelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuPrologRoberta Calegari
 
Cross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAECross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAEmarujirou
 
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...Dierk König
 
Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6Daniyal Mughal
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERTshaurya uppal
 
Implementation of lexical analyser
Implementation of lexical analyserImplementation of lexical analyser
Implementation of lexical analyserArchana Gopinath
 
Csr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminskiCsr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminskiCSR2011
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...Association for Computational Linguistics
 
Language Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory StudyLanguage Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory StudyMarco Torchiano
 
Scheme Programming Language
Scheme Programming LanguageScheme Programming Language
Scheme Programming LanguageReham AlBlehid
 
Notes on attention mechanism
Notes on attention mechanismNotes on attention mechanism
Notes on attention mechanismKhang Pham
 
A simple approach of lexical analyzers
A simple approach of lexical analyzersA simple approach of lexical analyzers
A simple approach of lexical analyzersArchana Gopinath
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingYoung Seok Kim
 
Chapter 13 - Recursion
Chapter 13 - RecursionChapter 13 - Recursion
Chapter 13 - RecursionAdan Hubahib
 
short_talk
short_talkshort_talk
short_talk卓 陈
 

Mais procurados (20)

A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
 
Labelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuPrologLabelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuProlog
 
Cross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAECross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAE
 
Method in oop
Method in oopMethod in oop
Method in oop
 
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
 
Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERT
 
Implementation of lexical analyser
Implementation of lexical analyserImplementation of lexical analyser
Implementation of lexical analyser
 
Sp imp gtu
Sp imp gtuSp imp gtu
Sp imp gtu
 
Csr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminskiCsr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminski
 
Fafl notes [2010] (sjbit)
Fafl notes [2010] (sjbit)Fafl notes [2010] (sjbit)
Fafl notes [2010] (sjbit)
 
Type Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLikeType Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLike
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
 
Language Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory StudyLanguage Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory Study
 
Scheme Programming Language
Scheme Programming LanguageScheme Programming Language
Scheme Programming Language
 
Notes on attention mechanism
Notes on attention mechanismNotes on attention mechanism
Notes on attention mechanism
 
A simple approach of lexical analyzers
A simple approach of lexical analyzersA simple approach of lexical analyzers
A simple approach of lexical analyzers
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 
Chapter 13 - Recursion
Chapter 13 - RecursionChapter 13 - Recursion
Chapter 13 - Recursion
 
short_talk
short_talkshort_talk
short_talk
 

Destaque

Mlearn 2011 conference keynote
Mlearn 2011 conference keynoteMlearn 2011 conference keynote
Mlearn 2011 conference keynoteMike Sharples
 
B2B marketing analytics-report
B2B marketing analytics-reportB2B marketing analytics-report
B2B marketing analytics-reportProgress
 
3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile Apps3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile AppsProgress
 
UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010Ian Hay
 
Effective Pedagogy at Scale – Social Learning and Citizen Inquiry
Effective Pedagogy at Scale –  Social Learning and Citizen InquiryEffective Pedagogy at Scale –  Social Learning and Citizen Inquiry
Effective Pedagogy at Scale – Social Learning and Citizen InquiryMike Sharples
 
Shift Happens
Shift HappensShift Happens
Shift Happensrspro007
 
Ingl Sfacil
Ingl SfacilIngl Sfacil
Ingl Sfacilireperez
 
Sense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentationSense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentationMike Sharples
 
M learn 2014 slideshare
M learn 2014   slideshareM learn 2014   slideshare
M learn 2014 slideshareMike Sharples
 
NEW MEDIA LECTURE - Swinburne University Radio Students
NEW MEDIA LECTURE -  Swinburne University Radio StudentsNEW MEDIA LECTURE -  Swinburne University Radio Students
NEW MEDIA LECTURE - Swinburne University Radio Studentsbryceives
 
期末專題
期末專題期末專題
期末專題GRX
 
Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Progress
 

Destaque (20)

Chemrev4
Chemrev4Chemrev4
Chemrev4
 
Mlearn 2011 conference keynote
Mlearn 2011 conference keynoteMlearn 2011 conference keynote
Mlearn 2011 conference keynote
 
B2B marketing analytics-report
B2B marketing analytics-reportB2B marketing analytics-report
B2B marketing analytics-report
 
3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile Apps3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile Apps
 
Chembond
ChembondChembond
Chembond
 
milieu
milieumilieu
milieu
 
Chembond
ChembondChembond
Chembond
 
UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010
 
Binayak
BinayakBinayak
Binayak
 
Amakusa Rick
Amakusa RickAmakusa Rick
Amakusa Rick
 
Effective Pedagogy at Scale – Social Learning and Citizen Inquiry
Effective Pedagogy at Scale –  Social Learning and Citizen InquiryEffective Pedagogy at Scale –  Social Learning and Citizen Inquiry
Effective Pedagogy at Scale – Social Learning and Citizen Inquiry
 
Shift Happens
Shift HappensShift Happens
Shift Happens
 
Ingl Sfacil
Ingl SfacilIngl Sfacil
Ingl Sfacil
 
Econfig Pres
Econfig PresEconfig Pres
Econfig Pres
 
Sense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentationSense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentation
 
M learn 2014 slideshare
M learn 2014   slideshareM learn 2014   slideshare
M learn 2014 slideshare
 
NEW MEDIA LECTURE - Swinburne University Radio Students
NEW MEDIA LECTURE -  Swinburne University Radio StudentsNEW MEDIA LECTURE -  Swinburne University Radio Students
NEW MEDIA LECTURE - Swinburne University Radio Students
 
期末專題
期末專題期末專題
期末專題
 
Webstock 2011
Webstock 2011Webstock 2011
Webstock 2011
 
Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?
 

Semelhante a Reference Scope Identification in Citing Sentences

CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools Lifeng (Aaron) Han
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)kevig
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)kevig
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)kevig
 
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...Johann Petrak
 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesSchwannden Kuo
 
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFEnd-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFJayavardhan Reddy Peddamail
 
Genetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech TaggingGenetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech Taggingkevig
 
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGINGGENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGINGijnlc
 
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...AIST
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmDhruvKushwaha12
 
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...Lifeng (Aaron) Han
 
neural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classificationneural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classificationJEE HYUN PARK
 
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...semanticsconference
 
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingLogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingErnesto Jimenez Ruiz
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Lifeng (Aaron) Han
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...kevig
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...ijnlc
 

Semelhante a Reference Scope Identification in Citing Sentences (20)

CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
 
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminaries
 
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFEnd-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
 
LiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word FinderLiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word Finder
 
Genetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech TaggingGenetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech Tagging
 
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGINGGENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
 
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
 
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
 
ICSE20_Tao_slides.pptx
ICSE20_Tao_slides.pptxICSE20_Tao_slides.pptx
ICSE20_Tao_slides.pptx
 
neural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classificationneural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classification
 
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
 
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingLogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
 

Mais de Akihiro Kameda

iPRES Day 3 instruction
iPRES Day 3 instructioniPRES Day 3 instruction
iPRES Day 3 instructionAkihiro Kameda
 
Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Akihiro Kameda
 
ISWC2016 1-slide-survey
ISWC2016 1-slide-surveyISWC2016 1-slide-survey
ISWC2016 1-slide-surveyAkihiro Kameda
 
いかにしてデータを手に入れるか
いかにしてデータを手に入れるかいかにしてデータを手に入れるか
いかにしてデータを手に入れるかAkihiro Kameda
 
"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説Akihiro Kameda
 
Iodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareIodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareAkihiro Kameda
 
PATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesPATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesAkihiro Kameda
 

Mais de Akihiro Kameda (7)

iPRES Day 3 instruction
iPRES Day 3 instructioniPRES Day 3 instruction
iPRES Day 3 instruction
 
Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...
 
ISWC2016 1-slide-survey
ISWC2016 1-slide-surveyISWC2016 1-slide-survey
ISWC2016 1-slide-survey
 
いかにしてデータを手に入れるか
いかにしてデータを手に入れるかいかにしてデータを手に入れるか
いかにしてデータを手に入れるか
 
"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説
 
Iodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareIodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshare
 
PATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesPATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic Types
 

Último

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Reference Scope Identification in Citing Sentences

  • 1. Reference Scope Identification in Citing Sentences          Authors: Amjad Abu-Jbara, Dragomir Radev (University of Michigan)             Conference: NAACL 2012             Expositor: Akihiro Kameda (Aizawa Lab. The University of Tokyo)
  • 2. Abstract ● Problem: ● Multiple citation in one sentence ● There are many POS taggers developed using different techniques for many major languages such as transformation-based error-driven learning (Brill, 1995), decision trees (Black et al., 1992), Markov model (Cutting et al., 1992), maximum entropy methods (Ratnaparkhi, 1996) etc for English. ● Approach:Prepossessing      and 2+1+2*3+1=10 methods
  • 4. Reference Preprocessing (tagging, grouping, non-syntactical element removal) ● These constraints can be lexicalized (REF.1; REF.2), unlexicalized (REF.3; TREF.4) or automatically learned (REF.5; REF.6). ● These constraints can be lexicalized (GREF.1), unlexicalized (GTREF.2) or automatically learned (GREF.3). ● (GTREF.1) apply fuzzy techniques for integrating source syntax into hierarchical phrase-based systems (REF.2).
  • 5. Approach 1(SVM,LR) ● Word classification ● with SVM, a logistic regression classifier ● Feature: Distance, Position(Before/After), in Segment(,.; and, but, for, nor, or, so, yet), POS tag, Dependency Distance, Dependency Relations, Common Ancestor Node, Syntactic Distance ● Problem Example: ● There are many POS taggers developed using different techniques for many major languages such as transformation- based error-driven learning (Brill, 1995), decision trees (Black et al., 1992), Markov model (Cutting et al., 1992), maximum entropy methods (Ratnaparkhi, 1996) etc for English.
  • 6. Approach 2(CRF) ● Sequence Labeling with CRF ● feature is same as Approach 1
  • 7. Approach 3-S1-* (CRF/segment) ● segmentation (1) ● punctuation marks ● coordination conjunctions – and, but, for, nor, or, so, yet ● a set of special expressions – "for example", "for instance", "including", "includes", "such as", "like", etc. ● [Rerankers have been successfully applied to numerous NLP tasks such as] [parse selection (GTREF)], [parse reranking (GREF)], [question-answering (REF)].
  • 8. Approach 3-S2-* (CRF/segment) ● segmentation (2) ● chunking tool – noun groups – verb groups – preposition groups – adjective groups – adverb groups – other parts form segment by themselves ● [To] [score] [the output] [of] [the coreference models], [we] [employ] [the commonly-used MUC scoring program (REF)] [and] [the recently-developed CEAF scoring program (TREF)].
  • 9. Approach 3-*-R1,2,3 (CRF/segment) ● R1: majority label of the words it contains ● R2: inside if any word is inside ● R3: outside if any word is outside ● [I O O O O] [I I I] [O O]
  • 10. AR2011 the link grammar parser (Sleator and Temperley,1991)
  • 12. Data ● ACL Anthology Network Corpus ● 3300 sentences, citations in each ≧ 2 Annotation agreement ● 500 of 3300, ● Preprocessing is perfect ● Kappa coefficient of scope is P ( A)−P ( E ) K= =2P ( A)−1=0.61 1−P ( E )
  • 13. Tools ● Edinburgh Language Technology Text Tokenization Toolkit (LT-TTT) ● text tokenization, part-of-speech tagging, chunking, and noun phrase head identification. ● Stanford parser ● syntactic and dependency parsing ● LibSVM with linear kernel ● Weka ● logistic regression classification
  • 14. Tools ● Machine Learning for Language Toolkit (MALLET) ● CRF Validation ● 10-fold cross validation
  • 15. Experiment (Preprocessing) These constraints can be lexicalized (REF.1; REF.2), ll r ec a ● unlexicalized (REF.3; TREF.4) or and 93 .1%learned (REF.5; REF.6). 3% preci s ion automatically ng: 9 8 . Taggi ● These constraints can be lexicalized (GREF.1), unlexicalized (GTREF.2) or Perfect! automatically learned (GREF.3). Grouping: (GTREF.1) apply fuzzy techniques for integrating source a l: ● syntax into hierarchicalence removsystems (REF.2). Non-syn tactic refer phrase-based ecall 9 0. 1% r cision and 9 0.08% pre
  • 16. Experiment (Main) ● CRF ● Chunking ● Majority
  • 17. Feature Analysis ● Feature: Distance, Position(Before/After), Same segment(,.; and, but, for, nor, or, so, yet), POS tag, Dependency Distance, Dependency Relations, Common Ancestor Node, Syntactic Distance
  • 18. Summary ● Identified reference scope in a sentence which has multiple citation ● CRF ● Chunking ● Majority

Notas do Editor

  1. 難波先生や自身らがある引用が文をまたがって説明されている場合のスコープの同定を扱っていることが関連研究に示されている。 応用は要約など。
  2. Annotator 2人なのでたまたま被る確率P(E)は1/2 P(A)は8割ちょい