SlideShare a Scribd company logo
1 of 1
Download to read offline
Isabelle Augenstein, Tim Rocktäschel, Andreas Vlachos,
Kalina Bontcheva
{i.augenstein | t.rocktaschel}@cs.ucl.ac.uk, {a.vlachos | k.bontcheva}@sheffield.ac.uk
Twitter Stance Detection with Bidirectional Conditional Encoding
Bidirectional Conditional Encoding
Bidirectional encoding of tweet conditioned on
bidirectional encoding of target
The stance is predicted using the last forward and
reversed output representations
Stance Detection
Classify attitude of tweet towards target as “favor”, “against”, “none”
Tweet: “No more Hillary Clinton” Target: Donald Trump Stance: FAVOR
Training targets: Climate Change is a Real Concern, Feminist Movement,
Atheism, Legalization of Abortion, Hillary Clinton
Testing target: Donald Trump
Challenges
•  Tweet interpretation depends on target
•  Solution: bidirectional conditional model
•  Labelled data not available for the test target
•  Solution: 1) domain adaptation; 2) weak labelling Results
Unseen Target Models
Weakly Supervised Models
Comparison Against SOA
0.3	
0.35	
0.4	
0.45	
0.5	
0.55	
0.6	
Macro	F1	
BoW	
TweetOnly	
Concat	
TarCondTweet	
TweetCondTar	
BiCond	
Conclusions
•  Learning conditional sequence representation of targets and
tweets better than representations without conditioning
•  State of the art performance with weakly supervised model
•  Good target representations with unsupervised pre-training
•  Successful sequence representation learning with small data
Acknowledgements
This work was partially supported by the European Union, grant No. 611233 PHEME
(http://www.pheme.eu) and by the Microsoft Research PhD Scholarship Programme.
Data
•  SemEval 2016 Task 6 Twitter Stance Detection corpus
•  5 628 labelled training tweets (1 278 about Hillary Clinton,
used for dev)
•  278 013 unlabelled Donald Trump tweets
•  395 212 additional unlabelled tweets about all targets
•  707 Donald Trump testing tweets
Experiments
Stance Detection Models
•  Sequence-agnostic: BoWV
•  Target-agnostic: Tweet-only LSTM encoding (TweetOnly)
•  Target+Tweet, no conditioning: Concatenated target and
tweet LSTM encodings (Concat)
•  Target+Tweet, conditioning: Target conditioned on tweet
(TarCondTweet); tweet conditioned on target
(TweetCondTar); bidirectional conditioning (BiCond)
Word Embedding Training
•  Random initialisation
•  Trained fixed word embeddings on tweets
•  Pre-trained word embeddings on tweets, continue training
with supervised objective
Model	 Macro	F1	
SVM-ngrams-comb	(official)	 0.2843	
BiCond	(Unseen	Target)	 0.4901	
pkudblab	(Weakly	Supervised*)	 0.5628	
BiCond	(Weakly	Supervised)	 0.5803	
References
SemEval 2016 Stance Detection: http://alt.qcri.org/semeval2016/task6/
Code: https://github.com/sheffieldnlp/stance-conditional
x1
c!
1
c1
h!
1
h1
x2
c!
2
c2
h!
2
h2
x3
c!
3
c3
h!
3
h3
x4
c!
4
c4
h!
4
h4
x5
c!
5
c5
h!
5
h5
x6
c!
6
c6
h!
6
h6
x7
c!
7
c7
h!
7
h7
x8
c!
8
c8
h!
8
h8
x9
c!
9
c9
h!
9
h9
Legalization of Abortion A foetus has rights too !
Target Tweet
Figure 1: Bidirectional encoding of tweet conditioned on bidirectional encoding of target ([c!
3 c1 ]). The stance is predicted using
the last forward and reversed output representations ([h!
9 h4 ]).
Here, xt is an input vector at time step t, ct denotes
the LSTM memory, ht 2 Rk is an output vector and
the remaining weight matrices and biases are train-
able parameters. We concatenate the two output vec-
tor representations and classify the stance using the
softmax over a non-linear projection
softmax(tanh(Wta
htarget + Wtw
htweet + b))
into the space of the three classes for stance detec-
tion where Wta, Wtw 2 R3⇥k are trainable weight
matrices and b 2 R3 is a trainable class bias. This
model learns target-independent distributed repre-
sentations for the tweets and relies on the non-
linear projection layer to incorporate the target in the
stance prediction.
3.2 Conditional Encoding
In order to learn target-dependent tweet representa-
tions, we use conditional encoding as previously ap-
plied to the task of recognising textual entailment
(Rockt¨aschel et al., 2016). We use one LSTM to en-
code the target as a fixed-length vector. Then, we
encode the tweet with another LSTM, whose state
is initialised with the representation of the target.
Finally, we use the last output vector of the tweet
LSTM to predict the stance of the target-tweet pair.
Formally, let (x1, . . . , xT ) be a sequence of tar-
get word vectors, (xT+1, . . . , xN ) be a sequence of
tweet word vectors and [h0 c0] be a start state of
zeros. The two LSTMs map input vectors and a pre-
vious state to a next state as follows:
[h1 c1] = LSTMtarget
(x1, h0, c0)
. . .
[hT cT ] = LSTMtarget
(xT , hT 1, cT 1)
[hT+1 cT+1] = LSTMtweet
(xT+1, h0, cT )
. . .
[hN cN ] = LSTMtweet
(xN , hN 1, cN 1)
Finally, the stance of the tweet w.r.t. the target is
classified using a non-linear projection
c = tanh(WhN )
where W 2 R3⇥k is a trainable weight matrix.
This effectively allows the second LSTM to read the
tweet in a target-specific manner, which is crucial
since the stance of the tweet depends on the target
(recall the Donald Trump example above).
3.3 Bidirectional Conditional Encoding
Bidirectional LSTMs (Graves and Schmidhuber,
2005) have been shown to learn improved represen-
tations of sequences by encoding a sequence from
left to right and from right to left. Therefore, we
adapt the conditional encoding model from Sec-
tion 3.2 to use bidirectional LSTMs, which repre-
sent the target and the tweet using two vectors for
each of them, one obtained by reading the target
[c3
!
c1
!
]
[h9
!
h4
!
]
Setting
Unseen Target Setting
•  Train on Climate Change Is A
Real Concern, Feminist
Movement, Atheism, Legalization
of Abortion, Hillary Clinton tweets
•  Test on Donald Trump tweets
Weakly Supervised Setting
•  Weakly label Donald tweets
using hashtags for training
•  Test on Donald Trump tweets
0.3	
0.35	
0.4	
0.45	
0.5	
0.55	
0.6	
Macro	F1	
BoW	
TweetOnly	
Concat	
TarCondTweet	
TweetCondTar	
BiCond

More Related Content

Viewers also liked

Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Andre Freitas
 
Information Extraction with Linked Data
Information Extraction with Linked DataInformation Extraction with Linked Data
Information Extraction with Linked DataIsabelle Augenstein
 
Natural Language Processing for the Semantic Web
Natural Language Processing for the Semantic WebNatural Language Processing for the Semantic Web
Natural Language Processing for the Semantic WebIsabelle Augenstein
 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question AnsweringMarina Santini
 
Semantic Search Over The Web
Semantic Search Over The WebSemantic Search Over The Web
Semantic Search Over The Webalierkan
 
Management information system question and answers
Management information system question and answersManagement information system question and answers
Management information system question and answerspradeep acharya
 
Deep Learning Models for Question Answering
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question AnsweringSujit Pal
 
Intro to Deep Learning for Question Answering
Intro to Deep Learning for Question AnsweringIntro to Deep Learning for Question Answering
Intro to Deep Learning for Question AnsweringTraian Rebedea
 
Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...
Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...
Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...Glen Cathey
 
Web 3.0 The Semantic Web
Web 3.0 The Semantic WebWeb 3.0 The Semantic Web
Web 3.0 The Semantic WebHatem Mahmoud
 

Viewers also liked (11)

Human Neural Machine
Human Neural MachineHuman Neural Machine
Human Neural Machine
 
Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)Question Answering over Linked Data (Reasoning Web Summer School)
Question Answering over Linked Data (Reasoning Web Summer School)
 
Information Extraction with Linked Data
Information Extraction with Linked DataInformation Extraction with Linked Data
Information Extraction with Linked Data
 
Natural Language Processing for the Semantic Web
Natural Language Processing for the Semantic WebNatural Language Processing for the Semantic Web
Natural Language Processing for the Semantic Web
 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question Answering
 
Semantic Search Over The Web
Semantic Search Over The WebSemantic Search Over The Web
Semantic Search Over The Web
 
Management information system question and answers
Management information system question and answersManagement information system question and answers
Management information system question and answers
 
Deep Learning Models for Question Answering
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question Answering
 
Intro to Deep Learning for Question Answering
Intro to Deep Learning for Question AnsweringIntro to Deep Learning for Question Answering
Intro to Deep Learning for Question Answering
 
Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...
Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...
Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic...
 
Web 3.0 The Semantic Web
Web 3.0 The Semantic WebWeb 3.0 The Semantic Web
Web 3.0 The Semantic Web
 

More from Isabelle Augenstein

Beyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific CommunicationBeyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific CommunicationIsabelle Augenstein
 
Automatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationAutomatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationIsabelle Augenstein
 
Accountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact CheckingAccountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact CheckingIsabelle Augenstein
 
Determining the Credibility of Science Communication
Determining the Credibility of Science CommunicationDetermining the Credibility of Science Communication
Determining the Credibility of Science CommunicationIsabelle Augenstein
 
Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)Isabelle Augenstein
 
Towards Explainable Fact Checking
Towards Explainable Fact CheckingTowards Explainable Fact Checking
Towards Explainable Fact CheckingIsabelle Augenstein
 
Tracking False Information Online
Tracking False Information OnlineTracking False Information Online
Tracking False Information OnlineIsabelle Augenstein
 
What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...Isabelle Augenstein
 
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Isabelle Augenstein
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondIsabelle Augenstein
 
Learning to read for automated fact checking
Learning to read for automated fact checkingLearning to read for automated fact checking
Learning to read for automated fact checkingIsabelle Augenstein
 
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...Isabelle Augenstein
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...Isabelle Augenstein
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...Isabelle Augenstein
 
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Isabelle Augenstein
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Isabelle Augenstein
 

More from Isabelle Augenstein (17)

Beyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific CommunicationBeyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific Communication
 
Automatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationAutomatically Detecting Scientific Misinformation
Automatically Detecting Scientific Misinformation
 
Accountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact CheckingAccountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact Checking
 
Determining the Credibility of Science Communication
Determining the Credibility of Science CommunicationDetermining the Credibility of Science Communication
Determining the Credibility of Science Communication
 
Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)
 
Explainability for NLP
Explainability for NLPExplainability for NLP
Explainability for NLP
 
Towards Explainable Fact Checking
Towards Explainable Fact CheckingTowards Explainable Fact Checking
Towards Explainable Fact Checking
 
Tracking False Information Online
Tracking False Information OnlineTracking False Information Online
Tracking False Information Online
 
What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...
 
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Learning to read for automated fact checking
Learning to read for automated fact checkingLearning to read for automated fact checking
Learning to read for automated fact checking
 
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...
 

Recently uploaded

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

Stance Detection with Bidirectional Conditional Encoding

  • 1. Isabelle Augenstein, Tim Rocktäschel, Andreas Vlachos, Kalina Bontcheva {i.augenstein | t.rocktaschel}@cs.ucl.ac.uk, {a.vlachos | k.bontcheva}@sheffield.ac.uk Twitter Stance Detection with Bidirectional Conditional Encoding Bidirectional Conditional Encoding Bidirectional encoding of tweet conditioned on bidirectional encoding of target The stance is predicted using the last forward and reversed output representations Stance Detection Classify attitude of tweet towards target as “favor”, “against”, “none” Tweet: “No more Hillary Clinton” Target: Donald Trump Stance: FAVOR Training targets: Climate Change is a Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton Testing target: Donald Trump Challenges •  Tweet interpretation depends on target •  Solution: bidirectional conditional model •  Labelled data not available for the test target •  Solution: 1) domain adaptation; 2) weak labelling Results Unseen Target Models Weakly Supervised Models Comparison Against SOA 0.3 0.35 0.4 0.45 0.5 0.55 0.6 Macro F1 BoW TweetOnly Concat TarCondTweet TweetCondTar BiCond Conclusions •  Learning conditional sequence representation of targets and tweets better than representations without conditioning •  State of the art performance with weakly supervised model •  Good target representations with unsupervised pre-training •  Successful sequence representation learning with small data Acknowledgements This work was partially supported by the European Union, grant No. 611233 PHEME (http://www.pheme.eu) and by the Microsoft Research PhD Scholarship Programme. Data •  SemEval 2016 Task 6 Twitter Stance Detection corpus •  5 628 labelled training tweets (1 278 about Hillary Clinton, used for dev) •  278 013 unlabelled Donald Trump tweets •  395 212 additional unlabelled tweets about all targets •  707 Donald Trump testing tweets Experiments Stance Detection Models •  Sequence-agnostic: BoWV •  Target-agnostic: Tweet-only LSTM encoding (TweetOnly) •  Target+Tweet, no conditioning: Concatenated target and tweet LSTM encodings (Concat) •  Target+Tweet, conditioning: Target conditioned on tweet (TarCondTweet); tweet conditioned on target (TweetCondTar); bidirectional conditioning (BiCond) Word Embedding Training •  Random initialisation •  Trained fixed word embeddings on tweets •  Pre-trained word embeddings on tweets, continue training with supervised objective Model Macro F1 SVM-ngrams-comb (official) 0.2843 BiCond (Unseen Target) 0.4901 pkudblab (Weakly Supervised*) 0.5628 BiCond (Weakly Supervised) 0.5803 References SemEval 2016 Stance Detection: http://alt.qcri.org/semeval2016/task6/ Code: https://github.com/sheffieldnlp/stance-conditional x1 c! 1 c1 h! 1 h1 x2 c! 2 c2 h! 2 h2 x3 c! 3 c3 h! 3 h3 x4 c! 4 c4 h! 4 h4 x5 c! 5 c5 h! 5 h5 x6 c! 6 c6 h! 6 h6 x7 c! 7 c7 h! 7 h7 x8 c! 8 c8 h! 8 h8 x9 c! 9 c9 h! 9 h9 Legalization of Abortion A foetus has rights too ! Target Tweet Figure 1: Bidirectional encoding of tweet conditioned on bidirectional encoding of target ([c! 3 c1 ]). The stance is predicted using the last forward and reversed output representations ([h! 9 h4 ]). Here, xt is an input vector at time step t, ct denotes the LSTM memory, ht 2 Rk is an output vector and the remaining weight matrices and biases are train- able parameters. We concatenate the two output vec- tor representations and classify the stance using the softmax over a non-linear projection softmax(tanh(Wta htarget + Wtw htweet + b)) into the space of the three classes for stance detec- tion where Wta, Wtw 2 R3⇥k are trainable weight matrices and b 2 R3 is a trainable class bias. This model learns target-independent distributed repre- sentations for the tweets and relies on the non- linear projection layer to incorporate the target in the stance prediction. 3.2 Conditional Encoding In order to learn target-dependent tweet representa- tions, we use conditional encoding as previously ap- plied to the task of recognising textual entailment (Rockt¨aschel et al., 2016). We use one LSTM to en- code the target as a fixed-length vector. Then, we encode the tweet with another LSTM, whose state is initialised with the representation of the target. Finally, we use the last output vector of the tweet LSTM to predict the stance of the target-tweet pair. Formally, let (x1, . . . , xT ) be a sequence of tar- get word vectors, (xT+1, . . . , xN ) be a sequence of tweet word vectors and [h0 c0] be a start state of zeros. The two LSTMs map input vectors and a pre- vious state to a next state as follows: [h1 c1] = LSTMtarget (x1, h0, c0) . . . [hT cT ] = LSTMtarget (xT , hT 1, cT 1) [hT+1 cT+1] = LSTMtweet (xT+1, h0, cT ) . . . [hN cN ] = LSTMtweet (xN , hN 1, cN 1) Finally, the stance of the tweet w.r.t. the target is classified using a non-linear projection c = tanh(WhN ) where W 2 R3⇥k is a trainable weight matrix. This effectively allows the second LSTM to read the tweet in a target-specific manner, which is crucial since the stance of the tweet depends on the target (recall the Donald Trump example above). 3.3 Bidirectional Conditional Encoding Bidirectional LSTMs (Graves and Schmidhuber, 2005) have been shown to learn improved represen- tations of sequences by encoding a sequence from left to right and from right to left. Therefore, we adapt the conditional encoding model from Sec- tion 3.2 to use bidirectional LSTMs, which repre- sent the target and the tweet using two vectors for each of them, one obtained by reading the target [c3 ! c1 ! ] [h9 ! h4 ! ] Setting Unseen Target Setting •  Train on Climate Change Is A Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton tweets •  Test on Donald Trump tweets Weakly Supervised Setting •  Weakly label Donald tweets using hashtags for training •  Test on Donald Trump tweets 0.3 0.35 0.4 0.45 0.5 0.55 0.6 Macro F1 BoW TweetOnly Concat TarCondTweet TweetCondTar BiCond