Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman

•Transferir como PPTX, PDF•

1 gostou•308 visualizações

venkatramanJ4

Fake news classification using CNN

Dados e análise

DETECTION AND
CLASSIFICATION OF
FAKE NEWS USING CNN
BY VENKATRAMAN J SENIOR SOFTWARE ENGINEER , METAPACK GROUP
MASTERS STUDENT IN NLP, MACHINE LEARNING, UNIVERSITY OF LONDON
AUGUST 31 2018

OUTLINE
• Section 1
Spread of fake news through social media and its implications on society
Why fake news identification is so hot now?.
• Section 2
Data set/Corpus exploration
Deep learning approaches to combat fake news
Convolutional neural networks in text classification, Network Architecture
• Section 3
Results summary
Questions

WHAT , WHY AND IT’S IMPLICATIONS
• A short definition of fake news is a news article or content created with false information to mislead
readers and spread false claims. Fake news is created by different people for various reasons including
financial and political gain.
• Is social media to be blamed for the spread of fake news?
Users in social media trust, like and share articles shared by their friend
Humans can’t fact check each and every article or blog read on internet
• Implications
https://www.snopes.com/fact-check/morgan-freeman-death-hoax/
Morgan freeman was declared dead in 2010 by CNN news channel according to some tweets.
Later
CNN reported that it’ not true

DATA SET/CORPUS EXPLORATION
• Challenging problem to collect the available datasets.
• Authenticated fact checked data sources
• Datasets published for research purposes
Kaggle, GitHub – how much can we trust?
• Dataset cleaning and preparation for learning algorithm
NLP techniques, stop words removal, padding documents to be of same
length,
drop missing columns

DEEP LEARNING APPROACHES
• Binary classification problem F : E -> { 0, 1} such that,
F(a) = {
1, if a is a piece of fake news,
0, otherwise }
Baseline classifiers - Naive bayes and Support vector machines
Bag-of-words model and Tf-Idf weighting scheme, Dimensionality reduction and feature
extraction
• Need for deep learning approaches? – Traditional models does not capture semantics in text.
”Words with similar meaning appear together in similar concept and must have same
representation”
• Word embeddings and n-gram models to the rescue
n-gram(bigram, trigram), skip-gram models – probabilistic language model
• Vectorization - Word2vec, Fast text, Glove vectors

CONVOLUTIONAL NEURAL NETWORKS IN TEXT
CLASSIFICATION
• Convolutional neural networks
State-of-art in computer vision models, sentence classification
Convoluted layers, pooling layers and fully connected layers
• How does CNN fit for text and NLP ?
CNN maintain spatial structure of text which in one dimensional in case of text
Feature extraction from text effective using convoluted layers
Global feature extraction from feature vectors using GlobalMaxPooling1D

NETWORK ARCHITECURE
• P(article = fake | new input document)
Document
vector
Word
Embedding
Layer(Pre
trained
embeddings)
Conv 1D
Layer(Windo
ws and
filters)
Max
Pooling
layer
Conv 1D
Layer(Windo
ws and filters)
Global Max
Pooling
layer
Softmax
Probabilistic
Output
layer(log-
likelihood)
Fully
Connected
Layer
Max
Pooling
layer
Conv 1D
Layer(Windo
ws and filters)

RESULTS SUMMARY
• Model was trained using Keras with Tensorflow backend
• Data size trained 2.5GB
• Epochs - 150, batch size – 256 trained on CPU
• Comparison of results with baseline classifiers
99.8%
90%
85%
50%
100%
Model Accuracy
CNN SVM Naïve Bayes

QUESTIONS?
Reach me on twitter @venar82 Linkedin: Venkatraman Jeyaraman
Blogs https://dzone.com/articles/demystifying-ai-and-machine-learning-part-2

Mais conteúdo relacionado

Mais procurados

Final Poster for Engineering ShowcaseTucker Truesdale

Automatic Hate Speech Detection: A Literature ReviewDr. Amarjeet Singh

FAKE NEWS DETECTION PPT VaishaliSrigadhi

Political prediction analysis using text mining and deep learningVishwambhar Deshpande

Poster presentation in 3rd big data conclave at vit chennai on 20th april 2017Rohit Desai

Seminar Report Minesachin narang

Analyzing-Threat-Levels-of-Extremists-using-TweetsRESHAN FARAZ

A RELIABLE ARTIFICIAL INTELLIGENCE MODEL FOR FALSE NEWS DETECTION MADE BY PUB...caijjournal

CATEGORIZING 2019-N-COV TWITTER HASHTAG DATA BY CLUSTERINGijaia

757Anurag Jain

A Survey Of Collaborative Filtering Techniquestengyue5i5j

E017433538IOSR Journals

IRJET - Fake News Detection: A SurveyIRJET Journal

Hybrid sentiment and network analysis of social opinion polarization icoictAndry Alamsyah

How Anonymous Can Someone be on Twitter?George Sam

Tweet Segmentation and Its Application to Named Entity Recognition1crore projects

Groundhog day: near duplicate detection on twitterDan Nguyen

A data mining tool for the detection of suicide in social networksYassine Bensaoucha

Iterative knowledge extraction from social networks. The Web Conference 2018Marco Brambilla

Evolving Swings (topics) from Social Streams using Probability ModelIJERA Editor

Mais procurados (20)

Final Poster for Engineering Showcase

Automatic Hate Speech Detection: A Literature Review

FAKE NEWS DETECTION PPT

Political prediction analysis using text mining and deep learning

Poster presentation in 3rd big data conclave at vit chennai on 20th april 2017

Seminar Report Mine

Analyzing-Threat-Levels-of-Extremists-using-Tweets

A RELIABLE ARTIFICIAL INTELLIGENCE MODEL FOR FALSE NEWS DETECTION MADE BY PUB...

CATEGORIZING 2019-N-COV TWITTER HASHTAG DATA BY CLUSTERING

757

A Survey Of Collaborative Filtering Techniques

E017433538

IRJET - Fake News Detection: A Survey

Hybrid sentiment and network analysis of social opinion polarization icoict

How Anonymous Can Someone be on Twitter?

Tweet Segmentation and Its Application to Named Entity Recognition

Groundhog day: near duplicate detection on twitter

A data mining tool for the detection of suicide in social networks

Iterative knowledge extraction from social networks. The Web Conference 2018

Evolving Swings (topics) from Social Streams using Probability Model

Semelhante a Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman

ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri

Fake news detection shalushamil

Discover How Scientific Data is Used for the Public Good with Natural Languag...BaoTramDuong2

Open Knowledge Extraction at ESWC2016Anna Lisa Gentile

Deepfake Detection on Social Media Leveraging Deep Learning and FastText Embe...Shakas Technologies

ARTIFICIAL............ INTELLIGENCE.pptxHimanshu Goyal

3282016 Additional Book Resourceshttpscourserooma.cap.docxtamicawaysmith

LOne A Novel Approach Towards Fake News Detection Using Customized Bidirectio...SAMIMAKTAR9

Paper id 71201913IJRAT

Fine grained irony classification through transfer learning approachCSITiaesprime

Extract Stressors for Suicide from Twitter Using Deep LearningThi K. Tran-Nguyen, PhD

Introduction of Machine learning and Deep LearningMadhu Sanjeevi (Mady)

NLP, Expert system and pattern recognitionMohammad Ilyas Malik

Paper id 152013128IJRAT

Online social networkingenioustech

Fake News Detection using Deep LearningNIET Journal of Engineering & Technology (NIETJET)

SENTIMENT ANALYSIS – SARCASM DETECTION USING MACHINE LEARNINGIRJET Journal

Fake_News_Detection_1st_review[1] capstone project.pptxHarshMangal20

Introduction To Applied Machine Learningananth

Research 36. How to Write Significance. Code.601.pptxUniversity of Education, Lahore

Semelhante a Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman (20)

ICS 2208 Lecture Slide Notes for Topic 6

Fake news detection

Discover How Scientific Data is Used for the Public Good with Natural Languag...

Open Knowledge Extraction at ESWC2016

Deepfake Detection on Social Media Leveraging Deep Learning and FastText Embe...

ARTIFICIAL............ INTELLIGENCE.pptx

3282016 Additional Book Resourceshttpscourserooma.cap.docx

LOne A Novel Approach Towards Fake News Detection Using Customized Bidirectio...

Paper id 71201913

Fine grained irony classification through transfer learning approach

Extract Stressors for Suicide from Twitter Using Deep Learning

Introduction of Machine learning and Deep Learning

NLP, Expert system and pattern recognition

Paper id 152013128

Online social network

Fake News Detection using Deep Learning

SENTIMENT ANALYSIS – SARCASM DETECTION USING MACHINE LEARNING

Fake_News_Detection_1st_review[1] capstone project.pptx

Introduction To Applied Machine Learning

Research 36. How to Write Significance. Code.601.pptx

Último

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls

Week-01-2.ppt BBB human Computer interactionfulawalesam

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Data-Analysis for Chicago Crime Data 2023ymrp368

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Invezz.com - Grow your wealth with trading signalsInvezz1

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal

Discover Why Less is More in B2B Researchmichael115558

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman

1. DETECTION AND CLASSIFICATION OF FAKE NEWS USING CNN BY VENKATRAMAN J SENIOR SOFTWARE ENGINEER , METAPACK GROUP MASTERS STUDENT IN NLP, MACHINE LEARNING, UNIVERSITY OF LONDON AUGUST 31 2018

2. OUTLINE • Section 1 Spread of fake news through social media and its implications on society Why fake news identification is so hot now?. • Section 2 Data set/Corpus exploration Deep learning approaches to combat fake news Convolutional neural networks in text classification, Network Architecture • Section 3 Results summary Questions

3. WHAT , WHY AND IT’S IMPLICATIONS • A short definition of fake news is a news article or content created with false information to mislead readers and spread false claims. Fake news is created by different people for various reasons including financial and political gain. • Is social media to be blamed for the spread of fake news? Users in social media trust, like and share articles shared by their friend Humans can’t fact check each and every article or blog read on internet • Implications https://www.snopes.com/fact-check/morgan-freeman-death-hoax/ Morgan freeman was declared dead in 2010 by CNN news channel according to some tweets. Later CNN reported that it’ not true

4. DATA SET/CORPUS EXPLORATION • Challenging problem to collect the available datasets. • Authenticated fact checked data sources • Datasets published for research purposes Kaggle, GitHub – how much can we trust? • Dataset cleaning and preparation for learning algorithm NLP techniques, stop words removal, padding documents to be of same length, drop missing columns

5. DEEP LEARNING APPROACHES • Binary classification problem F : E -> { 0, 1} such that, F(a) = { 1, if a is a piece of fake news, 0, otherwise } Baseline classifiers - Naive bayes and Support vector machines Bag-of-words model and Tf-Idf weighting scheme, Dimensionality reduction and feature extraction • Need for deep learning approaches? – Traditional models does not capture semantics in text. ”Words with similar meaning appear together in similar concept and must have same representation” • Word embeddings and n-gram models to the rescue n-gram(bigram, trigram), skip-gram models – probabilistic language model • Vectorization - Word2vec, Fast text, Glove vectors

6. CONVOLUTIONAL NEURAL NETWORKS IN TEXT CLASSIFICATION • Convolutional neural networks State-of-art in computer vision models, sentence classification Convoluted layers, pooling layers and fully connected layers • How does CNN fit for text and NLP ? CNN maintain spatial structure of text which in one dimensional in case of text Feature extraction from text effective using convoluted layers Global feature extraction from feature vectors using GlobalMaxPooling1D

7. NETWORK ARCHITECURE • P(article = fake | new input document) Document vector Word Embedding Layer(Pre trained embeddings) Conv 1D Layer(Windo ws and filters) Max Pooling layer Conv 1D Layer(Windo ws and filters) Global Max Pooling layer Softmax Probabilistic Output layer(log- likelihood) Fully Connected Layer Max Pooling layer Conv 1D Layer(Windo ws and filters)

8. RESULTS SUMMARY • Model was trained using Keras with Tensorflow backend • Data size trained 2.5GB • Epochs - 150, batch size – 256 trained on CPU • Comparison of results with baseline classifiers 99.8% 90% 85% 50% 100% Model Accuracy CNN SVM Naïve Bayes

9. QUESTIONS? Reach me on twitter @venar82 Linkedin: Venkatraman Jeyaraman Blogs https://dzone.com/articles/demystifying-ai-and-machine-learning-part-2

Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman

Semelhante a Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman (20)

Último

Último (20)

Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman