SlideShare uma empresa Scribd logo
1 de 46
Baixar para ler offline
Deep Learning
for Natural Language Processing
  
   
Bargava Subramanian @bargava
Amit Kapoor @amitkaps
1
Language Challenge
2
Put these adjectives in order: [adj.] + [Knife]
— old
— French
— lovely
— green
— rectangular
— whittling
— silver
— little
3
Which order is correct?
lovely old silver rectangular green little French
whittling knife
old lovely French rectangular green little
whittling silver knife
lovely little old rectangular green French silver
whittling knife
4
Grammar has rules
opinion - size - age - shape - colour -
origin -material - purpose [Noun]
The right version:
lovely little old rectangular green French silver
whittling knife
5
We speak the grammar, yet we don't know it
6
Natural Language
Problems are hard
7
Natural Language Programming Problems
— Summarization
— Text Classification (e.g. spam)
— Sentiment / Emotion Analysis
— Topic Modelling
— Recommendations
— Text Evaluation (e.g. grading)
8
Plan for this Session
— Moving beyond Statistical Learning
— Take first steps in NLP with Deep
Learning
— Showcase an example
— Practical challenges to overcome
9
NLP Learning Process
___
[1] Frame: Problem definition
[2] Acquire: Text ingestion
[3] Refine: Text wrangling
[4] Transform: Feature creation
[5] Explore: Feature selection
[6] Model: Model selection
[7] Insight: Solution communication
10
Simple Case
Demonetisation in India
11
Demonetisation in India
On Nov 8th, 2016, the National Government
announced that existing INR 1000 and INR
500 notes are no longer legal.
12
13
Reactions on Twitter
People started tweeting with the tag:
#demonetisation
14
[1] Frame
Create a viral tweet on
#demonetisation
15
Traditional way of framing
1. Someone has to write a tweet.
2. Run it on the classifier
3. If probability is high, post it.
4. Else, goto step 1
The prediction will be a probability of a new
tweet to go viral or not?
16
Generating tweets
— Can we learn from
historical tweets
algorithmically to
generate a viral tweet?
— Not possible to do using
traditional methods
17
Revised framing for Text Generation
Generate a tweet algorithmically, that is likely
to go viral
18
[2] Acquire
Get the raw tweets data
19
Get Tweets on
#demonetisation
Write your own twitter api
client to get json file or
use a python package like
Tweepy, but need to
manage rate limiting etc.
We used tweezer - an
open source project to get
twitter data
Raw dataset - 30,000+
tweets from past 1 week.
20
[3] Refine
How to categorise a tweet
as viral or not?
21
Simple Approach for Labelling
IF
retweets + favourites > = 100
THEN
Label = viral
ELSE
Label = normal
22
Sanitizing Tweets
— Stopword
— Stemming
— Remove urls
— Remove 'RT'
— Remove 'n'
23
[4] Transform
Creating Features from
Text
24
Traditional methods to covert text to numeric
— TF-IDF: Measures importance of a word
in a document relative to the corpus
— Bag-of-Word: Count of occurrences of a
word in a document
— n-grams: Count of every 1-word, 2-word,
etc combinations in a document
— entity & POS tagging: Transform
sentence to parts-of-speech, extract
entities and encode
25
Challenges in traditional methods of encoding
— Sparse inputs
— Input data space explodes
— Context lost in encoding
A quiet crowd entered the historic church
!=
A historic crowd entered the quiet church
26
Deep Learning Approach
Low-dimensional dense vectors for
representation.
— Tokenise characters (Faster)
— Tokenise words (More accurate, but needs
more memory)
27
Word Embedding
— Learn high-quality word vectors
— Similar words needs to be close to each
other
— Words can have multiple degrees of
similarity
28
Word Embedding using word2vec
Combines two approaches
— skip-gram: Predicting word given its
context
— continuous bag-of-words: Predicting
context given a word
29
word2vec: Example
vec[queen] − vec[king] = vec[woman] − vec[man]
1
1
https://www.tensorflow.org/versions/r0.12/tutorials/word2vec/index.html
30
[5] Explore
Features Selection
31
Feature Selection
— Manual process in Traditional Approach
— Feature selection happens automatically
in Deep Learning
32
[6] Model
Model Selection
33
Recurrent Neural Network (RNN)
— Network with loops
— Allows information to persist
— Enables connecting previous information
to present task
— Context preserved
I grew up in Brazil and I speak ______________.
                                                        portuguese
34
Unrolling over Time
____
[1] Think sequences - in input & output
    
- Recognize Image -> Explain in words
- Sentence(s) -> Sentiment Analysis
- English - Spanish Translation
- Video - task classification
35
Unrolled RNN
[2] Multiple copies of the same network
[3] Each pass message to its successor
2
2
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
36
Architecture Overview
37
[7] Model
Solution Communication
38
Generated Tweets
39
Deep Learning Challenges
— Data Size: RNN doesn't generalize well
on small datasets
— Relevant Corpus: Required to create
domain specific word embedding
— Deeper Networks: Empirically deeper
networks have better accuracy
— Training Time: RNNs take a long time to
learn.
40
Use case: Chat Bots
— Bookings
— Customer Support
— Help Desk Automation
— ...
41
Tools to get started: Software
Python Stack
- Use spacy for NLP preprocessing
- Use gensim for word2vec training
- Start with keras
- Have tensorflow as backend
Use pre-trained models like word2vec for
word embedding and similarly for RNNs
42
Tools to get started: Hardware
Work on GPUs
- Nvidia TitanX (suitable for consumers)
- Tesla K80 (suitable for professionals)
For detailed hardware choices:
http://timdettmers.com/2015/03/09/deep-
learning-hardware-guide/
43
Closing thoughts
44
Reference: Deep Learning for NLP
Notebooks and Material @
https://github.com/rouseguy/
DeepLearningNLP_Py
- What is deep learning?
- Motivation: Some use cases
- Building blocks of Neural Networks (Neuron, Activation Function)
- Backpropagation Algorithm
- Word Embedding
- word2vec
- Introduction to keras
- Multi-layer perceptron
- Convolutional Neural Network
- Recurrent Neural Network
- Challenges in Deep Learning
45
Contact
___
Bargava Subramanian
@bargava
Amit Kapoor
@amitkaps
amitkaps.com
46

Mais conteúdo relacionado

Mais procurados

Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersRoelof Pieters
 
Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...
Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...
Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...Association for Computational Linguistics
 
(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结君 廖
 
CMSC 723: Computational Linguistics I
CMSC 723: Computational Linguistics ICMSC 723: Computational Linguistics I
CMSC 723: Computational Linguistics Ibutest
 
Deep Learning Enabled Question Answering System to Automate Corporate Helpdesk
Deep Learning Enabled Question Answering System to Automate Corporate HelpdeskDeep Learning Enabled Question Answering System to Automate Corporate Helpdesk
Deep Learning Enabled Question Answering System to Automate Corporate HelpdeskSaurabh Saxena
 
ADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHY
ADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHYADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHY
ADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHYcsandit
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPAnuj Gupta
 
Talk from NVidia Developer Connect
Talk from NVidia Developer ConnectTalk from NVidia Developer Connect
Talk from NVidia Developer ConnectAnuj Gupta
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalBhaskar Mitra
 
Classifying Text using CNN
Classifying Text using CNNClassifying Text using CNN
Classifying Text using CNNSomnath Banerjee
 
Convolutional neural networks for sentiment classification
Convolutional neural networks for sentiment classificationConvolutional neural networks for sentiment classification
Convolutional neural networks for sentiment classificationYunchao He
 
BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2
BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2
BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2Karthik Murugesan
 
Information Retrieval with Deep Learning
Information Retrieval with Deep LearningInformation Retrieval with Deep Learning
Information Retrieval with Deep LearningAdam Gibson
 

Mais procurados (17)

Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ers
 
Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...
Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...
Abigail See - 2017 - Get To The Point: Summarization with Pointer-Generator N...
 
[PDF]
[PDF][PDF]
[PDF]
 
(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结
 
CMSC 723: Computational Linguistics I
CMSC 723: Computational Linguistics ICMSC 723: Computational Linguistics I
CMSC 723: Computational Linguistics I
 
Deep Learning Enabled Question Answering System to Automate Corporate Helpdesk
Deep Learning Enabled Question Answering System to Automate Corporate HelpdeskDeep Learning Enabled Question Answering System to Automate Corporate Helpdesk
Deep Learning Enabled Question Answering System to Automate Corporate Helpdesk
 
[ppt]
[ppt][ppt]
[ppt]
 
ADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHY
ADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHYADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHY
ADVANCED LSB TECHNIQUE FOR AUDIO STENOGRAPHY
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
 
Talk from NVidia Developer Connect
Talk from NVidia Developer ConnectTalk from NVidia Developer Connect
Talk from NVidia Developer Connect
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information Retrieval
 
Audio steganography using r prime rsa and ga based lsb algorithm to enhance s...
Audio steganography using r prime rsa and ga based lsb algorithm to enhance s...Audio steganography using r prime rsa and ga based lsb algorithm to enhance s...
Audio steganography using r prime rsa and ga based lsb algorithm to enhance s...
 
Language models
Language modelsLanguage models
Language models
 
Classifying Text using CNN
Classifying Text using CNNClassifying Text using CNN
Classifying Text using CNN
 
Convolutional neural networks for sentiment classification
Convolutional neural networks for sentiment classificationConvolutional neural networks for sentiment classification
Convolutional neural networks for sentiment classification
 
BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2
BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2
BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2
 
Information Retrieval with Deep Learning
Information Retrieval with Deep LearningInformation Retrieval with Deep Learning
Information Retrieval with Deep Learning
 

Destaque

Python Visualisation for Data Science
Python Visualisation for Data SciencePython Visualisation for Data Science
Python Visualisation for Data ScienceAmit Kapoor
 
Visualising Multi Dimensional Data
Visualising Multi Dimensional DataVisualising Multi Dimensional Data
Visualising Multi Dimensional DataAmit Kapoor
 
Model Visualisation
Model VisualisationModel Visualisation
Model VisualisationAmit Kapoor
 
Learning the Craft of Data Visualisation
Learning the Craft of Data VisualisationLearning the Craft of Data Visualisation
Learning the Craft of Data VisualisationAmit Kapoor
 
Drawing word2vec
Drawing word2vecDrawing word2vec
Drawing word2vecKai Sasaki
 
Tools & Resources for Data Visualisation
Tools & Resources for Data VisualisationTools & Resources for Data Visualisation
Tools & Resources for Data VisualisationAmit Kapoor
 
Word2vec algorithm
Word2vec algorithmWord2vec algorithm
Word2vec algorithmAndrew Koo
 
Data Storytelling: The only way to unlock true insight from your data
Data Storytelling: The only way to unlock true insight from your dataData Storytelling: The only way to unlock true insight from your data
Data Storytelling: The only way to unlock true insight from your dataBright North
 
Storytelling with Data - Approach | Skills
Storytelling with Data - Approach | SkillsStorytelling with Data - Approach | Skills
Storytelling with Data - Approach | SkillsAmit Kapoor
 
Data stories - how to combine the power storytelling with effective data visu...
Data stories - how to combine the power storytelling with effective data visu...Data stories - how to combine the power storytelling with effective data visu...
Data stories - how to combine the power storytelling with effective data visu...Coincidencity
 
The 8 Hats of Data Visualisation
The 8 Hats of Data VisualisationThe 8 Hats of Data Visualisation
The 8 Hats of Data VisualisationAndy Kirk
 
Algorithmic music generation
Algorithmic music generationAlgorithmic music generation
Algorithmic music generationPadmaja Bhagwat
 
Veda Semantics - introduction document
Veda Semantics - introduction documentVeda Semantics - introduction document
Veda Semantics - introduction documentrajatkr
 
Yarlin pilataxi practica hoja de calculo
Yarlin pilataxi practica hoja de calculoYarlin pilataxi practica hoja de calculo
Yarlin pilataxi practica hoja de calculoyarlin04
 
Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...
Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...
Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...Eastern European Computer Vision Conference
 
【來崇明學聰明】藝文老師怎麼教
 【來崇明學聰明】藝文老師怎麼教 【來崇明學聰明】藝文老師怎麼教
【來崇明學聰明】藝文老師怎麼教Yu Ru Huang
 
Deep Learning in Natural Language Processing
Deep Learning in Natural Language ProcessingDeep Learning in Natural Language Processing
Deep Learning in Natural Language ProcessingDavid Dao
 
W jaki sposób myślimy o świecie społecznym
W jaki sposób myślimy o świecie społecznymW jaki sposób myślimy o świecie społecznym
W jaki sposób myślimy o świecie społecznymPaula Pilarska
 

Destaque (20)

Python Visualisation for Data Science
Python Visualisation for Data SciencePython Visualisation for Data Science
Python Visualisation for Data Science
 
Visualising Multi Dimensional Data
Visualising Multi Dimensional DataVisualising Multi Dimensional Data
Visualising Multi Dimensional Data
 
Model Visualisation
Model VisualisationModel Visualisation
Model Visualisation
 
Learning the Craft of Data Visualisation
Learning the Craft of Data VisualisationLearning the Craft of Data Visualisation
Learning the Craft of Data Visualisation
 
Drawing word2vec
Drawing word2vecDrawing word2vec
Drawing word2vec
 
Tools & Resources for Data Visualisation
Tools & Resources for Data VisualisationTools & Resources for Data Visualisation
Tools & Resources for Data Visualisation
 
Word2vec algorithm
Word2vec algorithmWord2vec algorithm
Word2vec algorithm
 
Data Storytelling: The only way to unlock true insight from your data
Data Storytelling: The only way to unlock true insight from your dataData Storytelling: The only way to unlock true insight from your data
Data Storytelling: The only way to unlock true insight from your data
 
Storytelling with Data - Approach | Skills
Storytelling with Data - Approach | SkillsStorytelling with Data - Approach | Skills
Storytelling with Data - Approach | Skills
 
Data stories - how to combine the power storytelling with effective data visu...
Data stories - how to combine the power storytelling with effective data visu...Data stories - how to combine the power storytelling with effective data visu...
Data stories - how to combine the power storytelling with effective data visu...
 
The 8 Hats of Data Visualisation
The 8 Hats of Data VisualisationThe 8 Hats of Data Visualisation
The 8 Hats of Data Visualisation
 
Algorithmic music generation
Algorithmic music generationAlgorithmic music generation
Algorithmic music generation
 
Veda Semantics - introduction document
Veda Semantics - introduction documentVeda Semantics - introduction document
Veda Semantics - introduction document
 
Ejercicio3
Ejercicio3Ejercicio3
Ejercicio3
 
Yarlin pilataxi practica hoja de calculo
Yarlin pilataxi practica hoja de calculoYarlin pilataxi practica hoja de calculo
Yarlin pilataxi practica hoja de calculo
 
Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...
Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...
Viktor Sdobnikov - Computer Vision for Advanced Driver Assistance Systems (AD...
 
【來崇明學聰明】藝文老師怎麼教
 【來崇明學聰明】藝文老師怎麼教 【來崇明學聰明】藝文老師怎麼教
【來崇明學聰明】藝文老師怎麼教
 
Deep Learning in Natural Language Processing
Deep Learning in Natural Language ProcessingDeep Learning in Natural Language Processing
Deep Learning in Natural Language Processing
 
cv
cvcv
cv
 
W jaki sposób myślimy o świecie społecznym
W jaki sposób myślimy o świecie społecznymW jaki sposób myślimy o świecie społecznym
W jaki sposób myślimy o świecie społecznym
 

Semelhante a Deep Learning for NLP

Text classification in scikit-learn
Text classification in scikit-learnText classification in scikit-learn
Text classification in scikit-learnJimmy Lai
 
Representation Learning of Text for NLP
Representation Learning of Text for NLPRepresentation Learning of Text for NLP
Representation Learning of Text for NLPAnuj Gupta
 
Get Your Hands Dirty with Intel® Distribution for Python*
Get Your Hands Dirty with Intel® Distribution for Python*Get Your Hands Dirty with Intel® Distribution for Python*
Get Your Hands Dirty with Intel® Distribution for Python*Intel® Software
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text MiningMinha Hwang
 
Big Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-onBig Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-onDony Riyanto
 
CS3114_09212011.ppt
CS3114_09212011.pptCS3114_09212011.ppt
CS3114_09212011.pptArumugam90
 
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...Dataiku
 
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Data Works MD
 
Data Science Accelerator Program
Data Science Accelerator ProgramData Science Accelerator Program
Data Science Accelerator ProgramGoDataDriven
 
Dice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkDice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkSimon Hughes
 
RDataMining slides-text-mining-with-r
RDataMining slides-text-mining-with-rRDataMining slides-text-mining-with-r
RDataMining slides-text-mining-with-rYanchang Zhao
 
Section1 compound data class
Section1 compound data classSection1 compound data class
Section1 compound data classDương Tùng
 
Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...
Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...
Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...DataStax Academy
 
Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...
Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...
Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...Universitat Politècnica de Catalunya
 
""Into the Wild" ... with Natural Language Processing and Text Classification...
""Into the Wild" ... with Natural Language Processing and Text Classification...""Into the Wild" ... with Natural Language Processing and Text Classification...
""Into the Wild" ... with Natural Language Processing and Text Classification...Dataconomy Media
 
Into the Wild - wilth Natural Language Processing and Text Classification - D...
Into the Wild - wilth Natural Language Processing and Text Classification - D...Into the Wild - wilth Natural Language Processing and Text Classification - D...
Into the Wild - wilth Natural Language Processing and Text Classification - D...Peter Grosskopf
 

Semelhante a Deep Learning for NLP (20)

Text classification in scikit-learn
Text classification in scikit-learnText classification in scikit-learn
Text classification in scikit-learn
 
Representation Learning of Text for NLP
Representation Learning of Text for NLPRepresentation Learning of Text for NLP
Representation Learning of Text for NLP
 
Get Your Hands Dirty with Intel® Distribution for Python*
Get Your Hands Dirty with Intel® Distribution for Python*Get Your Hands Dirty with Intel® Distribution for Python*
Get Your Hands Dirty with Intel® Distribution for Python*
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
Deep learning for NLP
Deep learning for NLPDeep learning for NLP
Deep learning for NLP
 
Big Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-onBig Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-on
 
Ranking Twitter Conversations
Ranking Twitter ConversationsRanking Twitter Conversations
Ranking Twitter Conversations
 
CS3114_09212011.ppt
CS3114_09212011.pptCS3114_09212011.ppt
CS3114_09212011.ppt
 
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
 
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
 
Data Science Accelerator Program
Data Science Accelerator ProgramData Science Accelerator Program
Data Science Accelerator Program
 
Dice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkDice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank Talk
 
RDataMining slides-text-mining-with-r
RDataMining slides-text-mining-with-rRDataMining slides-text-mining-with-r
RDataMining slides-text-mining-with-r
 
Section1 compound data class
Section1 compound data classSection1 compound data class
Section1 compound data class
 
Ire major project
Ire major projectIre major project
Ire major project
 
Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...
Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...
Cassandra Summit 2014: Social Media Security Company Nexgate Relies on Cassan...
 
Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...
Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...
Attention-based Models (DLAI D8L 2017 UPC Deep Learning for Artificial Intell...
 
Raising the Bar
Raising the BarRaising the Bar
Raising the Bar
 
""Into the Wild" ... with Natural Language Processing and Text Classification...
""Into the Wild" ... with Natural Language Processing and Text Classification...""Into the Wild" ... with Natural Language Processing and Text Classification...
""Into the Wild" ... with Natural Language Processing and Text Classification...
 
Into the Wild - wilth Natural Language Processing and Text Classification - D...
Into the Wild - wilth Natural Language Processing and Text Classification - D...Into the Wild - wilth Natural Language Processing and Text Classification - D...
Into the Wild - wilth Natural Language Processing and Text Classification - D...
 

Mais de Amit Kapoor

The Power of Ensembles in Machine Learning
The Power of Ensembles in Machine LearningThe Power of Ensembles in Machine Learning
The Power of Ensembles in Machine LearningAmit Kapoor
 
Visualising Big Data
Visualising Big DataVisualising Big Data
Visualising Big DataAmit Kapoor
 
Fifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with DataFifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with DataAmit Kapoor
 
Storytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | EngageStorytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | EngageAmit Kapoor
 
Crafting Visual Stories with Data
Crafting Visual Stories with DataCrafting Visual Stories with Data
Crafting Visual Stories with DataAmit Kapoor
 
Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective Amit Kapoor
 
What makes a data-story work?
What makes a data-story work?What makes a data-story work?
What makes a data-story work?Amit Kapoor
 
What is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a StrategistWhat is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a StrategistAmit Kapoor
 
Telling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story SpineTelling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story SpineAmit Kapoor
 
Story Structure and Modern Storytelling
Story Structure and Modern StorytellingStory Structure and Modern Storytelling
Story Structure and Modern StorytellingAmit Kapoor
 
Targeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in RetailTargeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in RetailAmit Kapoor
 
Storytelling - Gutenberg
Storytelling - GutenbergStorytelling - Gutenberg
Storytelling - GutenbergAmit Kapoor
 
Analytics in Consulting
Analytics in ConsultingAnalytics in Consulting
Analytics in ConsultingAmit Kapoor
 
Retail Pricing Perspective
Retail Pricing PerspectiveRetail Pricing Perspective
Retail Pricing PerspectiveAmit Kapoor
 

Mais de Amit Kapoor (14)

The Power of Ensembles in Machine Learning
The Power of Ensembles in Machine LearningThe Power of Ensembles in Machine Learning
The Power of Ensembles in Machine Learning
 
Visualising Big Data
Visualising Big DataVisualising Big Data
Visualising Big Data
 
Fifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with DataFifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with Data
 
Storytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | EngageStorytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | Engage
 
Crafting Visual Stories with Data
Crafting Visual Stories with DataCrafting Visual Stories with Data
Crafting Visual Stories with Data
 
Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective
 
What makes a data-story work?
What makes a data-story work?What makes a data-story work?
What makes a data-story work?
 
What is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a StrategistWhat is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a Strategist
 
Telling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story SpineTelling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story Spine
 
Story Structure and Modern Storytelling
Story Structure and Modern StorytellingStory Structure and Modern Storytelling
Story Structure and Modern Storytelling
 
Targeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in RetailTargeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in Retail
 
Storytelling - Gutenberg
Storytelling - GutenbergStorytelling - Gutenberg
Storytelling - Gutenberg
 
Analytics in Consulting
Analytics in ConsultingAnalytics in Consulting
Analytics in Consulting
 
Retail Pricing Perspective
Retail Pricing PerspectiveRetail Pricing Perspective
Retail Pricing Perspective
 

Último

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...gajnagarg
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...gajnagarg
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...gajnagarg
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...gajnagarg
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 

Último (20)

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
 

Deep Learning for NLP

  • 1. Deep Learning for Natural Language Processing        Bargava Subramanian @bargava Amit Kapoor @amitkaps 1
  • 3. Put these adjectives in order: [adj.] + [Knife] — old — French — lovely — green — rectangular — whittling — silver — little 3
  • 4. Which order is correct? lovely old silver rectangular green little French whittling knife old lovely French rectangular green little whittling silver knife lovely little old rectangular green French silver whittling knife 4
  • 5. Grammar has rules opinion - size - age - shape - colour - origin -material - purpose [Noun] The right version: lovely little old rectangular green French silver whittling knife 5
  • 6. We speak the grammar, yet we don't know it 6
  • 8. Natural Language Programming Problems — Summarization — Text Classification (e.g. spam) — Sentiment / Emotion Analysis — Topic Modelling — Recommendations — Text Evaluation (e.g. grading) 8
  • 9. Plan for this Session — Moving beyond Statistical Learning — Take first steps in NLP with Deep Learning — Showcase an example — Practical challenges to overcome 9
  • 10. NLP Learning Process ___ [1] Frame: Problem definition [2] Acquire: Text ingestion [3] Refine: Text wrangling [4] Transform: Feature creation [5] Explore: Feature selection [6] Model: Model selection [7] Insight: Solution communication 10
  • 12. Demonetisation in India On Nov 8th, 2016, the National Government announced that existing INR 1000 and INR 500 notes are no longer legal. 12
  • 13. 13
  • 14. Reactions on Twitter People started tweeting with the tag: #demonetisation 14
  • 15. [1] Frame Create a viral tweet on #demonetisation 15
  • 16. Traditional way of framing 1. Someone has to write a tweet. 2. Run it on the classifier 3. If probability is high, post it. 4. Else, goto step 1 The prediction will be a probability of a new tweet to go viral or not? 16
  • 17. Generating tweets — Can we learn from historical tweets algorithmically to generate a viral tweet? — Not possible to do using traditional methods 17
  • 18. Revised framing for Text Generation Generate a tweet algorithmically, that is likely to go viral 18
  • 19. [2] Acquire Get the raw tweets data 19
  • 20. Get Tweets on #demonetisation Write your own twitter api client to get json file or use a python package like Tweepy, but need to manage rate limiting etc. We used tweezer - an open source project to get twitter data Raw dataset - 30,000+ tweets from past 1 week. 20
  • 21. [3] Refine How to categorise a tweet as viral or not? 21
  • 22. Simple Approach for Labelling IF retweets + favourites > = 100 THEN Label = viral ELSE Label = normal 22
  • 23. Sanitizing Tweets — Stopword — Stemming — Remove urls — Remove 'RT' — Remove 'n' 23
  • 25. Traditional methods to covert text to numeric — TF-IDF: Measures importance of a word in a document relative to the corpus — Bag-of-Word: Count of occurrences of a word in a document — n-grams: Count of every 1-word, 2-word, etc combinations in a document — entity & POS tagging: Transform sentence to parts-of-speech, extract entities and encode 25
  • 26. Challenges in traditional methods of encoding — Sparse inputs — Input data space explodes — Context lost in encoding A quiet crowd entered the historic church != A historic crowd entered the quiet church 26
  • 27. Deep Learning Approach Low-dimensional dense vectors for representation. — Tokenise characters (Faster) — Tokenise words (More accurate, but needs more memory) 27
  • 28. Word Embedding — Learn high-quality word vectors — Similar words needs to be close to each other — Words can have multiple degrees of similarity 28
  • 29. Word Embedding using word2vec Combines two approaches — skip-gram: Predicting word given its context — continuous bag-of-words: Predicting context given a word 29
  • 30. word2vec: Example vec[queen] − vec[king] = vec[woman] − vec[man] 1 1 https://www.tensorflow.org/versions/r0.12/tutorials/word2vec/index.html 30
  • 32. Feature Selection — Manual process in Traditional Approach — Feature selection happens automatically in Deep Learning 32
  • 34. Recurrent Neural Network (RNN) — Network with loops — Allows information to persist — Enables connecting previous information to present task — Context preserved I grew up in Brazil and I speak ______________.                                                         portuguese 34
  • 35. Unrolling over Time ____ [1] Think sequences - in input & output      - Recognize Image -> Explain in words - Sentence(s) -> Sentiment Analysis - English - Spanish Translation - Video - task classification 35
  • 36. Unrolled RNN [2] Multiple copies of the same network [3] Each pass message to its successor 2 2 http://colah.github.io/posts/2015-08-Understanding-LSTMs/ 36
  • 40. Deep Learning Challenges — Data Size: RNN doesn't generalize well on small datasets — Relevant Corpus: Required to create domain specific word embedding — Deeper Networks: Empirically deeper networks have better accuracy — Training Time: RNNs take a long time to learn. 40
  • 41. Use case: Chat Bots — Bookings — Customer Support — Help Desk Automation — ... 41
  • 42. Tools to get started: Software Python Stack - Use spacy for NLP preprocessing - Use gensim for word2vec training - Start with keras - Have tensorflow as backend Use pre-trained models like word2vec for word embedding and similarly for RNNs 42
  • 43. Tools to get started: Hardware Work on GPUs - Nvidia TitanX (suitable for consumers) - Tesla K80 (suitable for professionals) For detailed hardware choices: http://timdettmers.com/2015/03/09/deep- learning-hardware-guide/ 43
  • 45. Reference: Deep Learning for NLP Notebooks and Material @ https://github.com/rouseguy/ DeepLearningNLP_Py - What is deep learning? - Motivation: Some use cases - Building blocks of Neural Networks (Neuron, Activation Function) - Backpropagation Algorithm - Word Embedding - word2vec - Introduction to keras - Multi-layer perceptron - Convolutional Neural Network - Recurrent Neural Network - Challenges in Deep Learning 45