NLP with Deep Learning

•

0 likes•524 views

NLP with Deep Learning Guest Lecture slides by Fatih Mehmet Güler, PragmaCraft. Includes my background on the subject, our projects, the NLP stages and the latest developments.

Software

NLP with
Deep Learning
Fatih Mehmet Güler

Outline
• My Background

• MS CENG 2010

• YFYI 2012 & Intel Global Challenge

• TÜBİTAK 1512 (2013)

• Projects so far (Intelligent Search Assistant, Neural Machine Translation, Summarization, Company Similarity)

• NLP with Deep Learning

• ‘NLP Almost From Scratch’ paper

• LSTM - SRL paper

• Word2Vec, Glove, Elmo, BERT

• POS/NER/CHUNK/SRL

• QA - SQuAD

• Seq2Seq

• Siamese Networks

• Practical Applications & Tools & Problems

• PyTorch, AllenNLP, SentencePiece (BPE), LSTM Sequence Problem

• What’s Next?

My Background
• MS CENG - METU 2010

• Courses

• Artiﬁcial Intelligence

• Pattern Recognition

• Computational Linguistics

• Knowledge Engineering

• Syntax, Semantics and Computation

• Advanced Graphics

• Advanced Unix

• Real Time and Embedded Software Development

• Projects

• Implementation of Massively Multiplayer Online Game Architecture for Educational Games

• Conceptual Graph Based Expert System Shell

• Natural Intelligence – Question Answering System

• Voice Command Recognition With Nearest Neighbor Approach

• Relational Reinforcement Learning for Hitori Puzzle

• YFYİ 2012 & Intel Global Challenge

• TÜBİTAK 1512

My Background
• 2009-2010 Natural Intelligence Project
– Commonsense Question Answering with Conceptual Graphs
(IJCAI 2009, ICCS 2010)
– CCG, C&C Tools, Conceptual Graphs, Common Sense Ontology
(Open Cyc), KRR

Projects
• Intelligent Search Assistant

• Neural Machine Translation (PragmaMT)

• Summarization (OzetGecer)

• Company Similarity (PragmaPredict)

NLP with Deep Learning
• Stages of Natural Language Processing

• POS, NER, CHUNK, SRL (+Parsing of Course)

• NLP from Scratch Paper

• Word2Vec Glove, Elmo, BERT

• Question Answering - SQuAD

• Seq2Seq - Machine Translation

SRL with LSTM Paper
• End-to-end Learning of Semantic Role Labeling Using
Recurrent Neural Networks

• Jie Zhou and Wei Xu, 2015 (Baidu Research)

•

Word Vectors
• Word2Vec

• CBOW: predict the word by the context

• several times faster to train than the skip-gram, slightly better accuracy for the
frequent words

• Skip-Gram: predict the context by the word

• works well with small amount of the training data, represents well even rare
words or phrases

• Glove: Count-based model that learn vectors by essentially doing dimensionality
reduction on the co-occurrence counts matrix

• Elmo

• BERT

ELMO
• Bidirectional LSTM Language Model

• Dynamic Word Embedding

• Embedding changes according to the context

BERT
• Replaces language modeling with “masked language
modeling”

• Words in a sentence are randomly erased and replaced
with a special token (“masked”) with some small
probability, 15%.

• Then, a Transformer is used to generate a prediction for
the masked word based on the unmasked words
surrounding it, both to the left and right.

Seq2Seq Applications
• Machine Translation

• Summarization

• Email Reply

• Question Answering

Practical Applications
• Frameworks

• PyTorch

• TensorFlow

• Keras

• More High Level

• AllenNLP

• spaCy

• Flair, PyText, Torchtext

• Problems

• Unknown Word: Byte Pair Encoding - Sentence Piece

• LSTM Long Sequence Problem

What’s Next?
• More Variants of Elmo/BERT - Transfer Learning

• More NLP Applications - Embeddings all the way

• My Unsolicited Advice :)

• deeplearning.ai (course 5 - sequence models)

• read lots of papers (http://arxiv-sanity.com)

• twitter & facebook (!)

• Andrew Ng, Yann Lecun, Andrej Karpathy

What's hot

Deep learning for NLP and TransformerArvind Devaraj

Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...Databricks

Natural language processing (nlp)Kuppusamy P

BERT: Pre-training of Deep Bidirectional Transformers for Language Understandinggohyunwoong

NLP State of the Art | BERTshaurya uppal

Natural language processingYogendra Tamang

1909 BERT: why-and-how (CODE SEMINAR)WarNik Chow

NLP PPT.pptxLipika Sharma Shrivastava

Introduction to Named Entity RecognitionTomer Lieber

Fine tune and deploy Hugging Face NLP modelsOVHcloud

Deep Learning for Natural Language ProcessingDevashish Shanker

Large Language Models - From RNN to BERTATPowr

BERT: Bidirectional Encoder Representations from TransformersLiangqun Lu

Introduction to Natural Language ProcessingPranav Gupta

Language modelsMaryam Khordad

Introduction to Transformers for NLP - Olga PetrovaAlexey Grigorev

Text similarity measuresankit_ppt

NLP using transformers Arvind Devaraj

And then there were ... Large Language ModelsLeon Dohmen

GPT-2: Language Models are Unsupervised Multitask LearnersYoung Seok Kim

What's hot (20)

Deep learning for NLP and Transformer

Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...

Natural language processing (nlp)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

NLP State of the Art | BERT

Natural language processing

1909 BERT: why-and-how (CODE SEMINAR)

NLP PPT.pptx

Introduction to Named Entity Recognition

Fine tune and deploy Hugging Face NLP models

Deep Learning for Natural Language Processing

Large Language Models - From RNN to BERT

BERT: Bidirectional Encoder Representations from Transformers

Introduction to Natural Language Processing

Language models

Introduction to Transformers for NLP - Olga Petrova

Text similarity measures

NLP using transformers

And then there were ... Large Language Models

GPT-2: Language Models are Unsupervised Multitask Learners

Similar to NLP with Deep Learning

Building a Neural Machine Translation System From ScratchNatasha Latysheva

ICS1020 NLP 2020Vanessa Camilleri

Beyond the Symbols: A 30-minute Overview of NLPMENGSAYLOEM1

Tomáš Mikolov - Distributed Representations for NLPMachine Learning Prague

2106 ACM DISWarNik Chow

OWF14 - Big Data : The State of Machine Learning in 2014Paris Open Source Summit

Natural Language ProcessingSandeep Malhotra

Natural Language Processing (NLP)Yuriy Guts

Integration of speech recognition with computer assisted translationChamani Shiranthika

Delhi NCR JUG meetup - NLP - APIs - By Vikas MalikVikas Malik

Natural language processing techniques transition from machine learning to de...Divya Gera

Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)Márton Miháltz

2010 INTERSPEECH WarNik Chow

ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdfOrtus Solutions, Corp

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

Natural Language Processing for developmentAravind Reddy

DotNet 2019 | Pablo Doval - Recurrent Neural Networks with TF2.0Plain Concepts

final nlpsharmeen shaikh

Nlp research presentationSurya Sg

Similar to NLP with Deep Learning (20)

Building a Neural Machine Translation System From Scratch

ICS1020 NLP 2020

Beyond the Symbols: A 30-minute Overview of NLP

Tomáš Mikolov - Distributed Representations for NLP

2106 ACM DIS

OWF14 - Big Data : The State of Machine Learning in 2014

Natural Language Processing

Natural Language Processing (NLP)

Integration of speech recognition with computer assisted translation

Delhi NCR JUG meetup - NLP - APIs - By Vikas Malik

Natural language processing techniques transition from machine learning to de...

Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)

2010 INTERSPEECH

ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf

Engineering Intelligent NLP Applications Using Deep Learning – Part 2

Natural Language Processing for development

DotNet 2019 | Pablo Doval - Recurrent Neural Networks with TF2.0

final nlp

Nlp research presentation

Recently uploaded

%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba

%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba

Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812

TECUNIQUE: Success Stories: IT Service providermohitmore19

The Top App Development Trends Shaping the Industry in 2024-25 .pdfayushiqss

%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburgmasabamasaba

Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...Nitya salvi

8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health

Generic or specific? Making sensible software design decisionsBert Jan Schrijver

%in kempton park+277-882-255-28 abortion pills for sale in kempton park masabamasaba

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba

Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...masabamasaba

Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions

OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

Recently uploaded (20)

%in Harare+277-882-255-28 abortion pills for sale in Harare

%in tembisa+277-882-255-28 abortion pills for sale in tembisa

Unlocking the Future of AI Agents with Large Language Models

TECUNIQUE: Success Stories: IT Service provider

The Top App Development Trends Shaping the Industry in 2024-25 .pdf

%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg

Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...

8257 interfacing 2 in microprocessor for btech students

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...

Generic or specific? Making sensible software design decisions

%in kempton park+277-882-255-28 abortion pills for sale in kempton park

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...

Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...

Introducing Microsoft’s new Enterprise Work Management (EWM) Solution

OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...

NLP with Deep Learning

1. NLP with Deep Learning Fatih Mehmet Güler

2. Outline • My Background • MS CENG 2010 • YFYI 2012 & Intel Global Challenge • TÜBİTAK 1512 (2013) • Projects so far (Intelligent Search Assistant, Neural Machine Translation, Summarization, Company Similarity) • NLP with Deep Learning • ‘NLP Almost From Scratch’ paper • LSTM - SRL paper • Word2Vec, Glove, Elmo, BERT • POS/NER/CHUNK/SRL • QA - SQuAD • Seq2Seq • Siamese Networks • Practical Applications & Tools & Problems • PyTorch, AllenNLP, SentencePiece (BPE), LSTM Sequence Problem • What’s Next?

3. My Background • MS CENG - METU 2010 • Courses • Artiﬁcial Intelligence • Pattern Recognition • Computational Linguistics • Knowledge Engineering • Syntax, Semantics and Computation • Advanced Graphics • Advanced Unix • Real Time and Embedded Software Development • Projects • Implementation of Massively Multiplayer Online Game Architecture for Educational Games • Conceptual Graph Based Expert System Shell • Natural Intelligence – Question Answering System • Voice Command Recognition With Nearest Neighbor Approach • Relational Reinforcement Learning for Hitori Puzzle • YFYİ 2012 & Intel Global Challenge • TÜBİTAK 1512

4. My Background • 2009-2010 Natural Intelligence Project – Commonsense Question Answering with Conceptual Graphs (IJCAI 2009, ICCS 2010) – CCG, C&C Tools, Conceptual Graphs, Common Sense Ontology (Open Cyc), KRR

5. Projects • Intelligent Search Assistant • Neural Machine Translation (PragmaMT) • Summarization (OzetGecer) • Company Similarity (PragmaPredict)

6. Intelligent Search Assistant

7. Intelligent Search Assistant

8. Intelligent Search Assistant

9. Intelligent Search Assistant

10. Neural Machine Translation

11.

12. Beam Search Manipulation Example

13.

14. Neural Machine Translation

15. Neural Machine Translation

16. Summarization

17. Company Similarity

18. NLP with Deep Learning • Stages of Natural Language Processing • POS, NER, CHUNK, SRL (+Parsing of Course) • NLP from Scratch Paper • Word2Vec Glove, Elmo, BERT • Question Answering - SQuAD • Seq2Seq - Machine Translation

19. NLP Stages

20. The Seminal Paper

21.

22. SRL with LSTM Paper • End-to-end Learning of Semantic Role Labeling Using Recurrent Neural Networks • Jie Zhou and Wei Xu, 2015 (Baidu Research) •

23. Word Vectors • Word2Vec • CBOW: predict the word by the context • several times faster to train than the skip-gram, slightly better accuracy for the frequent words • Skip-Gram: predict the context by the word • works well with small amount of the training data, represents well even rare words or phrases • Glove: Count-based model that learn vectors by essentially doing dimensionality reduction on the co-occurrence counts matrix • Elmo • BERT

24.

25.

26. ELMO • Bidirectional LSTM Language Model • Dynamic Word Embedding • Embedding changes according to the context

27. BERT • Replaces language modeling with “masked language modeling” • Words in a sentence are randomly erased and replaced with a special token (“masked”) with some small probability, 15%. • Then, a Transformer is used to generate a prediction for the masked word based on the unmasked words surrounding it, both to the left and right.

28. Sequence to Sequence

29. Seq2Seq Applications • Machine Translation • Summarization • Email Reply • Question Answering

30. Practical Applications • Frameworks • PyTorch • TensorFlow • Keras • More High Level • AllenNLP • spaCy • Flair, PyText, Torchtext • Problems • Unknown Word: Byte Pair Encoding - Sentence Piece • LSTM Long Sequence Problem

31. What’s Next? • More Variants of Elmo/BERT - Transfer Learning • More NLP Applications - Embeddings all the way • My Unsolicited Advice :) • deeplearning.ai (course 5 - sequence models) • read lots of papers (http://arxiv-sanity.com) • twitter & facebook (!) • Andrew Ng, Yann Lecun, Andrej Karpathy

NLP with Deep Learning

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to NLP with Deep Learning

Similar to NLP with Deep Learning (20)

Recently uploaded

Recently uploaded (20)

NLP with Deep Learning