SlideShare uma empresa Scribd logo
1 de 78
Lecture 1:
Introduction
Kai-Wei Chang
CS @ UCLA
kw@kwchang.net
Couse webpage: https://uclanlp.github.io/CS269-17/
1
ML in NLP
Announcements
 Waiting list: Start attending the first few lectures
as if you are registered. Given that some students
will drop the class, some space will free up.
 We will use Piazza as an online discussion
platform. Please sign up here:
piazza.com/ucla/fall2017/cs269
ML in NLP 2
Staff
 Instructor: Kai-Wei Chang
 Email: ml17@kwchang.net
 Office: BH 3732J
 Office hour: 4:00 – 5:00, Tue (after class).
 TA: Md Rizwan Parvez
 Email: wua4nw@virginia.edu
 Office: BH 3809
 Office hour: 12:00 – 2:00, Wed
3
ML in NLP
This lecture
 Course Overview
 What is NLP? Why it is important?
 What types of ML methods used in NLP?
 What will you learn from this course?
 Course Information
 What are the challenges?
 Key NLP components
 Key ML ideas in NLP
ML in NLP 4
What is NLP
 Wiki: Natural language processing (NLP) is
a field of computer science, artificial
intelligence, and computational linguistics
concerned with the interactions between
computers and human (natural) languages.
ML in NLP 5
Go beyond the keyword matching
 Identify the structure and meaning of
words, sentences, texts and conversations
 Deep understanding of broad language
 NLP is all around us
ML in NLP 6
Machine translation
ML in NLP 7
Facebook translation, image credit: Meedan.org
Statistical machine translation
ML in NLP 8
Image credit: Julia Hockenmaier, Intro to NLP
Dialog Systems
ML in NLP 9
Sentiment/Opinion Analysis
ML in NLP 10
Text Classification
 Other applications?
ML in NLP 11
www.wired.com
Question answering
ML in NLP 12
credit: ifunny.com
'Watson' computer wins at 'Jeopardy'
Question answering
 Go beyond search
ML in NLP 13
Natural language instruction
ML in NLP 14
https://youtu.be/KkOCeAtKHIc?t=1m28s
Digital personal assistant
 Semantic parsing – understand tasks
 Entity linking – “my wife” = “Kellie” in the phone
book
ML in NLP 15
credit: techspot.com
More on natural language instruction
Information Extraction
 Unstructured text to database entries
ML in NLP 16
Yoav Artzi: Natural language processing
Language Comprehension
 Q: who wrote Winnie the Pooh?
 Q: where is Chris lived?
ML in NLP 17
Christopher Robin is alive and well. He is the same
person that you read about in the book, Winnie the Pooh.
As a boy, Chris lived in a pretty home called Cotchfield
Farm. When Chris was three years old, his father wrote
a poem about him. The poem was printed in a magazine
for others to read. Mr. Robin then wrote a book
What will you learn from this course
 The NLP Pipeline
 Key components for
understanding text
 NLP systems/applications
 Current techniques & limitation
 Build realistic NLP tools
ML in NLP 18
What’s not covered by this course
 Speech recognition – no signal processing
 Natural language generation
 Details of ML algorithms / theory
 Text mining / information retrieval
ML in NLP 19
This lecture
 Course Overview
 What is NLP? Why it is important?
 What will you learn from this course?
 Course Information
 What are the challenges?
 Key NLP components
ML in NLP 20
Overview
 New course, first time being offered
 Comments are welcomed
 target at first- or second- year PhD students
 Lecture + Seminar
 No course prerequisites, but I assume
 programming experience (for the final project)
 basic ML/AI background
 basics of probability calculus, and linear
algebra (HW0)
ML in NLP 21
Grading
 Attendance & participations (10%)
 Participate in discussion
 Paper summarization report (20%)
 Paper presentation (30%)
 Final project (40%)
 Proposal (5%)
 Final Paper (25%)
 Presentation (10%)
ML in NLP 22
Paper summarization
 1 page maximum
 Pick one paper from recent
ACL/NAACL/EMNLP/EACL
 Summarize the paper (use you own words)
 Write a blog post using markdown or jupyter
notebook:
 https://einstein.ai/research/learned-in-
translation-contextualized-word-vectors
 https://github.com/uclanlp/reducingbias/blob/ma
ster/src/fairCRF_gender_ratio.ipynb
ML in NLP 23
ML in NLP 24
ML in NLP 25
Paper presentation
 Each group has 2~3 students
 Read and understand 2~3 related papers
 Cannot be the same as your paper summary
 Can be related to your final project
 Register your choice next week
 30 min presentation/ Q&A
 Grading Rubric:
40% technical understanding, 40%
presentation, 20% interaction
ML in NLP 26
Final Project
 Work in groups (3 students)
 Project proposal
 1 page maximum (template)
 Project report
 Similar to the paper summary
 Due before the final presentation
 Project presentation
 in-class presentation (tentative)
ML in NLP 27
Late Policy
 Submission site will be closed 1hr after the
deadline.
 No late submission
 unless under emergency situation
ML in NLP 28
Cheating/Plagiarism
 No. Ask if you have concerns
 Rules of thumb:
 Cite your references
 Clearly state what are your contributions
ML in NLP 29
Lectures and office hours
 Participation is highly appreciated!
 Ask questions if you are still confusing
 Feedbacks are welcomed
 Lead the discussion in this class
 Enroll Piazza
ML in NLP 30
Topics of this class
 Fundamental NLP problems
 Machine learning & statistical approaches
for NLP
 NLP applications
 Recent trends in NLP
ML in NLP 31
What to Read?
 Natural Language Processing
ACL, NAACL, EACL, EMNLP, CoNLL, Coling, TACL
aclweb.org/anthology
 Machine learning
ICML, NIPS, ECML, AISTATS, ICLR, JMLR, MLJ
 Artificial Intelligence
AAAI, IJCAI, UAI, JAIR
ML in NLP 32
Questions?
ML in NLP 33
This lecture
 Course Overview
 What is NLP? Why it is important?
 What will you learn from this course?
 Course Information
 What are the challenges?
 Key NLP components
 Key ML ideas in NLP
ML in NLP 34
Challenges – ambiguity
 Word sense ambiguity
ML in NLP 35
Challenges – ambiguity
 Word sense / meaning ambiguity
ML in NLP 36
Credit: http://stuffsirisaid.com
Challenges – ambiguity
 PP attachment ambiguity
ML in NLP 37
Credit: Mark Liberman, http://languagelog.ldc.upenn.edu/nll/?p=17711
Challenges -- ambiguity
 Ambiguous headlines:
 Include your children when baking cookies
 Local High School Dropouts Cut in Half
 Hospitals are Sued by 7 Foot Doctors
 Iraqi Head Seeks Arms
 Safety Experts Say School Bus Passengers
Should Be Belted
 Teacher Strikes Idle Kids
ML in NLP 38
Challenges – ambiguity
 Pronoun reference ambiguity
ML in NLP 39
Credit: http://www.printwand.com/blog/8-catastrophic-examples-of-word-choice-mistakes
Challenges – language is not static
 Language grows and changes
 e.g., cyber lingo
ML in NLP 40
LOL Laugh out loud
G2G Got to go
BFN Bye for now
B4N Bye for now
Idk I don’t know
FWIW For what it’s worth
LUWAMH Love you with all my heart
Challenges--language is compositional
ML in NLP 41
Carefully
Slide
Challenges--language is compositional
ML in NLP 42
小心:
Carefully
Careful
Take
Care
Caution
地滑:
Slide
Landslip
Wet Floor
Smooth
Challenges – scale
 Examples:
 Bible (King James version): ~700K
 Penn Tree bank ~1M from Wall street journal
 Newswire collection: 500M+
 Wikipedia: 2.9 billion word (English)
 Web: several billions of words
ML in NLP 43
This lecture
 Course Overview
 What is NLP? Why it is important?
 What will you learn from this course?
 Course Information
 What are the challenges?
 Key NLP components
 Key ML ideas in NLP
ML in NLP 44
Part of speech tagging
ML in NLP 45
Syntactic (Constituency) parsing
ML in NLP 46
Syntactic structure => meaning
ML in NLP 47
Image credit: Julia Hockenmaier, Intro to NLP
Dependency Parsing
ML in NLP 48
Semantic analysis
 Word sense disambiguation
 Semantic role labeling
ML in NLP 49
Credit: Ivan Titov
Christopher Robin is alive and well. He is the
same person that you read about in the book,
Winnie the Pooh. As a boy, Chris lived in a
pretty home called Cotchfield Farm. When
Chris was three years old, his father wrote a
poem about him. The poem was printed in a
magazine for others to read. Mr. Robin then
wrote a book
50
Q: [Chris] = [Mr. Robin] ?
Slide modified from Dan Roth
ML in NLP
Christopher Robin is alive and well. He is the
same person that you read about in the book,
Winnie the Pooh. As a boy, Chris lived in a
pretty home called Cotchfield Farm. When
Chris was three years old, his father wrote a
poem about him. The poem was printed in a
magazine for others to read. Mr. Robin then
wrote a book
51
Co-reference Resolution
ML in NLP
This lecture
 Course Overview
 What is NLP? Why it is important?
 What will you learn from this course?
 Course Information
 What are the challenges?
 Key NLP components
 Key ML ideas in NLP
ML in NLP 52
Machine learning 101
CS6501- Advanced Machine Learning 53
CS6501- Advanced Machine Learning 54
CS6501- Advanced Machine Learning 55
Perceptron, decision tree, support vector machine
K-NN, Naïve Bayes, logistic regression….
Classification is generally well-understood
 Theoretically: generalization bound
 # examples to train a good model
 Algorithmically:
 Efficient algorithm for large data set
 E.g., take a few second to train a linear SVM on
data with millions instances and features
 Algorithms for non-linear model
 E.g., Kernel methods
CS6501- Advanced Machine Learning 56
Is this enough to solve all real-world problems?
Reading Comprehension
CS6501- Advanced Machine Learning 57
Challenges
Modeling challenges
How to model a complex decision?
Representation challenges
How to extract features?
Algorithmic challenges
 Large amount of data and complex decision structure
58
Bill Clinton, recently elected as the President of
the USA, has been invited by the Russian
President], [Vladimir Putin, to visit Russia.
President Clinton said that he looks forward to
strengthening ties between USA and Russia
Algorithm 2 is shown to perform
better Berg-Kirkpatrick, ACL
2010. It can also be expected to
converge faster -- anyway, the E-
step changes the auxiliary
function by changing the
expected counts, so there's no
point in finding a local maximum
of the auxiliary
function in each iteration
a local-optimality guarantee.
Consequently, LOLS can
improve upon the reference
policy, unlike previous
algorithms. This enables us to
develop structured contextual
bandits, a partial information
structured prediction setting with
many potential applications.
Can learning to search work even
when the reference is poor?
We provide a new learning to
search algorithm, LOLS, which
does well relative to the
reference policy, but additionally
guarantees low regret compared
to deviations from the learned
policy.
Methods for learning to search
for structured prediction typically
imitate a reference policy, with
existing theoretical guarantees
demonstrating low regret
compared to that reference. This
is unsatisfactory in many
applications where the reference
policy is suboptimal and the goal
of learning is to
Robin is alive and well. He is the
same person that you read about
in the book, Winnie the Pooh. As
a boy, Chris lived in a pretty
home called Cotchfield Farm.
When Chris was three years old,
his father wrote a poem about
him. The poem was printed in a
magazine for others to read. Mr.
Robin then wrote a book
Robin is alive and well. He is the same
person that you read about in the book,
Winnie the Pooh. As a boy, Chris lived
in a pretty home called Cotchfield
Farm. When Chris was three years old,
his father wrote a poem about him. The
poem was printed in a magazine for
others to read. Mr. Robin then wrote a
book
Structured prediction models
Deep learning models
Inference / learning algorithms
Modeling Challenges
 How to model a complex decision?
 Why this is important?
CS6501- Advanced Machine Learning 59
Robin is alive and well. He is the same
person that you read about in the book,
Winnie the Pooh. As a boy, Chris lived
in a pretty home called Cotchfield
Farm. When Chris was three years old,
his father wrote a poem about him. The
poem was printed in a magazine for
others to read. Mr. Robin then wrote a
book
Language is structural
CS6501- Advanced Machine Learning 60
Hand written recognition
 What is this letter?
CS6501- Advanced Machine Learning 61
Hand written recognition
 What is this letter?
CS6501- Advanced Machine Learning 62
Visual recognition
CS6501- Advanced Machine Learning 63
Human body recognition
CS6501- Advanced Machine Learning 64
Bridge the gap
 Simple classifiers are not designed for handle
complex output
 Need to make multiple decisions jointly
 Example: POS tagging:
CS6501- Advanced Machine Learning 65
Example from Vivek Srikumar
can you can a can as a canner can can a can
Make multiple decisions jointly
 Example: POS tagging:
 Each part needs a label
 Assign tag (V., N., A., …) to each word in the sentence
 The decisions are mutually dependent
 Cannot have verb followed by a verb
 Results are evaluated jointly
CS6501- Advanced Machine Learning 66
can you can a can as a canner can can a can
Structured prediction problems
 Problems that
 have multiple interdependent output variables
 and the output assignments are evaluated jointly
 Need a joint assignment to all the output variables
 We called it joint inference, global infernece or
simply inference
CS6501- Advanced Machine Learning 67
 Input: 𝑥 ∈ 𝑋
 Truth: y∗
∈ 𝑌(𝑥)
 Predicted: ℎ(𝑥) ∈ 𝑌(𝑥)
 Loss: 𝑙𝑜𝑠𝑠 𝑦, 𝑦∗
68
I can can a can
Pro Md Vb Dt Nn
Pro Md Nn Dt Vb
Pro Md Nn Dt Md
Pro Md Md Dt Nn
Pro Md Md Dt Vb
Goal: make joint prediction to minimize a joint loss
find ℎ ∈ 𝐻 such that ℎ x ∈ 𝑌(𝑋)
minimizing 𝐸 𝑥,𝑦 ~𝐷 𝑙𝑜𝑠𝑠 𝑦, ℎ 𝑥 based on 𝑁
samples 𝑥𝑛, 𝑦𝑛 ~𝐷
Kai-Wei Chang (University of Virginia)
A General learning setting
 Input: 𝑥 ∈ 𝑋
 Truth: y∗
∈ 𝑌(𝑥)
 Predicted: ℎ(𝑥) ∈ 𝑌(𝑥)
 Loss: 𝑙𝑜𝑠𝑠 𝑦, 𝑦∗
69
I can can a can
Pro Md Vb Dt Nn
Pro Md Nn Dt Vb
Pro Md Nn Dt Md
Pro Md Md Dt Nn
Pro Md Md Dt Vb
# POS tags: 45
How many possible outputs for sentence with 10
words?
Kai-Wei Chang (University of Virginia)
Combinatorial output space
4510
= 3.4 × 1016
Observation: Not all sequences are valid,
and we don’t need to consider all of them
Representation of interdependent output variables
 A compact way to represent output
combinations
 Abstract away unnecessary complexities
 We know how to process them
 Graph algorithms for linear chain, tree, etc.
CS6501- Advanced Machine Learning 70
Pronoun Verb Noun And Noun
Root They operate ships and banks .
Algorithms/models for structured prediction
 Many learning algorithms can be
generalized to the structured case
 Perceptron → Structured perceptron
 SVM → Structured SVM
 Logistic regression → Conditional random field
(a.k.a. log-linear models)
 Can be solved by a reduction stack
 Structured prediction → multi-class → binary
CS6501- Advanced Machine Learning 71
Representation Challenges
 How to obtain features?
CS6501- Advanced Machine Learning 72
Robin is alive and well. He is the same person that
you read about in the book, Winnie the Pooh. As a
boy, Chris lived in a pretty home called Cotchfield
Farm. When Chris was three years old, his father
wrote a poem about him. The poem was printed in a
magazine for others to read. Mr. Robin then wrote a
book
Representation Challenges
 How to obtain features?
1. Design features based on domain knowledge
 E.g., by patterns in parse trees
 By nicknames
 Need human experts/knowledge
CS6501- Advanced Machine Learning 73
When Chris was three years old, his father wrote a poem about him.
Christopher Robin is alive and well. He is the same person that you read
about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home
called Cotchfield Farm.
Representation Challenges
 How to obtain features?
1. Design features based on domain knowledge
2. Design feature templates and then let machine
find the right ones
 E.g., use all words, pairs of words, …
CS6501- Advanced Machine Learning 74
Robin is alive and well. He is the same person that you read about
in the book, Winnie the Pooh. As a boy, Chris lived in a pretty
home called Cotchfield Farm. When Chris was three years old, his
father wrote a poem about him. The poem was printed in a
magazine for others to read. Mr. Robin then wrote a book
Representation Challenges
 How to obtain features?
1. Design features based on domain knowledge
2. Design feature templates and then let machine
find the right ones
 Challenges:
 # featuers can be very large
 # English words: 171K (Oxford)
 # Bigram: 171𝐾 2~3 × 1010, # trigram?
 For some domains, it is hard to design features
CS6501- Advanced Machine Learning 75
Representation learning
 Learn compact representations of features
 Combinatorial (continuous representation)
CS6501- Advanced Machine Learning 76
Representation learning
 Learn compact representations of features
 Combinatorial (continuous representation)
 Hieratical/compositional
CS6501- Advanced Machine Learning 77
What will learn from this course
 Structured prediction
 Models / inference/ learning
 Representation (deep) learning
 Input/output representations
 Combining structured models and deep
learning
CS6501- Advanced Machine Learning 78

Mais conteúdo relacionado

Semelhante a ML in NLP Lecture 1 Introduction

[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systemsQi He
 
AIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with PythonAIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with PythonNhi Nguyen
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondIsabelle Augenstein
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processingMinh Pham
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...alessio_ferrari
 
CSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 NovCSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 Novcstalks
 
CSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17AugCSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17Augcstalks
 
Anthiil Inside workshop on NLP
Anthiil Inside workshop on NLPAnthiil Inside workshop on NLP
Anthiil Inside workshop on NLPSatyam Saxena
 
Representation Learning of Text for NLP
Representation Learning of Text for NLPRepresentation Learning of Text for NLP
Representation Learning of Text for NLPAnuj Gupta
 
Query Linguistic Intent Detection
Query Linguistic Intent DetectionQuery Linguistic Intent Detection
Query Linguistic Intent Detectionbutest
 
Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"Fwdays
 
Technical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with PythonTechnical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with PythonMichelle Purnama
 
Gadgets pwn us? A pattern language for CALL
Gadgets pwn us? A pattern language for CALLGadgets pwn us? A pattern language for CALL
Gadgets pwn us? A pattern language for CALLLawrie Hunter
 
how to program.ppt
how to program.ppthow to program.ppt
how to program.pptTalhaGoraya4
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageRoelof Pieters
 
MixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptxMixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptxMariYam371004
 

Semelhante a ML in NLP Lecture 1 Introduction (20)

Gate
GateGate
Gate
 
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
 
AIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with PythonAIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with Python
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
 
ACL 2018 Recap
ACL 2018 RecapACL 2018 Recap
ACL 2018 Recap
 
CSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 NovCSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 Nov
 
CSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17AugCSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17Aug
 
Anthiil Inside workshop on NLP
Anthiil Inside workshop on NLPAnthiil Inside workshop on NLP
Anthiil Inside workshop on NLP
 
Representation Learning of Text for NLP
Representation Learning of Text for NLPRepresentation Learning of Text for NLP
Representation Learning of Text for NLP
 
Query Linguistic Intent Detection
Query Linguistic Intent DetectionQuery Linguistic Intent Detection
Query Linguistic Intent Detection
 
Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"Thomas Wolf "Transfer learning in NLP"
Thomas Wolf "Transfer learning in NLP"
 
Technical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with PythonTechnical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with Python
 
Gadgets pwn us? A pattern language for CALL
Gadgets pwn us? A pattern language for CALLGadgets pwn us? A pattern language for CALL
Gadgets pwn us? A pattern language for CALL
 
how to program.ppt
how to program.ppthow to program.ppt
how to program.ppt
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
 
FinalReport
FinalReportFinalReport
FinalReport
 
MixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptxMixedLanguageProcessingTutorialEMNLP2019.pptx
MixedLanguageProcessingTutorialEMNLP2019.pptx
 

Mais de INyomanSwitrayana

Mais de INyomanSwitrayana (8)

sa.ppt
sa.pptsa.ppt
sa.ppt
 
ch8Bayes.ppt
ch8Bayes.pptch8Bayes.ppt
ch8Bayes.ppt
 
IS
ISIS
IS
 
Handout-INF106-SBD-3.pptx
Handout-INF106-SBD-3.pptxHandout-INF106-SBD-3.pptx
Handout-INF106-SBD-3.pptx
 
05_Decision Support and OLAP.pdf
05_Decision Support and OLAP.pdf05_Decision Support and OLAP.pdf
05_Decision Support and OLAP.pdf
 
05_Skema Database.ppt
05_Skema Database.ppt05_Skema Database.ppt
05_Skema Database.ppt
 
06_The ETL (Extract-Transform-Load) Process.ppt
06_The ETL (Extract-Transform-Load) Process.ppt06_The ETL (Extract-Transform-Load) Process.ppt
06_The ETL (Extract-Transform-Load) Process.ppt
 
Lecture2-DT.pptx
Lecture2-DT.pptxLecture2-DT.pptx
Lecture2-DT.pptx
 

Último

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 

Último (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 

ML in NLP Lecture 1 Introduction

  • 1. Lecture 1: Introduction Kai-Wei Chang CS @ UCLA kw@kwchang.net Couse webpage: https://uclanlp.github.io/CS269-17/ 1 ML in NLP
  • 2. Announcements  Waiting list: Start attending the first few lectures as if you are registered. Given that some students will drop the class, some space will free up.  We will use Piazza as an online discussion platform. Please sign up here: piazza.com/ucla/fall2017/cs269 ML in NLP 2
  • 3. Staff  Instructor: Kai-Wei Chang  Email: ml17@kwchang.net  Office: BH 3732J  Office hour: 4:00 – 5:00, Tue (after class).  TA: Md Rizwan Parvez  Email: wua4nw@virginia.edu  Office: BH 3809  Office hour: 12:00 – 2:00, Wed 3 ML in NLP
  • 4. This lecture  Course Overview  What is NLP? Why it is important?  What types of ML methods used in NLP?  What will you learn from this course?  Course Information  What are the challenges?  Key NLP components  Key ML ideas in NLP ML in NLP 4
  • 5. What is NLP  Wiki: Natural language processing (NLP) is a field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages. ML in NLP 5
  • 6. Go beyond the keyword matching  Identify the structure and meaning of words, sentences, texts and conversations  Deep understanding of broad language  NLP is all around us ML in NLP 6
  • 7. Machine translation ML in NLP 7 Facebook translation, image credit: Meedan.org
  • 8. Statistical machine translation ML in NLP 8 Image credit: Julia Hockenmaier, Intro to NLP
  • 11. Text Classification  Other applications? ML in NLP 11 www.wired.com
  • 12. Question answering ML in NLP 12 credit: ifunny.com 'Watson' computer wins at 'Jeopardy'
  • 13. Question answering  Go beyond search ML in NLP 13
  • 14. Natural language instruction ML in NLP 14 https://youtu.be/KkOCeAtKHIc?t=1m28s
  • 15. Digital personal assistant  Semantic parsing – understand tasks  Entity linking – “my wife” = “Kellie” in the phone book ML in NLP 15 credit: techspot.com More on natural language instruction
  • 16. Information Extraction  Unstructured text to database entries ML in NLP 16 Yoav Artzi: Natural language processing
  • 17. Language Comprehension  Q: who wrote Winnie the Pooh?  Q: where is Chris lived? ML in NLP 17 Christopher Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book
  • 18. What will you learn from this course  The NLP Pipeline  Key components for understanding text  NLP systems/applications  Current techniques & limitation  Build realistic NLP tools ML in NLP 18
  • 19. What’s not covered by this course  Speech recognition – no signal processing  Natural language generation  Details of ML algorithms / theory  Text mining / information retrieval ML in NLP 19
  • 20. This lecture  Course Overview  What is NLP? Why it is important?  What will you learn from this course?  Course Information  What are the challenges?  Key NLP components ML in NLP 20
  • 21. Overview  New course, first time being offered  Comments are welcomed  target at first- or second- year PhD students  Lecture + Seminar  No course prerequisites, but I assume  programming experience (for the final project)  basic ML/AI background  basics of probability calculus, and linear algebra (HW0) ML in NLP 21
  • 22. Grading  Attendance & participations (10%)  Participate in discussion  Paper summarization report (20%)  Paper presentation (30%)  Final project (40%)  Proposal (5%)  Final Paper (25%)  Presentation (10%) ML in NLP 22
  • 23. Paper summarization  1 page maximum  Pick one paper from recent ACL/NAACL/EMNLP/EACL  Summarize the paper (use you own words)  Write a blog post using markdown or jupyter notebook:  https://einstein.ai/research/learned-in- translation-contextualized-word-vectors  https://github.com/uclanlp/reducingbias/blob/ma ster/src/fairCRF_gender_ratio.ipynb ML in NLP 23
  • 24. ML in NLP 24
  • 25. ML in NLP 25
  • 26. Paper presentation  Each group has 2~3 students  Read and understand 2~3 related papers  Cannot be the same as your paper summary  Can be related to your final project  Register your choice next week  30 min presentation/ Q&A  Grading Rubric: 40% technical understanding, 40% presentation, 20% interaction ML in NLP 26
  • 27. Final Project  Work in groups (3 students)  Project proposal  1 page maximum (template)  Project report  Similar to the paper summary  Due before the final presentation  Project presentation  in-class presentation (tentative) ML in NLP 27
  • 28. Late Policy  Submission site will be closed 1hr after the deadline.  No late submission  unless under emergency situation ML in NLP 28
  • 29. Cheating/Plagiarism  No. Ask if you have concerns  Rules of thumb:  Cite your references  Clearly state what are your contributions ML in NLP 29
  • 30. Lectures and office hours  Participation is highly appreciated!  Ask questions if you are still confusing  Feedbacks are welcomed  Lead the discussion in this class  Enroll Piazza ML in NLP 30
  • 31. Topics of this class  Fundamental NLP problems  Machine learning & statistical approaches for NLP  NLP applications  Recent trends in NLP ML in NLP 31
  • 32. What to Read?  Natural Language Processing ACL, NAACL, EACL, EMNLP, CoNLL, Coling, TACL aclweb.org/anthology  Machine learning ICML, NIPS, ECML, AISTATS, ICLR, JMLR, MLJ  Artificial Intelligence AAAI, IJCAI, UAI, JAIR ML in NLP 32
  • 34. This lecture  Course Overview  What is NLP? Why it is important?  What will you learn from this course?  Course Information  What are the challenges?  Key NLP components  Key ML ideas in NLP ML in NLP 34
  • 35. Challenges – ambiguity  Word sense ambiguity ML in NLP 35
  • 36. Challenges – ambiguity  Word sense / meaning ambiguity ML in NLP 36 Credit: http://stuffsirisaid.com
  • 37. Challenges – ambiguity  PP attachment ambiguity ML in NLP 37 Credit: Mark Liberman, http://languagelog.ldc.upenn.edu/nll/?p=17711
  • 38. Challenges -- ambiguity  Ambiguous headlines:  Include your children when baking cookies  Local High School Dropouts Cut in Half  Hospitals are Sued by 7 Foot Doctors  Iraqi Head Seeks Arms  Safety Experts Say School Bus Passengers Should Be Belted  Teacher Strikes Idle Kids ML in NLP 38
  • 39. Challenges – ambiguity  Pronoun reference ambiguity ML in NLP 39 Credit: http://www.printwand.com/blog/8-catastrophic-examples-of-word-choice-mistakes
  • 40. Challenges – language is not static  Language grows and changes  e.g., cyber lingo ML in NLP 40 LOL Laugh out loud G2G Got to go BFN Bye for now B4N Bye for now Idk I don’t know FWIW For what it’s worth LUWAMH Love you with all my heart
  • 41. Challenges--language is compositional ML in NLP 41 Carefully Slide
  • 42. Challenges--language is compositional ML in NLP 42 小心: Carefully Careful Take Care Caution 地滑: Slide Landslip Wet Floor Smooth
  • 43. Challenges – scale  Examples:  Bible (King James version): ~700K  Penn Tree bank ~1M from Wall street journal  Newswire collection: 500M+  Wikipedia: 2.9 billion word (English)  Web: several billions of words ML in NLP 43
  • 44. This lecture  Course Overview  What is NLP? Why it is important?  What will you learn from this course?  Course Information  What are the challenges?  Key NLP components  Key ML ideas in NLP ML in NLP 44
  • 45. Part of speech tagging ML in NLP 45
  • 47. Syntactic structure => meaning ML in NLP 47 Image credit: Julia Hockenmaier, Intro to NLP
  • 49. Semantic analysis  Word sense disambiguation  Semantic role labeling ML in NLP 49 Credit: Ivan Titov
  • 50. Christopher Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book 50 Q: [Chris] = [Mr. Robin] ? Slide modified from Dan Roth ML in NLP
  • 51. Christopher Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book 51 Co-reference Resolution ML in NLP
  • 52. This lecture  Course Overview  What is NLP? Why it is important?  What will you learn from this course?  Course Information  What are the challenges?  Key NLP components  Key ML ideas in NLP ML in NLP 52
  • 53. Machine learning 101 CS6501- Advanced Machine Learning 53
  • 55. CS6501- Advanced Machine Learning 55 Perceptron, decision tree, support vector machine K-NN, Naïve Bayes, logistic regression….
  • 56. Classification is generally well-understood  Theoretically: generalization bound  # examples to train a good model  Algorithmically:  Efficient algorithm for large data set  E.g., take a few second to train a linear SVM on data with millions instances and features  Algorithms for non-linear model  E.g., Kernel methods CS6501- Advanced Machine Learning 56 Is this enough to solve all real-world problems?
  • 58. Challenges Modeling challenges How to model a complex decision? Representation challenges How to extract features? Algorithmic challenges  Large amount of data and complex decision structure 58 Bill Clinton, recently elected as the President of the USA, has been invited by the Russian President], [Vladimir Putin, to visit Russia. President Clinton said that he looks forward to strengthening ties between USA and Russia Algorithm 2 is shown to perform better Berg-Kirkpatrick, ACL 2010. It can also be expected to converge faster -- anyway, the E- step changes the auxiliary function by changing the expected counts, so there's no point in finding a local maximum of the auxiliary function in each iteration a local-optimality guarantee. Consequently, LOLS can improve upon the reference policy, unlike previous algorithms. This enables us to develop structured contextual bandits, a partial information structured prediction setting with many potential applications. Can learning to search work even when the reference is poor? We provide a new learning to search algorithm, LOLS, which does well relative to the reference policy, but additionally guarantees low regret compared to deviations from the learned policy. Methods for learning to search for structured prediction typically imitate a reference policy, with existing theoretical guarantees demonstrating low regret compared to that reference. This is unsatisfactory in many applications where the reference policy is suboptimal and the goal of learning is to Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book Structured prediction models Deep learning models Inference / learning algorithms
  • 59. Modeling Challenges  How to model a complex decision?  Why this is important? CS6501- Advanced Machine Learning 59 Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book
  • 60. Language is structural CS6501- Advanced Machine Learning 60
  • 61. Hand written recognition  What is this letter? CS6501- Advanced Machine Learning 61
  • 62. Hand written recognition  What is this letter? CS6501- Advanced Machine Learning 62
  • 63. Visual recognition CS6501- Advanced Machine Learning 63
  • 64. Human body recognition CS6501- Advanced Machine Learning 64
  • 65. Bridge the gap  Simple classifiers are not designed for handle complex output  Need to make multiple decisions jointly  Example: POS tagging: CS6501- Advanced Machine Learning 65 Example from Vivek Srikumar can you can a can as a canner can can a can
  • 66. Make multiple decisions jointly  Example: POS tagging:  Each part needs a label  Assign tag (V., N., A., …) to each word in the sentence  The decisions are mutually dependent  Cannot have verb followed by a verb  Results are evaluated jointly CS6501- Advanced Machine Learning 66 can you can a can as a canner can can a can
  • 67. Structured prediction problems  Problems that  have multiple interdependent output variables  and the output assignments are evaluated jointly  Need a joint assignment to all the output variables  We called it joint inference, global infernece or simply inference CS6501- Advanced Machine Learning 67
  • 68.  Input: 𝑥 ∈ 𝑋  Truth: y∗ ∈ 𝑌(𝑥)  Predicted: ℎ(𝑥) ∈ 𝑌(𝑥)  Loss: 𝑙𝑜𝑠𝑠 𝑦, 𝑦∗ 68 I can can a can Pro Md Vb Dt Nn Pro Md Nn Dt Vb Pro Md Nn Dt Md Pro Md Md Dt Nn Pro Md Md Dt Vb Goal: make joint prediction to minimize a joint loss find ℎ ∈ 𝐻 such that ℎ x ∈ 𝑌(𝑋) minimizing 𝐸 𝑥,𝑦 ~𝐷 𝑙𝑜𝑠𝑠 𝑦, ℎ 𝑥 based on 𝑁 samples 𝑥𝑛, 𝑦𝑛 ~𝐷 Kai-Wei Chang (University of Virginia) A General learning setting
  • 69.  Input: 𝑥 ∈ 𝑋  Truth: y∗ ∈ 𝑌(𝑥)  Predicted: ℎ(𝑥) ∈ 𝑌(𝑥)  Loss: 𝑙𝑜𝑠𝑠 𝑦, 𝑦∗ 69 I can can a can Pro Md Vb Dt Nn Pro Md Nn Dt Vb Pro Md Nn Dt Md Pro Md Md Dt Nn Pro Md Md Dt Vb # POS tags: 45 How many possible outputs for sentence with 10 words? Kai-Wei Chang (University of Virginia) Combinatorial output space 4510 = 3.4 × 1016 Observation: Not all sequences are valid, and we don’t need to consider all of them
  • 70. Representation of interdependent output variables  A compact way to represent output combinations  Abstract away unnecessary complexities  We know how to process them  Graph algorithms for linear chain, tree, etc. CS6501- Advanced Machine Learning 70 Pronoun Verb Noun And Noun Root They operate ships and banks .
  • 71. Algorithms/models for structured prediction  Many learning algorithms can be generalized to the structured case  Perceptron → Structured perceptron  SVM → Structured SVM  Logistic regression → Conditional random field (a.k.a. log-linear models)  Can be solved by a reduction stack  Structured prediction → multi-class → binary CS6501- Advanced Machine Learning 71
  • 72. Representation Challenges  How to obtain features? CS6501- Advanced Machine Learning 72 Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book
  • 73. Representation Challenges  How to obtain features? 1. Design features based on domain knowledge  E.g., by patterns in parse trees  By nicknames  Need human experts/knowledge CS6501- Advanced Machine Learning 73 When Chris was three years old, his father wrote a poem about him. Christopher Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm.
  • 74. Representation Challenges  How to obtain features? 1. Design features based on domain knowledge 2. Design feature templates and then let machine find the right ones  E.g., use all words, pairs of words, … CS6501- Advanced Machine Learning 74 Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book
  • 75. Representation Challenges  How to obtain features? 1. Design features based on domain knowledge 2. Design feature templates and then let machine find the right ones  Challenges:  # featuers can be very large  # English words: 171K (Oxford)  # Bigram: 171𝐾 2~3 × 1010, # trigram?  For some domains, it is hard to design features CS6501- Advanced Machine Learning 75
  • 76. Representation learning  Learn compact representations of features  Combinatorial (continuous representation) CS6501- Advanced Machine Learning 76
  • 77. Representation learning  Learn compact representations of features  Combinatorial (continuous representation)  Hieratical/compositional CS6501- Advanced Machine Learning 77
  • 78. What will learn from this course  Structured prediction  Models / inference/ learning  Representation (deep) learning  Input/output representations  Combining structured models and deep learning CS6501- Advanced Machine Learning 78