SlideShare uma empresa Scribd logo
1 de 28
An introduction to lexical
semantic change
Nina Tahmasebi, Associate Professor
University of Gothenburg
October 2020, Stuttgart
awesome
He was an
awesome leader!
He was an
awesome leader!
time
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Aims semantic change
Find word sense changes
automatically to find what
changes, how it changed and
when it changed
Stone
Music
Lifestyle
Rock
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Vision
Given a word in a document at time t
a
1
Mark words that are likely
to have a changed meaning
2
Find all changes to the word
a
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
LSC
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Methods for
computational
semantic change
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
collective text
individual
individual text
signal
topic, cluster, vector…
Pipeline
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
signal change
Nina Tahmasebi, On Lexical Semantic Change and Evaluation,
London, November 2019
wordword Single-senseSense-differentiated
embeddings
dynamic embeddings
neural embeddings
Single-sense
Sense-differentiated
20132008 201220102009 2011 2014 2015 2016 2017 2018 2019
topic models
word sense induction
contextual embeddings
2020
Giulianelli
et al
2020
Hu et al
2019
Tahmasebi et al.
2008
Mitra et al
2015
Tahmasebi & Risse
2017
Wijaya
& Yentizerzi
2011
Lau et al
2012
Frerman & Lapata
2016
Bamler & Mandt
2018
Kim et al
2014
Kulkarni et
al
2015
Hamilton et al
2016
Sagi et al
2009
Basile et al
2016
Word-level
semantic change
embeddings / context-based methods
dynamic embeddings
neural embeddings
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Context-based method
Sagi et al.
GEMS 2009
context vectors
w
ti
tj
Broadening of sense
Narrowing of sense
With grouping:
Added/removed sense
Data set split in approp. sets
BUT: 1. 2. No alignment of senses over time!No discrimination between senses
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Difficulty:
What does a word mean?
When are two meanings the same?
Nina Tahmasebi, On Lexical Semantic Change and Evaluation,
London, November 2019
Nina Tahmasebi, On Lexical Semantic Change and Evaluation,
London, November 2019
wordword Single-sense
Word embedding-based models
Kulkarni et al. WWW’15
Project a word onto a vector/point
(POS, frequency and embeddings)
Track vectors over time
Kim et al. LACSS 2014
Basile et al. CLiC-it 2016
Hamilton et al. ACL 2016
Image: Kulkarni et al. WWW’15
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
LSC – individually trained embedding spaces
Single-point
embedding space
ti
multiple
time points
Track an individual
word w over time
Change
point/degree
detection
align
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Dynamic Embeddings
Sharing data is highly beneficial!
Bamler & Mandt:
• Bayesian Skip-gram
Yao et al:
• PPMI embeddings
Rudolph & Blei:
• Exponential family embeddings
(Beronoulli embeddings)
Share data across all time points
Avoids aligning
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Temporal Referencing
Sharing data is highly beneficial!
Share contexts across all time points
Indivudal vectors for words for each bin
Avoids aligning
Dubossarsky et al
• SGNS
• PPMI embeddings
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Sense-differentiated
semantic change
topic models
word sense induction
contextual embeddings
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Wijaya & Yeniterzi
DETECT '11
Cook et al.
Coling 2014
Frermann & Lapata
TACL 2016
Topic-based methods
Finally, we
conduct a
preliminary
evaluation in
which we apply
our methods to
the task of
The meanings
of words are
not fixed but in
fact undergo
change
BNC ukWaC
1 Topic model (HDP)
Assign topics to all instances of a word.2
If a word sense WSi is assigned to collection 2
but not 1 then WSi is a novel word sense.3
BUT:
Only two time points (typically there is much noise!)
No alignment of senses over time!
A
B
Lau et al.
EACL 2014
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Downsides topic models
Topic
change
Sense
change
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Word sense induction
Word sense induction
(curvature clustering)
individual time slices
Tahmasebi & Risse, RANLP2017
Stone
Music
Lifestyle
Rock
Step 1: Step 2: Step 3:
Detecting stable
senses
 units
Relating units
Paths
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Complexity
O(|S|T)
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Type-based embedding methods
wSentence with w and more
Different sentence with w and more
Last sentence with w and more
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Token-based embedding methods
w
w
w
Sentence with w and more
Different sentence with w and more
Last sentence with w and more
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
• Positive examples
• Negative examples
• Pairs
Evaluation
Controlled data
3
ways
Top/bottom
results
Pre-determined list of:
Nina Tahmasebi, Digital Literacy, Sept. 2020 26
Summary of methods
• Most co-occurrence methods
• are outperformed by type-
embeddings
• Type-embeddings
• average embeddings
• need alignment across corpora
• need very much data
• Dynamic embeddings
• ‘remember’ too much historical
• Topic-based method
• have little correspondence to
senses
• (and run badly on too large
datasets)
• WSI-based method
• have typically too low coverage
• Contextual embeddings
• need to be clustered into senses
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Requirements
• Large & small data
• Separate senses
• Disambiguate instances
Nina Tahmasebi, An introduction to lexical semantic change,
Stuttgart,Oct 2020
Stone
Music
Lifestyle
Rock
a
Danke!
Nina.tahmasebi@gu.se
nina@tahmasebi.se

Mais conteúdo relacionado

Mais procurados

Pre trained language model
Pre trained language modelPre trained language model
Pre trained language modelJiWenKim
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data ScienceKenny Daniel
 
An introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTAn introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTSuman Debnath
 
1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)WarNik Chow
 
NLP Project Presentation
NLP Project PresentationNLP Project Presentation
NLP Project PresentationAryak Sengupta
 
Deep Neural Methods for Retrieval
Deep Neural Methods for RetrievalDeep Neural Methods for Retrieval
Deep Neural Methods for RetrievalBhaskar Mitra
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)Marina Santini
 
Deep learning algorithms for intrusion detection systems in internet of thin...
Deep learning algorithms for intrusion detection systems in  internet of thin...Deep learning algorithms for intrusion detection systems in  internet of thin...
Deep learning algorithms for intrusion detection systems in internet of thin...IJECEIAES
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationDavid Solomon
 
Neural Language Generation Head to Toe
Neural Language Generation Head to Toe Neural Language Generation Head to Toe
Neural Language Generation Head to Toe Hady Elsahar
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingVeenaSKumar2
 

Mais procurados (20)

Word2Vec
Word2VecWord2Vec
Word2Vec
 
Pre trained language model
Pre trained language modelPre trained language model
Pre trained language model
 
Bert
BertBert
Bert
 
[Paper review] BERT
[Paper review] BERT[Paper review] BERT
[Paper review] BERT
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
Text Mining
Text MiningText Mining
Text Mining
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data Science
 
The NLP Muppets revolution!
The NLP Muppets revolution!The NLP Muppets revolution!
The NLP Muppets revolution!
 
Text mining
Text miningText mining
Text mining
 
An introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTAn introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERT
 
Transformers in 2021
Transformers in 2021Transformers in 2021
Transformers in 2021
 
1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)
 
NLP Project Presentation
NLP Project PresentationNLP Project Presentation
NLP Project Presentation
 
Deep Neural Methods for Retrieval
Deep Neural Methods for RetrievalDeep Neural Methods for Retrieval
Deep Neural Methods for Retrieval
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
 
Deep learning algorithms for intrusion detection systems in internet of thin...
Deep learning algorithms for intrusion detection systems in  internet of thin...Deep learning algorithms for intrusion detection systems in  internet of thin...
Deep learning algorithms for intrusion detection systems in internet of thin...
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven Organization
 
Neural Language Generation Head to Toe
Neural Language Generation Head to Toe Neural Language Generation Head to Toe
Neural Language Generation Head to Toe
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time Analytics
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 

Mais de Nina Tahmasebi

Tartu-DHtalk-final.pdf
Tartu-DHtalk-final.pdfTartu-DHtalk-final.pdf
Tartu-DHtalk-final.pdfNina Tahmasebi
 
2022-10-18-KBR-for publication.pdf
2022-10-18-KBR-for publication.pdf2022-10-18-KBR-for publication.pdf
2022-10-18-KBR-for publication.pdfNina Tahmasebi
 
2020 09-28-odense-final-forpublication
2020 09-28-odense-final-forpublication2020 09-28-odense-final-forpublication
2020 09-28-odense-final-forpublicationNina Tahmasebi
 
Workshop on Digital Literacy - Digital text and data-intensive research
Workshop on Digital Literacy - Digital text and data-intensive researchWorkshop on Digital Literacy - Digital text and data-intensive research
Workshop on Digital Literacy - Digital text and data-intensive researchNina Tahmasebi
 
Dhn2018-A Study on Word2Vec on a Historical Swedish Newspaper Corpus
Dhn2018-A Study on Word2Vec on a Historical Swedish Newspaper CorpusDhn2018-A Study on Word2Vec on a Historical Swedish Newspaper Corpus
Dhn2018-A Study on Word2Vec on a Historical Swedish Newspaper CorpusNina Tahmasebi
 
Detecting language change for the digital humanities
Detecting language change for the digital humanitiesDetecting language change for the digital humanities
Detecting language change for the digital humanitiesNina Tahmasebi
 

Mais de Nina Tahmasebi (7)

CHR2022-final.pdf
CHR2022-final.pdfCHR2022-final.pdf
CHR2022-final.pdf
 
Tartu-DHtalk-final.pdf
Tartu-DHtalk-final.pdfTartu-DHtalk-final.pdf
Tartu-DHtalk-final.pdf
 
2022-10-18-KBR-for publication.pdf
2022-10-18-KBR-for publication.pdf2022-10-18-KBR-for publication.pdf
2022-10-18-KBR-for publication.pdf
 
2020 09-28-odense-final-forpublication
2020 09-28-odense-final-forpublication2020 09-28-odense-final-forpublication
2020 09-28-odense-final-forpublication
 
Workshop on Digital Literacy - Digital text and data-intensive research
Workshop on Digital Literacy - Digital text and data-intensive researchWorkshop on Digital Literacy - Digital text and data-intensive research
Workshop on Digital Literacy - Digital text and data-intensive research
 
Dhn2018-A Study on Word2Vec on a Historical Swedish Newspaper Corpus
Dhn2018-A Study on Word2Vec on a Historical Swedish Newspaper CorpusDhn2018-A Study on Word2Vec on a Historical Swedish Newspaper Corpus
Dhn2018-A Study on Word2Vec on a Historical Swedish Newspaper Corpus
 
Detecting language change for the digital humanities
Detecting language change for the digital humanitiesDetecting language change for the digital humanities
Detecting language change for the digital humanities
 

Último

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 

Último (20)

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 

2020 10-26-language change-stuttgart-workshop

  • 1. An introduction to lexical semantic change Nina Tahmasebi, Associate Professor University of Gothenburg October 2020, Stuttgart
  • 2. awesome He was an awesome leader! He was an awesome leader! time Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 3. Aims semantic change Find word sense changes automatically to find what changes, how it changed and when it changed Stone Music Lifestyle Rock Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 4. Vision Given a word in a document at time t a 1 Mark words that are likely to have a changed meaning 2 Find all changes to the word a Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 5. LSC Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 6. Methods for computational semantic change Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 7. collective text individual individual text signal topic, cluster, vector… Pipeline Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020 signal change
  • 8. Nina Tahmasebi, On Lexical Semantic Change and Evaluation, London, November 2019 wordword Single-senseSense-differentiated
  • 9. embeddings dynamic embeddings neural embeddings Single-sense Sense-differentiated 20132008 201220102009 2011 2014 2015 2016 2017 2018 2019 topic models word sense induction contextual embeddings 2020 Giulianelli et al 2020 Hu et al 2019 Tahmasebi et al. 2008 Mitra et al 2015 Tahmasebi & Risse 2017 Wijaya & Yentizerzi 2011 Lau et al 2012 Frerman & Lapata 2016 Bamler & Mandt 2018 Kim et al 2014 Kulkarni et al 2015 Hamilton et al 2016 Sagi et al 2009 Basile et al 2016
  • 10. Word-level semantic change embeddings / context-based methods dynamic embeddings neural embeddings Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 11. Context-based method Sagi et al. GEMS 2009 context vectors w ti tj Broadening of sense Narrowing of sense With grouping: Added/removed sense Data set split in approp. sets BUT: 1. 2. No alignment of senses over time!No discrimination between senses Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 12. Difficulty: What does a word mean? When are two meanings the same? Nina Tahmasebi, On Lexical Semantic Change and Evaluation, London, November 2019
  • 13. Nina Tahmasebi, On Lexical Semantic Change and Evaluation, London, November 2019 wordword Single-sense
  • 14. Word embedding-based models Kulkarni et al. WWW’15 Project a word onto a vector/point (POS, frequency and embeddings) Track vectors over time Kim et al. LACSS 2014 Basile et al. CLiC-it 2016 Hamilton et al. ACL 2016 Image: Kulkarni et al. WWW’15 Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 15. LSC – individually trained embedding spaces Single-point embedding space ti multiple time points Track an individual word w over time Change point/degree detection align Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 16. Dynamic Embeddings Sharing data is highly beneficial! Bamler & Mandt: • Bayesian Skip-gram Yao et al: • PPMI embeddings Rudolph & Blei: • Exponential family embeddings (Beronoulli embeddings) Share data across all time points Avoids aligning Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 17. Temporal Referencing Sharing data is highly beneficial! Share contexts across all time points Indivudal vectors for words for each bin Avoids aligning Dubossarsky et al • SGNS • PPMI embeddings Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 18. Sense-differentiated semantic change topic models word sense induction contextual embeddings Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 19. Wijaya & Yeniterzi DETECT '11 Cook et al. Coling 2014 Frermann & Lapata TACL 2016 Topic-based methods Finally, we conduct a preliminary evaluation in which we apply our methods to the task of The meanings of words are not fixed but in fact undergo change BNC ukWaC 1 Topic model (HDP) Assign topics to all instances of a word.2 If a word sense WSi is assigned to collection 2 but not 1 then WSi is a novel word sense.3 BUT: Only two time points (typically there is much noise!) No alignment of senses over time! A B Lau et al. EACL 2014 Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 20. Downsides topic models Topic change Sense change Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 21. Word sense induction Word sense induction (curvature clustering) individual time slices Tahmasebi & Risse, RANLP2017 Stone Music Lifestyle Rock Step 1: Step 2: Step 3: Detecting stable senses  units Relating units Paths Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 22. Complexity O(|S|T) Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 23. Type-based embedding methods wSentence with w and more Different sentence with w and more Last sentence with w and more Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 24. Token-based embedding methods w w w Sentence with w and more Different sentence with w and more Last sentence with w and more Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 25. • Positive examples • Negative examples • Pairs Evaluation Controlled data 3 ways Top/bottom results Pre-determined list of: Nina Tahmasebi, Digital Literacy, Sept. 2020 26
  • 26. Summary of methods • Most co-occurrence methods • are outperformed by type- embeddings • Type-embeddings • average embeddings • need alignment across corpora • need very much data • Dynamic embeddings • ‘remember’ too much historical • Topic-based method • have little correspondence to senses • (and run badly on too large datasets) • WSI-based method • have typically too low coverage • Contextual embeddings • need to be clustered into senses Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020
  • 27. Requirements • Large & small data • Separate senses • Disambiguate instances Nina Tahmasebi, An introduction to lexical semantic change, Stuttgart,Oct 2020 Stone Music Lifestyle Rock a