SlideShare uma empresa Scribd logo
1 de 12
Does Personalisation Benefit
Everyone in the Same Way?
M. Rami Ghorab
Postdoc, School of Computer Science & Statistics,
Trinity College Dublin
Today’s Web
Monolingual & Multilingual
Users
Searching across
Multilingual Content
• Diverse linguistic backgrounds
• Different language capabilities
• Different language preferences
We want to personalise search, given these characteristics
• Various languages.
• Relevant content – which lang?
• User Modelling
– Search interests (keywords) that span across multiple languages.
– Grouped into language fragments.
• Adapting Results in Multilingual Web Search
– Merging and Re-ranking the results.
– Translating where necessary.
Extending Personalisation
into the Multilingual Dimension
Personalised Multilingual Information Retrieval (PMIR)
User Modelling
Native Language
Familiar Languages
Preferred Language
Attributes
Structure
Result Lists
(English, French, German)
Ranked separately
against keywords
in User Model fragment
(textual similarity)
Re-ranked Result Lists
(English, French, German)
Merged & Translated List
Research Question - Revisited
Would multilingual search personalisation algorithms
achieve the same degree of improvements
for all search queries, regardless of query language?
• Evaluate the retrieval effectiveness of the multilingual
search personalisation algorithms (User Modelling
and Result Adaptation).
• Determine whether the algorithms achieve the same
degree of effectiveness for users who have different
language preferences (examine English vs. Non-
English users).
Experiment - Objectives
Experiment - Setup
Phase 2: Result Pooling
• Last query reserved for testing.
• Construct the user models.
• Generate various result lists.
Phase 3: Relevance Judgments
• 4-point scale of relevance
(not relevant / somewhat relevant /
relevant / very relevant)
Phase 4: Evaluation
• Metric: Mean Average Precision (MAP).
• Measures effectiveness of each
algorithm across all test queries
Phase 1: User Participation
• Sign up – language preferences.
• Two search topics.
• Use baseline multilingual Web search.
• Submit findings about topic.
Experiment - Results
MAP Improvements over Baseline
for various result list positions (cut-off points @5..@20)
Understanding the Results
List
Position
English
Non-
English
%
English over Non-
English
P@5 0.58 0.45 29.15%
P@10 0.55 0.49 11.54%
P@15 0.51 0.45 14.46%
P@20 0.50 0.48 3.71%
Baseline (non-personalised) Precision Scores
• Does personalisation benefit everyone in the same way?
– No.
– Multilingual search adaptation algorithms work differently with users of
different language preferences/capabilities.
• Recommendation
– Personalised Search systems should adopt different personalisation
strategies for certain languages or groups of languages.
• Future Work
– Concept-based user models (multilingual ontology or web taxonomy).
Conclusion & Future Work
Thank You
This research is supported by
the Science Foundation Ireland (Grant 12/CE/I2267)
as part of the Centre for Next Generation Localisation
(www.cngl.ie) at Trinity College, Dublin.

Mais conteúdo relacionado

Semelhante a Presentation at joint PIA workshop at UMAP 2014

Gpt1 and 2 model review
Gpt1 and 2 model reviewGpt1 and 2 model review
Gpt1 and 2 model reviewSeoung-Ho Choi
 
Vectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxVectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxSachinAngre3
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_saRobert Martin
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxElasticsearch
 
Ch1 language design issue
Ch1 language design issueCh1 language design issue
Ch1 language design issueJigisha Pandya
 
Math-Bridge Student Interface
Math-Bridge Student InterfaceMath-Bridge Student Interface
Math-Bridge Student Interfacemetamath
 
Learning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn JohannaLearning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn Johannaajviteri1
 
UX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among usersUX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among usersAnusha Radhakrishnan
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...CesToronto
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...TAUS - The Language Data Network
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Lifeng (Aaron) Han
 
Successful Single-Source Content Development
Successful Single-Source Content Development Successful Single-Source Content Development
Successful Single-Source Content Development Xyleme
 
Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...Lionbridge
 
Translation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.infoTranslation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.infoDamian Pajnkiher
 
Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16Laura Dent
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsLifeng (Aaron) Han
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyIconic Translation Machines
 
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education 2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education Carmen Lizy Lamboy-Naughton, Ed.D.
 

Semelhante a Presentation at joint PIA workshop at UMAP 2014 (20)

Gpt1 and 2 model review
Gpt1 and 2 model reviewGpt1 and 2 model review
Gpt1 and 2 model review
 
Vectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxVectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptx
 
Content Profiling - Sharon O'Brien (DCU)
Content Profiling - Sharon O'Brien (DCU)Content Profiling - Sharon O'Brien (DCU)
Content Profiling - Sharon O'Brien (DCU)
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_sa
 
Unit 5f.pptx
Unit 5f.pptxUnit 5f.pptx
Unit 5f.pptx
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolbox
 
Ch1 language design issue
Ch1 language design issueCh1 language design issue
Ch1 language design issue
 
Math-Bridge Student Interface
Math-Bridge Student InterfaceMath-Bridge Student Interface
Math-Bridge Student Interface
 
Learning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn JohannaLearning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn Johanna
 
UX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among usersUX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among users
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
 
Successful Single-Source Content Development
Successful Single-Source Content Development Successful Single-Source Content Development
Successful Single-Source Content Development
 
Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...
 
Translation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.infoTranslation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.info
 
Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methods
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case Study
 
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education 2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
 

Último

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 

Último (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Presentation at joint PIA workshop at UMAP 2014

  • 1. Does Personalisation Benefit Everyone in the Same Way? M. Rami Ghorab Postdoc, School of Computer Science & Statistics, Trinity College Dublin
  • 2. Today’s Web Monolingual & Multilingual Users Searching across Multilingual Content • Diverse linguistic backgrounds • Different language capabilities • Different language preferences We want to personalise search, given these characteristics • Various languages. • Relevant content – which lang?
  • 3. • User Modelling – Search interests (keywords) that span across multiple languages. – Grouped into language fragments. • Adapting Results in Multilingual Web Search – Merging and Re-ranking the results. – Translating where necessary. Extending Personalisation into the Multilingual Dimension Personalised Multilingual Information Retrieval (PMIR)
  • 4. User Modelling Native Language Familiar Languages Preferred Language Attributes Structure
  • 5. Result Lists (English, French, German) Ranked separately against keywords in User Model fragment (textual similarity) Re-ranked Result Lists (English, French, German) Merged & Translated List
  • 6. Research Question - Revisited Would multilingual search personalisation algorithms achieve the same degree of improvements for all search queries, regardless of query language?
  • 7. • Evaluate the retrieval effectiveness of the multilingual search personalisation algorithms (User Modelling and Result Adaptation). • Determine whether the algorithms achieve the same degree of effectiveness for users who have different language preferences (examine English vs. Non- English users). Experiment - Objectives
  • 8. Experiment - Setup Phase 2: Result Pooling • Last query reserved for testing. • Construct the user models. • Generate various result lists. Phase 3: Relevance Judgments • 4-point scale of relevance (not relevant / somewhat relevant / relevant / very relevant) Phase 4: Evaluation • Metric: Mean Average Precision (MAP). • Measures effectiveness of each algorithm across all test queries Phase 1: User Participation • Sign up – language preferences. • Two search topics. • Use baseline multilingual Web search. • Submit findings about topic.
  • 9. Experiment - Results MAP Improvements over Baseline for various result list positions (cut-off points @5..@20)
  • 10. Understanding the Results List Position English Non- English % English over Non- English P@5 0.58 0.45 29.15% P@10 0.55 0.49 11.54% P@15 0.51 0.45 14.46% P@20 0.50 0.48 3.71% Baseline (non-personalised) Precision Scores
  • 11. • Does personalisation benefit everyone in the same way? – No. – Multilingual search adaptation algorithms work differently with users of different language preferences/capabilities. • Recommendation – Personalised Search systems should adopt different personalisation strategies for certain languages or groups of languages. • Future Work – Concept-based user models (multilingual ontology or web taxonomy). Conclusion & Future Work
  • 12. Thank You This research is supported by the Science Foundation Ireland (Grant 12/CE/I2267) as part of the Centre for Next Generation Localisation (www.cngl.ie) at Trinity College, Dublin.