SlideShare uma empresa Scribd logo
1 de 12
Does Personalisation Benefit
Everyone in the Same Way?
M. Rami Ghorab
Postdoc, School of Computer Science & Statistics,
Trinity College Dublin
Today’s Web
Monolingual & Multilingual
Users
Searching across
Multilingual Content
• Diverse linguistic backgrounds
• Different language capabilities
• Different language preferences
We want to personalise search, given these characteristics
• Various languages.
• Relevant content – which lang?
• User Modelling
– Search interests (keywords) that span across multiple languages.
– Grouped into language fragments.
• Adapting Results in Multilingual Web Search
– Merging and Re-ranking the results.
– Translating where necessary.
Extending Personalisation
into the Multilingual Dimension
Personalised Multilingual Information Retrieval (PMIR)
User Modelling
Native Language
Familiar Languages
Preferred Language
Attributes
Structure
Result Lists
(English, French, German)
Ranked separately
against keywords
in User Model fragment
(textual similarity)
Re-ranked Result Lists
(English, French, German)
Merged & Translated List
Research Question - Revisited
Would multilingual search personalisation algorithms
achieve the same degree of improvements
for all search queries, regardless of query language?
• Evaluate the retrieval effectiveness of the multilingual
search personalisation algorithms (User Modelling
and Result Adaptation).
• Determine whether the algorithms achieve the same
degree of effectiveness for users who have different
language preferences (examine English vs. Non-
English users).
Experiment - Objectives
Experiment - Setup
Phase 2: Result Pooling
• Last query reserved for testing.
• Construct the user models.
• Generate various result lists.
Phase 3: Relevance Judgments
• 4-point scale of relevance
(not relevant / somewhat relevant /
relevant / very relevant)
Phase 4: Evaluation
• Metric: Mean Average Precision (MAP).
• Measures effectiveness of each
algorithm across all test queries
Phase 1: User Participation
• Sign up – language preferences.
• Two search topics.
• Use baseline multilingual Web search.
• Submit findings about topic.
Experiment - Results
MAP Improvements over Baseline
for various result list positions (cut-off points @5..@20)
Understanding the Results
List
Position
English
Non-
English
%
English over Non-
English
P@5 0.58 0.45 29.15%
P@10 0.55 0.49 11.54%
P@15 0.51 0.45 14.46%
P@20 0.50 0.48 3.71%
Baseline (non-personalised) Precision Scores
• Does personalisation benefit everyone in the same way?
– No.
– Multilingual search adaptation algorithms work differently with users of
different language preferences/capabilities.
• Recommendation
– Personalised Search systems should adopt different personalisation
strategies for certain languages or groups of languages.
• Future Work
– Concept-based user models (multilingual ontology or web taxonomy).
Conclusion & Future Work
Thank You
This research is supported by
the Science Foundation Ireland (Grant 12/CE/I2267)
as part of the Centre for Next Generation Localisation
(www.cngl.ie) at Trinity College, Dublin.

Mais conteúdo relacionado

Semelhante a Presentation at joint PIA workshop at UMAP 2014

Gpt1 and 2 model review
Gpt1 and 2 model reviewGpt1 and 2 model review
Gpt1 and 2 model reviewSeoung-Ho Choi
 
Vectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxVectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxSachinAngre3
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_saRobert Martin
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxElasticsearch
 
Ch1 language design issue
Ch1 language design issueCh1 language design issue
Ch1 language design issueJigisha Pandya
 
Math-Bridge Student Interface
Math-Bridge Student InterfaceMath-Bridge Student Interface
Math-Bridge Student Interfacemetamath
 
Learning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn JohannaLearning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn Johannaajviteri1
 
UX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among usersUX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among usersAnusha Radhakrishnan
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...CesToronto
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...TAUS - The Language Data Network
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Lifeng (Aaron) Han
 
Successful Single-Source Content Development
Successful Single-Source Content Development Successful Single-Source Content Development
Successful Single-Source Content Development Xyleme
 
Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...Lionbridge
 
Translation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.infoTranslation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.infoDamian Pajnkiher
 
Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16Laura Dent
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsLifeng (Aaron) Han
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyIconic Translation Machines
 
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education 2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education Carmen Lizy Lamboy-Naughton, Ed.D.
 

Semelhante a Presentation at joint PIA workshop at UMAP 2014 (20)

Gpt1 and 2 model review
Gpt1 and 2 model reviewGpt1 and 2 model review
Gpt1 and 2 model review
 
Vectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptxVectorized Intent of Multilingual Large Language Models.pptx
Vectorized Intent of Multilingual Large Language Models.pptx
 
Content Profiling - Sharon O'Brien (DCU)
Content Profiling - Sharon O'Brien (DCU)Content Profiling - Sharon O'Brien (DCU)
Content Profiling - Sharon O'Brien (DCU)
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_sa
 
Unit 5f.pptx
Unit 5f.pptxUnit 5f.pptx
Unit 5f.pptx
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolbox
 
Ch1 language design issue
Ch1 language design issueCh1 language design issue
Ch1 language design issue
 
Math-Bridge Student Interface
Math-Bridge Student InterfaceMath-Bridge Student Interface
Math-Bridge Student Interface
 
Learning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn JohannaLearning Activity 1_ Viteri Flores_Arlyn Johanna
Learning Activity 1_ Viteri Flores_Arlyn Johanna
 
UX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among usersUX Research Proposal - Microsoft Word - Regional differences among users
UX Research Proposal - Microsoft Word - Regional differences among users
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
 
Successful Single-Source Content Development
Successful Single-Source Content Development Successful Single-Source Content Development
Successful Single-Source Content Development
 
Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...Global SEO: How to Enhance Your Website's International User Experience with ...
Global SEO: How to Enhance Your Website's International User Experience with ...
 
Translation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.infoTranslation and localization process optimization - www.konsul.info
Translation and localization process optimization - www.konsul.info
 
Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methods
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case Study
 
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education 2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
2004 ISETL Speak English? ¿Habla Español? A Bilingual Model in Higher Education
 

Último

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 

Último (20)

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 

Presentation at joint PIA workshop at UMAP 2014

  • 1. Does Personalisation Benefit Everyone in the Same Way? M. Rami Ghorab Postdoc, School of Computer Science & Statistics, Trinity College Dublin
  • 2. Today’s Web Monolingual & Multilingual Users Searching across Multilingual Content • Diverse linguistic backgrounds • Different language capabilities • Different language preferences We want to personalise search, given these characteristics • Various languages. • Relevant content – which lang?
  • 3. • User Modelling – Search interests (keywords) that span across multiple languages. – Grouped into language fragments. • Adapting Results in Multilingual Web Search – Merging and Re-ranking the results. – Translating where necessary. Extending Personalisation into the Multilingual Dimension Personalised Multilingual Information Retrieval (PMIR)
  • 4. User Modelling Native Language Familiar Languages Preferred Language Attributes Structure
  • 5. Result Lists (English, French, German) Ranked separately against keywords in User Model fragment (textual similarity) Re-ranked Result Lists (English, French, German) Merged & Translated List
  • 6. Research Question - Revisited Would multilingual search personalisation algorithms achieve the same degree of improvements for all search queries, regardless of query language?
  • 7. • Evaluate the retrieval effectiveness of the multilingual search personalisation algorithms (User Modelling and Result Adaptation). • Determine whether the algorithms achieve the same degree of effectiveness for users who have different language preferences (examine English vs. Non- English users). Experiment - Objectives
  • 8. Experiment - Setup Phase 2: Result Pooling • Last query reserved for testing. • Construct the user models. • Generate various result lists. Phase 3: Relevance Judgments • 4-point scale of relevance (not relevant / somewhat relevant / relevant / very relevant) Phase 4: Evaluation • Metric: Mean Average Precision (MAP). • Measures effectiveness of each algorithm across all test queries Phase 1: User Participation • Sign up – language preferences. • Two search topics. • Use baseline multilingual Web search. • Submit findings about topic.
  • 9. Experiment - Results MAP Improvements over Baseline for various result list positions (cut-off points @5..@20)
  • 10. Understanding the Results List Position English Non- English % English over Non- English P@5 0.58 0.45 29.15% P@10 0.55 0.49 11.54% P@15 0.51 0.45 14.46% P@20 0.50 0.48 3.71% Baseline (non-personalised) Precision Scores
  • 11. • Does personalisation benefit everyone in the same way? – No. – Multilingual search adaptation algorithms work differently with users of different language preferences/capabilities. • Recommendation – Personalised Search systems should adopt different personalisation strategies for certain languages or groups of languages. • Future Work – Concept-based user models (multilingual ontology or web taxonomy). Conclusion & Future Work
  • 12. Thank You This research is supported by the Science Foundation Ireland (Grant 12/CE/I2267) as part of the Centre for Next Generation Localisation (www.cngl.ie) at Trinity College, Dublin.