SlideShare uma empresa Scribd logo
1 de 12
Athena Research and Innovation Center
Yahoo! Research
Diversifying User Comments
on News Articles
2
Problem
Problem description:
Given a news article and the respective set of user comments,
return a subset of the most diverse comments
Perception of a diverse set of comments:
A set of comments that represents different opinions and
sentiments,
…expressed by users with different demographic
characteristics,
…covering different aspects of the news article.
Motivation
Article’s content itself is not always enough to form a complete
view over a topic
The public opinion complements the article and represents
the “wisdom of the crowds”
3
Example
Given a political article:
Find all the subtopics handled
 Persons related
 Events (election, bill voting)
Find all opinions and
sentiments expressed
 Positive/negative/neutral
 On the whole article/on specific
subtopics
Find different kinds of users
commenting
 Different demographics
 Different commenting history on
previous articles
Present a set of comments that
better represents the diversity
of the above dimensions
4
Motivation
Several articles are very
popular (>10000 comments)
Articles get aggregated 
even more comments
Impossible for the reader to
review
Current comment sorting
options are based on more
simple criteria
Date
Votes
Replies
5
Method outline
Define diversification criteria
Dimensions
 Content, Sentiment, Named Entities, User co-commenting
behavior
Define a (dis)similarity function that produces a diversity
score based on the criteria
Quantify the dissimilarity of comments
Weighted sum of cosine similarities on diversity feature
vectors
Apply and iterative heuristic algorithm that, at each step,
selects the candidate comment that maximizes a diversity
objective
6
Method description - Criteria
Content
Baseline diversity criterion
Used in the rest of the literature to diversify search results.
Objective  obtain comments with diverse content.
Processing
 Comments’ text  term vectors
 Document length-normalized tf values
7
Method description - Criteria
Named Entities (Nes)
Person, Organizations, Locations
Many times news articles revolve around Nes
 Even when an article talks about events or situations, usually one
or more Persons or Locations are involved
Objective  obtain comments that cover (uniformly) as many
different NEs as possible
Processing
 Extraction of Nes in comments (Stanford NER)
 Comments’ Nes  term vectors
 Document length-normalized tf values
8
Method description - Criteria
Sentiment
9 classes of sentiment within the interval [-4, 4]
 -4  very negative
 4  very positive
 0  neutral
Expresses users’ opinions on the news articles’ topics.
Objective  obtain comments that cover (uniformly) different
classes of sentiment
Processing
 Sentiment analysis of the comment’s text (SentiStrength)
 Construct sentiment vectors
 Each vector value represents a sentiment class
9
Comment scoring
Cosine similarity function between
A pair of comments
A comment and a set of comments
Apply the similarity function for each criterion separately
Produce a final diversity score as a weighted sum of all
criterion scores
Produce a final score that incorporates comment-to-article
similarity
10
Algorithm (MAXSUM)
Initially
Empty diverse result set  all comments belong to the
candidate set
Arbitrary insertion of a candidate comment into the result set
Greedy construction heuristic
Compare each candidate comment with the centroid
(average) of the current result set
Finish after (k-1) iterations  k comments are inserted
11
Evaluation
Comparison of methods’ coverage on different information
nuggets they contain
Baseline diversification based only on content
Proposed method (combination of multiple criteria)
Proposed methods outperform the baseline
12
Framework - Implementations
A desktop java application retrieving news articles and
comments comments stored in a MySQL database
News and comments obtained by the NY Times API
Arcomem offline module for calculating diverse
WebObjects of WebResources

Mais conteúdo relacionado

Destaque

Programmed instruction my presentation
Programmed instruction my presentationProgrammed instruction my presentation
Programmed instruction my presentationEko Priyanto
 
E Recruitments
E RecruitmentsE Recruitments
E Recruitmentsmaayaa86
 
Programmed Instruction
Programmed InstructionProgrammed Instruction
Programmed Instructiongangothri90
 
Psychological Testing Techniques
Psychological Testing TechniquesPsychological Testing Techniques
Psychological Testing Techniquespsychegames2
 
Programmed learning
Programmed learningProgrammed learning
Programmed learningShyba Salam
 
Programmed instruction
Programmed instructionProgrammed instruction
Programmed instructiondayawana
 
Psychometrics ppt
Psychometrics pptPsychometrics ppt
Psychometrics pptsanthosh357
 
Types of psychological test
Types of psychological testTypes of psychological test
Types of psychological testAbigail Gamboa
 
Basic concepts in psychological testing
Basic concepts in psychological testingBasic concepts in psychological testing
Basic concepts in psychological testingRoi Xcel
 
Nature and use of Psychological Tests
Nature and use of Psychological TestsNature and use of Psychological Tests
Nature and use of Psychological TestsLenie Rose Julia
 
Psychological test meaning, concept, need & importance
Psychological test   meaning, concept, need & importancePsychological test   meaning, concept, need & importance
Psychological test meaning, concept, need & importancejd singh
 
Computer Based Training Methods
Computer Based Training MethodsComputer Based Training Methods
Computer Based Training MethodsDejiaofOrlando
 

Destaque (16)

Psychometric tests
Psychometric testsPsychometric tests
Psychometric tests
 
Programmed instruction my presentation
Programmed instruction my presentationProgrammed instruction my presentation
Programmed instruction my presentation
 
E Recruitments
E RecruitmentsE Recruitments
E Recruitments
 
Programmed Instruction
Programmed InstructionProgrammed Instruction
Programmed Instruction
 
Psychological Testing Techniques
Psychological Testing TechniquesPsychological Testing Techniques
Psychological Testing Techniques
 
e-recruitment
e-recruitmente-recruitment
e-recruitment
 
Programmed learning
Programmed learningProgrammed learning
Programmed learning
 
Programmed instruction
Programmed instructionProgrammed instruction
Programmed instruction
 
Psychometrics ppt
Psychometrics pptPsychometrics ppt
Psychometrics ppt
 
Psychometric Tests
Psychometric TestsPsychometric Tests
Psychometric Tests
 
Types of psychological test
Types of psychological testTypes of psychological test
Types of psychological test
 
Basic concepts in psychological testing
Basic concepts in psychological testingBasic concepts in psychological testing
Basic concepts in psychological testing
 
Nature and use of Psychological Tests
Nature and use of Psychological TestsNature and use of Psychological Tests
Nature and use of Psychological Tests
 
Psychological test meaning, concept, need & importance
Psychological test   meaning, concept, need & importancePsychological test   meaning, concept, need & importance
Psychological test meaning, concept, need & importance
 
Psychometric Assessment
Psychometric Assessment Psychometric Assessment
Psychometric Assessment
 
Computer Based Training Methods
Computer Based Training MethodsComputer Based Training Methods
Computer Based Training Methods
 

Semelhante a Arcomem training diversification

Text Analysis – Current Educational Philosophy Issue Your Text a.docx
Text Analysis – Current Educational Philosophy Issue Your Text a.docxText Analysis – Current Educational Philosophy Issue Your Text a.docx
Text Analysis – Current Educational Philosophy Issue Your Text a.docxmehek4
 
Enc 1102 preliminary genre analysis workshop
Enc 1102 preliminary genre analysis workshopEnc 1102 preliminary genre analysis workshop
Enc 1102 preliminary genre analysis workshopLaura Martinez
 
What is an Exploratory PaperExploratory Argument In explorat.docx
What is an Exploratory PaperExploratory Argument In explorat.docxWhat is an Exploratory PaperExploratory Argument In explorat.docx
What is an Exploratory PaperExploratory Argument In explorat.docxalanfhall8953
 
Content Analysis Overview for Persona Development
Content Analysis Overview for Persona DevelopmentContent Analysis Overview for Persona Development
Content Analysis Overview for Persona DevelopmentPamela Rutledge
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWJournal For Research
 
IMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COM
IMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COMIMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COM
IMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COMalbert0068
 
SOC110HM Course Paper Guidelines Option 1 Journal A.docx
SOC110HM  Course Paper Guidelines  Option 1 Journal A.docxSOC110HM  Course Paper Guidelines  Option 1 Journal A.docx
SOC110HM Course Paper Guidelines Option 1 Journal A.docxwhitneyleman54422
 
Sentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveySentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveyIJERA Editor
 
9Developing a Critical Analysis of a TextKeywordsCriti
9Developing a Critical Analysis of a TextKeywordsCriti9Developing a Critical Analysis of a TextKeywordsCriti
9Developing a Critical Analysis of a TextKeywordsCritiromeliadoan
 
Enc 1102 summer b july 2nd
Enc 1102 summer b july 2ndEnc 1102 summer b july 2nd
Enc 1102 summer b july 2ndLaura Martinez
 
Dr.saleem gul assignment summary
Dr.saleem gul assignment summaryDr.saleem gul assignment summary
Dr.saleem gul assignment summaryJaved Riza
 
The Open Debate Engine
The Open Debate EngineThe Open Debate Engine
The Open Debate EngineGreg Schnippel
 
As we move into more in-depth discussion of media literacy, we wil.docx
As we move into more in-depth discussion of media literacy, we wil.docxAs we move into more in-depth discussion of media literacy, we wil.docx
As we move into more in-depth discussion of media literacy, we wil.docxwildmandelorse
 
OverviewWrite a proposal that describes the text you will an.docx
OverviewWrite a proposal that describes the text you will an.docxOverviewWrite a proposal that describes the text you will an.docx
OverviewWrite a proposal that describes the text you will an.docxhoney690131
 
TopicAs we move into more in-depth discussion of media literacy.docx
TopicAs we move into more in-depth discussion of media literacy.docxTopicAs we move into more in-depth discussion of media literacy.docx
TopicAs we move into more in-depth discussion of media literacy.docxTakishaPeck109
 

Semelhante a Arcomem training diversification (20)

Anu paper(IJARCCE)
Anu paper(IJARCCE)Anu paper(IJARCCE)
Anu paper(IJARCCE)
 
Synthesis part1
Synthesis part1Synthesis part1
Synthesis part1
 
Text Analysis – Current Educational Philosophy Issue Your Text a.docx
Text Analysis – Current Educational Philosophy Issue Your Text a.docxText Analysis – Current Educational Philosophy Issue Your Text a.docx
Text Analysis – Current Educational Philosophy Issue Your Text a.docx
 
Enc 1102 september 11
Enc 1102 september 11Enc 1102 september 11
Enc 1102 september 11
 
Enc 1102 preliminary genre analysis workshop
Enc 1102 preliminary genre analysis workshopEnc 1102 preliminary genre analysis workshop
Enc 1102 preliminary genre analysis workshop
 
What is an Exploratory PaperExploratory Argument In explorat.docx
What is an Exploratory PaperExploratory Argument In explorat.docxWhat is an Exploratory PaperExploratory Argument In explorat.docx
What is an Exploratory PaperExploratory Argument In explorat.docx
 
Content Analysis Overview for Persona Development
Content Analysis Overview for Persona DevelopmentContent Analysis Overview for Persona Development
Content Analysis Overview for Persona Development
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
 
IMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COM
IMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COMIMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COM
IMPORTANCE OF THE MEDIA / TUTORIALOUTLET DOT COM
 
SOC110HM Course Paper Guidelines Option 1 Journal A.docx
SOC110HM  Course Paper Guidelines  Option 1 Journal A.docxSOC110HM  Course Paper Guidelines  Option 1 Journal A.docx
SOC110HM Course Paper Guidelines Option 1 Journal A.docx
 
Sentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveySentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A Survey
 
9Developing a Critical Analysis of a TextKeywordsCriti
9Developing a Critical Analysis of a TextKeywordsCriti9Developing a Critical Analysis of a TextKeywordsCriti
9Developing a Critical Analysis of a TextKeywordsCriti
 
Writing a Research - Part 1
Writing a Research - Part 1Writing a Research - Part 1
Writing a Research - Part 1
 
Enc 1102 summer b july 2nd
Enc 1102 summer b july 2ndEnc 1102 summer b july 2nd
Enc 1102 summer b july 2nd
 
P1803018289
P1803018289P1803018289
P1803018289
 
Dr.saleem gul assignment summary
Dr.saleem gul assignment summaryDr.saleem gul assignment summary
Dr.saleem gul assignment summary
 
The Open Debate Engine
The Open Debate EngineThe Open Debate Engine
The Open Debate Engine
 
As we move into more in-depth discussion of media literacy, we wil.docx
As we move into more in-depth discussion of media literacy, we wil.docxAs we move into more in-depth discussion of media literacy, we wil.docx
As we move into more in-depth discussion of media literacy, we wil.docx
 
OverviewWrite a proposal that describes the text you will an.docx
OverviewWrite a proposal that describes the text you will an.docxOverviewWrite a proposal that describes the text you will an.docx
OverviewWrite a proposal that describes the text you will an.docx
 
TopicAs we move into more in-depth discussion of media literacy.docx
TopicAs we move into more in-depth discussion of media literacy.docxTopicAs we move into more in-depth discussion of media literacy.docx
TopicAs we move into more in-depth discussion of media literacy.docx
 

Mais de arcomem

Arcomem training – Enrichment Advanced (update)
Arcomem training – Enrichment Advanced (update)Arcomem training – Enrichment Advanced (update)
Arcomem training – Enrichment Advanced (update)arcomem
 
Arcomem training – Enrichment Beginner (update)
Arcomem training – Enrichment Beginner (update)Arcomem training – Enrichment Beginner (update)
Arcomem training – Enrichment Beginner (update)arcomem
 
Arcomem training Specifying Crawls Advanced
Arcomem training Specifying Crawls AdvancedArcomem training Specifying Crawls Advanced
Arcomem training Specifying Crawls Advancedarcomem
 
Arcomem training Specifying Crawls Beginners
Arcomem training Specifying Crawls BeginnersArcomem training Specifying Crawls Beginners
Arcomem training Specifying Crawls Beginnersarcomem
 
Arcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advancedArcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advancedarcomem
 
Arcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginnersArcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginnersarcomem
 
Arcomem training Twitter Domain Experts advanced
Arcomem training Twitter Domain Experts advancedArcomem training Twitter Domain Experts advanced
Arcomem training Twitter Domain Experts advancedarcomem
 
Arcomem training Cultural Analysis Advanced
Arcomem training Cultural Analysis AdvancedArcomem training Cultural Analysis Advanced
Arcomem training Cultural Analysis Advancedarcomem
 
Arcomem training Cultural Analysis Beginner
Arcomem training Cultural Analysis BeginnerArcomem training Cultural Analysis Beginner
Arcomem training Cultural Analysis Beginnerarcomem
 
Arcomem training twitter-dynamics_advanced
Arcomem training twitter-dynamics_advancedArcomem training twitter-dynamics_advanced
Arcomem training twitter-dynamics_advancedarcomem
 
Arcomem training system-overview_advanced
Arcomem training system-overview_advancedArcomem training system-overview_advanced
Arcomem training system-overview_advancedarcomem
 
Arcomem training specifying-crawls
Arcomem training specifying-crawlsArcomem training specifying-crawls
Arcomem training specifying-crawlsarcomem
 
Arcomem training simple-text-mining_beginner
Arcomem training simple-text-mining_beginnerArcomem training simple-text-mining_beginner
Arcomem training simple-text-mining_beginnerarcomem
 
Arcomem training opinions_advanced
Arcomem training opinions_advancedArcomem training opinions_advanced
Arcomem training opinions_advancedarcomem
 
Arcomem training neer_beginner
Arcomem training neer_beginnerArcomem training neer_beginner
Arcomem training neer_beginnerarcomem
 
Arcomem training neer_advanced
Arcomem training neer_advancedArcomem training neer_advanced
Arcomem training neer_advancedarcomem
 
Arcomem training heritrix_beginner
Arcomem training heritrix_beginnerArcomem training heritrix_beginner
Arcomem training heritrix_beginnerarcomem
 
Arcomem training heritrix_advanced
Arcomem training heritrix_advancedArcomem training heritrix_advanced
Arcomem training heritrix_advancedarcomem
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedarcomem
 
Arcomem training enrichment_beginner
Arcomem training enrichment_beginnerArcomem training enrichment_beginner
Arcomem training enrichment_beginnerarcomem
 

Mais de arcomem (20)

Arcomem training – Enrichment Advanced (update)
Arcomem training – Enrichment Advanced (update)Arcomem training – Enrichment Advanced (update)
Arcomem training – Enrichment Advanced (update)
 
Arcomem training – Enrichment Beginner (update)
Arcomem training – Enrichment Beginner (update)Arcomem training – Enrichment Beginner (update)
Arcomem training – Enrichment Beginner (update)
 
Arcomem training Specifying Crawls Advanced
Arcomem training Specifying Crawls AdvancedArcomem training Specifying Crawls Advanced
Arcomem training Specifying Crawls Advanced
 
Arcomem training Specifying Crawls Beginners
Arcomem training Specifying Crawls BeginnersArcomem training Specifying Crawls Beginners
Arcomem training Specifying Crawls Beginners
 
Arcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advancedArcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advanced
 
Arcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginnersArcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginners
 
Arcomem training Twitter Domain Experts advanced
Arcomem training Twitter Domain Experts advancedArcomem training Twitter Domain Experts advanced
Arcomem training Twitter Domain Experts advanced
 
Arcomem training Cultural Analysis Advanced
Arcomem training Cultural Analysis AdvancedArcomem training Cultural Analysis Advanced
Arcomem training Cultural Analysis Advanced
 
Arcomem training Cultural Analysis Beginner
Arcomem training Cultural Analysis BeginnerArcomem training Cultural Analysis Beginner
Arcomem training Cultural Analysis Beginner
 
Arcomem training twitter-dynamics_advanced
Arcomem training twitter-dynamics_advancedArcomem training twitter-dynamics_advanced
Arcomem training twitter-dynamics_advanced
 
Arcomem training system-overview_advanced
Arcomem training system-overview_advancedArcomem training system-overview_advanced
Arcomem training system-overview_advanced
 
Arcomem training specifying-crawls
Arcomem training specifying-crawlsArcomem training specifying-crawls
Arcomem training specifying-crawls
 
Arcomem training simple-text-mining_beginner
Arcomem training simple-text-mining_beginnerArcomem training simple-text-mining_beginner
Arcomem training simple-text-mining_beginner
 
Arcomem training opinions_advanced
Arcomem training opinions_advancedArcomem training opinions_advanced
Arcomem training opinions_advanced
 
Arcomem training neer_beginner
Arcomem training neer_beginnerArcomem training neer_beginner
Arcomem training neer_beginner
 
Arcomem training neer_advanced
Arcomem training neer_advancedArcomem training neer_advanced
Arcomem training neer_advanced
 
Arcomem training heritrix_beginner
Arcomem training heritrix_beginnerArcomem training heritrix_beginner
Arcomem training heritrix_beginner
 
Arcomem training heritrix_advanced
Arcomem training heritrix_advancedArcomem training heritrix_advanced
Arcomem training heritrix_advanced
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advanced
 
Arcomem training enrichment_beginner
Arcomem training enrichment_beginnerArcomem training enrichment_beginner
Arcomem training enrichment_beginner
 

Último

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 

Último (20)

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 

Arcomem training diversification

  • 1. Athena Research and Innovation Center Yahoo! Research Diversifying User Comments on News Articles
  • 2. 2 Problem Problem description: Given a news article and the respective set of user comments, return a subset of the most diverse comments Perception of a diverse set of comments: A set of comments that represents different opinions and sentiments, …expressed by users with different demographic characteristics, …covering different aspects of the news article. Motivation Article’s content itself is not always enough to form a complete view over a topic The public opinion complements the article and represents the “wisdom of the crowds”
  • 3. 3 Example Given a political article: Find all the subtopics handled  Persons related  Events (election, bill voting) Find all opinions and sentiments expressed  Positive/negative/neutral  On the whole article/on specific subtopics Find different kinds of users commenting  Different demographics  Different commenting history on previous articles Present a set of comments that better represents the diversity of the above dimensions
  • 4. 4 Motivation Several articles are very popular (>10000 comments) Articles get aggregated  even more comments Impossible for the reader to review Current comment sorting options are based on more simple criteria Date Votes Replies
  • 5. 5 Method outline Define diversification criteria Dimensions  Content, Sentiment, Named Entities, User co-commenting behavior Define a (dis)similarity function that produces a diversity score based on the criteria Quantify the dissimilarity of comments Weighted sum of cosine similarities on diversity feature vectors Apply and iterative heuristic algorithm that, at each step, selects the candidate comment that maximizes a diversity objective
  • 6. 6 Method description - Criteria Content Baseline diversity criterion Used in the rest of the literature to diversify search results. Objective  obtain comments with diverse content. Processing  Comments’ text  term vectors  Document length-normalized tf values
  • 7. 7 Method description - Criteria Named Entities (Nes) Person, Organizations, Locations Many times news articles revolve around Nes  Even when an article talks about events or situations, usually one or more Persons or Locations are involved Objective  obtain comments that cover (uniformly) as many different NEs as possible Processing  Extraction of Nes in comments (Stanford NER)  Comments’ Nes  term vectors  Document length-normalized tf values
  • 8. 8 Method description - Criteria Sentiment 9 classes of sentiment within the interval [-4, 4]  -4  very negative  4  very positive  0  neutral Expresses users’ opinions on the news articles’ topics. Objective  obtain comments that cover (uniformly) different classes of sentiment Processing  Sentiment analysis of the comment’s text (SentiStrength)  Construct sentiment vectors  Each vector value represents a sentiment class
  • 9. 9 Comment scoring Cosine similarity function between A pair of comments A comment and a set of comments Apply the similarity function for each criterion separately Produce a final diversity score as a weighted sum of all criterion scores Produce a final score that incorporates comment-to-article similarity
  • 10. 10 Algorithm (MAXSUM) Initially Empty diverse result set  all comments belong to the candidate set Arbitrary insertion of a candidate comment into the result set Greedy construction heuristic Compare each candidate comment with the centroid (average) of the current result set Finish after (k-1) iterations  k comments are inserted
  • 11. 11 Evaluation Comparison of methods’ coverage on different information nuggets they contain Baseline diversification based only on content Proposed method (combination of multiple criteria) Proposed methods outperform the baseline
  • 12. 12 Framework - Implementations A desktop java application retrieving news articles and comments comments stored in a MySQL database News and comments obtained by the NY Times API Arcomem offline module for calculating diverse WebObjects of WebResources