SlideShare uma empresa Scribd logo
1 de 22
Baixar para ler offline
A Query Routing Model to Rank
Expert Candidates on Twitter
Cleyton Souza, Jonathas Magalhães, Evandro Costa and
Joseana Fechine
LIA - Laboratory of Artificial Intelligence
UFCG - Federal University of Campina Grande
Campina Grande - Brazil
Introduction
• What is Social Query?
– It is the process of asking questions trough social
media (e.g., Twitter, Facebook, etc.)! [Morris et al.]
– The common strategy is sharing the question with
everyone, but this way there is no guarantee that you
will receive a good and quick answer
• Directing questions to someone is more efficient.

• What is Query Routing?
– It is the process of directing questions to appropriate
answerers (people able to help)!
cleyton.caetano.souza@copin.ufcg.edu.br

2
Introduction
• What are we proposing?
– A Query Routing Model: a technique that finds
the most suitable person to help you based on
knowledge, trust and activity.
– We are focusing in the Twitter context!

A Query Routing Model to Rank Expert
Candidates on Twitter

cleyton.caetano.souza@copin.ufcg.edu.br

3
Agenda
•
•
•
•

Introduction
Related Work
Proposal
Evaluation
– Methodology
– Results
– Treats to Validity

• Conclusion & Future Work
cleyton.caetano.souza@copin.ufcg.edu.br

4
Related Word (1/2)
• What are the differentials of our proposal to
Previous Work?
– Context – We are focusing on a Social Network
Context;
• While previous work focused on Community Question
and Answering context…
• Why did we choose Twitter?
– It is one of the most popular Online Social Networks;
– Less than 18% percent of questions asked on Twitter are
answered [Paul et al.];
– [Nichols and Kang] confirmed that directing questions
significantly improve the response rate;
cleyton.caetano.souza@copin.ufcg.edu.br

5
Related Word (2/2)
• What are the differentials of our proposal to
Previous Work?
– Problem – We are leading with the Query Routing
problem as a Multi-criteria Decision Making
Problem (Weight Product Model – WPM);
• While previous work applied mainly probabilistic
models…
• Why did we choose WPM?
– [Triantaphyllou and Mann] confirmed that for problems with
dependence up to three variables, WPM achieves the best
performance

cleyton.caetano.souza@copin.ufcg.edu.br

6
Proposal
• Some user on Twitter has a question
• Our model analyzes the question and orders his
followers based on three criteria (further details
[Souza et al.])
– Knowledge (K) – using bag of words strategy;
– Trust (T) – a combination of similarity and
conversation rate;
– Activity (A) – mean latency time between
consecutive messages;

• What do we want?
– We want to find the best combination of K, T and A!
cleyton.caetano.souza@copin.ufcg.edu.br

7
Knowledge
• We want to ask someone who knows about the
topic of the question
• We used Vector Space Model
– Users and question are represented by a vector of
terms
– We match users and questions using cosine
similarity between these vectors

cleyton.caetano.souza@copin.ufcg.edu.br

8
Trust/Closeness
• Sometimes, we want receive answers from
people close to us
• How we automatically discover these people
– We analyze the conversation rate between the
questioner and each follower
– We analyze the followers set similarity between
the questioner and each follower
– We established that trust is the product between
conversation rate and followers set similarity
cleyton.caetano.souza@copin.ufcg.edu.br

9
Activity
• Sometimes, we prefer a quick answer with low
quality instead a high quality answer but slow
• Our assumption is that people who produces
a lot of content in a short time will provide
quick answers
• Activity is a mean latency time between
consecutive posts

cleyton.caetano.souza@copin.ufcg.edu.br

10
Proposal
• How we compare the criteria configuration of
the followers?
– We use Weight Product Model - we compare two
users using the following function:
𝑐𝑜𝑚𝑝 𝑢, 𝑣 =

𝑚𝑎𝑝 𝐾 𝑢
𝑚𝑎𝑝 𝐾 𝑣

𝑤𝑘

∗

𝑚𝑎𝑝 𝑇 𝑢
𝑚𝑎𝑝 𝑇 𝑣

𝑤𝑡

*

𝑚𝑎𝑝 𝐴 𝑢
𝑚𝑎𝑝 𝐴 𝑣

𝑤𝑎

– The result of comparison tell us who is the best
user!
– We sum the victories of each user and order them
based on their total of victories!
cleyton.caetano.souza@copin.ufcg.edu.br

11
Evaluation
• We used a Quantitative Approach!
• Methodology
1. We selected 160 questions and their answers
published on Twitter
2. We manually ranked the answers of each
question based on their utility

cleyton.caetano.souza@copin.ufcg.edu.br

12
Evaluation
• We manually ranked the answers of each
question based on their utility
Question

How Much it costs go to Disneyland?

Answer

Answer Type

Utility

I don’t know

A unhelpful answer

1

I think @someone knows

Indicating someone or some source

2

Between $1000 and $2000

A uncertainty answer

3

I was last year and I spent $700

A direct answer

4

• We used as tie-breaker the order in which the
answers were given
cleyton.caetano.souza@copin.ufcg.edu.br

13
Evaluation
• Methodology
4. We crawled information about their questioners and
answerers (user profile, followers set, following set, tweets);
5. We ranked the answerers using our proposal
6. We compared both ranks using nDCG

• Our aim is answer the following questions
– Does our Model perform well to predict the utility of
the answers?
– Does WPM reach better performance than the use of
criteria individually?
cleyton.caetano.souza@copin.ufcg.edu.br

14
Results
Question Type
[Morris et al.]

Recommendation

Amount of Questions
56

17

Opinion
Factual Knowledge

40
15

Rhetorical

3

Invitation

8

Favor
Social connection

12
9

Offer
Mean

160

cleyton.caetano.souza@copin.ufcg.edu.br

Mean of nDCG

0,92 ± 0,23
0,83 ± 0,31
0,91 ± 0,26
0,90 ± 0,25

0,99 ± 0,01
1,00 ± 0,00
0,87 ± 0,28
0,84 ± 0,31
0,90

15
Does our Model perform well to predict the
aptitude of the expert candidates?
• Promising results
– We reach a mean of nDCG bigger than 0.9;
– A one-tailed binomial test statically confirmed that
QR model predicted the ideal rank in more than
64% of cases (p-value= 0.03219 and α=5%);
• An improvement in comparison with [Souza et al. 2012]

cleyton.caetano.souza@copin.ufcg.edu.br

16
Does WPM reach better performance than
the use of individually criteria?

Figure 1: Boxplot comparing WPM with Individually Criterion
cleyton.caetano.souza@copin.ufcg.edu.br

17
Does WPM reach better performance than
the use of individually criteria?
• We performed a pairwise comparison using
Wilcoxon Signed Rank Test (α=5%)
Hypotheses

P-value

Conclusion

WPM has a nDCG distribution better than Knowledge

1.357e-15

True

WPM has a nDCG distribution better than Activity

6.701e-16

True

WPM has a nDCG distribution better than Trust

4.025e-16

True

cleyton.caetano.souza@copin.ufcg.edu.br

18
Treats to Validity
• Evaluation Methodology
• Few Questions
• Manually order answers

cleyton.caetano.souza@copin.ufcg.edu.br

19
Conclusion & Future Work
• We proposed a QR Model for Twitter
– We achieved promising results in a young field
– We confirmed the superiority of WPM use
– We created a public dataset for future research in the
area

• Future Work
– Is directing questions to experts more effective than
sharing questions?
– How is the relationship between the weights given to
criteria with the qualities (truth, intimacy, speed) of
the received answer?
cleyton.caetano.souza@copin.ufcg.edu.br

20
References
•

•

•

•

•

•

M. Morris, J. Teevan, and K. Panovich, “What do people ask their social networks, and
why?: a survey study of status message q&a behavior”, Proceedings of the 28th ACM
International Conference on Human Factors in Computing Systems, 2010, pp. 1739–
1748
J. Nichols, and J. Kang. “Asking questions of targeted strangers on social networks”.
Proceedings of the ACM Conference on Computer Supported Cooperative Work, 2012,
pp. 999–1002.
S. Paul, L. Hong, and E. Chi, “Is Twitter a good place for asking questions? a
characterization study”. Proceedings of the 5th International AAAI Conference on
Weblogs and Social Media, 2011, pp. 578–581.
C. Souza, J. Magalhães and E. Costa. “A Formal Model to the Routing Questions Problem
in the Context of Twitter”. Proceedings of the IADIS International Conference
WWW/Internet, 2011 .
C. Souza, J. Magalhães, E. Costa e J. Fechine. “Predicting Potential Responders in
Twitter : A Query Routing Algorithm”. Proceedings of the 12th International Conference
on Computational Science and Its Applications, 2012, pp. 714–729.
E. Triantaphyllou, and S. Mann, “An examination of the effectiveness of multidimensional decision-making methods: A decision-making paradox,” Decision Support
Systems, vol. 5, 1989, pp. 303–312
cleyton.caetano.souza@copin.ufcg.edu.br

21
Questions?

cleyton.caetano.souza@copin.ufcg.edu.br

22

Mais conteúdo relacionado

Mais procurados

The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...Eamon Costello
 
An agent-based model of the effects of message interventions on opinion dynam...
An agent-based model of the effects of message interventions on opinion dynam...An agent-based model of the effects of message interventions on opinion dynam...
An agent-based model of the effects of message interventions on opinion dynam...Shahan Ali Memon
 
RecSys Challenge 2016
RecSys Challenge 2016RecSys Challenge 2016
RecSys Challenge 2016Fabian Abel
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...David Zibriczky
 
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...Eamon Costello
 
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie RecommendationYONG ZHENG
 
On the Validity of Peer Grading and a Cloud Teaching Assistant System
On the Validity of Peer Grading and a Cloud Teaching Assistant SystemOn the Validity of Peer Grading and a Cloud Teaching Assistant System
On the Validity of Peer Grading and a Cloud Teaching Assistant SystemTimVogelsang
 

Mais procurados (8)

The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
An agent-based model of the effects of message interventions on opinion dynam...
An agent-based model of the effects of message interventions on opinion dynam...An agent-based model of the effects of message interventions on opinion dynam...
An agent-based model of the effects of message interventions on opinion dynam...
 
RecSys Challenge 2016
RecSys Challenge 2016RecSys Challenge 2016
RecSys Challenge 2016
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
 
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
 
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
 
On the Validity of Peer Grading and a Cloud Teaching Assistant System
On the Validity of Peer Grading and a Cloud Teaching Assistant SystemOn the Validity of Peer Grading and a Cloud Teaching Assistant System
On the Validity of Peer Grading and a Cloud Teaching Assistant System
 

Destaque

A Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterA Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterJonathas Magalhães
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmJonathas Magalhães
 
Recommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User CurriculumRecommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User CurriculumJonathas Magalhães
 
An Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalAn Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalJonathas Magalhães
 
Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Jonathas Magalhães
 
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesSistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesJonathas Magalhães
 
Enhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookEnhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookJonathas Magalhães
 
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...Jonathas Magalhães
 
Social Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterSocial Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterJonathas Magalhães
 
My reume templet
My reume templetMy reume templet
My reume templetgabfall
 
21 career-planning-people-development
21 career-planning-people-development21 career-planning-people-development
21 career-planning-people-developmentLuthfy Pribadi
 
Presentacion de mi autobiografia
Presentacion de mi autobiografiaPresentacion de mi autobiografia
Presentacion de mi autobiografiaVanessa Uribe
 

Destaque (20)

A Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterA Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in Twitter
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing Algorithm
 
Recommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User CurriculumRecommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User Curriculum
 
Naive Bayes
Naive BayesNaive Bayes
Naive Bayes
 
An Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalAn Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed Educational
 
Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...
 
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesSistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
 
Redes Bayesianas
Redes BayesianasRedes Bayesianas
Redes Bayesianas
 
Probabilidade
ProbabilidadeProbabilidade
Probabilidade
 
Enhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookEnhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on Facebook
 
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
 
K-Nearest Neighbor
K-Nearest NeighborK-Nearest Neighbor
K-Nearest Neighbor
 
Social Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterSocial Query: A Query Routing System for Twitter
Social Query: A Query Routing System for Twitter
 
My reume templet
My reume templetMy reume templet
My reume templet
 
Cover note
Cover noteCover note
Cover note
 
BSS Student Leaders TTT
BSS Student Leaders TTTBSS Student Leaders TTT
BSS Student Leaders TTT
 
21 career-planning-people-development
21 career-planning-people-development21 career-planning-people-development
21 career-planning-people-development
 
Formatos
FormatosFormatos
Formatos
 
Presentacion de mi autobiografia
Presentacion de mi autobiografiaPresentacion de mi autobiografia
Presentacion de mi autobiografia
 
Apres dl224 2015_scie
Apres dl224 2015_scieApres dl224 2015_scie
Apres dl224 2015_scie
 

Semelhante a A Query Routing Model to Rank Expertcandidates on Twitter

[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...DataScienceConferenc1
 
Planning and running usability tests
Planning and running usability testsPlanning and running usability tests
Planning and running usability testsChris Collingridge
 
The Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject CrowdsourcingThe Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject CrowdsourcingMatthew Lease
 
Crowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to EthicsCrowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to EthicsMatthew Lease
 
seminar.pptx On the PROJECT TITLE machine Learning
seminar.pptx On the PROJECT TITLE machine Learningseminar.pptx On the PROJECT TITLE machine Learning
seminar.pptx On the PROJECT TITLE machine Learninggaherwarsaloni1234
 
Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016Anthony D. Paul
 
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14SURF Events
 
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4Anthony D. Paul
 
Klaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentKlaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentEADTU
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation SystemIRJET Journal
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Roi Blanco
 
Data driveneducationicwl2016
Data driveneducationicwl2016Data driveneducationicwl2016
Data driveneducationicwl2016Peter Brusilovsky
 
eMOOCs2015 Does peer grading work?
eMOOCs2015 Does peer grading work?eMOOCs2015 Does peer grading work?
eMOOCs2015 Does peer grading work?Rémi Bachelet
 
Learning Analytics: Seeking new insights from educational data
Learning Analytics: Seeking new insights from educational dataLearning Analytics: Seeking new insights from educational data
Learning Analytics: Seeking new insights from educational dataAndrew Deacon
 
Lightning Talks (select full talk per category)
Lightning Talks (select full talk per category)Lightning Talks (select full talk per category)
Lightning Talks (select full talk per category)Embedding Employability
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...ijtsrd
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...ijtsrd
 
Engaging with Users on Public Social Media
Engaging with Users on Public Social MediaEngaging with Users on Public Social Media
Engaging with Users on Public Social MediaJeffrey Nichols
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsSimon Knight
 

Semelhante a A Query Routing Model to Rank Expertcandidates on Twitter (20)

[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
 
Planning and running usability tests
Planning and running usability testsPlanning and running usability tests
Planning and running usability tests
 
The Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject CrowdsourcingThe Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject Crowdsourcing
 
Crowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to EthicsCrowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to Ethics
 
seminar.pptx On the PROJECT TITLE machine Learning
seminar.pptx On the PROJECT TITLE machine Learningseminar.pptx On the PROJECT TITLE machine Learning
seminar.pptx On the PROJECT TITLE machine Learning
 
Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016
 
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
 
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
 
Klaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentKlaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessment
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation System
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement
 
Data driveneducationicwl2016
Data driveneducationicwl2016Data driveneducationicwl2016
Data driveneducationicwl2016
 
eMOOCs2015 Does peer grading work?
eMOOCs2015 Does peer grading work?eMOOCs2015 Does peer grading work?
eMOOCs2015 Does peer grading work?
 
Learning Analytics: Seeking new insights from educational data
Learning Analytics: Seeking new insights from educational dataLearning Analytics: Seeking new insights from educational data
Learning Analytics: Seeking new insights from educational data
 
Lightning Talks (select full talk per category)
Lightning Talks (select full talk per category)Lightning Talks (select full talk per category)
Lightning Talks (select full talk per category)
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
 
Itana gimenesevalplan
Itana gimenesevalplanItana gimenesevalplan
Itana gimenesevalplan
 
Engaging with Users on Public Social Media
Engaging with Users on Public Social MediaEngaging with Users on Public Social Media
Engaging with Users on Public Social Media
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & Needs
 

Último

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 

Último (20)

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

A Query Routing Model to Rank Expertcandidates on Twitter

  • 1. A Query Routing Model to Rank Expert Candidates on Twitter Cleyton Souza, Jonathas Magalhães, Evandro Costa and Joseana Fechine LIA - Laboratory of Artificial Intelligence UFCG - Federal University of Campina Grande Campina Grande - Brazil
  • 2. Introduction • What is Social Query? – It is the process of asking questions trough social media (e.g., Twitter, Facebook, etc.)! [Morris et al.] – The common strategy is sharing the question with everyone, but this way there is no guarantee that you will receive a good and quick answer • Directing questions to someone is more efficient. • What is Query Routing? – It is the process of directing questions to appropriate answerers (people able to help)! cleyton.caetano.souza@copin.ufcg.edu.br 2
  • 3. Introduction • What are we proposing? – A Query Routing Model: a technique that finds the most suitable person to help you based on knowledge, trust and activity. – We are focusing in the Twitter context! A Query Routing Model to Rank Expert Candidates on Twitter cleyton.caetano.souza@copin.ufcg.edu.br 3
  • 4. Agenda • • • • Introduction Related Work Proposal Evaluation – Methodology – Results – Treats to Validity • Conclusion & Future Work cleyton.caetano.souza@copin.ufcg.edu.br 4
  • 5. Related Word (1/2) • What are the differentials of our proposal to Previous Work? – Context – We are focusing on a Social Network Context; • While previous work focused on Community Question and Answering context… • Why did we choose Twitter? – It is one of the most popular Online Social Networks; – Less than 18% percent of questions asked on Twitter are answered [Paul et al.]; – [Nichols and Kang] confirmed that directing questions significantly improve the response rate; cleyton.caetano.souza@copin.ufcg.edu.br 5
  • 6. Related Word (2/2) • What are the differentials of our proposal to Previous Work? – Problem – We are leading with the Query Routing problem as a Multi-criteria Decision Making Problem (Weight Product Model – WPM); • While previous work applied mainly probabilistic models… • Why did we choose WPM? – [Triantaphyllou and Mann] confirmed that for problems with dependence up to three variables, WPM achieves the best performance cleyton.caetano.souza@copin.ufcg.edu.br 6
  • 7. Proposal • Some user on Twitter has a question • Our model analyzes the question and orders his followers based on three criteria (further details [Souza et al.]) – Knowledge (K) – using bag of words strategy; – Trust (T) – a combination of similarity and conversation rate; – Activity (A) – mean latency time between consecutive messages; • What do we want? – We want to find the best combination of K, T and A! cleyton.caetano.souza@copin.ufcg.edu.br 7
  • 8. Knowledge • We want to ask someone who knows about the topic of the question • We used Vector Space Model – Users and question are represented by a vector of terms – We match users and questions using cosine similarity between these vectors cleyton.caetano.souza@copin.ufcg.edu.br 8
  • 9. Trust/Closeness • Sometimes, we want receive answers from people close to us • How we automatically discover these people – We analyze the conversation rate between the questioner and each follower – We analyze the followers set similarity between the questioner and each follower – We established that trust is the product between conversation rate and followers set similarity cleyton.caetano.souza@copin.ufcg.edu.br 9
  • 10. Activity • Sometimes, we prefer a quick answer with low quality instead a high quality answer but slow • Our assumption is that people who produces a lot of content in a short time will provide quick answers • Activity is a mean latency time between consecutive posts cleyton.caetano.souza@copin.ufcg.edu.br 10
  • 11. Proposal • How we compare the criteria configuration of the followers? – We use Weight Product Model - we compare two users using the following function: 𝑐𝑜𝑚𝑝 𝑢, 𝑣 = 𝑚𝑎𝑝 𝐾 𝑢 𝑚𝑎𝑝 𝐾 𝑣 𝑤𝑘 ∗ 𝑚𝑎𝑝 𝑇 𝑢 𝑚𝑎𝑝 𝑇 𝑣 𝑤𝑡 * 𝑚𝑎𝑝 𝐴 𝑢 𝑚𝑎𝑝 𝐴 𝑣 𝑤𝑎 – The result of comparison tell us who is the best user! – We sum the victories of each user and order them based on their total of victories! cleyton.caetano.souza@copin.ufcg.edu.br 11
  • 12. Evaluation • We used a Quantitative Approach! • Methodology 1. We selected 160 questions and their answers published on Twitter 2. We manually ranked the answers of each question based on their utility cleyton.caetano.souza@copin.ufcg.edu.br 12
  • 13. Evaluation • We manually ranked the answers of each question based on their utility Question How Much it costs go to Disneyland? Answer Answer Type Utility I don’t know A unhelpful answer 1 I think @someone knows Indicating someone or some source 2 Between $1000 and $2000 A uncertainty answer 3 I was last year and I spent $700 A direct answer 4 • We used as tie-breaker the order in which the answers were given cleyton.caetano.souza@copin.ufcg.edu.br 13
  • 14. Evaluation • Methodology 4. We crawled information about their questioners and answerers (user profile, followers set, following set, tweets); 5. We ranked the answerers using our proposal 6. We compared both ranks using nDCG • Our aim is answer the following questions – Does our Model perform well to predict the utility of the answers? – Does WPM reach better performance than the use of criteria individually? cleyton.caetano.souza@copin.ufcg.edu.br 14
  • 15. Results Question Type [Morris et al.] Recommendation Amount of Questions 56 17 Opinion Factual Knowledge 40 15 Rhetorical 3 Invitation 8 Favor Social connection 12 9 Offer Mean 160 cleyton.caetano.souza@copin.ufcg.edu.br Mean of nDCG 0,92 ± 0,23 0,83 ± 0,31 0,91 ± 0,26 0,90 ± 0,25 0,99 ± 0,01 1,00 ± 0,00 0,87 ± 0,28 0,84 ± 0,31 0,90 15
  • 16. Does our Model perform well to predict the aptitude of the expert candidates? • Promising results – We reach a mean of nDCG bigger than 0.9; – A one-tailed binomial test statically confirmed that QR model predicted the ideal rank in more than 64% of cases (p-value= 0.03219 and α=5%); • An improvement in comparison with [Souza et al. 2012] cleyton.caetano.souza@copin.ufcg.edu.br 16
  • 17. Does WPM reach better performance than the use of individually criteria? Figure 1: Boxplot comparing WPM with Individually Criterion cleyton.caetano.souza@copin.ufcg.edu.br 17
  • 18. Does WPM reach better performance than the use of individually criteria? • We performed a pairwise comparison using Wilcoxon Signed Rank Test (α=5%) Hypotheses P-value Conclusion WPM has a nDCG distribution better than Knowledge 1.357e-15 True WPM has a nDCG distribution better than Activity 6.701e-16 True WPM has a nDCG distribution better than Trust 4.025e-16 True cleyton.caetano.souza@copin.ufcg.edu.br 18
  • 19. Treats to Validity • Evaluation Methodology • Few Questions • Manually order answers cleyton.caetano.souza@copin.ufcg.edu.br 19
  • 20. Conclusion & Future Work • We proposed a QR Model for Twitter – We achieved promising results in a young field – We confirmed the superiority of WPM use – We created a public dataset for future research in the area • Future Work – Is directing questions to experts more effective than sharing questions? – How is the relationship between the weights given to criteria with the qualities (truth, intimacy, speed) of the received answer? cleyton.caetano.souza@copin.ufcg.edu.br 20
  • 21. References • • • • • • M. Morris, J. Teevan, and K. Panovich, “What do people ask their social networks, and why?: a survey study of status message q&a behavior”, Proceedings of the 28th ACM International Conference on Human Factors in Computing Systems, 2010, pp. 1739– 1748 J. Nichols, and J. Kang. “Asking questions of targeted strangers on social networks”. Proceedings of the ACM Conference on Computer Supported Cooperative Work, 2012, pp. 999–1002. S. Paul, L. Hong, and E. Chi, “Is Twitter a good place for asking questions? a characterization study”. Proceedings of the 5th International AAAI Conference on Weblogs and Social Media, 2011, pp. 578–581. C. Souza, J. Magalhães and E. Costa. “A Formal Model to the Routing Questions Problem in the Context of Twitter”. Proceedings of the IADIS International Conference WWW/Internet, 2011 . C. Souza, J. Magalhães, E. Costa e J. Fechine. “Predicting Potential Responders in Twitter : A Query Routing Algorithm”. Proceedings of the 12th International Conference on Computational Science and Its Applications, 2012, pp. 714–729. E. Triantaphyllou, and S. Mann, “An examination of the effectiveness of multidimensional decision-making methods: A decision-making paradox,” Decision Support Systems, vol. 5, 1989, pp. 303–312 cleyton.caetano.souza@copin.ufcg.edu.br 21