SlideShare uma empresa Scribd logo
1 de 25
Digital Enterprise Research Institute

www.deri.ie

A CAPABILITY REQUIREMENTS APPROACH FOR
PREDICTING WORKER PERFORMANCE IN
CROWDSOURCING
Umair ul Hassan, Edward Curry
Digital Enterprise Research Institute
National University of Ireland, Galway

9th IEEE International Conference on Collaborative Computing:
Networking, Applications and Worksharing
Austin, Texas, United States
October 20–23, 2013
Copyright 2010 Digital Enterprise Research Institute. All rights reserved.
Agenda
Digital Enterprise Research Institute





Motivation
Background
Task Modelling





Capability Requirements





www.deri.ie

Capabilities Taxonomy

Capability Tracing
Experiment
Summary

2
Motivation: Heterogeneity
Digital Enterprise Research Institute

www.deri.ie

3
Motivation: Task Routing
Digital Enterprise Research Institute



www.deri.ie

Assigning heterogeneous tasks to heterogeneous workers

TASK MODELLING
Models
Models
Models

TASK ROUTING

WORKER PROFILING

Matching

Profiles
Profiles
Profiles

Task↔Worker

4
Proposal: Performance Prediction
Digital Enterprise Research Institute



www.deri.ie

Predict performance of workers on new tasks based on the
capabilities required for tasks and assign tasks accordingly
TASK MODELLING
Models
Models
Models
Capability
Requirements
Approach

TASK ROUTING

WORKER PROFILING

Matching

Profiles
Profiles
Profiles

Task↔Worker
Performance
Prediction

Capability Tracing
Model

5
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

When micro tasks are crowd sourced



Single person cannot do the task





Computers cannot do the task
Work can be split into smaller tasks

Some online microtask platforms

6
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

Most common tasks in Amazon Mechanical Turk (AMT)
and CrowdFlower (CFL)

7
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

Example of information extraction task in AMT

8
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

Example of video transcription task in AMT

9
Task Modelling
Digital Enterprise Research Institute

www.deri.ie



Appropriate models are needed to compare and contrast
micro tasks.



Capability Requirements approach


Capability is defined as the ability of humans to do things in
terms of both the capacity and the opportunity.



Four types of capabilities
–
–
–
–

Knowledge,
Skill,
Ability,
Other characteristics (e.g. motivation, price, etc)

10
Capability Requirements
Digital Enterprise Research Institute



www.deri.ie

Taxonomies have be used to study human task
performance, e.g.



Bloom’s taxonomy of classification of learning objectives





Fleishman’s taxonomy of human abilities
O*NET-SOC taxonomy of occupational classification

We are interested in taxonomy that


Describes tasks in terms of human capabilities



Helps in comparing tasks in terms of differences and similarities
of capabilities

11
Capabilities Taxonomy
Digital Enterprise Research Institute




www.deri.ie

Based on Fleishman’s abilities taxonomy
Selected abilities relevant to micro tasks


Comprehension (C): The ability to understand the meaning or importance of
something



Bilingualism (B): The ability to speak and understand two languages



Writing (W): The ability or capacity to write text in a given language



Comparison (M): The ability or capacity to compare things based on some
criteria



Judgment (J): The act or process of judging; the formation of an opinion after
consideration



Perception (P): The ability or capacity to perceive items visually or phonetically



Identification (I): The process of recognizing something



Reasoning (R): The ability to draw conclusions from
facts, evidence, relationships, etc.

12
Requirements of Micro Tasks
Digital Enterprise Research Institute

www.deri.ie

13
Capability Tracing
Digital Enterprise Research Institute




www.deri.ie

How to model worker’s capabilities?
Capability tracing





Inspired by Knowledge Tracing*
Estimates probability of a worker knowing a capability given
worker’s responses to test tasks

Worker Profile constrains


Set of binary variables representing capabilities



Probability estimates of each variable being in a state

* A. T. Corbett and J. R. Anderson, “Knowledge tracing: Modeling the acquisition of procedural knowledge,” User Modeling and User-Adapted
Interaction, vol. 4, no. 4, pp. 253–278, 1994.

14
Capability Tracing
Digital Enterprise Research Institute



www.deri.ie

Probabilistic network of a capability and four parameters of
capability tracing model

States of
Capability
Variable

Not
Learned

p(T): Probability of
transition between states

p(T)
Learned

p(L)

p(G)
Values of
Response
Variable

p(L): Probability of a
worker learning to
employ the capability

p(S)
p(G): Probability of
guess

Correct

Incorrect

15

p(S): Probability of slip
Experiment
Digital Enterprise Research Institute



www.deri.ie

Objective





Solicit capability requirements of tasks from crowds
Evaluation of capability tracing for performance prediction

Three types of micro tasks with manually created ground
truth data



Image comparison





Fact verification
Information Extraction

37 crowd workers including


University students



Workers from Shorttask.com

16
Crowdsourcing
Digital Enterprise Research Institute




www.deri.ie

Custom web application for gathering data
Example of fact verification task

17
Capability Requirements of Tasks
Digital Enterprise Research Institute



www.deri.ie

Objective 1: Solicit capability requirements of tasks
from crowds

(a) fact verification

(b) image comparison

18

(c) information extraction
Crowd Performance
Digital Enterprise Research Institute



www.deri.ie

How the crowd performed on each type of task?

Fact Verification task
• 37 workers
• Best workers perform with
both precision and recall
above 0.8
• More variation in recall means
some workers were could not
spot the incorrect facts
• Ideally tasks should be
assigned to workers that lie in
the top-right quadrant of the
plot

19
Crowd Performance
Digital Enterprise Research Institute



www.deri.ie

Image Comparison (20 workers) and Information Extraction (17 workers)

20
Performance Prediction
Digital Enterprise Research Institute




www.deri.ie

Objective 2: Evaluation of capability tracing for
performance prediction
Two phases


Build model with observation tasks



Predict performance on new tasks

AC: Consider previous
Accuracy as prediction
of future performance

CT: Capability Tracing

21
Summary
Digital Enterprise Research Institute




Capabilities taxonomy is first steps towards modelling of
micro tasks based on human factors
Capability tracing is effective in predicting future
performance





www.deri.ie

Even across tasks if there are similar capabilities

Predicted performance can be used to make right task
routing decisions
Future Work


Evaluate on more types of tasks



Evaluate capabilities such as domain knowledge and skills



Define standard tests for measuring worker capabilities

22
Further Reading
Digital Enterprise Research Institute

www.deri.ie

9th IEEE International Conference on Collaborative Computing:
Networking, Applications and Worksharing
Austin, Texas, United States
October 20–23, 2013

U. Ul Hassan and E. Curry, “A Capability Requirements Approach for Predicting
Worker Performance in Crowdsourcing,” in 9th IEEE International Conference on
Collaborative Computing: Networking, Applications and Worksharing, 2013.
http://deri.ie/users/umair-ul-hassan

23
Capability Tracing
Digital Enterprise Research Institute

www.deri.ie



Conditional probability of worker learning to employ
capability p(Ln|On) is calculated



When evidence On is positive



When evidence On is negative

24
Capability Tracing
Digital Enterprise Research Institute

www.deri.ie



Probability of worker learning to employ capability



Performance of worker on next task

25

Mais conteúdo relacionado

Semelhante a A Capability Requirements Approach for Predicting Worker Performance in Crowdsourcing

Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...
Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...
Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...Umair ul Hassan
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...Paolo Missier
 
K anonymity for crowdsourcing database
K anonymity for crowdsourcing databaseK anonymity for crowdsourcing database
K anonymity for crowdsourcing databaseLeMeniz Infotech
 
Dad (Data Analysis And Design)
Dad (Data Analysis And Design)Dad (Data Analysis And Design)
Dad (Data Analysis And Design)Jill Lyons
 
A Collaborative Approach for Metadata Management for Internet of Things
A Collaborative Approach for Metadata Management for Internet of ThingsA Collaborative Approach for Metadata Management for Internet of Things
A Collaborative Approach for Metadata Management for Internet of ThingsUmair ul Hassan
 
Building, Growing and Serving Large Knowledge Graphs with Human-in-the-Loop
Building, Growing and Serving Large Knowledge Graphs with Human-in-the-LoopBuilding, Growing and Serving Large Knowledge Graphs with Human-in-the-Loop
Building, Growing and Serving Large Knowledge Graphs with Human-in-the-LoopYunyao Li
 
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...kcortis
 
lecture-intro-pet-nams-ai-in-toxicology.pptx
lecture-intro-pet-nams-ai-in-toxicology.pptxlecture-intro-pet-nams-ai-in-toxicology.pptx
lecture-intro-pet-nams-ai-in-toxicology.pptxMarc Teunis
 
Keynote@CADE2018_HalukDemirkan
Keynote@CADE2018_HalukDemirkanKeynote@CADE2018_HalukDemirkan
Keynote@CADE2018_HalukDemirkanHaluk Demirkan
 
Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebEdward Curry
 
Mena Salwans - Computer Vision for developers
Mena Salwans - Computer Vision for developersMena Salwans - Computer Vision for developers
Mena Salwans - Computer Vision for developersAWS Chicago
 
Mena Salwans Computer Vision for developers
Mena Salwans Computer Vision for developers  Mena Salwans Computer Vision for developers
Mena Salwans Computer Vision for developers AWS Chicago
 
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...Wim Laurier
 
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...Wim Laurier
 
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage DataCollaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage DataEdward Curry
 
Machine Learning for Recommender Systems in the Job Market
Machine Learning for Recommender Systems in the Job MarketMachine Learning for Recommender Systems in the Job Market
Machine Learning for Recommender Systems in the Job MarketFabian Abel
 
Data Science Demystified
Data Science DemystifiedData Science Demystified
Data Science DemystifiedEmily Robinson
 
CS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdfCS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdfAROCKIAJAYAIECW
 

Semelhante a A Capability Requirements Approach for Predicting Worker Performance in Crowdsourcing (20)

Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...
Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...
Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
 
K anonymity for crowdsourcing database
K anonymity for crowdsourcing databaseK anonymity for crowdsourcing database
K anonymity for crowdsourcing database
 
Dad (Data Analysis And Design)
Dad (Data Analysis And Design)Dad (Data Analysis And Design)
Dad (Data Analysis And Design)
 
A Collaborative Approach for Metadata Management for Internet of Things
A Collaborative Approach for Metadata Management for Internet of ThingsA Collaborative Approach for Metadata Management for Internet of Things
A Collaborative Approach for Metadata Management for Internet of Things
 
Building, Growing and Serving Large Knowledge Graphs with Human-in-the-Loop
Building, Growing and Serving Large Knowledge Graphs with Human-in-the-LoopBuilding, Growing and Serving Large Knowledge Graphs with Human-in-the-Loop
Building, Growing and Serving Large Knowledge Graphs with Human-in-the-Loop
 
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
 
lecture-intro-pet-nams-ai-in-toxicology.pptx
lecture-intro-pet-nams-ai-in-toxicology.pptxlecture-intro-pet-nams-ai-in-toxicology.pptx
lecture-intro-pet-nams-ai-in-toxicology.pptx
 
Keynote@CADE2018_HalukDemirkan
Keynote@CADE2018_HalukDemirkanKeynote@CADE2018_HalukDemirkan
Keynote@CADE2018_HalukDemirkan
 
Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data Web
 
Mena Salwans - Computer Vision for developers
Mena Salwans - Computer Vision for developersMena Salwans - Computer Vision for developers
Mena Salwans - Computer Vision for developers
 
Mena Salwans Computer Vision for developers
Mena Salwans Computer Vision for developers  Mena Salwans Computer Vision for developers
Mena Salwans Computer Vision for developers
 
Sara Nasser
Sara NasserSara Nasser
Sara Nasser
 
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
 
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
 
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage DataCollaborative Data Management: How Crowdsourcing Can Help To Manage Data
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
 
Machine Learning for Recommender Systems in the Job Market
Machine Learning for Recommender Systems in the Job MarketMachine Learning for Recommender Systems in the Job Market
Machine Learning for Recommender Systems in the Job Market
 
Data Science Demystified
Data Science DemystifiedData Science Demystified
Data Science Demystified
 
CS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdfCS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdf
 
2008.11560v2.pdf
2008.11560v2.pdf2008.11560v2.pdf
2008.11560v2.pdf
 

Mais de Umair ul Hassan

Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality AssessmentLeveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality AssessmentUmair ul Hassan
 
A Multi-armed Bandit Approach to Online Spatial Task Assignment
A Multi-armed Bandit Approach to Online Spatial Task AssignmentA Multi-armed Bandit Approach to Online Spatial Task Assignment
A Multi-armed Bandit Approach to Online Spatial Task AssignmentUmair ul Hassan
 
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingSLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingUmair ul Hassan
 
Researh toolbox - Data analysis with python
Researh toolbox  - Data analysis with pythonResearh toolbox  - Data analysis with python
Researh toolbox - Data analysis with pythonUmair ul Hassan
 
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...Umair ul Hassan
 
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...Umair ul Hassan
 

Mais de Umair ul Hassan (6)

Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality AssessmentLeveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
 
A Multi-armed Bandit Approach to Online Spatial Task Assignment
A Multi-armed Bandit Approach to Online Spatial Task AssignmentA Multi-armed Bandit Approach to Online Spatial Task Assignment
A Multi-armed Bandit Approach to Online Spatial Task Assignment
 
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingSLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
 
Researh toolbox - Data analysis with python
Researh toolbox  - Data analysis with pythonResearh toolbox  - Data analysis with python
Researh toolbox - Data analysis with python
 
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
 
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
 

Último

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 

Último (20)

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 

A Capability Requirements Approach for Predicting Worker Performance in Crowdsourcing

  • 1. Digital Enterprise Research Institute www.deri.ie A CAPABILITY REQUIREMENTS APPROACH FOR PREDICTING WORKER PERFORMANCE IN CROWDSOURCING Umair ul Hassan, Edward Curry Digital Enterprise Research Institute National University of Ireland, Galway 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing Austin, Texas, United States October 20–23, 2013 Copyright 2010 Digital Enterprise Research Institute. All rights reserved.
  • 2. Agenda Digital Enterprise Research Institute    Motivation Background Task Modelling    Capability Requirements   www.deri.ie Capabilities Taxonomy Capability Tracing Experiment Summary 2
  • 3. Motivation: Heterogeneity Digital Enterprise Research Institute www.deri.ie 3
  • 4. Motivation: Task Routing Digital Enterprise Research Institute  www.deri.ie Assigning heterogeneous tasks to heterogeneous workers TASK MODELLING Models Models Models TASK ROUTING WORKER PROFILING Matching Profiles Profiles Profiles Task↔Worker 4
  • 5. Proposal: Performance Prediction Digital Enterprise Research Institute  www.deri.ie Predict performance of workers on new tasks based on the capabilities required for tasks and assign tasks accordingly TASK MODELLING Models Models Models Capability Requirements Approach TASK ROUTING WORKER PROFILING Matching Profiles Profiles Profiles Task↔Worker Performance Prediction Capability Tracing Model 5
  • 6. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie When micro tasks are crowd sourced   Single person cannot do the task   Computers cannot do the task Work can be split into smaller tasks Some online microtask platforms 6
  • 7. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie Most common tasks in Amazon Mechanical Turk (AMT) and CrowdFlower (CFL) 7
  • 8. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie Example of information extraction task in AMT 8
  • 9. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie Example of video transcription task in AMT 9
  • 10. Task Modelling Digital Enterprise Research Institute www.deri.ie  Appropriate models are needed to compare and contrast micro tasks.  Capability Requirements approach  Capability is defined as the ability of humans to do things in terms of both the capacity and the opportunity.  Four types of capabilities – – – – Knowledge, Skill, Ability, Other characteristics (e.g. motivation, price, etc) 10
  • 11. Capability Requirements Digital Enterprise Research Institute  www.deri.ie Taxonomies have be used to study human task performance, e.g.   Bloom’s taxonomy of classification of learning objectives   Fleishman’s taxonomy of human abilities O*NET-SOC taxonomy of occupational classification We are interested in taxonomy that  Describes tasks in terms of human capabilities  Helps in comparing tasks in terms of differences and similarities of capabilities 11
  • 12. Capabilities Taxonomy Digital Enterprise Research Institute   www.deri.ie Based on Fleishman’s abilities taxonomy Selected abilities relevant to micro tasks  Comprehension (C): The ability to understand the meaning or importance of something  Bilingualism (B): The ability to speak and understand two languages  Writing (W): The ability or capacity to write text in a given language  Comparison (M): The ability or capacity to compare things based on some criteria  Judgment (J): The act or process of judging; the formation of an opinion after consideration  Perception (P): The ability or capacity to perceive items visually or phonetically  Identification (I): The process of recognizing something  Reasoning (R): The ability to draw conclusions from facts, evidence, relationships, etc. 12
  • 13. Requirements of Micro Tasks Digital Enterprise Research Institute www.deri.ie 13
  • 14. Capability Tracing Digital Enterprise Research Institute   www.deri.ie How to model worker’s capabilities? Capability tracing    Inspired by Knowledge Tracing* Estimates probability of a worker knowing a capability given worker’s responses to test tasks Worker Profile constrains  Set of binary variables representing capabilities  Probability estimates of each variable being in a state * A. T. Corbett and J. R. Anderson, “Knowledge tracing: Modeling the acquisition of procedural knowledge,” User Modeling and User-Adapted Interaction, vol. 4, no. 4, pp. 253–278, 1994. 14
  • 15. Capability Tracing Digital Enterprise Research Institute  www.deri.ie Probabilistic network of a capability and four parameters of capability tracing model States of Capability Variable Not Learned p(T): Probability of transition between states p(T) Learned p(L) p(G) Values of Response Variable p(L): Probability of a worker learning to employ the capability p(S) p(G): Probability of guess Correct Incorrect 15 p(S): Probability of slip
  • 16. Experiment Digital Enterprise Research Institute  www.deri.ie Objective    Solicit capability requirements of tasks from crowds Evaluation of capability tracing for performance prediction Three types of micro tasks with manually created ground truth data   Image comparison   Fact verification Information Extraction 37 crowd workers including  University students  Workers from Shorttask.com 16
  • 17. Crowdsourcing Digital Enterprise Research Institute   www.deri.ie Custom web application for gathering data Example of fact verification task 17
  • 18. Capability Requirements of Tasks Digital Enterprise Research Institute  www.deri.ie Objective 1: Solicit capability requirements of tasks from crowds (a) fact verification (b) image comparison 18 (c) information extraction
  • 19. Crowd Performance Digital Enterprise Research Institute  www.deri.ie How the crowd performed on each type of task? Fact Verification task • 37 workers • Best workers perform with both precision and recall above 0.8 • More variation in recall means some workers were could not spot the incorrect facts • Ideally tasks should be assigned to workers that lie in the top-right quadrant of the plot 19
  • 20. Crowd Performance Digital Enterprise Research Institute  www.deri.ie Image Comparison (20 workers) and Information Extraction (17 workers) 20
  • 21. Performance Prediction Digital Enterprise Research Institute   www.deri.ie Objective 2: Evaluation of capability tracing for performance prediction Two phases  Build model with observation tasks  Predict performance on new tasks AC: Consider previous Accuracy as prediction of future performance CT: Capability Tracing 21
  • 22. Summary Digital Enterprise Research Institute   Capabilities taxonomy is first steps towards modelling of micro tasks based on human factors Capability tracing is effective in predicting future performance    www.deri.ie Even across tasks if there are similar capabilities Predicted performance can be used to make right task routing decisions Future Work  Evaluate on more types of tasks  Evaluate capabilities such as domain knowledge and skills  Define standard tests for measuring worker capabilities 22
  • 23. Further Reading Digital Enterprise Research Institute www.deri.ie 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing Austin, Texas, United States October 20–23, 2013 U. Ul Hassan and E. Curry, “A Capability Requirements Approach for Predicting Worker Performance in Crowdsourcing,” in 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing, 2013. http://deri.ie/users/umair-ul-hassan 23
  • 24. Capability Tracing Digital Enterprise Research Institute www.deri.ie  Conditional probability of worker learning to employ capability p(Ln|On) is calculated  When evidence On is positive  When evidence On is negative 24
  • 25. Capability Tracing Digital Enterprise Research Institute www.deri.ie  Probability of worker learning to employ capability  Performance of worker on next task 25

Notas do Editor

  1. AMT is amazonmechincalturk
  2. AMT is amazonmechincalturk
  3. AMT is amazonmechincalturk
  4. AMT is amazon mechanical turk
  5. Probability estimates are generated while learning the model through capability tracing
  6. As can be seen, more than majority of workers believed that identification capability is essential for all three types of tasks. Majority of workers also agreed that the judgment and comprehension capabilities are important for the Fact Verification task. In the case of the Image Comparison task most workers specified that comparison and perception are important as well. There is general consensus between workers that comprehension, judgment and reasoning capabilities are also useful for Information Extraction tasks. We selected top-3 capabilities for each type of task for building capability tracing models.
  7. Interestingly, no worker achieved the highest recall for the information extraction task, which highlights the difference between workers and ground truth in terms of the entities extracted from the Wikipedia articles. Nevertheless, these distributions emphasize that in order to achieve high accuracy tasks should be assigned to workers that lie in the top-right quadrant of the plots.
  8. Results show that the capability tracing approach is comparable to the baseline approach in general and achieves better accuracy of prediction between similar tasks. The Fact Verification and Information Extraction tasks have similar capabilities requirements, therefore capability tracingcan better predict the performance of workers between them. The drop in prediction quality of capability tracing for Image Comparison task can be attributed to the little variation is the performance of workers on this task.
  9. Hide these
  10. Hide these