SlideShare uma empresa Scribd logo
1 de 31
Baixar para ler offline
The human factor in big data
BDVe webinar series
November 6th 2018
Elena Simperl, University of Southampton, UK
@esimperl
Volume
Veracity
Velocity
Variety
Big data
• Data value chains as driver for growth and change
• Transformative impact leading to new infrastructure,
businesses, politics and social interactions
• Created, refined, valued and exchanged unlike any other
resources
• Alters the rules for markets and demands new
approaches from regulators
The data economy
Example: Disrupting transport
Smart cities have access to more data than ever to
inform policy and service design
Driverless cars, electrification and connectivity are
transforming the automotive industry
Machine learning and AI can help optimise traffic,
support future planning and improve fuel efficiencies
Challenges
Data
availability
• Collecting missing data
• Labelling data to train and
validate algorithms
• Improving data quality
• Integrating across sources
Data use
• Making decisions inclusively
• Enabling the free flow of data
• Innovating responsibly
Many of these
tasks are
automated, but
technology has
limitations
Legal, economic,
social, ethical
implications
More and better data
Training and validating algorithms
Engaging and empowering citizens,
customers etc.
The human factor in big data
Approaches
Citizen sensing
Urban auditing
Participatory democracy
Open innovation
Crowdsourcing
Human in the loop
Crowdsourcing
Organisations struggle to leverage
the human factor
What form of
crowdsourcing
to choose?
How to engage
with the
crowd?
Why would
the crowd
care?
How do we
control the
quality?
Does it need
to be in real-
time?
Can we afford
it at scale?
Qrowd
Innovation action, part of the Big Data Value PPP
Started in December 2016, 3 years, 3.9M €
8 partners from 5 European countries, coordinated by the
University of Southampton
Smart city solutions
Combining crowd and computational intelligence
Piloted in transportation with
A medium-sized smart city
A leading navigation and traffic management service provider
Enabling data value chains
Standards compliant,
interoperable, open, no
vendor lock-in
Leverages existing
technology stacks
Used by industry partners
Extendable and scalable to
adapt to new urban
contexts
Platform for data and
process (data flow)
integration
The human factor in Qrowd
Mix of open innovation methods to co-design pilots and encourage
stakeholder participation
Value-centric approach to platform design: personal data empowerment,
open source, building upon existing standards
Sustainable urban auditing through online and mobile crowdsourcing
Human-in-the-loop (HIL) architecture to improve the accuracy of
predictions
More than just technology
Supports deployment of
human-machine workflows
throughout
Interfaces to multiple
crowdsourcing services
Complemented by
methodology and
guidelines
Data protection by design
The ‘what, who, how, why’ methodology
14
What
• Tasks you can’t complete in-house or using computers
• A question of time, budget, resources, ethics etc.
Who
• Crowdsourcing ≠‘turkers’
• Open call, biased via choice of platforms and promotion
channels
• No traditional means to manage and incentivize
• Crowd has often little to no context about the project
How
• Macro vs. microtasks
• Complex workflows
• Assessment and aggregation
• Timeliness of results
Why
• Different crowds with different motivations
• Incentives influence motivations
• Aligning incentives
Using the methodology
Who is it for
• Organisations interested in increasing participation via crowdsourcing
• Technology providers implementing HIL architectures
How can it be used
• Provides a process model starting with the What, followed by the Who,
which then determine the How. Every What/Who/How decision impacts
on the Why
• Can be used with or without the Qrowd platform
• Helps specify goals and decide what forms of crowdsourcing to use
• Helps roll out crowdsourcing projects and use their results effectively
• Helps understand motivations and incentives and their role in successful
projects
Examples
Urban auditing: Collect up to date
information about parking spaces in a city
Modal split: Collecting training data to
predict the use of different means of
transport
What
In general
• Something you cannot do using traditional means or that
requires broader engagement
• Something you cannot do (fully) automatically – a data
collection or analysis task
In our examples
• Parking: We need a dataset with all parking spaces in a city
(alternatively: parking availability). Traditional surveys too costly.
• Modal split: We need trips involving different means of
transport and labels for each trip segment. This data is not
available and is needed to train AIs.
8/1/2019 17
What Who
How Why
What task am I trying to
solve?
Can I solve it via other
means: buy the data,
label in house, use
less/noisier data etc.
Who
In general
• An open (‘unknown’) crowd
• Scale helps solve problem faster
• Some tasks will have time, location or skills constraints (hence,
smaller crowd, hence slower or costlier)
In our examples
• Parking
• People who are familiar with an urban area e.g., Open Street Map community, citizens
• Drivers using a SatNav
• Paid crowd workers
• Social media users
• Modal split
• Commuters, tourists, people using transport
8/1/2019 19
What Who
How Why
Who is my crowd?
How do I recruit
participants?
What are my
requirements?
Can I find volunteers?
Shall I use a crowdsourcing
platform?
How: Process
In general
• Many ways to implement tasks: specialized platforms, social media, extension of
existing system etc.
• Tasks broken down into smaller units, undertaken in parallel by different people
• Does not apply to all forms of crowdsourcing – sometimes the breakdown is part of the
solution!
• Does not apply to creative tasks, underexplored problem spaces etc.
• Task assignment to match skills, preferences, and contribution history
• Example: random assignment vs meritocracy vs full autonomy
• Explicit vs. implicit participation
• Affects motivation
• Partial or independent answers consolidated and aggregated into complete
solution
• Example: challenges (e.g., Netflix) vs aggregation (e.g., Wikipedia)
• Real-time answers
• Require alternative models and incentives
8/1/2019 21
What Who
How Why
How: Process
In our example - parking
1. Crowdsourcing platform: Virtual City Explorer tool using virtual
street imagery. Participants are paid.
2. Extension of existing system: SatNav prompting user to answer
questions about parking availability. Contributions could be
incentivised.
3. Data collection app: i-Log app launches challenges to collect
parking pictures in a city. Best pictures receive a prize.
8/1/2019 22
What Who
How Why
Virtual City Explorer
• Crowdsourcing platform for
urban auditing, developed at the
University of Southampton
• People explore a virtual city via
street imagery
• They solve small tasks against
micropayments
• VCE validates answers,
consolidates data and analyses
user behaviour to propose
optimisations
i-Log and QrowdLab
i-Log is an Android application developed at the University of
Trento used for people-centric sensing
QrowdLab is a citizen innovation lab set up in Trento to
engage with citizens on city matters
We need tools to connect with the citizens
We need data to understand patterns of
behaviour and collect missing data
We need feedback on how people interact with
the city and its infrastructure
How: Process
In our example – modal split
• Combination of machine learning classifier, citizen sensing and
labelled data collected via gamified challenges
8/1/2019 25
What Who
How Why
Where do I deploy
crowdsourcing? Do I need a
new system?
How do I allocate tasks to
people? Or do I let them
choose freely how to
contribute?
How do I deal with low quality
solutions? Can I recognise
good solutions easily?
Why: money, love or glory
Love and glory reduce costs
Money and glory make the
crowd move faster
27
Intrinsic vs extrinsic motivation
• Rewards/incentives influence motivation
Successful unpaid crowdsourcing is difficult to
predict or replicate
• Highly context-specific
• Not applicable to arbitrary tasks
Reward models often easier to study and
control (if performance can be reliably
measured)
• Not always easy to abstract from social
aspects (free-riding, social pressure)
• May undermine intrinsic motivation
What Who
How Why
Why
In our examples
Who benefits from the results?
Who owns the results?
How much effort does it require from the crowd?
Money
Different models: pay-per-time, pay-per-unit, winner-
takes-it-all
Define the rewards, analyse trade-offs accuracy vs.
costs, avoid spam
Love
OpenStreetMap, games, citizen panels
Glory
Competitions, awards
Why would anyone care to
contribute?
Is the task intrinsically
rewarding?
What would motivate people
to participate?
How do I sustain participation?
Leveraging the human factor
The most sophisticated AI systems showcase ingenious
combinations of human and machine intelligence
Crowdsourcing can augment any aspect of the data value
chain
Our methodology can help organisations understand how
to use crowdsourcing effectively
Qrowd develops a platform with integrated crowdsourcing
support to deploy hybrid data collection and analysis
workflows
Further reading
• Qrowd project: qrowd-project.eu, @QrowdProject
• Figure Eight: figure-eight.com
• How to use crowdsourcing effectively, Simperl, E. (2015):
https://www.liberquarterly.eu/articles/10.18352/lq.9948/
• When computers were human, David Alan Grier, 2007
• The collective intelligence genome, Malone, T. W., Laubacher, R., &
Dellarocas, C. (2010). MIT Sloan Management Review, 51(3), 21.
• Getting Results from Crowds: The Definitive Guide to Using
Crowdsourcing to Grow, Dawson, R. and Bynghall, S. (2011).
Advanced Human Technologies

Mais conteúdo relacionado

Mais procurados

CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...
CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...
CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...Dirk Ahlers
 
Service orientated development in public sector
Service orientated development in public sectorService orientated development in public sector
Service orientated development in public sectorRisto Hinno
 
FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...
FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...
FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...FIWARE
 
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...European Data Forum
 
Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...
Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...
Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...plan4all
 
Collection Methodology for Key Performance Indicators for Smart Sustainable C...
Collection Methodology for Key Performance Indicators for Smart Sustainable C...Collection Methodology for Key Performance Indicators for Smart Sustainable C...
Collection Methodology for Key Performance Indicators for Smart Sustainable C...ITU
 
Open Data Governance as an Integral Part of a Smart City: How to start with?
Open Data Governance as an Integral Part of a Smart City: How to start with?Open Data Governance as an Integral Part of a Smart City: How to start with?
Open Data Governance as an Integral Part of a Smart City: How to start with?Open Knowledge Belgium
 
D-CENT project presentation
D-CENT project presentationD-CENT project presentation
D-CENT project presentationdcentproject
 
Open Belgium 5-star linked open data address registry
Open Belgium 5-star linked open data address registryOpen Belgium 5-star linked open data address registry
Open Belgium 5-star linked open data address registryRaf Buyle
 
Commissione Europea Dg Infso Carmela Asero
Commissione Europea Dg Infso   Carmela AseroCommissione Europea Dg Infso   Carmela Asero
Commissione Europea Dg Infso Carmela AseroMarilina Asero
 
Growing Your Strategic Capability
Growing Your Strategic CapabilityGrowing Your Strategic Capability
Growing Your Strategic CapabilityThe Concept Store
 
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...European Data Forum
 
SC4 Hangout 1: Big data europe transport webinar Philippe Crist
SC4 Hangout 1: Big data europe   transport webinar Philippe CristSC4 Hangout 1: Big data europe   transport webinar Philippe Crist
SC4 Hangout 1: Big data europe transport webinar Philippe CristBigData_Europe
 
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...European Data Forum
 
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...European Data Forum
 
Co-creating data value chains with the public sector
 Co-creating data value chains with the public sector Co-creating data value chains with the public sector
Co-creating data value chains with the public sectorBig Data Value Association
 

Mais procurados (20)

C³PO Project Leaflet
C³PO Project LeafletC³PO Project Leaflet
C³PO Project Leaflet
 
Barbato leit ict 15-16-17
Barbato leit ict 15-16-17Barbato leit ict 15-16-17
Barbato leit ict 15-16-17
 
CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...
CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...
CIRRE keynote: Stakeholders in Positive Energy District Development – +CityxC...
 
Service orientated development in public sector
Service orientated development in public sectorService orientated development in public sector
Service orientated development in public sector
 
FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...
FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...
FIWARE Global Summit - Digital Service Infrastructure for the EU Digital Sing...
 
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
 
Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...
Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...
Putting Cities on the Map: Innovative Geospatial Initiative Win’s Europe’s Di...
 
Membership Intro Presentation
Membership Intro PresentationMembership Intro Presentation
Membership Intro Presentation
 
Collection Methodology for Key Performance Indicators for Smart Sustainable C...
Collection Methodology for Key Performance Indicators for Smart Sustainable C...Collection Methodology for Key Performance Indicators for Smart Sustainable C...
Collection Methodology for Key Performance Indicators for Smart Sustainable C...
 
Open Data Governance as an Integral Part of a Smart City: How to start with?
Open Data Governance as an Integral Part of a Smart City: How to start with?Open Data Governance as an Integral Part of a Smart City: How to start with?
Open Data Governance as an Integral Part of a Smart City: How to start with?
 
D-CENT project presentation
D-CENT project presentationD-CENT project presentation
D-CENT project presentation
 
Open Belgium 5-star linked open data address registry
Open Belgium 5-star linked open data address registryOpen Belgium 5-star linked open data address registry
Open Belgium 5-star linked open data address registry
 
Commissione Europea Dg Infso Carmela Asero
Commissione Europea Dg Infso   Carmela AseroCommissione Europea Dg Infso   Carmela Asero
Commissione Europea Dg Infso Carmela Asero
 
Growing Your Strategic Capability
Growing Your Strategic CapabilityGrowing Your Strategic Capability
Growing Your Strategic Capability
 
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
 
Financial Services
Financial ServicesFinancial Services
Financial Services
 
SC4 Hangout 1: Big data europe transport webinar Philippe Crist
SC4 Hangout 1: Big data europe   transport webinar Philippe CristSC4 Hangout 1: Big data europe   transport webinar Philippe Crist
SC4 Hangout 1: Big data europe transport webinar Philippe Crist
 
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...
 
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...
 
Co-creating data value chains with the public sector
 Co-creating data value chains with the public sector Co-creating data value chains with the public sector
Co-creating data value chains with the public sector
 

Semelhante a Human factor in big data qrowd bdve

BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBig Data Value Association
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBig Data Value Association
 
Crowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesCrowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesElena Simperl
 
Smart Cities? Smart Citizens!
Smart Cities? Smart Citizens!Smart Cities? Smart Citizens!
Smart Cities? Smart Citizens!Frank Kresin
 
Data Days: Citadel pilots results
Data Days: Citadel pilots resultsData Days: Citadel pilots results
Data Days: Citadel pilots resultsSarahBuelens
 
Digital Vision for CALP
Digital Vision for CALPDigital Vision for CALP
Digital Vision for CALPtaipida
 
"Developments in Accessibility of Information" - Access Israel 's 6th Annual ...
"Developments in Accessibility of Information" - Access Israel 's 6th Annual ..."Developments in Accessibility of Information" - Access Israel 's 6th Annual ...
"Developments in Accessibility of Information" - Access Israel 's 6th Annual ...Ricardo Garcia Bahamonde
 
Revenue models of personal data platform operators
Revenue models of personal data platform operatorsRevenue models of personal data platform operators
Revenue models of personal data platform operatorsLaura Kemppainen
 
The Purdue IronHacks
The Purdue IronHacksThe Purdue IronHacks
The Purdue IronHacksPurdue RCODI
 
Community solutions lab
Community solutions labCommunity solutions lab
Community solutions labGabe Sawhney
 
Locus Charter Presentation
Locus Charter Presentation Locus Charter Presentation
Locus Charter Presentation Suchith Anand
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
 
Open Mobility - the case for CitySDK
Open Mobility - the case for CitySDKOpen Mobility - the case for CitySDK
Open Mobility - the case for CitySDKFrank Kresin
 
London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20LG Inform Plus
 
Engaging citizens in the future of mobility
Engaging citizens in the future of mobilityEngaging citizens in the future of mobility
Engaging citizens in the future of mobilityMobility Lab UK
 
Managing Change: Transformation for Productive Public Services 6/12/2016
Managing Change: Transformation for Productive Public Services 6/12/2016Managing Change: Transformation for Productive Public Services 6/12/2016
Managing Change: Transformation for Productive Public Services 6/12/2016mckenln
 
ICT for Local Government - better service delivery
ICT for Local Government - better service deliveryICT for Local Government - better service delivery
ICT for Local Government - better service deliveryAllison Hornery
 

Semelhante a Human factor in big data qrowd bdve (20)

BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big Data
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big Data
 
Crowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesCrowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart cities
 
Smart Cities? Smart Citizens!
Smart Cities? Smart Citizens!Smart Cities? Smart Citizens!
Smart Cities? Smart Citizens!
 
Data Days: Citadel pilots results
Data Days: Citadel pilots resultsData Days: Citadel pilots results
Data Days: Citadel pilots results
 
Digital Vision for CALP
Digital Vision for CALPDigital Vision for CALP
Digital Vision for CALP
 
"Developments in Accessibility of Information" - Access Israel 's 6th Annual ...
"Developments in Accessibility of Information" - Access Israel 's 6th Annual ..."Developments in Accessibility of Information" - Access Israel 's 6th Annual ...
"Developments in Accessibility of Information" - Access Israel 's 6th Annual ...
 
Revenue models of personal data platform operators
Revenue models of personal data platform operatorsRevenue models of personal data platform operators
Revenue models of personal data platform operators
 
CTDC Ecosystem Mapping Guide
CTDC Ecosystem Mapping Guide  CTDC Ecosystem Mapping Guide
CTDC Ecosystem Mapping Guide
 
The Purdue IronHacks
The Purdue IronHacksThe Purdue IronHacks
The Purdue IronHacks
 
Community solutions lab
Community solutions labCommunity solutions lab
Community solutions lab
 
Locus Charter Presentation
Locus Charter Presentation Locus Charter Presentation
Locus Charter Presentation
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressed
 
Open Mobility - the case for CitySDK
Open Mobility - the case for CitySDKOpen Mobility - the case for CitySDK
Open Mobility - the case for CitySDK
 
London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20
 
Engaging citizens in the future of mobility
Engaging citizens in the future of mobilityEngaging citizens in the future of mobility
Engaging citizens in the future of mobility
 
Managing Change: Transformation for Productive Public Services 6/12/2016
Managing Change: Transformation for Productive Public Services 6/12/2016Managing Change: Transformation for Productive Public Services 6/12/2016
Managing Change: Transformation for Productive Public Services 6/12/2016
 
What is Crowdsourcing - Nicola Osborne
What is Crowdsourcing - Nicola OsborneWhat is Crowdsourcing - Nicola Osborne
What is Crowdsourcing - Nicola Osborne
 
Week2 chapters1 3
Week2 chapters1 3Week2 chapters1 3
Week2 chapters1 3
 
ICT for Local Government - better service delivery
ICT for Local Government - better service deliveryICT for Local Government - better service delivery
ICT for Local Government - better service delivery
 

Mais de Luis Daniel Ibáñez

Qrowd and thecity_connectedsmartcities_2019
Qrowd and thecity_connectedsmartcities_2019Qrowd and thecity_connectedsmartcities_2019
Qrowd and thecity_connectedsmartcities_2019Luis Daniel Ibáñez
 
Simplifying maps tomtomgeomonday_2019
Simplifying maps tomtomgeomonday_2019Simplifying maps tomtomgeomonday_2019
Simplifying maps tomtomgeomonday_2019Luis Daniel Ibáñez
 
Hybrid workflows aw4city-webconf-2019
Hybrid workflows aw4city-webconf-2019Hybrid workflows aw4city-webconf-2019
Hybrid workflows aw4city-webconf-2019Luis Daniel Ibáñez
 
Inclusive cities thecityofthefuture_cardiff_2018
Inclusive cities thecityofthefuture_cardiff_2018Inclusive cities thecityofthefuture_cardiff_2018
Inclusive cities thecityofthefuture_cardiff_2018Luis Daniel Ibáñez
 
Human factorsession introduction-bdv-forum-sofia
Human factorsession introduction-bdv-forum-sofiaHuman factorsession introduction-bdv-forum-sofia
Human factorsession introduction-bdv-forum-sofiaLuis Daniel Ibáñez
 
Qrowd human factorsession-bdv-forum-2018
Qrowd human factorsession-bdv-forum-2018Qrowd human factorsession-bdv-forum-2018
Qrowd human factorsession-bdv-forum-2018Luis Daniel Ibáñez
 
Qrowd transport session-bdv-forum-2018
Qrowd transport session-bdv-forum-2018Qrowd transport session-bdv-forum-2018
Qrowd transport session-bdv-forum-2018Luis Daniel Ibáñez
 
Smart modalsplit smarmobilitycongress_2018
Smart modalsplit smarmobilitycongress_2018Smart modalsplit smarmobilitycongress_2018
Smart modalsplit smarmobilitycongress_2018Luis Daniel Ibáñez
 
LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013Luis Daniel Ibáñez
 

Mais de Luis Daniel Ibáñez (11)

Qrowd and thecity_connectedsmartcities_2019
Qrowd and thecity_connectedsmartcities_2019Qrowd and thecity_connectedsmartcities_2019
Qrowd and thecity_connectedsmartcities_2019
 
Simplifying maps tomtomgeomonday_2019
Simplifying maps tomtomgeomonday_2019Simplifying maps tomtomgeomonday_2019
Simplifying maps tomtomgeomonday_2019
 
Qrowd ppp week-2018
Qrowd ppp week-2018Qrowd ppp week-2018
Qrowd ppp week-2018
 
Hybrid workflows aw4city-webconf-2019
Hybrid workflows aw4city-webconf-2019Hybrid workflows aw4city-webconf-2019
Hybrid workflows aw4city-webconf-2019
 
Inclusive cities thecityofthefuture_cardiff_2018
Inclusive cities thecityofthefuture_cardiff_2018Inclusive cities thecityofthefuture_cardiff_2018
Inclusive cities thecityofthefuture_cardiff_2018
 
Human factorsession introduction-bdv-forum-sofia
Human factorsession introduction-bdv-forum-sofiaHuman factorsession introduction-bdv-forum-sofia
Human factorsession introduction-bdv-forum-sofia
 
Qrowd human factorsession-bdv-forum-2018
Qrowd human factorsession-bdv-forum-2018Qrowd human factorsession-bdv-forum-2018
Qrowd human factorsession-bdv-forum-2018
 
Qrowd transport session-bdv-forum-2018
Qrowd transport session-bdv-forum-2018Qrowd transport session-bdv-forum-2018
Qrowd transport session-bdv-forum-2018
 
Qrowd overview-solothurn-2018
Qrowd overview-solothurn-2018Qrowd overview-solothurn-2018
Qrowd overview-solothurn-2018
 
Smart modalsplit smarmobilitycongress_2018
Smart modalsplit smarmobilitycongress_2018Smart modalsplit smarmobilitycongress_2018
Smart modalsplit smarmobilitycongress_2018
 
LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013LiveLinkedData - TransWebData - Nantes 2013
LiveLinkedData - TransWebData - Nantes 2013
 

Último

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Último (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

Human factor in big data qrowd bdve

  • 1. The human factor in big data BDVe webinar series November 6th 2018 Elena Simperl, University of Southampton, UK @esimperl
  • 2. Volume Veracity Velocity Variety Big data • Data value chains as driver for growth and change • Transformative impact leading to new infrastructure, businesses, politics and social interactions • Created, refined, valued and exchanged unlike any other resources • Alters the rules for markets and demands new approaches from regulators The data economy
  • 3. Example: Disrupting transport Smart cities have access to more data than ever to inform policy and service design Driverless cars, electrification and connectivity are transforming the automotive industry Machine learning and AI can help optimise traffic, support future planning and improve fuel efficiencies
  • 4. Challenges Data availability • Collecting missing data • Labelling data to train and validate algorithms • Improving data quality • Integrating across sources Data use • Making decisions inclusively • Enabling the free flow of data • Innovating responsibly Many of these tasks are automated, but technology has limitations Legal, economic, social, ethical implications
  • 5.
  • 6. More and better data Training and validating algorithms Engaging and empowering citizens, customers etc. The human factor in big data
  • 7. Approaches Citizen sensing Urban auditing Participatory democracy Open innovation Crowdsourcing Human in the loop
  • 9. Organisations struggle to leverage the human factor What form of crowdsourcing to choose? How to engage with the crowd? Why would the crowd care? How do we control the quality? Does it need to be in real- time? Can we afford it at scale?
  • 10. Qrowd Innovation action, part of the Big Data Value PPP Started in December 2016, 3 years, 3.9M € 8 partners from 5 European countries, coordinated by the University of Southampton Smart city solutions Combining crowd and computational intelligence Piloted in transportation with A medium-sized smart city A leading navigation and traffic management service provider
  • 11. Enabling data value chains Standards compliant, interoperable, open, no vendor lock-in Leverages existing technology stacks Used by industry partners Extendable and scalable to adapt to new urban contexts Platform for data and process (data flow) integration
  • 12. The human factor in Qrowd Mix of open innovation methods to co-design pilots and encourage stakeholder participation Value-centric approach to platform design: personal data empowerment, open source, building upon existing standards Sustainable urban auditing through online and mobile crowdsourcing Human-in-the-loop (HIL) architecture to improve the accuracy of predictions
  • 13. More than just technology Supports deployment of human-machine workflows throughout Interfaces to multiple crowdsourcing services Complemented by methodology and guidelines Data protection by design
  • 14. The ‘what, who, how, why’ methodology 14 What • Tasks you can’t complete in-house or using computers • A question of time, budget, resources, ethics etc. Who • Crowdsourcing ≠‘turkers’ • Open call, biased via choice of platforms and promotion channels • No traditional means to manage and incentivize • Crowd has often little to no context about the project How • Macro vs. microtasks • Complex workflows • Assessment and aggregation • Timeliness of results Why • Different crowds with different motivations • Incentives influence motivations • Aligning incentives
  • 15. Using the methodology Who is it for • Organisations interested in increasing participation via crowdsourcing • Technology providers implementing HIL architectures How can it be used • Provides a process model starting with the What, followed by the Who, which then determine the How. Every What/Who/How decision impacts on the Why • Can be used with or without the Qrowd platform • Helps specify goals and decide what forms of crowdsourcing to use • Helps roll out crowdsourcing projects and use their results effectively • Helps understand motivations and incentives and their role in successful projects
  • 16. Examples Urban auditing: Collect up to date information about parking spaces in a city Modal split: Collecting training data to predict the use of different means of transport
  • 17. What In general • Something you cannot do using traditional means or that requires broader engagement • Something you cannot do (fully) automatically – a data collection or analysis task In our examples • Parking: We need a dataset with all parking spaces in a city (alternatively: parking availability). Traditional surveys too costly. • Modal split: We need trips involving different means of transport and labels for each trip segment. This data is not available and is needed to train AIs. 8/1/2019 17 What Who How Why
  • 18. What task am I trying to solve? Can I solve it via other means: buy the data, label in house, use less/noisier data etc.
  • 19. Who In general • An open (‘unknown’) crowd • Scale helps solve problem faster • Some tasks will have time, location or skills constraints (hence, smaller crowd, hence slower or costlier) In our examples • Parking • People who are familiar with an urban area e.g., Open Street Map community, citizens • Drivers using a SatNav • Paid crowd workers • Social media users • Modal split • Commuters, tourists, people using transport 8/1/2019 19 What Who How Why
  • 20. Who is my crowd? How do I recruit participants? What are my requirements? Can I find volunteers? Shall I use a crowdsourcing platform?
  • 21. How: Process In general • Many ways to implement tasks: specialized platforms, social media, extension of existing system etc. • Tasks broken down into smaller units, undertaken in parallel by different people • Does not apply to all forms of crowdsourcing – sometimes the breakdown is part of the solution! • Does not apply to creative tasks, underexplored problem spaces etc. • Task assignment to match skills, preferences, and contribution history • Example: random assignment vs meritocracy vs full autonomy • Explicit vs. implicit participation • Affects motivation • Partial or independent answers consolidated and aggregated into complete solution • Example: challenges (e.g., Netflix) vs aggregation (e.g., Wikipedia) • Real-time answers • Require alternative models and incentives 8/1/2019 21 What Who How Why
  • 22. How: Process In our example - parking 1. Crowdsourcing platform: Virtual City Explorer tool using virtual street imagery. Participants are paid. 2. Extension of existing system: SatNav prompting user to answer questions about parking availability. Contributions could be incentivised. 3. Data collection app: i-Log app launches challenges to collect parking pictures in a city. Best pictures receive a prize. 8/1/2019 22 What Who How Why
  • 23. Virtual City Explorer • Crowdsourcing platform for urban auditing, developed at the University of Southampton • People explore a virtual city via street imagery • They solve small tasks against micropayments • VCE validates answers, consolidates data and analyses user behaviour to propose optimisations
  • 24. i-Log and QrowdLab i-Log is an Android application developed at the University of Trento used for people-centric sensing QrowdLab is a citizen innovation lab set up in Trento to engage with citizens on city matters We need tools to connect with the citizens We need data to understand patterns of behaviour and collect missing data We need feedback on how people interact with the city and its infrastructure
  • 25. How: Process In our example – modal split • Combination of machine learning classifier, citizen sensing and labelled data collected via gamified challenges 8/1/2019 25 What Who How Why
  • 26. Where do I deploy crowdsourcing? Do I need a new system? How do I allocate tasks to people? Or do I let them choose freely how to contribute? How do I deal with low quality solutions? Can I recognise good solutions easily?
  • 27. Why: money, love or glory Love and glory reduce costs Money and glory make the crowd move faster 27 Intrinsic vs extrinsic motivation • Rewards/incentives influence motivation Successful unpaid crowdsourcing is difficult to predict or replicate • Highly context-specific • Not applicable to arbitrary tasks Reward models often easier to study and control (if performance can be reliably measured) • Not always easy to abstract from social aspects (free-riding, social pressure) • May undermine intrinsic motivation What Who How Why
  • 28. Why In our examples Who benefits from the results? Who owns the results? How much effort does it require from the crowd? Money Different models: pay-per-time, pay-per-unit, winner- takes-it-all Define the rewards, analyse trade-offs accuracy vs. costs, avoid spam Love OpenStreetMap, games, citizen panels Glory Competitions, awards
  • 29. Why would anyone care to contribute? Is the task intrinsically rewarding? What would motivate people to participate? How do I sustain participation?
  • 30. Leveraging the human factor The most sophisticated AI systems showcase ingenious combinations of human and machine intelligence Crowdsourcing can augment any aspect of the data value chain Our methodology can help organisations understand how to use crowdsourcing effectively Qrowd develops a platform with integrated crowdsourcing support to deploy hybrid data collection and analysis workflows
  • 31. Further reading • Qrowd project: qrowd-project.eu, @QrowdProject • Figure Eight: figure-eight.com • How to use crowdsourcing effectively, Simperl, E. (2015): https://www.liberquarterly.eu/articles/10.18352/lq.9948/ • When computers were human, David Alan Grier, 2007 • The collective intelligence genome, Malone, T. W., Laubacher, R., & Dellarocas, C. (2010). MIT Sloan Management Review, 51(3), 21. • Getting Results from Crowds: The Definitive Guide to Using Crowdsourcing to Grow, Dawson, R. and Bynghall, S. (2011). Advanced Human Technologies