SlideShare uma empresa Scribd logo
1 de 67
Data Matters
Alan Dix
Talis & University of Birmingham
http://alandix.com/ref2014/
University of
Birmingham
Tiree
Tiree Tech Wave
22-26 October 2015
today I am not talking about …
• intelligent internet interfaces
• visualisation and sampling
• situated displays, eCampus,
small device – large display interactions
• fun and games, virtual crackers,
artistic performance, slow time
• creativity and Bad Ideas
• modelling dreams and regret
and the emergence of self
…
… or even lots of lights
http:/www.hcibook.com/alan/projects/firefly/
I am talking about ...
REF data analysis
long tail of small data
REF
REF 2014
Research Excellence Framework
approx 5 yearly research assessment in the UK
not just about the UK …
lots of countries thinking to do similar
... and looking to REF as example
REF elements
three elements:
outputs (mainly papers)
impact
environment
focus of
this work
REF panels
4 main panels, 36 sub-panels, ~200K outputs
sub-panel 11: computer science and informatics
I was on this panel
but NO confidential data here
everything public domain
REF profiles
every output graded: 4* / 3* / 2* / 1*
individual grades confidential and destroyed
each ‘Unit of Assessment’ (dept) given a profile
http://results.ref.ac.uk/Results/ByUoa/11/Outputs
sub-area profiles
N.B. computing only
each output given ACM code
originally to enable allocation to panelists
… but, also used to create sub-area profiles …
sub-area profiles
From Morris Sloman’s slides & panel report
theoretical areas
30-40% 4*
applied/human areas
10-20% 4*
data not information
sub-panel report warning:
"These data should be treated with circumspection …
however already affecting institutional policy
hiring, internal investment
… and may influence research council policy
possible reasons for variation …
1. best applied work is weak
– including HCI :-/
2. long tail
– weak researchers choose applied areas
3. latent bias
– despite panel’s efforts to be fair
can bibliometrics disentangle these?
metrics and assessment
citation metrics known to be good
post-hoc correlates of sophisticated measures
… but not for individuals and small cohorts
and danger of gaming and policy distortion
suitable for verifying large-scale patterns
(and HEFCE using them for this)
data used for analysis
all in public domain
(virtually) complete list of outputs:
– excluding a few confidential ones
– for each: name, doi, ACM topic area, Scopus citations
Google scholar citations for each
– gathered after REF (not used in assessment)
UoA and sub-area profiles
metrics used
Scopus (late 2013 census )
– with/without 2012/13 as few citations
‘Normalised Scopus’
– using ‘contextual data’, corrects for
different citation patterns between areas
– places output in top 1%, 5%, 10% of its area worldwide
Google Scholar (late 2014 census)
– with/without 2012/13; zero treated as zero/missing
seven variants – all give similar results
results … massive differences
% citations in
top quartile
% REF 4* ratio
winners
losers
‘scatter’ graph
% outputs in top quartile for citations
% outputs
awarded
REF 4*
rank scores
winners
losers
diagram thanks to Andrew Howes
Another way of looking at it …
world ranking within own field
recall REF …
for example,
HCI research (web similar) …
on average …
• HCI/CSCW paper needs to be in top 0.5%
worldwide to get 4*
• logic/algorithms paper just needs to be in top 5%
10 fold difference
and just as you thought it was all over …
… institutional effects
look at +/- 25% REF compared with citations
N.B. use high-end weighted measure as money is
focused (4:1:0:0)
of 35 losers, 25 are post-1992 universities
of 17 winners, 16 are pre-1992 universities
an example …
XXXXXXX – a new university
YYYYYYYY – an old university
World Rankings
REF
and Gender?
Female authors in main panel B were significantly less likely
to achieve a 4* output than male authors with the same
metrics ratings. When considered in the UOA models,
women were significantly less likely to have 4*
outputs than men whilst controlling for metric
scores in the following UOAs: Psychology, Psychiatry
and Neuroscience; Computer Science and
Informatics; Architecture, Built Environment and Planning;
Economics and Econometrics.
The Metric Tide (HEFCE, 2015)
implicit bias?
HEFCE analysis:
male staff in computing is 1/3 more likely to get
a 4* than female
areas and types institutions disadvantaged by REF
often those with more women
… implications for future recruitment?
future for research assessment?
• pure metrics?
• metrics as part (e.g. older outputs)
• metrics as under-girding (burden of proof)
• human process – metrics for in-process feedback
..
long tail of small data
Big Data
everyone is talking about it
Twitter, Google, Facebook, NSA,
universities, … and funding
Big Data does it with MapReduce
Semantic Data does it with RDF
the long tail
size of
data set
a few very large data sets
e.g. Twitter, streams,
Open Govt., OS,
geonames, dbpedia the small data of ordinary life:
from local bus timetables
to squash club league tables
stories of small data …
Walking Wales
Learning analytics
Open Data Islands and Communities
Musicology
Alan Walks Wales
1058 miles (1700km)
3 million footfalls
3 ½ months
April-July 2013
focus on IT at the margins
one thousand miles of poetry, technology and community
vision
personal
encircling, encompassing, pilgrimage, homecoming,
practical
IT for the walker & IT for local communities
philosophical
reflections on walking and space, locality and identity
research
personal agenda and living lab
lots of
data
data
location
GPX ... batteries ... sporadic signals ....
bio-sensing
ECG (heart), EDA (skin) and accelerometers
audio and images
in the moment
text
after the event
implicit
explicit
The largest ECG trace
in the public domain
challenges (1)
location
GPX – merging and mending
bio-sensing
ECG & EDA – special formats & volume
audio and images
volume, transcription and annotation
text
semantic markup, synchronising sources
challenges (2)
documentation
methodology of creation, data formats
for other people to use!
meta-data
for machines to use
PR
telling the world about it!
academic culture
we do not value data!
an offer
multiple synchronisable data streams
largest public domain ECG trace
post-hoc analysis
simulate real use
please use it!
Learning analytics
macro-analytics
university strategy
MOOCs
micro-analytics
individual course,
student,
resource
time frames for learning analytics
days and hours
email, during lectures and labs, stduent meetings, gaps
week
preparing for teaching, exercises
months/mid-semester
reporting points, staff meetings, cohort/student progress
end of semester/term/year
exams, exam boards, course revew,
start of semester/term/year
preparing for new courses or re-runs, rollover!
years
new courses, professional development, appraisal, promotion
Open Data
everyone is doing it
Governments, Cities, local gov.
In C21 Data is Power
why not an island?
island data flows
Community
groups and individuals
rest of
the world
other
communities
1
2
3
4
island data flows
from community to world
Community
groups and individuals
rest of
the world
1
• visibility and
control
• identity and
empowerment
• level of detail
• local knowledge
island data flows
from world to community
Community
groups and individuals
rest of
the world
2 • making the
most
of open data
• local decision
making
• lobbying and
negotiation
island data flows
within the community
Community
groups and individuals
3
• gossip is not enough!
• sparse, dispersed population
• social cohesion and economic benefits
island data flows
between communities
Community
groups and individuals
other
communities
4
• sharing best practice
• brand presence
• interlinked data
benefits to …
the community
empowerment and control
availability of information
communication within and between communities
the world
improved quality of data
level of detail of data
local knowledge and understanding
In Concert
Concert ephemera
1750–1800 Calendar of London Concerts
1815–1895 Concert Life in London
1894–1944 Concert Programme Exchange (BL)
External sources
MusicBrainz
MBz id as connect into Linked Data, BBC, etc.
Authoritative sources (future)
e.g. British Library BNB, Concert Programmes metadata
concert database
classic digital humanities?
original
sources
selected
sources
systematic
sample
transcription
& extraction
(medium expertise)
interpretation
(high expertise)
digitised
sources
authoritative
data
analysis & use
(high expertise)
academic
publication
large digital
archive
(e.g. BBC)
possibly
create
linkage
Barriers to progress
effort and expertise
authority and quality
digital acontextuality
openness
Openness and Reward
Career development
Leverhulme & REF
Building the discipline?
Re-envisioning the Digital Archive:
Curation and Use
big bang to incremental
digitised
sources
authoritative
data
academic
publication
...
big bang to incremental
problem focused augmentation
transform cost-benefit
digitial
archive
academic
publications
...
partial
enhancement
& interpretation
scenario-focused investigations
=> reflection and requirements
digital symbiosis
suggestion and confirmation
provenance and authority
spreadsheet as user interface
semantics through interaction
themes and take-aways ...
data in context
heterogeneity and linking
value and values
ethics and empowerment
…. and please use my data 
Data matters-bournemouth-2015

Mais conteúdo relacionado

Mais procurados

When is a digital link a network edge? Exploring ways to construct social net...
When is a digital link a network edge? Exploring ways to construct social net...When is a digital link a network edge? Exploring ways to construct social net...
When is a digital link a network edge? Exploring ways to construct social net...
Derek Weber
 

Mais procurados (16)

Websci 2018
Websci 2018Websci 2018
Websci 2018
 
The Hidden Stories of Missing Data
The Hidden Stories of Missing DataThe Hidden Stories of Missing Data
The Hidden Stories of Missing Data
 
Advanced analytics for supporting public policy, bracketology, and beyond!
Advanced analytics for supporting public policy, bracketology, and beyond!Advanced analytics for supporting public policy, bracketology, and beyond!
Advanced analytics for supporting public policy, bracketology, and beyond!
 
Reputation Management for Early Career Researchers
Reputation Management for Early Career ResearchersReputation Management for Early Career Researchers
Reputation Management for Early Career Researchers
 
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
 
Gender, Representation and Online Participation: a Quantitative Study
Gender, Representation and Online Participation: a Quantitative StudyGender, Representation and Online Participation: a Quantitative Study
Gender, Representation and Online Participation: a Quantitative Study
 
How to write a CHI paper
How to write a CHI paperHow to write a CHI paper
How to write a CHI paper
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's Perspective
 
When is a digital link a network edge? Exploring ways to construct social net...
When is a digital link a network edge? Exploring ways to construct social net...When is a digital link a network edge? Exploring ways to construct social net...
When is a digital link a network edge? Exploring ways to construct social net...
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
 
Week13 ppt
Week13 pptWeek13 ppt
Week13 ppt
 
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
2-6-14 ESI Supplemental Webinar: The Data Information  Literacy Project2-6-14 ESI Supplemental Webinar: The Data Information  Literacy Project
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
 
Social barriers to digital scholarship for arts and humanities researchers
Social barriers to digital scholarship for arts and humanities researchersSocial barriers to digital scholarship for arts and humanities researchers
Social barriers to digital scholarship for arts and humanities researchers
 
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
 
Social Media in Selected Australian Federal and State Election Campaigns, 201...
Social Media in Selected Australian Federal and State Election Campaigns, 201...Social Media in Selected Australian Federal and State Election Campaigns, 201...
Social Media in Selected Australian Federal and State Election Campaigns, 201...
 
Week13 ppt
Week13 pptWeek13 ppt
Week13 ppt
 

Destaque

Marketing - Promotion
Marketing - PromotionMarketing - Promotion
Marketing - Promotion
tutor2u
 

Destaque (17)

Ewrt1b class 11 post hw
Ewrt1b class 11  post hwEwrt1b class 11  post hw
Ewrt1b class 11 post hw
 
Circuit Theory 2: Filters Project Report
Circuit Theory 2: Filters Project ReportCircuit Theory 2: Filters Project Report
Circuit Theory 2: Filters Project Report
 
Habitos alimenticios
Habitos alimenticiosHabitos alimenticios
Habitos alimenticios
 
Frasan – Tiree Heritage App
Frasan – Tiree Heritage AppFrasan – Tiree Heritage App
Frasan – Tiree Heritage App
 
Ubs outlook 2016
Ubs outlook 2016Ubs outlook 2016
Ubs outlook 2016
 
Facilitation Skills
Facilitation SkillsFacilitation Skills
Facilitation Skills
 
Continuous delivery for Android
Continuous delivery for AndroidContinuous delivery for Android
Continuous delivery for Android
 
IN DUTCH - strategic benchmarking in the supply chain triangle 201601
IN DUTCH - strategic benchmarking in the supply chain triangle 201601IN DUTCH - strategic benchmarking in the supply chain triangle 201601
IN DUTCH - strategic benchmarking in the supply chain triangle 201601
 
Perakende Matematiği Eğitim Seti
Perakende Matematiği Eğitim Seti Perakende Matematiği Eğitim Seti
Perakende Matematiği Eğitim Seti
 
Havayolu Kargo Taşımacılığı
Havayolu Kargo TaşımacılığıHavayolu Kargo Taşımacılığı
Havayolu Kargo Taşımacılığı
 
Perakende Matematiği Eğitimi
Perakende Matematiği EğitimiPerakende Matematiği Eğitimi
Perakende Matematiği Eğitimi
 
SICSA : Open Data Islands and Communities
SICSA : Open Data Islands and CommunitiesSICSA : Open Data Islands and Communities
SICSA : Open Data Islands and Communities
 
Perakende Matematiği Eğitim Seti
Perakende Matematiği Eğitim Seti Perakende Matematiği Eğitim Seti
Perakende Matematiği Eğitim Seti
 
PunkED ipadpalooza 2016
PunkED  ipadpalooza 2016PunkED  ipadpalooza 2016
PunkED ipadpalooza 2016
 
ElegantJ BI Overview
ElegantJ BI OverviewElegantJ BI Overview
ElegantJ BI Overview
 
Decision trees for machine learning
Decision trees for machine learningDecision trees for machine learning
Decision trees for machine learning
 
Marketing - Promotion
Marketing - PromotionMarketing - Promotion
Marketing - Promotion
 

Semelhante a Data matters-bournemouth-2015

David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
michellep
 
Strategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSSStrategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSS
webuploader
 

Semelhante a Data matters-bournemouth-2015 (20)

The big story of small data.
The big story of small data. The big story of small data.
The big story of small data.
 
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
David Nicholas, Ciber: Audience Analysis and Modelling, the case of CIBER and...
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Opportunity and risk in social computing environments
Opportunity and risk in social computing environmentsOpportunity and risk in social computing environments
Opportunity and risk in social computing environments
 
We Went Mobile! (Or Did We?)
We Went Mobile! (Or Did We?) We Went Mobile! (Or Did We?)
We Went Mobile! (Or Did We?)
 
Macfadyen usc tlt keynote 2015.pptx
Macfadyen usc tlt keynote 2015.pptxMacfadyen usc tlt keynote 2015.pptx
Macfadyen usc tlt keynote 2015.pptx
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training
 
Handling social science data: Challenges and responses
Handling social science data: Challenges and responsesHandling social science data: Challenges and responses
Handling social science data: Challenges and responses
 
Technology and Student Affairs
Technology and Student AffairsTechnology and Student Affairs
Technology and Student Affairs
 
Eurocall2014 SpeakApps Presentation - SpeakApps and Learning Analytics
Eurocall2014 SpeakApps Presentation - SpeakApps and Learning AnalyticsEurocall2014 SpeakApps Presentation - SpeakApps and Learning Analytics
Eurocall2014 SpeakApps Presentation - SpeakApps and Learning Analytics
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflows
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...
 
CISER & the Data Reference Interview
CISER & the Data Reference InterviewCISER & the Data Reference Interview
CISER & the Data Reference Interview
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
Technology and learning centers: Best and innovative practices
Technology and learning centers: Best and innovative practicesTechnology and learning centers: Best and innovative practices
Technology and learning centers: Best and innovative practices
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
 
Strategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSSStrategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSS
 
Qs1 group a
Qs1 group a Qs1 group a
Qs1 group a
 
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
 
What can be done with Open Data?
What can be done with Open Data?What can be done with Open Data?
What can be done with Open Data?
 

Mais de Alan Dix

CDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbers
CDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbersCDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbers
CDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbers
Alan Dix
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Alan Dix
 
The future of UX design support tools - talk Paris March 2024
The future of UX design support tools - talk Paris March 2024The future of UX design support tools - talk Paris March 2024
The future of UX design support tools - talk Paris March 2024
Alan Dix
 
Inclusivity and AI: opportunity or threat
Inclusivity and AI: opportunity or threatInclusivity and AI: opportunity or threat
Inclusivity and AI: opportunity or threat
Alan Dix
 
ChatGPT, Culture and Creativity simulacrum and alterity
ChatGPT, Culture and Creativity simulacrum and alterityChatGPT, Culture and Creativity simulacrum and alterity
ChatGPT, Culture and Creativity simulacrum and alterity
Alan Dix
 
Beyond the Wireframe: tools to design, analyse and prototype physical devices
Beyond the Wireframe: tools to design, analyse and prototype physical devicesBeyond the Wireframe: tools to design, analyse and prototype physical devices
Beyond the Wireframe: tools to design, analyse and prototype physical devices
Alan Dix
 
Truth in an Age of Information
Truth in an Age of InformationTruth in an Age of Information
Truth in an Age of Information
Alan Dix
 
Follow your nose: history frames the future
Follow your nose: history frames the futureFollow your nose: history frames the future
Follow your nose: history frames the future
Alan Dix
 

Mais de Alan Dix (20)

CDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbers
CDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbersCDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbers
CDT Away Day Talk: Qualitative–Quantitative reasoning and lightweight numbers
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Human-Centred Artificial Intelligence – Malta 2024
Human-Centred Artificial Intelligence – Malta 2024Human-Centred Artificial Intelligence – Malta 2024
Human-Centred Artificial Intelligence – Malta 2024
 
The future of UX design support tools - talk Paris March 2024
The future of UX design support tools - talk Paris March 2024The future of UX design support tools - talk Paris March 2024
The future of UX design support tools - talk Paris March 2024
 
Qualitative–Quantitative reasoning and lightweight numbers
Qualitative–Quantitative reasoning and lightweight numbersQualitative–Quantitative reasoning and lightweight numbers
Qualitative–Quantitative reasoning and lightweight numbers
 
Invited talk at Diversifying Knowledge Production in HCI
Invited talk at Diversifying Knowledge Production in HCIInvited talk at Diversifying Knowledge Production in HCI
Invited talk at Diversifying Knowledge Production in HCI
 
Exceptional Experiences for Everyone
Exceptional Experiences for EveryoneExceptional Experiences for Everyone
Exceptional Experiences for Everyone
 
Inclusivity and AI: opportunity or threat
Inclusivity and AI: opportunity or threatInclusivity and AI: opportunity or threat
Inclusivity and AI: opportunity or threat
 
Hidden Figures architectural challenges to expose parameters lost in code
Hidden Figures architectural challenges to expose parameters lost in codeHidden Figures architectural challenges to expose parameters lost in code
Hidden Figures architectural challenges to expose parameters lost in code
 
ChatGPT, Culture and Creativity simulacrum and alterity
ChatGPT, Culture and Creativity simulacrum and alterityChatGPT, Culture and Creativity simulacrum and alterity
ChatGPT, Culture and Creativity simulacrum and alterity
 
Why pandemics and climate change are hard to understand and make decision mak...
Why pandemics and climate change are hard to understand and make decision mak...Why pandemics and climate change are hard to understand and make decision mak...
Why pandemics and climate change are hard to understand and make decision mak...
 
Beyond the Wireframe: tools to design, analyse and prototype physical devices
Beyond the Wireframe: tools to design, analyse and prototype physical devicesBeyond the Wireframe: tools to design, analyse and prototype physical devices
Beyond the Wireframe: tools to design, analyse and prototype physical devices
 
Forever Cyborgs – a long view on physical-digital interaction
Forever Cyborgs – a long view on physical-digital interactionForever Cyborgs – a long view on physical-digital interaction
Forever Cyborgs – a long view on physical-digital interaction
 
Truth in an Age of Information
Truth in an Age of InformationTruth in an Age of Information
Truth in an Age of Information
 
Rome Seminar: Designing User Interactions with AI
Rome Seminar: Designing User Interactions with AIRome Seminar: Designing User Interactions with AI
Rome Seminar: Designing User Interactions with AI
 
Tools and technology to support rich community heritage
Tools and technology to support rich community heritageTools and technology to support rich community heritage
Tools and technology to support rich community heritage
 
Maps with Meaning
Maps with MeaningMaps with Meaning
Maps with Meaning
 
Democratising Digitisation Tools to Support Small Community Archives
Democratising Digitisation Tools to Support Small Community ArchivesDemocratising Digitisation Tools to Support Small Community Archives
Democratising Digitisation Tools to Support Small Community Archives
 
Follow your nose: history frames the future
Follow your nose: history frames the futureFollow your nose: history frames the future
Follow your nose: history frames the future
 

Último

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 

Data matters-bournemouth-2015

  • 1. Data Matters Alan Dix Talis & University of Birmingham http://alandix.com/ref2014/
  • 3. today I am not talking about … • intelligent internet interfaces • visualisation and sampling • situated displays, eCampus, small device – large display interactions • fun and games, virtual crackers, artistic performance, slow time • creativity and Bad Ideas • modelling dreams and regret and the emergence of self …
  • 4. … or even lots of lights http:/www.hcibook.com/alan/projects/firefly/
  • 5. I am talking about ... REF data analysis long tail of small data
  • 6. REF
  • 7. REF 2014 Research Excellence Framework approx 5 yearly research assessment in the UK not just about the UK … lots of countries thinking to do similar ... and looking to REF as example
  • 8. REF elements three elements: outputs (mainly papers) impact environment focus of this work
  • 9. REF panels 4 main panels, 36 sub-panels, ~200K outputs sub-panel 11: computer science and informatics I was on this panel but NO confidential data here everything public domain
  • 10. REF profiles every output graded: 4* / 3* / 2* / 1* individual grades confidential and destroyed each ‘Unit of Assessment’ (dept) given a profile http://results.ref.ac.uk/Results/ByUoa/11/Outputs
  • 11. sub-area profiles N.B. computing only each output given ACM code originally to enable allocation to panelists … but, also used to create sub-area profiles …
  • 12. sub-area profiles From Morris Sloman’s slides & panel report theoretical areas 30-40% 4* applied/human areas 10-20% 4*
  • 13. data not information sub-panel report warning: "These data should be treated with circumspection … however already affecting institutional policy hiring, internal investment … and may influence research council policy
  • 14. possible reasons for variation … 1. best applied work is weak – including HCI :-/ 2. long tail – weak researchers choose applied areas 3. latent bias – despite panel’s efforts to be fair can bibliometrics disentangle these?
  • 15. metrics and assessment citation metrics known to be good post-hoc correlates of sophisticated measures … but not for individuals and small cohorts and danger of gaming and policy distortion suitable for verifying large-scale patterns (and HEFCE using them for this)
  • 16. data used for analysis all in public domain (virtually) complete list of outputs: – excluding a few confidential ones – for each: name, doi, ACM topic area, Scopus citations Google scholar citations for each – gathered after REF (not used in assessment) UoA and sub-area profiles
  • 17. metrics used Scopus (late 2013 census ) – with/without 2012/13 as few citations ‘Normalised Scopus’ – using ‘contextual data’, corrects for different citation patterns between areas – places output in top 1%, 5%, 10% of its area worldwide Google Scholar (late 2014 census) – with/without 2012/13; zero treated as zero/missing seven variants – all give similar results
  • 18. results … massive differences % citations in top quartile % REF 4* ratio winners losers
  • 19. ‘scatter’ graph % outputs in top quartile for citations % outputs awarded REF 4*
  • 21. Another way of looking at it … world ranking within own field
  • 23. for example, HCI research (web similar) … on average … • HCI/CSCW paper needs to be in top 0.5% worldwide to get 4* • logic/algorithms paper just needs to be in top 5% 10 fold difference
  • 24. and just as you thought it was all over … … institutional effects look at +/- 25% REF compared with citations N.B. use high-end weighted measure as money is focused (4:1:0:0) of 35 losers, 25 are post-1992 universities of 17 winners, 16 are pre-1992 universities
  • 25. an example … XXXXXXX – a new university YYYYYYYY – an old university World Rankings REF
  • 26. and Gender? Female authors in main panel B were significantly less likely to achieve a 4* output than male authors with the same metrics ratings. When considered in the UOA models, women were significantly less likely to have 4* outputs than men whilst controlling for metric scores in the following UOAs: Psychology, Psychiatry and Neuroscience; Computer Science and Informatics; Architecture, Built Environment and Planning; Economics and Econometrics. The Metric Tide (HEFCE, 2015)
  • 27. implicit bias? HEFCE analysis: male staff in computing is 1/3 more likely to get a 4* than female areas and types institutions disadvantaged by REF often those with more women … implications for future recruitment?
  • 28. future for research assessment? • pure metrics? • metrics as part (e.g. older outputs) • metrics as under-girding (burden of proof) • human process – metrics for in-process feedback
  • 29.
  • 30. .. long tail of small data
  • 31. Big Data everyone is talking about it Twitter, Google, Facebook, NSA, universities, … and funding Big Data does it with MapReduce Semantic Data does it with RDF
  • 32. the long tail size of data set a few very large data sets e.g. Twitter, streams, Open Govt., OS, geonames, dbpedia the small data of ordinary life: from local bus timetables to squash club league tables
  • 33. stories of small data … Walking Wales Learning analytics Open Data Islands and Communities Musicology
  • 34.
  • 35. Alan Walks Wales 1058 miles (1700km) 3 million footfalls 3 ½ months April-July 2013 focus on IT at the margins one thousand miles of poetry, technology and community
  • 36. vision personal encircling, encompassing, pilgrimage, homecoming, practical IT for the walker & IT for local communities philosophical reflections on walking and space, locality and identity research personal agenda and living lab lots of data
  • 37. data location GPX ... batteries ... sporadic signals .... bio-sensing ECG (heart), EDA (skin) and accelerometers audio and images in the moment text after the event implicit explicit The largest ECG trace in the public domain
  • 38. challenges (1) location GPX – merging and mending bio-sensing ECG & EDA – special formats & volume audio and images volume, transcription and annotation text semantic markup, synchronising sources
  • 39. challenges (2) documentation methodology of creation, data formats for other people to use! meta-data for machines to use PR telling the world about it! academic culture we do not value data!
  • 40. an offer multiple synchronisable data streams largest public domain ECG trace post-hoc analysis simulate real use please use it!
  • 41.
  • 43. time frames for learning analytics days and hours email, during lectures and labs, stduent meetings, gaps week preparing for teaching, exercises months/mid-semester reporting points, staff meetings, cohort/student progress end of semester/term/year exams, exam boards, course revew, start of semester/term/year preparing for new courses or re-runs, rollover! years new courses, professional development, appraisal, promotion
  • 44.
  • 45. Open Data everyone is doing it Governments, Cities, local gov. In C21 Data is Power
  • 46. why not an island?
  • 47. island data flows Community groups and individuals rest of the world other communities 1 2 3 4
  • 48. island data flows from community to world Community groups and individuals rest of the world 1 • visibility and control • identity and empowerment • level of detail • local knowledge
  • 49. island data flows from world to community Community groups and individuals rest of the world 2 • making the most of open data • local decision making • lobbying and negotiation
  • 50. island data flows within the community Community groups and individuals 3 • gossip is not enough! • sparse, dispersed population • social cohesion and economic benefits
  • 51. island data flows between communities Community groups and individuals other communities 4 • sharing best practice • brand presence • interlinked data
  • 52. benefits to … the community empowerment and control availability of information communication within and between communities the world improved quality of data level of detail of data local knowledge and understanding
  • 53.
  • 54. In Concert Concert ephemera 1750–1800 Calendar of London Concerts 1815–1895 Concert Life in London 1894–1944 Concert Programme Exchange (BL) External sources MusicBrainz MBz id as connect into Linked Data, BBC, etc. Authoritative sources (future) e.g. British Library BNB, Concert Programmes metadata
  • 55.
  • 56.
  • 57. concert database classic digital humanities? original sources selected sources systematic sample transcription & extraction (medium expertise) interpretation (high expertise) digitised sources authoritative data analysis & use (high expertise) academic publication large digital archive (e.g. BBC) possibly create linkage
  • 58. Barriers to progress effort and expertise authority and quality digital acontextuality openness
  • 59. Openness and Reward Career development Leverhulme & REF Building the discipline?
  • 60. Re-envisioning the Digital Archive: Curation and Use
  • 61. big bang to incremental digitised sources authoritative data academic publication ...
  • 62. big bang to incremental problem focused augmentation transform cost-benefit digitial archive academic publications ... partial enhancement & interpretation
  • 64. => reflection and requirements digital symbiosis suggestion and confirmation provenance and authority spreadsheet as user interface semantics through interaction
  • 65.
  • 66. themes and take-aways ... data in context heterogeneity and linking value and values ethics and empowerment …. and please use my data 