5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
LEARN Final Conference: Tutorial Group | Costing RDM
1. How to Cost Data Curation
Paul Stokes, Senior Co-Design Manger
2. Definitions
» Data Curation
the act of managing digital items
held within an archive over the long
term. It is an active process,
implying action on the part of the
curators so that items remain
secure, discoverable and accessible.
‘Digital curation involves
maintaining, preserving and adding
value’ to archived items ‘throughout
their lifecycle.
» Research Data Management
the storage, access and preservation
of data produced from a given
investigation.
Data management practices cover
the entire lifecycle of the data, from
planning the investigation to
conducting it, and from backing
up data as it is created and used
to long term preservation
of data deliverables after the
research investigation has
concluded.
2
6. Who AreYou?
» You thought it wasn’t going to happen …..
› Name
› Affiliation
› Background/interest
»Where are you up to with
Costs and costing?
That Bit….
6
7. »Jisc Research Data Shared Service Pilot CoreTeam
› Rachel Bruce
› John Kaye
› Catherine Grout
› Paul Stokes
› Daniela Duca
› Dom Fripp
› Alan Mackenzie
› Nick Lonergan
Me
Senior co-design manager – research data
Business case, costing, 4C
7
8. Who We Are - Jisc
Jisc is the UK higher, further education
and skills sectors’ not-for-profit organisation
for digital services and solutions
Operate and develop
shared digital
infrastructure and
services
Provide trusted advice and
practical assistance for
universities, colleges and
learning providers
We…
Negotiate sector-wide deals
and conditions with IT vendors
and commercial publishers
8
9. Who We Are - Jisc
Mission
To enable people in higher
education, further education and
skills to perform at the forefront of
international practice by exploiting
fully the possibilities of modern
digital empowerment, content
and connectivity
Vision
To make the UK the most
digitally advanced
education and research
nation in the world
9
10. Who We Are – Jisc – Futures
“We take risks so you don’t
have to…”
10
13. » We’ve been around in the preservation/curation space for a while…
› CEDARs work
› Emulation via the CAmilieon / Doomsday project
› DPC
› DCC
› DP Handbook
› Journal archiving and the UK LOCKSS pilot
› Establishment of the LOCKSS Alliance
› UKWeb Archiving Consortium
› Higher education records retention schedule
› Involvement in Blue RibbonTask force, 4C project, and so on
Preservation/Curation
13
14. »Every HEI has a preservation
need
»We need global community
effort
› Policies
› Planning
› Costing
»Improving existing tools is
better than starting from
scratch
What we’ve learned
14
15. BCCRDM Project
»Business Case and Costing for Research Data
Management
› A need expressed by the community – R@R
› https://www.jisc.ac.uk/rd/projects/business-case-and-
costing-for-research-data-management
› Outputs on http://researchdata.network
15
16. BCCRDM Conclusions
»Everything has a price
› How do you know what
it is?
»Someone needs to pay
› How do you know how
much and who?
»What are the Benefits to
offset the costs?
16
17. Pain
»Tracking costs
› How?
› What information needs to be collected?
»Cost recovery mechanisms
› Is it possible?
› Is there an accepted methodology?
› How do you cope equitably with disciplinary
differences?
17
18. More Pain
»Post project funding
› How can something (preservation) be paid for after a
project has finished?
»Who benefits?
› The direct beneficiary of reused data may not be those
who paid for it
»How do you evaluate indirect benefits?
18
24. Costing
Activity
Based
Costing
1. Determine ALL activities
associated with a
product/service (direct
and indirect)
2. Establish costs for those
activities
3. Add it all up and divide by
the number of units to
get the unit cost
24
25. Costing
Traditional
Costing
1. Determine the direct
costs (e.g. Salary)
2. Add the company
“standard” overhead (e.g.
salary x %)
3. Add it all up and divide by
the number of units to
get the unit cost
25
30. Costing
»Identify the people (probably from Finance and/or IT)
who’s help you will need to get the information
»Identify the systems / data sources that can provide the
required information.
»Start gathering the costing information and crunching
the numbers
30
32. Costing
»Variable costs
› Vary in proportion to volumes of data
› Possibly not directly in proportion
› E.g. cloud storage
»Fixed costs
› NOT FIXED!
› Step changes in relation to volumes of
data
› E.g. staff
32
40. Steps
Step 0. Decide why you’re costing
Step 1. Define what you’re costing
Step 2. Define the methodology
Step 3. Plug in the numbers
40
41. Modelling
»What of the future needs?
»Models
»Modelling resources
› 4C / Curation Exchange
http://www.curationexchange.org/understand-your-
costs/19-summary-of-cost-models
»Scaling
› E.g. Number of users, volume of data, number of
copies, etc
41
43. Process
» Register/Login
» Create an Organisation
» Create a cost set (or sets)
–Map Asset types
–Map to activities
–Map to purchases and staff
» Compare
» (Publish)
43
51. Links
» 4C Project—http://4cproject.eu/
» Curation Costs Exchange—
http://www.curationexchange.org/
» Business Case and Costing for RDM—
https://www.jisc.ac.uk/rd/projects/business-case-
and-costing-for-research-data-management
» Business Case and Costing for RDM outputs—
https://research-data-
network.readme.io/v2.01/docs/overview
52
52. »When: 27th to 28th June 2017
»Where: York
»Cost: Free!
Research Data Network – June 2017
Registration now open
http://jisc.ly/UD2if7
More information on the RDN website
http://researchdata.network
Image: Sebastiaan ter Burg
#JiscRDM
53
55. Scaffold - David Gray - CC BY-NC - https://flic.kr/p/5hmJqA
Shout - Sebastiaan ter Burg - CC BY - https://flic.kr/p/dobs8W
Nails - Mark Hunter - CC BY - https://flic.kr/p/9HYdgo
Question Everything (Nullius in verba) Take nobody's word for it- Duncan Hull - CC BY -
https://flic.kr/p/iVLZt
Papa Bruno take out 191109 - Ambernectar 13 - CC BY-ND - https://flic.kr/p/7gMjnV
Historic Preservation Sign - Alan Levine – CC BY – https://flic.kr/p/bRNDtF
Rinse and Repeat - Benson Kua – CC BY-SA - https://flic.kr/p/9pPjDS
oh no!- torbakhopper - CC BY - https://flic.kr/p/bBJwXg
All other images CC0
Acknowledgements
56
Notas do Editor
Before we get down to the nitty grity…
Definitions
Sufficient overlap to be almost interchangeable in the context of costing
More of one being a subset of the other – use the terms interchangeably for the purposes of costing
Thought it wasn’t going to happen …..
Which brings me to …
ME
RDSS – I’ll come back to that
Quick canter through Jisc’s raison d’etre
Unofficial moto – “We take risks so you don’t have to”
Currently working on a Major project – RDSS
Driven by mandates relating to data management and preservation
e.g. 10 years from last access
Some links relating to the above
http://www.ukoln.ac.uk/metadata/cedars/guidance/metadata.html
http://www.dpconline.org/docman/miscellaneous/events/368-digital-curation-jones/file
http://www.dcc.ac.uk/resources/external/camileon-creative-archiving-michigan-and-leeds-emulating-old-new
http://webarchive.nationalarchives.gov.uk/20140702233839/http://www.jisc.ac.uk/whatwedo/programmes/preservation.aspx
For community use – tools and resources
For Jisc use – making the case for the RDSS
Some of the things we came up against
Submissions from the floor
Submissions from the floor
The number one reason it’s difficult
Two roommates in an apartment will typically split the costs of rent, utilities and groceries, and they have a couple of options for doing so. They could simply total the cost of all of the bills and divide it exactly in two. This would be similar to traditional costing.
The roommates also have the option of determining who uses specific utilities and paying only for what each one uses. They can then create an itemized bill for each roommate. For example, if one roommate doesn’t use the internet and the other doesn’t use cable, they won’t have to pay those parts of the bill. This method is similar to activity-based costing.
http://quickbooks.intuit.com/r/pricing-strategy/activity-based-vs-traditional-costing/
See http://www.curationexchange.org/understand-your-costs/14-cost-concept-model for more information
OAIS is a good starting point
Often the biggest problem of them all
Submissions from the floor (again)
Submissions from the floor (again)
Current themes include:
Updates on RDSS Pilot
Research Data Discovery
Business case and costing
European Open Science cloud
RDM Policy
Research Data reuse
DMPs
RDM Maturity models
Sensitive data
Text mining
FAIR
Posters
Birds of a feather sessions