SlideShare uma empresa Scribd logo
1 de 18
CSIRO investing in the future of data
INFORMATION MANAGEMENT & TECHNOLOGY
John Morrissey | eResearch Planner
22 July 2016
CSIRO
CSIRO investing in the future of data | John Morrissey2 |
~5300
talented staff
$1billion+
budget
Working
with over
2800+
industry
partners
55
sites across
Australia
Top 1%
of global
research
agencies
Each year
6 CSIRO
technologies
contribute
$5 billion to
the economy
The ongoing problem….
Science data assets:
• Undescribed …
• Inaccessible …
• Undiscoverable, unusable, uncitable …
• On a really wide range of infrastructure …
• In a really wide range of preservation-unfriendly formats …
• Unconnected …
CSIRO investing in the future of data | John Morrissey3 |
Some elements to connect
4 |
Systems
Infrastructure
Processes
(e.g. Quality Control,
Approval)
Legal
Licensing Intellectual
Property
Culture
Training
Fulfilling
needs
… … …
Policy
CSIRO investing in the future of data | John Morrissey
Data Access Portal
Functions
Self serve Deposit
Describe
Create Citation
Restrict
License
Approve
Store
Publish
Discover
Access
Manage
CSIRO investing in the future of data | John Morrissey5 |
Goals for a data repository
• Persistent access
• Version control
• Self service
• Scalable storage
• Minimal use of expensive spinning disk storage
• Cheaper tape storage added as required – fast throughput when data is
optimally “encapsulated” on tape
• Integration with Bowen Research Cloud storage – used by projects for
working storage
CSIRO investing in the future of data | John Morrissey6 |
Decision workflows for data and software
CSIRO investing in the future of data | John Morrissey7 |
The Data Management Ecosystem …
CSIRO investing in the future of data | John Morrissey8 |
Collaboration:
industry,
universities, other
organisations
Vocab
Service
Like an onion …
CSIRO investing in the future of data | John Morrissey9 |
Data
management
ecosystem
Collaboration:
Industry,
universities, other
organisations
Marine National Facility
CSIRO investing in the future of data | John Morrissey10 |
CSIRO ASKAP Science Data Archive (CASDA)
CSIRO investing in the future of data | John Morrissey11 |
CASDA: Data rate at full operation: 16TB per day, 5PB per year
CSIRO investing in the future of data | John Morrissey12 |
Who’s interested?
CSIRO investing in the future of data | John Morrissey13 |
What’s next?
Policy
• Supported by infrastructure services that make compliance easier
• Data management planning, with tools to support this and return value to
end user
• Management support within research projects required to allocate resources
to data management
Development
• Storage
– Better integration with existing network storage for simpler ingest
– More access options
• Services, vocabularies, semantic web
• Provenance
• Object / file level metadata
CSIRO investing in the future of data | John Morrissey14 |
What’s even more exciting?
• Researchers wanting to add “plug-in” functions to the DAP
• Researchers writing whole-of-program data management
roadmaps for their business units, heavily referencing DAP and
enterprise-developed tools.
• Continuation of the “working with research groups” model to
implement:
• Semantic enablement and vocabularies
• Provenance
• Reuse of DAP metadata in other tools
CSIRO investing in the future of data | John Morrissey15 |
interested [ view, download ]
similar
Data Collection A
likely interested
Similar Data Collections
Data User
In current implementation, similar
datasets are determined based on :
• Title
• Description
• Keyword
• Fields of research
• Data Contributor
• Activity
• Related Collection (specified by data
depositors)
A Recommender System for Research Data
Data Sources
• DAP Web Service
• Offline files (ANZSRC, Activity)
• Server logs (download, views)*
*will be included in future
New development work by Dr Anusuriya Devaraju, Postdoctoral Fellow ,CSIRO Mineral
Resources
You may
also like :
• ..
• ..
• ..
An Overview of the DAP Recommender System
SQL
database
Research Data
Recommender Model
RecommendationService
DAP Metadata
Store
Web Service
Other Data Sources (e.g.,
server logs, auxiliary data)
Data View
Data Download
Data Deposit (Post-Process)
Information Management &
Technology
John Morrissey
eResearch Planner
t +61 2 6124 1411
e john.morrissey@csiro.au
w www.csiro.au
INFORMATION MANAGEMENT & TECHNOLOGY
Thank you

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

RDA UK
RDA UKRDA UK
RDA UK
 
Implementing the Research Data Management Policy: University of Edinburgh Roa...
Implementing the Research Data Management Policy: University of Edinburgh Roa...Implementing the Research Data Management Policy: University of Edinburgh Roa...
Implementing the Research Data Management Policy: University of Edinburgh Roa...
 
Developing Research Data Management Policy and Services
Developing Research Data Management Policy and ServicesDeveloping Research Data Management Policy and Services
Developing Research Data Management Policy and Services
 
Ucla july 2018 natasha simons
Ucla july 2018 natasha simonsUcla july 2018 natasha simons
Ucla july 2018 natasha simons
 
Lightning Talks - Intro
Lightning Talks - IntroLightning Talks - Intro
Lightning Talks - Intro
 
Frances Burton on sensitive data
Frances Burton on sensitive dataFrances Burton on sensitive data
Frances Burton on sensitive data
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
 
RDN Lightning talk - Open Research Leeds (@OpenResLeeds): networks, metrics a...
RDN Lightning talk - Open Research Leeds (@OpenResLeeds): networks, metrics a...RDN Lightning talk - Open Research Leeds (@OpenResLeeds): networks, metrics a...
RDN Lightning talk - Open Research Leeds (@OpenResLeeds): networks, metrics a...
 
A discovery service for UK research data
A discovery service for UK research dataA discovery service for UK research data
A discovery service for UK research data
 
European Open Science Cloud
European Open Science CloudEuropean Open Science Cloud
European Open Science Cloud
 
MANTRA for Change
MANTRA for ChangeMANTRA for Change
MANTRA for Change
 
Presenting RISE
Presenting RISEPresenting RISE
Presenting RISE
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
Supporting Good Practice in Research Data Management: Edinburgh’s ExperienceSupporting Good Practice in Research Data Management: Edinburgh’s Experience
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
 
RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus
 
DMPOnline by Sarah Jones
DMPOnline by Sarah JonesDMPOnline by Sarah Jones
DMPOnline by Sarah Jones
 
HESA data, describing research activity and #REF2021
HESA data, describing research activity and #REF2021HESA data, describing research activity and #REF2021
HESA data, describing research activity and #REF2021
 
EOSC pilot STFC
EOSC pilot STFCEOSC pilot STFC
EOSC pilot STFC
 

Destaque

Destaque (16)

Ready and Prepared for Research data
Ready and Prepared for Research dataReady and Prepared for Research data
Ready and Prepared for Research data
 
What are sensitive data and why might they be trickier to publish?
What are sensitive data and why might they be trickier to publish?What are sensitive data and why might they be trickier to publish?
What are sensitive data and why might they be trickier to publish?
 
Managing sensitive data in your repository
Managing sensitive data in your repositoryManaging sensitive data in your repository
Managing sensitive data in your repository
 
Research Integrity Advisors Data Management Workshop: A National Approach to ...
Research Integrity Advisors Data Management Workshop: A National Approach to ...Research Integrity Advisors Data Management Workshop: A National Approach to ...
Research Integrity Advisors Data Management Workshop: A National Approach to ...
 
DMP Tool at Curtin University
DMP Tool at Curtin UniversityDMP Tool at Curtin University
DMP Tool at Curtin University
 
Orcid researchers webinar_aaf-ands-caul_20160518
Orcid researchers webinar_aaf-ands-caul_20160518Orcid researchers webinar_aaf-ands-caul_20160518
Orcid researchers webinar_aaf-ands-caul_20160518
 
Brisbane Health-y Data: Sharing/Accessing Data Through "My Health Record"
Brisbane Health-y Data: Sharing/Accessing Data Through "My Health Record"Brisbane Health-y Data: Sharing/Accessing Data Through "My Health Record"
Brisbane Health-y Data: Sharing/Accessing Data Through "My Health Record"
 
DMP Tool at UNSW
DMP Tool at UNSWDMP Tool at UNSW
DMP Tool at UNSW
 
Observations on a whole lot of Things learned through the 23 (research data) ...
Observations on a whole lot of Things learned through the 23 (research data) ...Observations on a whole lot of Things learned through the 23 (research data) ...
Observations on a whole lot of Things learned through the 23 (research data) ...
 
Brisbane Health-y Data: Legislation, Ethics and Governance
Brisbane Health-y Data: Legislation, Ethics and GovernanceBrisbane Health-y Data: Legislation, Ethics and Governance
Brisbane Health-y Data: Legislation, Ethics and Governance
 
20160623 alia sydney
20160623 alia sydney20160623 alia sydney
20160623 alia sydney
 
Australian ORCID Consortium Update at INORMS
Australian ORCID Consortium Update at INORMSAustralian ORCID Consortium Update at INORMS
Australian ORCID Consortium Update at INORMS
 
Your data future: A perspective - Dr Douglas Robertson
Your data future: A perspective - Dr Douglas RobertsonYour data future: A perspective - Dr Douglas Robertson
Your data future: A perspective - Dr Douglas Robertson
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ Library
 
Research Vocabularies Australia
Research Vocabularies Australia Research Vocabularies Australia
Research Vocabularies Australia
 
Controlled vocabularies for medical and health research
Controlled vocabularies for medical and health researchControlled vocabularies for medical and health research
Controlled vocabularies for medical and health research
 

Semelhante a CSIRO investing in the future of data - John Morrissey

Semelhante a CSIRO investing in the future of data - John Morrissey (20)

Data Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDsData Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDs
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
RD shared services and research data spring
RD shared services and research data springRD shared services and research data spring
RD shared services and research data spring
 
Guidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access PlansGuidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access Plans
 
From Data Sharing to Data Stewardship
From Data Sharing to Data StewardshipFrom Data Sharing to Data Stewardship
From Data Sharing to Data Stewardship
 
Jisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 PaperJisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 Paper
 
David Reeve - UKAD 2016 forum
David Reeve - UKAD 2016 forumDavid Reeve - UKAD 2016 forum
David Reeve - UKAD 2016 forum
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
Data Management Planning for Engineers
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for Engineers
 
Research Data Management at the University of Salford
Research Data Management at the University of SalfordResearch Data Management at the University of Salford
Research Data Management at the University of Salford
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance Webinar
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Building blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositoriesBuilding blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositories
 
Information Systems
Information SystemsInformation Systems
Information Systems
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Competency framework: engineers, statisticians, data scientists, librarians, ...
Competency framework: engineers, statisticians, data scientists, librarians, ...Competency framework: engineers, statisticians, data scientists, librarians, ...
Competency framework: engineers, statisticians, data scientists, librarians, ...
 
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
 
Big Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To KnowBig Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To Know
 

Mais de ARDC

Mais de ARDC (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domain
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharing
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studies
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scope
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and Challenges
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of data
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018
 

Último

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Último (20)

Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 

CSIRO investing in the future of data - John Morrissey

  • 1. CSIRO investing in the future of data INFORMATION MANAGEMENT & TECHNOLOGY John Morrissey | eResearch Planner 22 July 2016
  • 2. CSIRO CSIRO investing in the future of data | John Morrissey2 | ~5300 talented staff $1billion+ budget Working with over 2800+ industry partners 55 sites across Australia Top 1% of global research agencies Each year 6 CSIRO technologies contribute $5 billion to the economy
  • 3. The ongoing problem…. Science data assets: • Undescribed … • Inaccessible … • Undiscoverable, unusable, uncitable … • On a really wide range of infrastructure … • In a really wide range of preservation-unfriendly formats … • Unconnected … CSIRO investing in the future of data | John Morrissey3 |
  • 4. Some elements to connect 4 | Systems Infrastructure Processes (e.g. Quality Control, Approval) Legal Licensing Intellectual Property Culture Training Fulfilling needs … … … Policy CSIRO investing in the future of data | John Morrissey
  • 5. Data Access Portal Functions Self serve Deposit Describe Create Citation Restrict License Approve Store Publish Discover Access Manage CSIRO investing in the future of data | John Morrissey5 |
  • 6. Goals for a data repository • Persistent access • Version control • Self service • Scalable storage • Minimal use of expensive spinning disk storage • Cheaper tape storage added as required – fast throughput when data is optimally “encapsulated” on tape • Integration with Bowen Research Cloud storage – used by projects for working storage CSIRO investing in the future of data | John Morrissey6 |
  • 7. Decision workflows for data and software CSIRO investing in the future of data | John Morrissey7 |
  • 8. The Data Management Ecosystem … CSIRO investing in the future of data | John Morrissey8 | Collaboration: industry, universities, other organisations Vocab Service
  • 9. Like an onion … CSIRO investing in the future of data | John Morrissey9 | Data management ecosystem Collaboration: Industry, universities, other organisations
  • 10. Marine National Facility CSIRO investing in the future of data | John Morrissey10 |
  • 11. CSIRO ASKAP Science Data Archive (CASDA) CSIRO investing in the future of data | John Morrissey11 |
  • 12. CASDA: Data rate at full operation: 16TB per day, 5PB per year CSIRO investing in the future of data | John Morrissey12 |
  • 13. Who’s interested? CSIRO investing in the future of data | John Morrissey13 |
  • 14. What’s next? Policy • Supported by infrastructure services that make compliance easier • Data management planning, with tools to support this and return value to end user • Management support within research projects required to allocate resources to data management Development • Storage – Better integration with existing network storage for simpler ingest – More access options • Services, vocabularies, semantic web • Provenance • Object / file level metadata CSIRO investing in the future of data | John Morrissey14 |
  • 15. What’s even more exciting? • Researchers wanting to add “plug-in” functions to the DAP • Researchers writing whole-of-program data management roadmaps for their business units, heavily referencing DAP and enterprise-developed tools. • Continuation of the “working with research groups” model to implement: • Semantic enablement and vocabularies • Provenance • Reuse of DAP metadata in other tools CSIRO investing in the future of data | John Morrissey15 |
  • 16. interested [ view, download ] similar Data Collection A likely interested Similar Data Collections Data User In current implementation, similar datasets are determined based on : • Title • Description • Keyword • Fields of research • Data Contributor • Activity • Related Collection (specified by data depositors) A Recommender System for Research Data Data Sources • DAP Web Service • Offline files (ANZSRC, Activity) • Server logs (download, views)* *will be included in future New development work by Dr Anusuriya Devaraju, Postdoctoral Fellow ,CSIRO Mineral Resources
  • 17. You may also like : • .. • .. • .. An Overview of the DAP Recommender System SQL database Research Data Recommender Model RecommendationService DAP Metadata Store Web Service Other Data Sources (e.g., server logs, auxiliary data) Data View Data Download Data Deposit (Post-Process)
  • 18. Information Management & Technology John Morrissey eResearch Planner t +61 2 6124 1411 e john.morrissey@csiro.au w www.csiro.au INFORMATION MANAGEMENT & TECHNOLOGY Thank you

Notas do Editor

  1. Staff # as at 3 March 2016 = 5319 2014–15 budget = $1.2 billion -------------------- Today we have around 5300 talented people working out of 50-plus centres in Australia and internationally. We are a billion dollar organisation We generate $485+ million in external revenue – essentially nearly 40% per cent of our revenue is externally sourced Our people work closely with industry and communities to leave a lasting legacy. Our ability to achieve results is shown by the quality of our research. We are in the top 1% of global research institutions in 15 of 22 research fields and in the top 0.1% in four research fields. CSIRO is the key connector of institutions in the Australian system for some areas. CSIRO is the most central Australian institution in 6 research fields – Agricultural Sciences, Environment/Ecology, Plant and Animal Sciences, Geosciences, Chemistry and Materials Science. CSIRO works with 1208 SME’s and 2,877 customers each year. We’re always looking for ways we can help business and industry.