Just an idea I have about Research Analytics. So the Dutch government would like to see performance indicators by the Higher Education institutions. This information can be drawn from many silo's and just give the indicator. However, when the information from these silo's can be combined, more advanced analytic's can be drawn from this information by data and text mining techniques.
But the first step is to integrate the information in a scalable data infrastructure. Research information is not big data, but it is wise to think about scalability to overcome performance issues in the future.
Multiple sources provide the data with a daily update, such as repositories'metadata, web log files, altmetrics data, citation references, funding and grand information, etc. The information is calculated and implicit relations are made explicit. In he end it is provided as open data invarious formats, including RESTful API's. Services can draw information from these API's to base their services on. Examples are metrics and analytics services, but also resolution and information portals.
4. Research Analytics Infrastructure
building an Open Data middleware infrastructure
for research information
CC-BY: Maurice Vanderfeesten
5. What should be done with the meaning outcomes? Making policy
wisdom
What do the calculations mean? intelligence Impact interpretation
What can we calculate? analytics Impact factors
Calculations, Correllations & Comparisson
metrics HE performance indicators
What can we measure?
Accumulations, Citations, Downloads, Mentions, Bookmarks, etc.
data
What data do we have?
Publications, Datasets, Tweets, Projects, People, Grants, etc.
sources
What sources are Repositories, Mendeley, CRIS’s, Blogs, NOD, Facebook, SciVerse, Twitter, Web of Science
out there?
CC-BY: Maurice Vanderfeesten
6. Problem statement
Current research policy is made based on a homogeneous-mono-metrics
(eenzijdige metrics), from a single source.
Yet making impact spans a much wider spectrum.
CC-BY: Maurice Vanderfeesten
8. What should be done with the meaning outcomes? Making policy
wisdom
What do the calculations mean? intelligence Impact interpretation
What can we calculate? analytics H-index Impact factors
Leiden index
Shanghai index
metrics HE performance indicators
What can we measure?
Citations
data
What data do we have? A-journal
peer reviewed
Publications
sources
Thompson-Reuters ISI
What sources are Web of Science
out there?
CC-BY: Maurice Vanderfeesten
10. What should be done with the meaning outcomes? Making policy
wisdom
What do the calculations mean? intelligence Impact interpretation
What can we calculate? analytics Impact factors
Open Access
Popularity
index?
metrics Not yet, HE performance indicators
What can we measure?
Downloads
data
What data do we have? Open Access
Publications
sources
What sources are Repositories
out there?
CC-BY: Maurice Vanderfeesten
11. What should be done with the meaning outcomes? Making policy
wisdom
What do the calculations mean? intelligence Impact interpretation
What can we calculate? analytics Impact factors
Like-ability
index?
metrics Not yet, HE performance indicators
What can we measure?
Mentions /
Altmetrics
data
What data do we have? Tweets,
Bookmarks
sources
Twitter,
What sources are Mendeley
out there?
CC-BY: Maurice Vanderfeesten
13. Policy makers must answer:
Only the HE community wisdom
can make a Business case What could be done with the meaning outcomes?
intelligence Impact interpretors must answer:
What should the calculations mean?
analytics Impact factors
Calculations, Correllations & Comparisson
metrics HE performance indicators
SURF can orchestrate
the ICT infrastructure Accumulations, Citations, Downloads, Mentions, Bookmarks, etc.
and licences
data
Publications, Datasets, Tweets, Projects, People, Grants, etc.
sources
Repositories, Mendeley, CRIS’s, Blogs, NOD, Facebook, SciVerse, Twitter, Web of Science
CC-BY: Maurice Vanderfeesten
15. Imagine…
• A single entry where you can find the research information you need
• Where you can combine research information to build rich analytics
• Get’s updated daily from different sources of information
• Scalable and performing fast for tons of services to draw from
CC-BY: Maurice Vanderfeesten
16. Community Cloud
Scalables Research Information storage and restructuring from and by the Dutch
Higher Education sector
CC-BY: Maurice Vanderfeesten
17. Research Analytics middleware
Sources Data types (Information Brokerage and Restructuring) Services Users/Partners
distributed scalable database (community cloud)
institutions
Holding:
CWTS
licences Research
Partners:
CBS
Technology
Partners: Researchers
ministry
Proven Open
Technology:
Libraries
CC-BY: Maurice Vanderfeesten
20. Sources Data types Data warehouses Services Users
metadata
repositories Publications Authors NARCIS
[PID] [DAI]
[metadata] [PID] Researchers
[DAI]
Resolution Resolution
[PID]
[URL]
web usage Repositorymetrics
[http event]
[PID] Libraries
[GEO]
CC-BY: Maurice Vanderfeesten
21. Sources Data types Data warehouses Services Users
Repository metrics
repositories web statistics Web usage
[http event] Libraries
[PID]
[GEO]
publishers Web usage Subscription metrics
web statistics [http event] Libraries
[PID]
[GEO]
Twitter altmetrics
Alt table Social metrics
facebook [social media event]
[PID] Researchers
Mendeley
Citation table institutions
[citation reference] Citation metrics
citation [PID]
Web of CWTS
science
CC-BY: Maurice Vanderfeesten
22. Questions
• All these sources, why does every service need to build a separate
datawarehouse?
• What are the posibilities if we could combine all the information
needed for different services, and put it in one community cloud?
CC-BY: Maurice Vanderfeesten
23. Metrics Analytics
From Metrics to Analytics
Integrating sources of information, making trends across sources visible
CC-BY: Maurice Vanderfeesten
24. Sources Data types Data warehouse Services Users
metadata institutions
repositories
web statistics
CWTS
publishers
web statistics
Usage table
[http event]
CBS
scopus
[PID] Metrics
[GEO]
Web of citation
science Researchers
Alt table Citation table
Twitter altmetrics [social media event] [citation reference]
[PID] [PID]
facebook OCW
Mendeley
Library
CC-BY: Maurice Vanderfeesten
25. Sources Data types Data warehouse Services Users
institutions
repositories
web statistics
CWTS
publishers
web statistics
Usage table
[http event]
CBS
scopus
[PID] Analytics
[GEO]
Web of citation
science Researchers
Alt table Citation table
Twitter altmetrics [social media event] [citation reference]
[PID] [PID]
facebook OCW
Mendeley
Library
CC-BY: Maurice Vanderfeesten
26. Adding sources, adding
services
Potential for a high performing community cloud, as a middelware solution for
shared services in the Research Information and Research Analytics domain.
CC-BY: Maurice Vanderfeesten
27. Community Cloud
as middleware for
Research Information
Services
Resolution NARCIS Analytics Repositorymetrics HBO kennisbank
CC-BY: Maurice Vanderfeesten
28. Research Analytics middleware
Sources Data types (Information Brokerage and Restructuring) Services Users
distributed scalable database (community cloud) Resolution
metadata institutions
repositories
Publications Authors
web statistics [PID] [DAI]
[metadata] [ISNI/ ORCID]
[DAI] [PID] CWTS
NARCIS
publishers
web statistics
Usage events Resolution entries
[http event] [PID]
CBS
scopus
[PID] [URL]
[GEO]
Web of citation Analytics
science data mesh Researchers
Altmetrics Citation references
Twitter altmetrics [social media event] [citation reference]
[PID] [PID]
facebook Repositorymetrics OCW
Mendeley
Projects Etc… scalable
OCLC ISNI/DAI [ProjectID] […] Library
[PID] […] HBO kennisbank
CERIF
CRIS’
CC-BY: Maurice Vanderfeesten