SlideShare uma empresa Scribd logo
1 de 25
Baixar para ler offline
www.tugraz.at n
W I S S E N n T E C H N I K n L E I D E N S C H A F T
u www.tugraz.at
Research Data Explored:
Citations versus Altmetrics
Isabella Peters (ZBW), Peter Kraker (Know-Center), Elisabeth Lex (TU
Graz), Christian Gumpenberger (Uni Wien), Juan Gorraiz (Uni Wien)
32. Austrian Librarian Day, Sept 17th 2015, Vienna
www.tugraz.at n
Motivation
•  Data citations have gained momentum
•  Citations: Publish or Perish
•  Altmetrics: social-media based metrics
•  Societal impact of research data
Our Goal: Investigate research data with respect
bibliometric characteristics - citations as well as
altmetrics
2
www.tugraz.at n
Our study – Dataset
•  Thomson Reuters Data Citation Index (DCI)
•  high-quality research data from various
repositories
•  Enables search, exploration and bibliometric
analysis of research data through Web of
Science
•  We did a basic analysis for all items published in
DCI between 1960 and 2014
•  Plus: altmetrics collected from three big altmetrics
data providers: ImpactStory, Altmetric.com, PlumX
3
www.tugraz.at n
Research Questions
1.  How often are research data cited? Which and how
many of these have a DOI? From which repositories
do research data originate?
2.  What are the characteristics of the most cited
research data? Which data types and disciplines are
the most cited? How does citedness evolve over
time?
3.  To what extent are cited research data visible on
various altmetrics channels? Are there any
differences between the tools used for altmetrics
scores aggregation?
4
www.tugraz.at n
ImpactStory
•  Targeted at individual researcher
•  Works with individually assigned permanent
identifiers (e.g. DOIs, URLs, PubMed IDs) or links to
ORCID, Figshare, Publons, Slideshare, or Github to
auto-import new research outputs like e.g. papers,
data sets, slides
•  Features altmetric scores (Twitter, Facebook,
Mendeley, Figshare, Google+, and Wikipedia
mentions)
5
www.tugraz.at n
Altmetric.com
•  Targeted towards institutions and organizations
•  Provides an altmetrics score + underlying data
•  Search within variety of social media-platforms (e.g.,
Twitter, Facebook, Google+, blogs) for keywords and
for permanent identifiers
•  E.g. DOIs, arXiv IDs, PubMed IDs
6
www.tugraz.at n
PlumX
•  Article-level metrics for “artifacts”
•  articles, audios, videos, book chapters, trials
•  Works with ORCID and other user IDs (e.g., from
YouTube, Slideshare) as well as with DOIs, ISBNs,
PubMed-IDs, patent numbers, and URLs
•  Statistics on usage of articles and artifacts
•  e.g., views to or downloads of html pages or pdfs),
Mendeley readers, GitHub forks, Facebook
comments, YouTube subscribers.
7
www.tugraz.at n
Methodology
•  DCI to retrieve records of cited research data
•  Items published in the last decades (1960-9, 1970-9,
1980-9, 1990-9, 2000-9, 2010-4)
•  Metadata fields: DOI/URL, doc type, source, research
area, publication year, data type, #citations, ORCID
•  Citedness investigated for each decade
•  Distribution of document types, data types, sources,
research area
•  with >=2 citation (Sample 1, n=10,934 records )
•  with >= 2 citations and at least 1 altmetric score
(Sample 2, n= 301)
8
www.tugraz.at n
Results
9
high uncitedness of research data
low percentage of altmetrics scores available for research data with >= 2
citations
www.tugraz.at n
Results for Sample 1
10
Citedness comparatively higher for research data published more recently
! interest in younger research data and increase in social media activity
www.tugraz.at n
Citation Distribution for Sample 1
11
•  Almost half of the data
studies have a DOI
(48.9%) but only few data
sets
•  Data studies on average
more cited than data sets
•  Data studies with DOI
more citations than with
URL
•  Only few repositories
(51), but most citations
www.tugraz.at n
Citation Distribution for Sample 1
12
Half of the research data (4,974 items; 45.5%) à only 2 citations
6 items (2 repos and 4 data studies): > 1000 citations
www.tugraz.at n
Citation Distribution for Sample 1
•  Differences between most cited data types when
considering research data with a DOI or with a URL
13
www.tugraz.at n
Citation Distribution for Sample 1
•  More common to refer to data studies via DOIs in
Social Sciences than in Natural and Life Sciences
14
Disciplinary differences: DOIs vs URLs, document types
www.tugraz.at n
Results for Sample 2
15
•  Total of altmetrics
scores < than
number of citations
for all document
types with or
without DOI
•  Mean altmetrics
score higher for
data studies than
for data sets
www.tugraz.at n
Results for Sample 2
•  Distributions of data types and subject areas
16
www.tugraz.at n
Results for Sample 2
•  Distributions of data types and subject areas
17
www.tugraz.at n
Results for Sample 2
•  Distributions of data types and subject areas
18
www.tugraz.at n
Correlation Analysis
19
No correlation between citations and altmetrics scores in Sample 2
www.tugraz.at n
Details on Altmetrics Analysis in Plum X
20
•  DOIs for data sets
seem to be important
in order to get
captures (Mendeley)
•  URL sufficient for
inclusion in social
media (e.g.
Facebook, Twitter)
www.tugraz.at n
More Altmetrics Results...
•  Top 10 research
data-DOIs with >=2
citations and with at
least 1entry in PlumX
•  Cited research data
attracts more
citations than
altmetrics scores
•  No correlation
between highly cited
and highly scored
research data.
21
www.tugraz.at n
Conclusions
•  Low percentage of altmetrics scores for research
data with two or more citations
•  Research data not so often published/shared?
•  Reliability of altmetrics aggregation tools?
•  We didn‘t observe a correlation between citation and
altmetrics scores
•  Neither most cited research data nor most cited
sources (repositories) received highest scores in
PlumX
•  Interestingly, although “figshare” accounts for almost
25% of the DCI, no item from “figshare” was cited at
least twice in DCI à see our follow-up work
presented at STI 2015!22
www.tugraz.at n
Conclusions
•  Growing trend in citing research data since 2008 –
bias towards more recent research data à in general,
Research data mostly uncited
•  Availability of cited research data with a DOI rather
low in DCI, but increasing
•  Data studies with a DOI attract more citations than
those with a URL
•  DOI in cited research data has so far been more
embraced in the Social Sciences than in the Natural
Sciences
•  DOI/identifiers important to increase altmetrics scores
as well as aggregators rely on it
23
www.tugraz.at n
Future Work
•  Investigate data citations in more detail
•  Different from „paper citations“
•  E.g. we found that entire repositories are
proportionally more often cited than single data
sets
•  Meaning of data citations
•  Influence of structure of underlying data
•  Data curation, identifiers,..
24
www.tugraz.at n
Thank you for your attention!
Elisabeth Lex
elisabeth.lex@tugraz.at
25

Mais conteúdo relacionado

Mais procurados

Stevan Harnad: Open Access - Open Data: similarities and differences
Stevan Harnad: Open Access - Open Data: similarities and differencesStevan Harnad: Open Access - Open Data: similarities and differences
Stevan Harnad: Open Access - Open Data: similarities and differences
"Open Access - Open Data" conference, 13th/14th December, 2010
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer Tutorial
Nees Jan van Eck
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
Susanna-Assunta Sansone
 

Mais procurados (20)

Stevan Harnad: Open Access - Open Data: similarities and differences
Stevan Harnad: Open Access - Open Data: similarities and differencesStevan Harnad: Open Access - Open Data: similarities and differences
Stevan Harnad: Open Access - Open Data: similarities and differences
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data Management
 
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
 
Transparency and reproducibility in research
Transparency and reproducibility in researchTransparency and reproducibility in research
Transparency and reproducibility in research
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing Elsevier
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer Tutorial
 
Research data: publishers, policies and patient privacy
Research data: publishers, policies and patient privacyResearch data: publishers, policies and patient privacy
Research data: publishers, policies and patient privacy
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
Integrating research indicators for use in the repositories infrastructure
Integrating research indicators for use in the repositories infrastructure Integrating research indicators for use in the repositories infrastructure
Integrating research indicators for use in the repositories infrastructure
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
ODIN Final Event - The Care and Feeding of Scientific Data
ODIN Final Event - The Care and Feeding of Scientific DataODIN Final Event - The Care and Feeding of Scientific Data
ODIN Final Event - The Care and Feeding of Scientific Data
 
Giving researchers credit for data
Giving researchers credit for dataGiving researchers credit for data
Giving researchers credit for data
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
 
Data availability
Data availabilityData availability
Data availability
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopter
 
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
 
COAR Next Generation Repositories WG - Text mining and Recommender system sto...
COAR Next Generation Repositories WG - Text mining and Recommender system sto...COAR Next Generation Repositories WG - Text mining and Recommender system sto...
COAR Next Generation Repositories WG - Text mining and Recommender system sto...
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
 
Open Science : Democratizing Access to Science
Open Science : Democratizing Access to ScienceOpen Science : Democratizing Access to Science
Open Science : Democratizing Access to Science
 

Semelhante a Research Data Explored: Citations versus Altmetrics

Has anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetricsHas anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetrics
Nick Sheppard
 
Altmetrics: the movement, the tools, and the implications
Altmetrics: the movement, the tools, and the implicationsAltmetrics: the movement, the tools, and the implications
Altmetrics: the movement, the tools, and the implications
KR_Barker
 
CL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdf
CL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdfCL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdf
CL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdf
ssuserb76cdd
 

Semelhante a Research Data Explored: Citations versus Altmetrics (20)

Has anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetricsHas anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetrics
 
Public engagement while you sleep
Public engagement while you sleepPublic engagement while you sleep
Public engagement while you sleep
 
Academic Social Networks and Researcher Ranking
Academic Social Networks and Researcher RankingAcademic Social Networks and Researcher Ranking
Academic Social Networks and Researcher Ranking
 
Why altmetrics?
Why altmetrics?Why altmetrics?
Why altmetrics?
 
Altmetrics: the movement, the tools, and the implications
Altmetrics: the movement, the tools, and the implicationsAltmetrics: the movement, the tools, and the implications
Altmetrics: the movement, the tools, and the implications
 
ALTMETRICS : A HASTY PEEP INTO NEW SCHOLARLY MEASUREMENT
ALTMETRICS : A HASTY PEEP INTO NEW SCHOLARLY MEASUREMENTALTMETRICS : A HASTY PEEP INTO NEW SCHOLARLY MEASUREMENT
ALTMETRICS : A HASTY PEEP INTO NEW SCHOLARLY MEASUREMENT
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
 
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
 
Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...
 
Public engagement while you sleep
Public engagement while you sleep Public engagement while you sleep
Public engagement while you sleep
 
Exploring Altmetrics with Impactstory
Exploring Altmetrics with ImpactstoryExploring Altmetrics with Impactstory
Exploring Altmetrics with Impactstory
 
Altmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the ImplicationsAltmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the Implications
 
Altmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the ImplicationsAltmetrics: The Movement, The Tools, and the Implications
Altmetrics: The Movement, The Tools, and the Implications
 
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
 
Assessing research impact mic 1 Sep 2015
Assessing research impact mic 1 Sep 2015Assessing research impact mic 1 Sep 2015
Assessing research impact mic 1 Sep 2015
 
Assessing Research Impact: Bibliometrics, Citations and the H-Index
Assessing Research Impact: Bibliometrics, Citations and the H-IndexAssessing Research Impact: Bibliometrics, Citations and the H-Index
Assessing Research Impact: Bibliometrics, Citations and the H-Index
 
British Library
British LibraryBritish Library
British Library
 
Altmetrics in Practice - wssf -- Montreal - Oct 14, 2013
Altmetrics in Practice  - wssf -- Montreal - Oct 14, 2013Altmetrics in Practice  - wssf -- Montreal - Oct 14, 2013
Altmetrics in Practice - wssf -- Montreal - Oct 14, 2013
 
Introduction to Metrics and Impact Tracking
Introduction to Metrics and Impact TrackingIntroduction to Metrics and Impact Tracking
Introduction to Metrics and Impact Tracking
 
CL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdf
CL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdfCL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdf
CL8 Scientiometrics Module 6 RPE-Rijo TKMCE.pdf
 

Último

Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 

Último (20)

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 

Research Data Explored: Citations versus Altmetrics

  • 1. www.tugraz.at n W I S S E N n T E C H N I K n L E I D E N S C H A F T u www.tugraz.at Research Data Explored: Citations versus Altmetrics Isabella Peters (ZBW), Peter Kraker (Know-Center), Elisabeth Lex (TU Graz), Christian Gumpenberger (Uni Wien), Juan Gorraiz (Uni Wien) 32. Austrian Librarian Day, Sept 17th 2015, Vienna
  • 2. www.tugraz.at n Motivation •  Data citations have gained momentum •  Citations: Publish or Perish •  Altmetrics: social-media based metrics •  Societal impact of research data Our Goal: Investigate research data with respect bibliometric characteristics - citations as well as altmetrics 2
  • 3. www.tugraz.at n Our study – Dataset •  Thomson Reuters Data Citation Index (DCI) •  high-quality research data from various repositories •  Enables search, exploration and bibliometric analysis of research data through Web of Science •  We did a basic analysis for all items published in DCI between 1960 and 2014 •  Plus: altmetrics collected from three big altmetrics data providers: ImpactStory, Altmetric.com, PlumX 3
  • 4. www.tugraz.at n Research Questions 1.  How often are research data cited? Which and how many of these have a DOI? From which repositories do research data originate? 2.  What are the characteristics of the most cited research data? Which data types and disciplines are the most cited? How does citedness evolve over time? 3.  To what extent are cited research data visible on various altmetrics channels? Are there any differences between the tools used for altmetrics scores aggregation? 4
  • 5. www.tugraz.at n ImpactStory •  Targeted at individual researcher •  Works with individually assigned permanent identifiers (e.g. DOIs, URLs, PubMed IDs) or links to ORCID, Figshare, Publons, Slideshare, or Github to auto-import new research outputs like e.g. papers, data sets, slides •  Features altmetric scores (Twitter, Facebook, Mendeley, Figshare, Google+, and Wikipedia mentions) 5
  • 6. www.tugraz.at n Altmetric.com •  Targeted towards institutions and organizations •  Provides an altmetrics score + underlying data •  Search within variety of social media-platforms (e.g., Twitter, Facebook, Google+, blogs) for keywords and for permanent identifiers •  E.g. DOIs, arXiv IDs, PubMed IDs 6
  • 7. www.tugraz.at n PlumX •  Article-level metrics for “artifacts” •  articles, audios, videos, book chapters, trials •  Works with ORCID and other user IDs (e.g., from YouTube, Slideshare) as well as with DOIs, ISBNs, PubMed-IDs, patent numbers, and URLs •  Statistics on usage of articles and artifacts •  e.g., views to or downloads of html pages or pdfs), Mendeley readers, GitHub forks, Facebook comments, YouTube subscribers. 7
  • 8. www.tugraz.at n Methodology •  DCI to retrieve records of cited research data •  Items published in the last decades (1960-9, 1970-9, 1980-9, 1990-9, 2000-9, 2010-4) •  Metadata fields: DOI/URL, doc type, source, research area, publication year, data type, #citations, ORCID •  Citedness investigated for each decade •  Distribution of document types, data types, sources, research area •  with >=2 citation (Sample 1, n=10,934 records ) •  with >= 2 citations and at least 1 altmetric score (Sample 2, n= 301) 8
  • 9. www.tugraz.at n Results 9 high uncitedness of research data low percentage of altmetrics scores available for research data with >= 2 citations
  • 10. www.tugraz.at n Results for Sample 1 10 Citedness comparatively higher for research data published more recently ! interest in younger research data and increase in social media activity
  • 11. www.tugraz.at n Citation Distribution for Sample 1 11 •  Almost half of the data studies have a DOI (48.9%) but only few data sets •  Data studies on average more cited than data sets •  Data studies with DOI more citations than with URL •  Only few repositories (51), but most citations
  • 12. www.tugraz.at n Citation Distribution for Sample 1 12 Half of the research data (4,974 items; 45.5%) à only 2 citations 6 items (2 repos and 4 data studies): > 1000 citations
  • 13. www.tugraz.at n Citation Distribution for Sample 1 •  Differences between most cited data types when considering research data with a DOI or with a URL 13
  • 14. www.tugraz.at n Citation Distribution for Sample 1 •  More common to refer to data studies via DOIs in Social Sciences than in Natural and Life Sciences 14 Disciplinary differences: DOIs vs URLs, document types
  • 15. www.tugraz.at n Results for Sample 2 15 •  Total of altmetrics scores < than number of citations for all document types with or without DOI •  Mean altmetrics score higher for data studies than for data sets
  • 16. www.tugraz.at n Results for Sample 2 •  Distributions of data types and subject areas 16
  • 17. www.tugraz.at n Results for Sample 2 •  Distributions of data types and subject areas 17
  • 18. www.tugraz.at n Results for Sample 2 •  Distributions of data types and subject areas 18
  • 19. www.tugraz.at n Correlation Analysis 19 No correlation between citations and altmetrics scores in Sample 2
  • 20. www.tugraz.at n Details on Altmetrics Analysis in Plum X 20 •  DOIs for data sets seem to be important in order to get captures (Mendeley) •  URL sufficient for inclusion in social media (e.g. Facebook, Twitter)
  • 21. www.tugraz.at n More Altmetrics Results... •  Top 10 research data-DOIs with >=2 citations and with at least 1entry in PlumX •  Cited research data attracts more citations than altmetrics scores •  No correlation between highly cited and highly scored research data. 21
  • 22. www.tugraz.at n Conclusions •  Low percentage of altmetrics scores for research data with two or more citations •  Research data not so often published/shared? •  Reliability of altmetrics aggregation tools? •  We didn‘t observe a correlation between citation and altmetrics scores •  Neither most cited research data nor most cited sources (repositories) received highest scores in PlumX •  Interestingly, although “figshare” accounts for almost 25% of the DCI, no item from “figshare” was cited at least twice in DCI à see our follow-up work presented at STI 2015!22
  • 23. www.tugraz.at n Conclusions •  Growing trend in citing research data since 2008 – bias towards more recent research data à in general, Research data mostly uncited •  Availability of cited research data with a DOI rather low in DCI, but increasing •  Data studies with a DOI attract more citations than those with a URL •  DOI in cited research data has so far been more embraced in the Social Sciences than in the Natural Sciences •  DOI/identifiers important to increase altmetrics scores as well as aggregators rely on it 23
  • 24. www.tugraz.at n Future Work •  Investigate data citations in more detail •  Different from „paper citations“ •  E.g. we found that entire repositories are proportionally more often cited than single data sets •  Meaning of data citations •  Influence of structure of underlying data •  Data curation, identifiers,.. 24
  • 25. www.tugraz.at n Thank you for your attention! Elisabeth Lex elisabeth.lex@tugraz.at 25