SlideShare uma empresa Scribd logo
1 de 24
Abstract: Connecting locally hosted data repositories to internationally hosted related articles has never
been easier. With APIs and other web services becoming standardized at the same time that new linking
standards, such as Datacite DOIs, are being adopted, new ways to distribute and mashup content are now
possible. This presentation will explore emerging trends in linking scholarly literature to data. Both entity
linking and data linking will be discussed. Examples will be presented demonstrating how these
technologies are being employed by publishers and A&I vendors in cooperation with local data repositories.
__________________________________________
Before I get started, I would like to take a minute to set some expectations for this talk. The examples used
will primarily be about hard sciences, my challenge to you is to figure out how to apply these technologies
and methods to the digital humanities.




                                                                                                            1
This is a theoretical framework for looking at the different ways that publications can be connected
to data.
This is also the agenda for the talk. I will first speak about the top left quadrant and then work my
way to the bottom right. This means starting from the easiest to apply to the humanities and
working through to the hardest.




                                                                                                        2
This quadrant is primarily about publications to supplemental data.




                                                                      3
Supplemental data submitted as a file with an article is the traditional way. It has its place, but that
is not what I am talking about today.




                                                                                                       4
Instead, new tools now enable display and direct manipulation of data in new and interesting ways.
This example is an application that displays KML files on a Google Map:
http://www.applications.sciverse.com/action/appDetail/298231?zone=main&pageOrigin=appGallery
&activity=display




                                                                                                 5
Next on the agenda is automating the connection between publications and whole supplementary
or related datasets.




                                                                                               6
One example of this is the PANGAEA app which searches PANGAEA apis by article DOI and
retrieves the coordinates of where supplementary data was collected and then charts these on a
Google map displayed directly on the ScienceDirect article page.




                                                                                                 7
This also works on Scopus record pages (so for lot’s of publishers and journals). From deciding to
put it on Scopus as well it took less than 24 hours for the PANGAEA developer to implement. This
was enabled by the SciVerse Applications platform.




                                                                                                 8
Users can link through to the main record for the dataset on PANGAEA. One thing I would like to
mention here is that there is also a DOI for the dataset. This was done through DataCite.




                                                                                                  9
So what is DataCite and why is it important? It is also very important for creating links to data in
repositories.




                                                                                                       10
Takeaway points: International DOI Foundation enables CrossRef to give out DOIs. DataCite
roughly equivalent to CrossRef. Learn more at the DataCite website. A central institution in Serbia
might want to become a Member Institute.




                                                                                                 11
So those were examples of linking to whole datasets and displaying them in new and interesting
ways. Next to discuss is linking to entities.




                                                                                                 12
Traditional linking involves an author marking up an entity such as a protein so that it can be easily
linked to additional information about that entity in a different database. While this is useful, it is
not what I wish to share with you today. Why make a user follow a link when…




                                                                                                     13
You can now embed a 3D interactive model of the protein directly in context in the article. In this
example the PDB Protein Viewer is embedded directly in the article.




                                                                                                      14
In this example an author adds key structures to the article and they are then embedded using
Reaxys information and software.




                                                                                                15
16
The last examples still required an Author to manually mark up entities. Through text analysis and
mining, this is no longer always necessary.




                                                                                                17
In this example, our partner NextBio automatically recognizes entities in the text of the
article.

Easily extendable to new / other entities
Works retrospectively on older content
Does create recall / precision errors




                                                                                            18
Not only can it display them in the sidebar, but the application framework enables adding links to
the entities in the text on the fly.




                                                                                                     19
A reader can then click those links for additional information form multiple databases.




                                                                                          20
1.   Colours & tags genes, proteins, molecule names
2.   Clicking shows a summary of features for the term (ie: sequence or 2D structure)
3.   User can click on links in the pop-up leading out to more information




                                                                                        21
22
* To summarize, we started with very traditional linking of datasets where an author submits the dataset with the
article. One example of how this can be improved was the Interactive map viewer that displays supplementary KML
files rather than simple attaching the files to the article.
* Next we discussed automated linking to datasets. This included the example of searching PANGAEA APIs for
related datasets and then displaying the locations the data was collected. This will be driven by new standards such as
DataCite.
* Third, authors manually mark up entities that can be linked to in other databases. Now it is possible to embed
content from other databases using APIs.
* Last, is totally automated entity recognition using text analysis and mining, Again, information from third party
databases can be embedded directly in the article itself.
* While I haven’t spoken too much about the technologies enabling these new ways of linking articles to data, one
example is the SciVerse Application Framework, which now enables all of the examples discussed today.
http://www.applications.sciverse.com/action/userhome




                                                                                                                      23
I would like to close with the same questions I opened with. Thank you.




                                                                          24

Mais conteúdo relacionado

Semelhante a Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases

Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperWilliam Gunn
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016Jessie Chuang
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemSemantic Web Company
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database dannyijwest
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchangelagoze
 
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...IJCI JOURNAL
 
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...AI Publications
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010Andreas Blumauer
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035Neelam Rawat
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSemantic Web Company
 
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowNotes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowPeterWinstanley1
 
reegle - a new key portal for open energy data
reegle - a new key portal for open energy datareegle - a new key portal for open energy data
reegle - a new key portal for open energy datareeep
 
moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016Richard (Rick) Nelson
 
The “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedInThe “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedInKun Le
 
The "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedInThe "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedInSam Shah
 
Graph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4jGraph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4jijtsrd
 
Evaluation criteria for nosql databases
Evaluation criteria for nosql databasesEvaluation criteria for nosql databases
Evaluation criteria for nosql databasesEbenezer Daniel
 
Jarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataJarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataMustafa Jarrar
 
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - BrusselsPoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - BrusselsMartin Kaltenböck
 

Semelhante a Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases (20)

Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 Paper
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
 
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategies
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowNotes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
 
reegle - a new key portal for open energy data
reegle - a new key portal for open energy datareegle - a new key portal for open energy data
reegle - a new key portal for open energy data
 
moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016
 
The “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedInThe “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedIn
 
The "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedInThe "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedIn
 
Graph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4jGraph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4j
 
Evaluation criteria for nosql databases
Evaluation criteria for nosql databasesEvaluation criteria for nosql databases
Evaluation criteria for nosql databases
 
Jarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataJarrar: Introduction to Linked Data
Jarrar: Introduction to Linked Data
 
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - BrusselsPoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
 

Mais de Michael Habib

Complexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery InterfacesComplexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery InterfacesMichael Habib
 
Ubiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflowsUbiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflowsMichael Habib
 
Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Michael Habib
 
Application Platforms and Developer Communities - New software tools and app...
Application Platforms and Developer Communities -  New software tools and app...Application Platforms and Developer Communities -  New software tools and app...
Application Platforms and Developer Communities - New software tools and app...Michael Habib
 
"New Technologies: Empowering the Research community for Better Outcomes", L...
"New Technologies:  Empowering the Research community for Better Outcomes", L..."New Technologies:  Empowering the Research community for Better Outcomes", L...
"New Technologies: Empowering the Research community for Better Outcomes", L...Michael Habib
 
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...Michael Habib
 
SNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use caseSNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use caseMichael Habib
 
From Academic Library 2.0 to (Literature) Research 2.0
From Academic Library 2.0  to (Literature) Research 2.0From Academic Library 2.0  to (Literature) Research 2.0
From Academic Library 2.0 to (Literature) Research 2.0Michael Habib
 
Scholarly Reputation Management Online : The Challenges and Opportunities of ...
Scholarly Reputation Management Online: The Challenges and Opportunities of ...Scholarly Reputation Management Online: The Challenges and Opportunities of ...
Scholarly Reputation Management Online : The Challenges and Opportunities of ...Michael Habib
 
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0Michael Habib
 

Mais de Michael Habib (10)

Complexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery InterfacesComplexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery Interfaces
 
Ubiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflowsUbiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflows
 
Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...
 
Application Platforms and Developer Communities - New software tools and app...
Application Platforms and Developer Communities -  New software tools and app...Application Platforms and Developer Communities -  New software tools and app...
Application Platforms and Developer Communities - New software tools and app...
 
"New Technologies: Empowering the Research community for Better Outcomes", L...
"New Technologies:  Empowering the Research community for Better Outcomes", L..."New Technologies:  Empowering the Research community for Better Outcomes", L...
"New Technologies: Empowering the Research community for Better Outcomes", L...
 
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
 
SNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use caseSNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use case
 
From Academic Library 2.0 to (Literature) Research 2.0
From Academic Library 2.0  to (Literature) Research 2.0From Academic Library 2.0  to (Literature) Research 2.0
From Academic Library 2.0 to (Literature) Research 2.0
 
Scholarly Reputation Management Online : The Challenges and Opportunities of ...
Scholarly Reputation Management Online: The Challenges and Opportunities of ...Scholarly Reputation Management Online: The Challenges and Opportunities of ...
Scholarly Reputation Management Online : The Challenges and Opportunities of ...
 
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
 

Último

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answersdalebeck957
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsSandeep D Chaudhary
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsNbelano25
 

Último (20)

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answers
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
 

Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases

  • 1. Abstract: Connecting locally hosted data repositories to internationally hosted related articles has never been easier. With APIs and other web services becoming standardized at the same time that new linking standards, such as Datacite DOIs, are being adopted, new ways to distribute and mashup content are now possible. This presentation will explore emerging trends in linking scholarly literature to data. Both entity linking and data linking will be discussed. Examples will be presented demonstrating how these technologies are being employed by publishers and A&I vendors in cooperation with local data repositories. __________________________________________ Before I get started, I would like to take a minute to set some expectations for this talk. The examples used will primarily be about hard sciences, my challenge to you is to figure out how to apply these technologies and methods to the digital humanities. 1
  • 2. This is a theoretical framework for looking at the different ways that publications can be connected to data. This is also the agenda for the talk. I will first speak about the top left quadrant and then work my way to the bottom right. This means starting from the easiest to apply to the humanities and working through to the hardest. 2
  • 3. This quadrant is primarily about publications to supplemental data. 3
  • 4. Supplemental data submitted as a file with an article is the traditional way. It has its place, but that is not what I am talking about today. 4
  • 5. Instead, new tools now enable display and direct manipulation of data in new and interesting ways. This example is an application that displays KML files on a Google Map: http://www.applications.sciverse.com/action/appDetail/298231?zone=main&pageOrigin=appGallery &activity=display 5
  • 6. Next on the agenda is automating the connection between publications and whole supplementary or related datasets. 6
  • 7. One example of this is the PANGAEA app which searches PANGAEA apis by article DOI and retrieves the coordinates of where supplementary data was collected and then charts these on a Google map displayed directly on the ScienceDirect article page. 7
  • 8. This also works on Scopus record pages (so for lot’s of publishers and journals). From deciding to put it on Scopus as well it took less than 24 hours for the PANGAEA developer to implement. This was enabled by the SciVerse Applications platform. 8
  • 9. Users can link through to the main record for the dataset on PANGAEA. One thing I would like to mention here is that there is also a DOI for the dataset. This was done through DataCite. 9
  • 10. So what is DataCite and why is it important? It is also very important for creating links to data in repositories. 10
  • 11. Takeaway points: International DOI Foundation enables CrossRef to give out DOIs. DataCite roughly equivalent to CrossRef. Learn more at the DataCite website. A central institution in Serbia might want to become a Member Institute. 11
  • 12. So those were examples of linking to whole datasets and displaying them in new and interesting ways. Next to discuss is linking to entities. 12
  • 13. Traditional linking involves an author marking up an entity such as a protein so that it can be easily linked to additional information about that entity in a different database. While this is useful, it is not what I wish to share with you today. Why make a user follow a link when… 13
  • 14. You can now embed a 3D interactive model of the protein directly in context in the article. In this example the PDB Protein Viewer is embedded directly in the article. 14
  • 15. In this example an author adds key structures to the article and they are then embedded using Reaxys information and software. 15
  • 16. 16
  • 17. The last examples still required an Author to manually mark up entities. Through text analysis and mining, this is no longer always necessary. 17
  • 18. In this example, our partner NextBio automatically recognizes entities in the text of the article. Easily extendable to new / other entities Works retrospectively on older content Does create recall / precision errors 18
  • 19. Not only can it display them in the sidebar, but the application framework enables adding links to the entities in the text on the fly. 19
  • 20. A reader can then click those links for additional information form multiple databases. 20
  • 21. 1. Colours & tags genes, proteins, molecule names 2. Clicking shows a summary of features for the term (ie: sequence or 2D structure) 3. User can click on links in the pop-up leading out to more information 21
  • 22. 22
  • 23. * To summarize, we started with very traditional linking of datasets where an author submits the dataset with the article. One example of how this can be improved was the Interactive map viewer that displays supplementary KML files rather than simple attaching the files to the article. * Next we discussed automated linking to datasets. This included the example of searching PANGAEA APIs for related datasets and then displaying the locations the data was collected. This will be driven by new standards such as DataCite. * Third, authors manually mark up entities that can be linked to in other databases. Now it is possible to embed content from other databases using APIs. * Last, is totally automated entity recognition using text analysis and mining, Again, information from third party databases can be embedded directly in the article itself. * While I haven’t spoken too much about the technologies enabling these new ways of linking articles to data, one example is the SciVerse Application Framework, which now enables all of the examples discussed today. http://www.applications.sciverse.com/action/userhome 23
  • 24. I would like to close with the same questions I opened with. Thank you. 24

Notas do Editor

  1. Title: Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases   Abstract: Connecting locally hosted data repositories to internationally hosted related articles has never been easier. With APIs and other web services becoming standardized at the same time that new linking standards, such as Datacite DOIs, are being adopted, new ways to distribute and mashup content are now possible. This presentation will explore emerging trends in linking scholarly literature to data. Both entity linking and data linking will be discussed. Examples will be presented demonstrating how these technologies are being employed by publishers and A&I vendors in cooperation with local data repositories. __________________________________________ Before I get started, I would like to take a minute to set some expectations for this talk. The examples used will primarily be about hard sciences, my challenge to you is to figure out how to apply these technologies and methods to the digital humanities.
  2. This is a theoretical framework for looking at the different ways that publications can be connected to data. This is also the agenda for the talk. I will first speak about the top left quadrant and then work my way to the bottom right. This means starting from the easiest to apply to the humanities and working through to the hardest.
  3. This quadrant is primarily about publications to supplemental data.
  4. Supplemental data submitted as a file with an article is the traditional way. It has its place, but that is not what I am talking about today.
  5. Instead, new tools now enable display and direct manipulation of data in new and interesting ways. This example is an application that displays KML files on a Google Map: http://www.applications.sciverse.com/action/appDetail/298231?zone=main&pageOrigin=appGallery&activity=display
  6. Next on the agenda is automating the connection between publications and whole supplementary or related datasets.
  7. One example of this is the PANGAEA app which searches PANGAEA apis by article DOI and retrieves the coordinates of where supplementary data was collected and then charts these on a Google map displayed directly on the ScienceDirect article page.
  8. This also works on Scopus record pages (so for lot’s of publishers and journals). From deciding to put it on Scopus as well it took less than 24 hours for the PANGAEA developer to implement. This was enabled by the SciVerse Applications platform.
  9. Users can link through to the main record for the dataset on PANGAEA. One thing I would like to mention here is that there is also a DOI for the dataset. This was done through DataCite.
  10. So what is DataCite and why is it important? It is also very important for creating links to data in repositories.
  11. Takeaway points: International DOI Foundation enables CrossRef to give out DOIs. DataCite roughly equivalent to CrossRef. Learn more at the DataCite website. A central institution in Serbia might want to become a Member Institute.
  12. So those were examples of linking to whole datasets and displaying them in new and interesting ways. Next to discuss is linking to entities.
  13. Traditional linking involves an author marking up an entity such as a protein so that it can be easily linked to additional information about that entity in a different database. While this is useful, it is not what I wish to share with you today. Why make a user follow a link when…
  14. You can now embed a 3D interactive model of the protein directly in context in the article. In this example the PDB Protein Viewer is embedded directly in the article.
  15. In this example an author adds key structures to the article and they are then embedded using Reaxys information and software.
  16. The last examples still required an Author to manually mark up entities. Through text analysis and mining, this is no longer always necessary.
  17. In this example, our partner NextBio automatically recognizes entities in the text of the article. Easily extendable to new / other entities Works retrospectively on older content Does create recall / precision errors
  18. Not only can it display them in the sidebar, but the application framework enables adding links to the entities in the text on the fly.
  19. A reader can then click those links for additional information form multiple databases.
  20. Colours & tags genes, proteins, molecule names Clicking shows a summary of features for the term (ie: sequence or 2D structure) User can click on links in the pop-up leading out to more information
  21. Colours & tags genes, proteins, molecule names Clicking shows a summary of features for the term (ie: sequence or 2D structure) User can click on links in the pop-up leading out to more information
  22. To summarize, we started with very traditional linking of datasets where an author submits the dataset with the article. One example of how this can be improved was the Interactive map viewer that displays supplementary KML files rather than simple attaching the files to the article. Next we discussed automated linking to datasets. This included the example of searching PANGAEA APIs for related datasets and then displaying the locations the data was collected. This will be driven by new standards such as DataCite. Third, authors manually mark up entities that can be linked to in other databases. Now it is possible to embed content from other databases using APIs. Last, is totally automated entity recognition using text analysis and mining, Again, information from third party databases can be embedded directly in the article itself. While I haven’t spoken too much about the technologies enabling these new ways of linking articles to data, one example is the SciVerse Application Framework, which now enables all of the examples discussed today. http://www.applications.sciverse.com/action/userhome
  23. I would like to close with the same questions I opened with. Thank you.