SlideShare uma empresa Scribd logo
1 de 15
Describing Theses and Dissertations Using 
Schema.org 
Project Report Presentation and Update 
DCMI 2014 Austin, Texas 
October 10, 2014 
Jeff Mixter - OCLC Research 
Patrick OBrien - Montana State Univeristy 
Kenning Arlitsch - Montana State University
Background 
• This project is based on an IMLS Grant that 
Kenning and Patrick were awarded in 2010 
• Initial scope was to improve indexing and 
visibility of digital collections in Search Engines 
• Since the release of Schema.org in 2011 the 
scope has expanded to include modeling IR 
material in a way that make them more visible to 
traditional search engines 
2
Schema.org 
• Released in 2011 by Bing, Google, Yahoo and 
Yandex 
• Lingua franca for describing things on the web 
• W3C Working Group SchemaBibExtend was 
created to help make bibliographic 
recommendations and suggestions to 
Schema.org 
3
Data Sample 
• 1,909 DC records from the Montana State 
University ScholarWorks IR 
• They had already undergone extensive 
metadata clean-up 
4
Data Model 
• Started with Schema.org as the base 
• We created an extension vocabulary using the 
same mechanics and conventions used in 
Schema.org 
– RDFS vocabulary 
– It is published as RDFa 
– http://purl.org/montana-state/library/ 
5
6 
schema: http://schema.org/ 
dcterms: http://purl.org/dc/terms/ 
mont: http://purl.org/montana-state/library/
Classes 
• There was a need to add more specificity to the 
existing Creative Work branch classes 
– Mont:Thesis 
– Mont:Concept 
• There was also a need to describe entities 
unique to IRs and Universities that are not 
covered in Schema.org’s current vocabulary 
– Mont:InstitutionalRepository 
– Mont:AcademicDepartment 
7
Properties 
• Create more granular relationships between 
classes 
– Mont:committeeMember 
• Describe important attributes of Theses and 
Dissertations that were not included in 
Schema.org* 
– Mont:firstPage** 
• Highlight and model unique relationships that 
were otherwise locked in the metadata records 
– Mont:advisor 
* Schema.org underwent an update following the publication of the project report 
** This property has since been replaced by the schema:pageStart 
8
• Inferring additional 
information from the 
record 
• This has the potential of 
allowing Universities to 
aggregate a large amount 
of data about Academic 
Output and use it for 
reviews/marketing 
• This highlights the idea of 
developing a graph of 
university entities 
9
Process Model 
• Data was loaded into OpenRefine 
• Data was reconciled against Dbpedia.org, LCSH 
and VIAF 
– Matching was made easier by the specific metadata 
fields that the records used 
– dc:subjects.lcsh matched 78% 
• Generated our own internal URIs*** 
*** The URI pattern for the current production data differs from that used in the 
example data presented in the project report 
10
Syndication of RDF data 
• Data from three records was published online along 
with an HTML page that described all of the entities 
referenced in the CBDs 
– Serialized at RDFa 
• Since then we have loaded all 1,909 RDF 
descriptions back into the ScholarWorks repository 
and tweaked the Dspace instance to pull over and 
display JSON-ld data 
• All newly created entities are loaded into a Triple 
Store with a Pubby front end 
– http://54.191.234.158:8081/resource/department/7 
11
Google Webmaster Tools 
12
Next Steps 
• Setup a more production ready Pubby interface 
• Make modifications to the ScholarWorks 
structured data 
• Make libraries visible on the Web 
– Build the presence of the library and its sub-organizations 
on the Semantic Web 
– Kenning, A., OBrien, P., Clark, J. A., Young, S. W. H. 
& Rossmann, D. (2014). Demonstrating Library Value 
at Network Scale: Leveraging the Semantic Web With 
New Knowledge Work. Journal of Library 
Administration, 54(5), 413-425. 
13
Questions? 
14
Thank You! 
Jeff Mixter 
mixterj@oclc.org 
@JeffMixter 
614-761-5159 
©2014 OCLC. This work is licensed under a Creative Commons Attribution 3.0 Unported License. Suggested attribution: “This 
work uses content from [presentation title] © OCLC, used under a Creative Commons Attribution license: 
http://creativecommons.org/licenses/by/3.0/”

Mais conteúdo relacionado

Mais procurados

What business are we in?
What business are we in?What business are we in?
What business are we in?lisld
 
Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...lisld
 
Towards collaboration at scale: Libraries, the social and the technical
Towards collaboration at scale:  Libraries, the social and the technicalTowards collaboration at scale:  Libraries, the social and the technical
Towards collaboration at scale: Libraries, the social and the technicallisld
 
Library discovery: past, present and some futures
Library discovery: past, present and some futuresLibrary discovery: past, present and some futures
Library discovery: past, present and some futureslisld
 
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...OCLC
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Janifer Gatenby
 
Thinking about technology .... differently
Thinking about technology .... differentlyThinking about technology .... differently
Thinking about technology .... differentlylisld
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureOCLC Research
 
The research library: scalable efficiency and scalable learning
The research library: scalable efficiency and scalable learningThe research library: scalable efficiency and scalable learning
The research library: scalable efficiency and scalable learninglisld
 
Infrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directionsInfrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directionslisld
 
The Library in the Life of the User: Two Collection Directions
The Library in the Life of the User: Two Collection DirectionsThe Library in the Life of the User: Two Collection Directions
The Library in the Life of the User: Two Collection Directionslisld
 
Library collections and the emerging scholarly record
Library collections and the emerging scholarly recordLibrary collections and the emerging scholarly record
Library collections and the emerging scholarly recordlisld
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Charleston Conference
 
Challenges and opportunities for academic libraries
Challenges and opportunities for academic librariesChallenges and opportunities for academic libraries
Challenges and opportunities for academic librarieslisld
 
Virtual Research Networks : Towards Research 2.0
Virtual Research Networks : Towards Research 2.0Virtual Research Networks : Towards Research 2.0
Virtual Research Networks : Towards Research 2.0Guus van den Brekel
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the userlisld
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentationekansa
 
The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentlisld
 
The Inside Out Library.
The Inside Out Library. The Inside Out Library.
The Inside Out Library. lisld
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC
 

Mais procurados (20)

What business are we in?
What business are we in?What business are we in?
What business are we in?
 
Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...
 
Towards collaboration at scale: Libraries, the social and the technical
Towards collaboration at scale:  Libraries, the social and the technicalTowards collaboration at scale:  Libraries, the social and the technical
Towards collaboration at scale: Libraries, the social and the technical
 
Library discovery: past, present and some futures
Library discovery: past, present and some futuresLibrary discovery: past, present and some futures
Library discovery: past, present and some futures
 
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19
 
Thinking about technology .... differently
Thinking about technology .... differentlyThinking about technology .... differently
Thinking about technology .... differently
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructure
 
The research library: scalable efficiency and scalable learning
The research library: scalable efficiency and scalable learningThe research library: scalable efficiency and scalable learning
The research library: scalable efficiency and scalable learning
 
Infrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directionsInfrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directions
 
The Library in the Life of the User: Two Collection Directions
The Library in the Life of the User: Two Collection DirectionsThe Library in the Life of the User: Two Collection Directions
The Library in the Life of the User: Two Collection Directions
 
Library collections and the emerging scholarly record
Library collections and the emerging scholarly recordLibrary collections and the emerging scholarly record
Library collections and the emerging scholarly record
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
 
Challenges and opportunities for academic libraries
Challenges and opportunities for academic librariesChallenges and opportunities for academic libraries
Challenges and opportunities for academic libraries
 
Virtual Research Networks : Towards Research 2.0
Virtual Research Networks : Towards Research 2.0Virtual Research Networks : Towards Research 2.0
Virtual Research Networks : Towards Research 2.0
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the user
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
 
The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environment
 
The Inside Out Library.
The Inside Out Library. The Inside Out Library.
The Inside Out Library.
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
 

Semelhante a Describing Theses and Dissertations Using Schema.org

The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commonsJesse Wang
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices Richard Wallis
 
The Canadian Linked Data Initiative: Charting a Path to a Linked Data Future
The Canadian Linked Data Initiative: Charting a Path to a Linked Data FutureThe Canadian Linked Data Initiative: Charting a Path to a Linked Data Future
The Canadian Linked Data Initiative: Charting a Path to a Linked Data FutureNASIG
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...rmacneil88
 
Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014ResearchSpace
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...DuraSpace
 
Southwickc lampert lodlam_training
Southwickc lampert lodlam_trainingSouthwickc lampert lodlam_training
Southwickc lampert lodlam_trainingssouthwick
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)Christine Stohn
 
Duraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository ServicesDuraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository ServicesMatthew Critchlow
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Dataaba-sah
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBnw13
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 
The Digital Library Federation Aquifer Initiative
The Digital Library Federation Aquifer InitiativeThe Digital Library Federation Aquifer Initiative
The Digital Library Federation Aquifer InitiativeJenn Riley
 
Using Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case studyUsing Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case studyLeila Zemmouchi-Ghomari
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13DataDryad
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data GenerationFilip Radulovic
 

Semelhante a Describing Theses and Dissertations Using Schema.org (20)

The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices
 
The Canadian Linked Data Initiative: Charting a Path to a Linked Data Future
The Canadian Linked Data Initiative: Charting a Path to a Linked Data FutureThe Canadian Linked Data Initiative: Charting a Path to a Linked Data Future
The Canadian Linked Data Initiative: Charting a Path to a Linked Data Future
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...
 
Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
 
Southwickc lampert lodlam_training
Southwickc lampert lodlam_trainingSouthwickc lampert lodlam_training
Southwickc lampert lodlam_training
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
 
Duraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository ServicesDuraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository Services
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Data
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNB
 
Open University Data
Open University DataOpen University Data
Open University Data
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
The Digital Library Federation Aquifer Initiative
The Digital Library Federation Aquifer InitiativeThe Digital Library Federation Aquifer Initiative
The Digital Library Federation Aquifer Initiative
 
An A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked DataAn A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked Data
 
Using Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case studyUsing Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case study
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 

Mais de OCLC

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...OCLC
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...OCLC
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.OCLC
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...OCLC
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...OCLC
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchOCLC
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsOCLC
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...OCLC
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...OCLC
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...OCLC
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopOCLC
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyOCLC
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...OCLC
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopOCLC
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the UserOCLC
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...OCLC
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaOCLC
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LISOCLC
 

Mais de OCLC (20)

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant Program
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to research
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and Residents
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUK
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with Technology
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise Workshop
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the User
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research Agenda
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LIS
 

Último

Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 

Último (20)

LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 

Describing Theses and Dissertations Using Schema.org

  • 1. Describing Theses and Dissertations Using Schema.org Project Report Presentation and Update DCMI 2014 Austin, Texas October 10, 2014 Jeff Mixter - OCLC Research Patrick OBrien - Montana State Univeristy Kenning Arlitsch - Montana State University
  • 2. Background • This project is based on an IMLS Grant that Kenning and Patrick were awarded in 2010 • Initial scope was to improve indexing and visibility of digital collections in Search Engines • Since the release of Schema.org in 2011 the scope has expanded to include modeling IR material in a way that make them more visible to traditional search engines 2
  • 3. Schema.org • Released in 2011 by Bing, Google, Yahoo and Yandex • Lingua franca for describing things on the web • W3C Working Group SchemaBibExtend was created to help make bibliographic recommendations and suggestions to Schema.org 3
  • 4. Data Sample • 1,909 DC records from the Montana State University ScholarWorks IR • They had already undergone extensive metadata clean-up 4
  • 5. Data Model • Started with Schema.org as the base • We created an extension vocabulary using the same mechanics and conventions used in Schema.org – RDFS vocabulary – It is published as RDFa – http://purl.org/montana-state/library/ 5
  • 6. 6 schema: http://schema.org/ dcterms: http://purl.org/dc/terms/ mont: http://purl.org/montana-state/library/
  • 7. Classes • There was a need to add more specificity to the existing Creative Work branch classes – Mont:Thesis – Mont:Concept • There was also a need to describe entities unique to IRs and Universities that are not covered in Schema.org’s current vocabulary – Mont:InstitutionalRepository – Mont:AcademicDepartment 7
  • 8. Properties • Create more granular relationships between classes – Mont:committeeMember • Describe important attributes of Theses and Dissertations that were not included in Schema.org* – Mont:firstPage** • Highlight and model unique relationships that were otherwise locked in the metadata records – Mont:advisor * Schema.org underwent an update following the publication of the project report ** This property has since been replaced by the schema:pageStart 8
  • 9. • Inferring additional information from the record • This has the potential of allowing Universities to aggregate a large amount of data about Academic Output and use it for reviews/marketing • This highlights the idea of developing a graph of university entities 9
  • 10. Process Model • Data was loaded into OpenRefine • Data was reconciled against Dbpedia.org, LCSH and VIAF – Matching was made easier by the specific metadata fields that the records used – dc:subjects.lcsh matched 78% • Generated our own internal URIs*** *** The URI pattern for the current production data differs from that used in the example data presented in the project report 10
  • 11. Syndication of RDF data • Data from three records was published online along with an HTML page that described all of the entities referenced in the CBDs – Serialized at RDFa • Since then we have loaded all 1,909 RDF descriptions back into the ScholarWorks repository and tweaked the Dspace instance to pull over and display JSON-ld data • All newly created entities are loaded into a Triple Store with a Pubby front end – http://54.191.234.158:8081/resource/department/7 11
  • 13. Next Steps • Setup a more production ready Pubby interface • Make modifications to the ScholarWorks structured data • Make libraries visible on the Web – Build the presence of the library and its sub-organizations on the Semantic Web – Kenning, A., OBrien, P., Clark, J. A., Young, S. W. H. & Rossmann, D. (2014). Demonstrating Library Value at Network Scale: Leveraging the Semantic Web With New Knowledge Work. Journal of Library Administration, 54(5), 413-425. 13
  • 15. Thank You! Jeff Mixter mixterj@oclc.org @JeffMixter 614-761-5159 ©2014 OCLC. This work is licensed under a Creative Commons Attribution 3.0 Unported License. Suggested attribution: “This work uses content from [presentation title] © OCLC, used under a Creative Commons Attribution license: http://creativecommons.org/licenses/by/3.0/”

Notas do Editor

  1. My name is Jeff Mixter and I work as a Research Support Specialist at OCLC. My colleagues on this IMLS grant funded project are Kenning Arlitsch, Dean of Montana State University Libraries, and Patrick Obrien, Semantic Web Researcher at Montana State University Library. Kenning and Patrick are not able to attend the conference but we would all like to thank DCMI for inviting me to present today.
  2. To start out I would like to provide a bit of background information about the project. As mentioned this project is part of an IMLS funded grant project entitled “Getting Found” that Kenning and Patrick were awarded in 2010. The initial scope of the project was to help improve the indexing and visibility of digital collections in search engines. Since the release of Schema.org in 2011, the project has expanded to include modeling Institutional Repository materials, specifically Theses and Dissertations, using terms that are understood and consumed by the major search engines.
  3. I am sure that everyone here is aware of what Schema.org is but just to give the 10,000 foot view of  the vocabulary, here are a few important facts. Of particular importance to the library community is the W3C SchemaBibExtend working group that works to propose extension terms to Schema.org in order to allow for better description of bibliographic materials.
  4. The data sample that we used for this project was taken from the Montana State University ScholarWorks database. It included 1,909 student theses and dissertations records cataloged using qualified Dublin Core. It should be noted that the metadata underwent rigorous clean-up prior to this study. This will be important when I talk about the process model that we used for converting the data into RDF.
  5. The initial phase of the work involved developing a data model for the sample set. We chose to use Schema.org as the primary vocabulary for the project. In order to describe in adequate detail the theses and dissertations, we needed to develop a set of extension terms that could be used to further refine the terms already in Schema.org. This set of terms was published using the same conventions as the Schema.org vocabulary. Primarily it was published as an RDFS vocabulary.
  6. This is a general conception of the model that we developed for the theses and dissertations.
  7. As we were developing the set of extension classes, they were grouped into two general categories. The first was those that were created to add more specificity to the existing CreativeWork branch Schema.org terms. For example, we wanted to describe an item as being not only a schema:Creative Work but more specifically a Thesis. The second category of classes were those that in essence did not exist within the Schema.org vocabulary. The Institutional Repository class was one such example. Although there are schema:Organization and schema:LocalBusiness classes (although some people might argue that an IR is not a 'business' per  se), we felt that that the IR class was unique enough to warrant its inclusion in this category. That being said, we still positioned the IR class as a sub-class of schema:Organization. Also one should not read too much into the two categories, we simply used them as a means to help organize the modeling process and to help influence ideas for future work (which I will talk about briefly)
  8. The development of the extension properties were also grouped into two general categories, those that added more granularity to existing schema.org properties and those that were needed (and did not already exist) to adequately describe Theses and Dissertations. So here I have listed mont:committeeMember. This property is listed as a sub-property of the schema:member property. In addition to this committee member relationship we also coined a mont:committeeChair property. So in the end, both properties are directly related to the generic schema:member property but each carries with it unique context and meaning that is important when describing theses and dissertations. As stated on the slides the mont:firstPage property was created to help account/model for all of the existing properties in the original DC metadata. It should be noted that since the model was developed, Schema.org has updated its vocabulary to include firstPage, lastPage properties along with other terms that allow for better descriptions of periodicals.
  9. We found after some of the initial modeling work that we were able to infer a lot of information that would otherwise require human reading and interpretation. The example here illustrates a few of these inferences. This type of information could be of huge potential to universities as a means to better understand the academic output and impact of their faculty staff and students. More specifically this could help university libraries better cement their role as the 'centers for university information and knowledge management'. I should say that while I present this idea as novel, it is, in essence, just developing a graph for university data. Its novelty lies in leveraging the library and library data to do so.
  10. So I have gone on a bit now about the data model, the other major part of this project was developing a process model for creating and publishing the RDF. We began by loading the data in OpenRefine and used its features to reconcile the existing strings to entities in VIAF, LCSH and Dbpedia. Some stats are on the screen so I will not go over them. If there are specific questions about the reconciliation process or results, please feel free to ask me afterwards. This process has been well documented by Ruben Verborgh and Max De Wilde in their book “Using OpenRefine”. This process has also been used in various library projects to convert flat data into RDF. After the reconciliation process, we took all of the remaining 'strings' and created  our own entities from them. This was particularly important for student and faculty names, and university departments, since there are very few repositories that have this type of data in them. With the rise in importance of services such as ISNI and ORCID, is makes a lot of sense for libraries to being to better understand and deal with the unique person entities that exist in their metadata. One could imagine theses entities becoming the type of data that a university library could aggregate and then submit to ISNI or ORCID for bulk upload. It is important to note that we also used OpenRefine to help identify and merge people that had variations in the spelling of their name. Again, this is a topic that I would love to talk about after the presentation if people are interested.
  11. So the final step in the process was syndicating that data. As part of our initial study, we published three sample records as Concise Bounded Descriptions (or CBD) within html pages using RDFa. These were added to the ScholarWorks website. We then published descriptions for all of the referenced entities (that were not reconciled) on an HTML page using RDFa, differentiating each descriptions using hash URIs. Since then we have loaded CDB for all 1,909 sample records back into the Dspace database and are now delivering them using JSON-ld on the HTML pages. All of the newly created entity descriptions are stored in a triple store (Apache Fuseki) that is hosted on an Amazon Web Service (AWS) Elastic Compute Cloud (EC2) instance. We have set up a front end for display of the data in the triple store using Pubby, a simple Linked Data front end for SPARQL endpoints.
  12. This slide shows the amount of structured data that Google Webmaster tools is now harvesting from the Montana State University ScholarWorks website. Note this does not serve as evidence of improved SEO but rather illustrates the initial transition from, what Eric Miller described as, the invisible web to the visible web.
  13. The immediate next steps that we will be working on will be to set up a more 'production-ized' triple store and Pubby interface. This primarily means positioning the services at standard port 80 locations and then making the necessary URI pattern changes in the sample data. We also plan to make some tweaks to the structured data that is being delivered to the Dspace html pages in order to improve how it is consumed and indexed by search engines. We might even abandon JSON-ld in favor of RDFa in order to allow users to 'interact' with the structured data (click on and follow links to external entities). The major next step that we plan to undertake as part of a follow-up grant that we were just awarded from IMLS will be to make libraries more visible on the web. This includes building the presence of libraries and library related organizations on the semantic web. An article recently published by my colleagues at Montana State in the Journal of Library Administration outlines the importance of this work and presents a sample that was done at Montana State University. In addition to making libraries present in the semantic web, this work will also help up in future projects that attempt to convert existing metadata into RDF by creating entity datasets that can be used for reconciliation.
  14. Again, on behalf of everyone on our team, I would like to thank DCMI for the invitation to speak today and invite folks to ask any questions you might have.