SlideShare a Scribd company logo
1 of 27
SSP Annual Conference
June 2013
Jay Henry
Introduction
Web Scale Discovery in brief & why it matters
Metadata – new ruler of the realm
Life Cycle of Metadata – Publisher as Parent
Evangelic Appeal for Standards
Strategies, Tactics & Pitfalls to Avoid
Many terms tossed around…
Federated search, Metasearch, NextGen catalogs, discovery
layers --- and now “Web Scale Discovery Service”
An improved search experience has always been the
motivation behind innovation…
The latest generation of tools are something different.
A Definition
A pre-harvested central index coupled with a richly featured
discovery layer providing a single search across a library’s
local, open access, and subscription collections.
…but it’s more than that
Not Just Another Search
PDA/DDA are purchasing models that were ahead of
technologies ability to properly accommodate. The
acquisition systems developed in conjunction with WSD
represent a logical progression of capabilities
Patron-driven acquisition, or PDA, is not new, but it is on the
rise. Approximately 400 to 600 libraries worldwide have
switched to a patron-driven system for purchasing new
works, and that number is likely to double over the next year
and a half (2012)
Simple Logical Progression
The Players
Content is King?
Metadata is the real ruler of the realm
Using descriptions of content to generate purchase and use
is more important now than ever
So, if we know what the target is, how do we create the best
possible metadata?
The Black Box
The people who know how these systems work aren’t telling
Lifecycle of Metadata
The Basics (More Is Better)
Title
Author
Format
ISBN
Subject categories
Imprint
Link to publisher’s dedicated page
Publication Date
Price
Data = Sales
Titles that meet the BIC Basic standard see average sales 98%
higher than those that don’t meet the standard
Records with complete BIC Basic data but no image have
average sales…of 473% [higher] in comparison to those
records which have neither the complete BIC Basic data
elements or an image.
The difference in average sales between records which…
don’t have enhanced metadata, and records which do…have
enhanced metadata elements is on average over2,600 units,
which represents an increase of almost 700%
Standard Identifiers… please.
How identifiers help
Proper understanding of the customer, whether author,
reader or institution
Provides a simple basis for wider data governance:
Data governance, as defined at Ringgold, is the processes,
policies, standards, organization, and technologies required
to manage and ensure the availability, accessibility, quality,
consistency, auditability, and security of data.
The supply chain
Consortium
Author
Submission
and Peer
Review
System
Publisher
Technology
Partner
Subscription
Agent or
Sales Agent
Fulfilment
House or
System
Library
Discovery
Service
WSDs
End User
Data
Syndication
Targets
Consortium
Societies
FundersCitation
The supply chain
Consortium
Submission
and Peer
Review
System
Technology
Partner
Subscription
Agent or
Sales Agent
Fulfilment
House or
System
End User
Consortium
Societies
FundersCitation
The supply chain using identifiers
Consortium
The supply chain using identifiers
Consortium
The supply chain using identifiers
Consortium
Strategy Suggestions
Create the most complete metadata possible
Distribute widely and efficiently
Adhere to standards
Uniquely describe each manifestation of a work
Develop an internal policy to create uniform data across all
published works
Practical Tactics
Require Authors to establish an ORCID profile
Create links into content, the more specific the better
Develop concise descriptions of content (not jacket copy)
Include as much as practical – e.g. abstracts of chapters are
often written by the authors themselves
Apply unique identifiers to establish longevity of the
metadata (e.g. ORCID, ISBN, ISSN, DOIs Ringgold ID, ISNI)
Evaluate the benefits of working with outside partners to
assist in metadata development, application and syndication
Pitfalls to Avoid
Non-Standardised Naming Conventions
Result: Poorly associated data in the supply chain.
Example 1: Inconsistent author listings, e.g. John Smith, J Smith,
Smith J etc.
Solution: use ORCID numbers
Example 2: Lack of affiliations between authors and institutional
customers.
Solution: use the Ringgold or ISNI number
Example 3: Inability to link author and customer data together.
Solution: use the Ringgold or ISNI number
Pitfalls to Avoid (continued)
Lack of or Inadequate Subject Classifications and Keywords:
Result: Dramatic negative effect the positioning of content in
relevancy rankings in discovery or search services
Example 1: Applying non-standard subject classifications causes a
mismatch against what is expected by libraries or end-users
Solution: Understand the standards and best practices being applied by current
systems and similar publishers; provide information in a form that will most
easily utilized by the systems presenting your data
Example 2: DDA sales are lost because subjects were applied without
using an international standard resulting in poor search results among
international users; cross-discipline keywords lacking entirely e.g.
Football in the US does not mean the same as Football in Europe.
Solution: Adopt an internal policy to adhere to an accepted standard at the core of
subject description, and then expand the description using keywords in the
abstract/summary copy.
Pitfalls to Avoid (continued)
Format and versions:
Result: Confusion within sales and distribution channels
Example 1: Users fail to find a compatible format for the title they
want
Solution: Apply ISBNs correctly – unique identifier for each e-edition
Example 2: Citations are incorrect or inconsistent
Solution: Apply version-specific pagination if appropriate
Example 3: Links to content fail over time
Solution: Apply DOIs to establish a persistent and reliable link
Example 4: Data is not fully utilized/indexed by discovery systems
Solution: Output information in industry standard formats (ONIX)
Pitfalls
Lack of high quality information reduces the likelihood of
content to be discovered.
References
 The Ins and Outs of Evaluating Web-Scale Discovery Services by Athena Hoeppner
http://www.infotoday.com/cilmag/apr12/Hoeppner-Web-Scale-Discovery-Services.shtml
 Stakeholders Strive to Define Standards for Web-Scale Discovery Systems By Michael Kelley on October 11, 2012
http://www.thedigitalshift.com/2012/10/discovery/coming-into-focus-web-scale-discovery-services-face-growing-need-for-best-practices
 White Paper: The Link Between Metadata and Sales By Andre Breedt, Head of Publisher Account Management; David
Walter, Research and Development Analyst, 2012
http://www.isbn.nielsenbook.co.uk/uploads/3971_Nielsen_Metadata_white_paper_A4(3).pdf
 The BIC Basic standards for bibliographic data provision
http://www.bic.org.uk/17/BIC-Basic/
 Web-Scale Discovery in an Academic Health Sciences Library: Development and Implementation of the
EBSCO Discovery Service DOI:10.1080/02763869.2013.749111JoLinda L. Thompsona*
, Kathe S. Obriga
& Laura E. Abatea
Medical Reference Services Quarterly Volume 32, Issue 1, 2013
http://www.tandfonline.com/doi/abs/10.1080/02763869.2013.749111
 Discoverability Challenges and Collaboration Opportunities within the Scholarly Communications Ecosystem:
A SAGE White Paper Update by Mary M. Somerville, University of Colorado Denver;Lettie Y. Conrad, SAGE Collaborative
Librarianship Vol 5, No 1 (2013)
 Affection for PDA By Steve Kolowich 2012 Inside Higher Ed
http://www.insidehighered.com/news/2012/06/20/research-foresees-demand-driven-book-acquisition-replacing-librarians-
discretion#ixzz2VWOAqWoU
Jay Henry
Vice President
Ringgold Inc.
Jay.henry@ringgold.com

More Related Content

What's hot

Information Extraction and Aggregation from Unstructured Web Data for Busines...
Information Extraction and Aggregation from Unstructured Web Data for Busines...Information Extraction and Aggregation from Unstructured Web Data for Busines...
Information Extraction and Aggregation from Unstructured Web Data for Busines...Alexander Michels
 
Role of metadata in transportation agency data programs
Role of metadata in transportation agency data programsRole of metadata in transportation agency data programs
Role of metadata in transportation agency data programsJoseph Busch
 
Marketing Research and Competitive Intelligence
Marketing Research and Competitive IntelligenceMarketing Research and Competitive Intelligence
Marketing Research and Competitive IntelligenceAugust Jackson
 
Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...Joseph Busch
 
Dissemination Documentation
Dissemination DocumentationDissemination Documentation
Dissemination Documentationannegrete
 
Building an Enterprise Metadata Repository
Building an Enterprise Metadata RepositoryBuilding an Enterprise Metadata Repository
Building an Enterprise Metadata RepositoryEmbarcadero Technologies
 
Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Dr. Haxel Consult
 
Metadata and Analytics
Metadata and AnalyticsMetadata and Analytics
Metadata and Analyticsbrunomase
 
Secondary Research in Applied Marketing Research
Secondary Research in Applied Marketing ResearchSecondary Research in Applied Marketing Research
Secondary Research in Applied Marketing ResearchKelly Page
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”VOGIN-academie
 
Graph-based Discovery and Analytics at Enterprise Scale
Graph-based Discovery and Analytics at Enterprise ScaleGraph-based Discovery and Analytics at Enterprise Scale
Graph-based Discovery and Analytics at Enterprise ScaleCambridge Semantics
 
Econ3323 Proquest ABI inform
Econ3323 Proquest ABI informEcon3323 Proquest ABI inform
Econ3323 Proquest ABI informLucia Ravi
 
SLA CI Division Webinar: Using the Internet to Research Private Companies
SLA CI Division Webinar: Using the Internet to Research Private CompaniesSLA CI Division Webinar: Using the Internet to Research Private Companies
SLA CI Division Webinar: Using the Internet to Research Private CompaniesAugust Jackson
 
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...Ringgold Inc
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRMichel Dumontier
 
Introduction to Mergent and DataAnalysis
Introduction to Mergent and DataAnalysisIntroduction to Mergent and DataAnalysis
Introduction to Mergent and DataAnalysisLucia Ravi
 

What's hot (20)

web mining
web miningweb mining
web mining
 
Information Extraction and Aggregation from Unstructured Web Data for Busines...
Information Extraction and Aggregation from Unstructured Web Data for Busines...Information Extraction and Aggregation from Unstructured Web Data for Busines...
Information Extraction and Aggregation from Unstructured Web Data for Busines...
 
Opentext Decisiv
Opentext DecisivOpentext Decisiv
Opentext Decisiv
 
Semantic Technology in Publishing & Finance
Semantic Technology in Publishing & FinanceSemantic Technology in Publishing & Finance
Semantic Technology in Publishing & Finance
 
Role of metadata in transportation agency data programs
Role of metadata in transportation agency data programsRole of metadata in transportation agency data programs
Role of metadata in transportation agency data programs
 
Marketing Research and Competitive Intelligence
Marketing Research and Competitive IntelligenceMarketing Research and Competitive Intelligence
Marketing Research and Competitive Intelligence
 
Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...
 
Dissemination Documentation
Dissemination DocumentationDissemination Documentation
Dissemination Documentation
 
Building an Enterprise Metadata Repository
Building an Enterprise Metadata RepositoryBuilding an Enterprise Metadata Repository
Building an Enterprise Metadata Repository
 
Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Linked Open Data in the World of Patents
Linked Open Data in the World of Patents
 
Primary secondarydata
Primary secondarydataPrimary secondarydata
Primary secondarydata
 
Metadata and Analytics
Metadata and AnalyticsMetadata and Analytics
Metadata and Analytics
 
Secondary Research in Applied Marketing Research
Secondary Research in Applied Marketing ResearchSecondary Research in Applied Marketing Research
Secondary Research in Applied Marketing Research
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
 
Graph-based Discovery and Analytics at Enterprise Scale
Graph-based Discovery and Analytics at Enterprise ScaleGraph-based Discovery and Analytics at Enterprise Scale
Graph-based Discovery and Analytics at Enterprise Scale
 
Econ3323 Proquest ABI inform
Econ3323 Proquest ABI informEcon3323 Proquest ABI inform
Econ3323 Proquest ABI inform
 
SLA CI Division Webinar: Using the Internet to Research Private Companies
SLA CI Division Webinar: Using the Internet to Research Private CompaniesSLA CI Division Webinar: Using the Internet to Research Private Companies
SLA CI Division Webinar: Using the Internet to Research Private Companies
 
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
Introduction to Mergent and DataAnalysis
Introduction to Mergent and DataAnalysisIntroduction to Mergent and DataAnalysis
Introduction to Mergent and DataAnalysis
 

Viewers also liked

Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016Ringgold Inc
 
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...Ringgold Inc
 
Spring Cleaning: Easy Ways to Tidy Your Customer Data
Spring Cleaning: Easy Ways to Tidy Your Customer DataSpring Cleaning: Easy Ways to Tidy Your Customer Data
Spring Cleaning: Easy Ways to Tidy Your Customer DataCharleston Conference
 
Using Data to Drive Discovery of New Scholarly Works
Using Data to Drive Discovery of New Scholarly WorksUsing Data to Drive Discovery of New Scholarly Works
Using Data to Drive Discovery of New Scholarly WorksRinggold Inc
 
Persistent Identifiers - The 5 Things You Need To Know
Persistent Identifiers - The 5 Things You Need To KnowPersistent Identifiers - The 5 Things You Need To Know
Persistent Identifiers - The 5 Things You Need To KnowRinggold Inc
 
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015Ringgold Inc
 
Metadata Standards: A Golden Age Arrives? - Christine Orr at STM
Metadata Standards: A Golden Age Arrives? - Christine Orr at STMMetadata Standards: A Golden Age Arrives? - Christine Orr at STM
Metadata Standards: A Golden Age Arrives? - Christine Orr at STMRinggold Inc
 
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...Ringgold Inc
 
Metadata & Standards in Scholarly Communication
Metadata & Standards in Scholarly CommunicationMetadata & Standards in Scholarly Communication
Metadata & Standards in Scholarly CommunicationRinggold Inc
 
Pulling Together: information flow throughout the scholarly supply chain
Pulling Together: information flow throughout the scholarly supply chainPulling Together: information flow throughout the scholarly supply chain
Pulling Together: information flow throughout the scholarly supply chainRinggold Inc
 
Institutional Identifiers in Practice: Christine Orr at CESSE 2015
Institutional Identifiers in Practice: Christine Orr at CESSE 2015Institutional Identifiers in Practice: Christine Orr at CESSE 2015
Institutional Identifiers in Practice: Christine Orr at CESSE 2015Ringgold Inc
 
Small Data, Big Benefits - Christine Orr at SSP 2016
Small Data, Big Benefits - Christine Orr at SSP 2016Small Data, Big Benefits - Christine Orr at SSP 2016
Small Data, Big Benefits - Christine Orr at SSP 2016Ringgold Inc
 

Viewers also liked (13)

Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
 
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
 
Spring Cleaning: Easy Ways to Tidy Your Customer Data
Spring Cleaning: Easy Ways to Tidy Your Customer DataSpring Cleaning: Easy Ways to Tidy Your Customer Data
Spring Cleaning: Easy Ways to Tidy Your Customer Data
 
Using Data to Drive Discovery of New Scholarly Works
Using Data to Drive Discovery of New Scholarly WorksUsing Data to Drive Discovery of New Scholarly Works
Using Data to Drive Discovery of New Scholarly Works
 
Persistent Identifiers - The 5 Things You Need To Know
Persistent Identifiers - The 5 Things You Need To KnowPersistent Identifiers - The 5 Things You Need To Know
Persistent Identifiers - The 5 Things You Need To Know
 
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
 
Metadata Standards: A Golden Age Arrives? - Christine Orr at STM
Metadata Standards: A Golden Age Arrives? - Christine Orr at STMMetadata Standards: A Golden Age Arrives? - Christine Orr at STM
Metadata Standards: A Golden Age Arrives? - Christine Orr at STM
 
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henr...
 
Nicolson
NicolsonNicolson
Nicolson
 
Metadata & Standards in Scholarly Communication
Metadata & Standards in Scholarly CommunicationMetadata & Standards in Scholarly Communication
Metadata & Standards in Scholarly Communication
 
Pulling Together: information flow throughout the scholarly supply chain
Pulling Together: information flow throughout the scholarly supply chainPulling Together: information flow throughout the scholarly supply chain
Pulling Together: information flow throughout the scholarly supply chain
 
Institutional Identifiers in Practice: Christine Orr at CESSE 2015
Institutional Identifiers in Practice: Christine Orr at CESSE 2015Institutional Identifiers in Practice: Christine Orr at CESSE 2015
Institutional Identifiers in Practice: Christine Orr at CESSE 2015
 
Small Data, Big Benefits - Christine Orr at SSP 2016
Small Data, Big Benefits - Christine Orr at SSP 2016Small Data, Big Benefits - Christine Orr at SSP 2016
Small Data, Big Benefits - Christine Orr at SSP 2016
 

Similar to What Publishers Need to Know About Web Scale Discovery

Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Inc
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Denodo
 
conceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business ValueconceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business Valuemartingarland
 
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Denodo
 
The future of scholarly publishing under digital transformation data, ai an...
The future of scholarly publishing under digital transformation   data, ai an...The future of scholarly publishing under digital transformation   data, ai an...
The future of scholarly publishing under digital transformation data, ai an...Xiaofeng Chen
 
BEA Pathways - Expertise Location
BEA Pathways - Expertise LocationBEA Pathways - Expertise Location
BEA Pathways - Expertise LocationHutch Carpenter
 
Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2ALATechSource
 
Data Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityData Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityPrecisely
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for FindabilityFindwise
 
Information Search
Information SearchInformation Search
Information Searchallerhed
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...IJTET Journal
 
If You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository ManagementIf You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository ManagementSarah Currier
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewDenodo
 
Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1ALATechSource
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
Enterprise Data Marketplace: A Centralized Portal for All Your Data Assets
Enterprise Data Marketplace: A Centralized Portal for All Your Data AssetsEnterprise Data Marketplace: A Centralized Portal for All Your Data Assets
Enterprise Data Marketplace: A Centralized Portal for All Your Data AssetsDenodo
 

Similar to What Publishers Need to Know About Web Scale Discovery (20)

Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 
conceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business ValueconceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business Value
 
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
 
The future of scholarly publishing under digital transformation data, ai an...
The future of scholarly publishing under digital transformation   data, ai an...The future of scholarly publishing under digital transformation   data, ai an...
The future of scholarly publishing under digital transformation data, ai an...
 
BEA Pathways - Expertise Location
BEA Pathways - Expertise LocationBEA Pathways - Expertise Location
BEA Pathways - Expertise Location
 
Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2Managing Electronic Resources for Public Libraries: Part 2
Managing Electronic Resources for Public Libraries: Part 2
 
Data Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityData Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data Quality
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
Information Search
Information SearchInformation Search
Information Search
 
uae views on big data
  uae views on  big data  uae views on  big data
uae views on big data
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
 
If You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository ManagementIf You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository Management
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1
 
Content analytics
Content analyticsContent analytics
Content analytics
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Enterprise Data Marketplace: A Centralized Portal for All Your Data Assets
Enterprise Data Marketplace: A Centralized Portal for All Your Data AssetsEnterprise Data Marketplace: A Centralized Portal for All Your Data Assets
Enterprise Data Marketplace: A Centralized Portal for All Your Data Assets
 

More from Ringgold Inc

Identify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UKIdentify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UKRinggold Inc
 
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018 Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018 Ringgold Inc
 
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...Ringgold Inc
 
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...Ringgold Inc
 
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy DataRinggold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy DataRinggold Inc
 
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...Ringgold Inc
 

More from Ringgold Inc (8)

Identify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UKIdentify Database User Group Meeting 2017 UK
Identify Database User Group Meeting 2017 UK
 
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018 Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
 
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
 
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
 
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy DataRinggold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
 
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
 
Identify database
Identify databaseIdentify database
Identify database
 
CDO
CDOCDO
CDO
 

Recently uploaded

Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchirictsugar
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Servicecallgirls2057
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy Verified Accounts
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesKeppelCorporation
 
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Doge Mining Website
 
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptxThe-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptxmbikashkanyari
 
Guide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDFGuide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDFChandresh Chudasama
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationAnamaria Contreras
 
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City GurgaonCall Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaoncallgirls2057
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyotictsugar
 
Cyber Security Training in Office Environment
Cyber Security Training in Office EnvironmentCyber Security Training in Office Environment
Cyber Security Training in Office Environmentelijahj01012
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...ssuserf63bd7
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfJos Voskuil
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMintel Group
 
Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...Americas Got Grants
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckHajeJanKamps
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africaictsugar
 

Recently uploaded (20)

Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchir
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail Accounts
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation Slides
 
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCREnjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
 
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
Unlocking the Future: Explore Web 3.0 Workshop to Start Earning Today!
 
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptxThe-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
 
Guide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDFGuide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDF
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement Presentation
 
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City GurgaonCall Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyot
 
Cyber Security Training in Office Environment
Cyber Security Training in Office EnvironmentCyber Security Training in Office Environment
Cyber Security Training in Office Environment
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...
 
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdf
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 Edition
 
Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africa
 

What Publishers Need to Know About Web Scale Discovery

  • 1. SSP Annual Conference June 2013 Jay Henry
  • 2. Introduction Web Scale Discovery in brief & why it matters Metadata – new ruler of the realm Life Cycle of Metadata – Publisher as Parent Evangelic Appeal for Standards Strategies, Tactics & Pitfalls to Avoid
  • 3. Many terms tossed around… Federated search, Metasearch, NextGen catalogs, discovery layers --- and now “Web Scale Discovery Service” An improved search experience has always been the motivation behind innovation… The latest generation of tools are something different.
  • 4. A Definition A pre-harvested central index coupled with a richly featured discovery layer providing a single search across a library’s local, open access, and subscription collections. …but it’s more than that
  • 5. Not Just Another Search PDA/DDA are purchasing models that were ahead of technologies ability to properly accommodate. The acquisition systems developed in conjunction with WSD represent a logical progression of capabilities Patron-driven acquisition, or PDA, is not new, but it is on the rise. Approximately 400 to 600 libraries worldwide have switched to a patron-driven system for purchasing new works, and that number is likely to double over the next year and a half (2012)
  • 8. Content is King? Metadata is the real ruler of the realm Using descriptions of content to generate purchase and use is more important now than ever So, if we know what the target is, how do we create the best possible metadata?
  • 9. The Black Box The people who know how these systems work aren’t telling
  • 11. The Basics (More Is Better) Title Author Format ISBN Subject categories Imprint Link to publisher’s dedicated page Publication Date Price
  • 12. Data = Sales Titles that meet the BIC Basic standard see average sales 98% higher than those that don’t meet the standard Records with complete BIC Basic data but no image have average sales…of 473% [higher] in comparison to those records which have neither the complete BIC Basic data elements or an image. The difference in average sales between records which… don’t have enhanced metadata, and records which do…have enhanced metadata elements is on average over2,600 units, which represents an increase of almost 700%
  • 14. How identifiers help Proper understanding of the customer, whether author, reader or institution Provides a simple basis for wider data governance: Data governance, as defined at Ringgold, is the processes, policies, standards, organization, and technologies required to manage and ensure the availability, accessibility, quality, consistency, auditability, and security of data.
  • 15. The supply chain Consortium Author Submission and Peer Review System Publisher Technology Partner Subscription Agent or Sales Agent Fulfilment House or System Library Discovery Service WSDs End User Data Syndication Targets Consortium Societies FundersCitation
  • 16. The supply chain Consortium Submission and Peer Review System Technology Partner Subscription Agent or Sales Agent Fulfilment House or System End User Consortium Societies FundersCitation
  • 17. The supply chain using identifiers Consortium
  • 18. The supply chain using identifiers Consortium
  • 19. The supply chain using identifiers Consortium
  • 20. Strategy Suggestions Create the most complete metadata possible Distribute widely and efficiently Adhere to standards Uniquely describe each manifestation of a work Develop an internal policy to create uniform data across all published works
  • 21. Practical Tactics Require Authors to establish an ORCID profile Create links into content, the more specific the better Develop concise descriptions of content (not jacket copy) Include as much as practical – e.g. abstracts of chapters are often written by the authors themselves Apply unique identifiers to establish longevity of the metadata (e.g. ORCID, ISBN, ISSN, DOIs Ringgold ID, ISNI) Evaluate the benefits of working with outside partners to assist in metadata development, application and syndication
  • 22. Pitfalls to Avoid Non-Standardised Naming Conventions Result: Poorly associated data in the supply chain. Example 1: Inconsistent author listings, e.g. John Smith, J Smith, Smith J etc. Solution: use ORCID numbers Example 2: Lack of affiliations between authors and institutional customers. Solution: use the Ringgold or ISNI number Example 3: Inability to link author and customer data together. Solution: use the Ringgold or ISNI number
  • 23. Pitfalls to Avoid (continued) Lack of or Inadequate Subject Classifications and Keywords: Result: Dramatic negative effect the positioning of content in relevancy rankings in discovery or search services Example 1: Applying non-standard subject classifications causes a mismatch against what is expected by libraries or end-users Solution: Understand the standards and best practices being applied by current systems and similar publishers; provide information in a form that will most easily utilized by the systems presenting your data Example 2: DDA sales are lost because subjects were applied without using an international standard resulting in poor search results among international users; cross-discipline keywords lacking entirely e.g. Football in the US does not mean the same as Football in Europe. Solution: Adopt an internal policy to adhere to an accepted standard at the core of subject description, and then expand the description using keywords in the abstract/summary copy.
  • 24. Pitfalls to Avoid (continued) Format and versions: Result: Confusion within sales and distribution channels Example 1: Users fail to find a compatible format for the title they want Solution: Apply ISBNs correctly – unique identifier for each e-edition Example 2: Citations are incorrect or inconsistent Solution: Apply version-specific pagination if appropriate Example 3: Links to content fail over time Solution: Apply DOIs to establish a persistent and reliable link Example 4: Data is not fully utilized/indexed by discovery systems Solution: Output information in industry standard formats (ONIX)
  • 25. Pitfalls Lack of high quality information reduces the likelihood of content to be discovered.
  • 26. References  The Ins and Outs of Evaluating Web-Scale Discovery Services by Athena Hoeppner http://www.infotoday.com/cilmag/apr12/Hoeppner-Web-Scale-Discovery-Services.shtml  Stakeholders Strive to Define Standards for Web-Scale Discovery Systems By Michael Kelley on October 11, 2012 http://www.thedigitalshift.com/2012/10/discovery/coming-into-focus-web-scale-discovery-services-face-growing-need-for-best-practices  White Paper: The Link Between Metadata and Sales By Andre Breedt, Head of Publisher Account Management; David Walter, Research and Development Analyst, 2012 http://www.isbn.nielsenbook.co.uk/uploads/3971_Nielsen_Metadata_white_paper_A4(3).pdf  The BIC Basic standards for bibliographic data provision http://www.bic.org.uk/17/BIC-Basic/  Web-Scale Discovery in an Academic Health Sciences Library: Development and Implementation of the EBSCO Discovery Service DOI:10.1080/02763869.2013.749111JoLinda L. Thompsona* , Kathe S. Obriga & Laura E. Abatea Medical Reference Services Quarterly Volume 32, Issue 1, 2013 http://www.tandfonline.com/doi/abs/10.1080/02763869.2013.749111  Discoverability Challenges and Collaboration Opportunities within the Scholarly Communications Ecosystem: A SAGE White Paper Update by Mary M. Somerville, University of Colorado Denver;Lettie Y. Conrad, SAGE Collaborative Librarianship Vol 5, No 1 (2013)  Affection for PDA By Steve Kolowich 2012 Inside Higher Ed http://www.insidehighered.com/news/2012/06/20/research-foresees-demand-driven-book-acquisition-replacing-librarians- discretion#ixzz2VWOAqWoU
  • 27. Jay Henry Vice President Ringgold Inc. Jay.henry@ringgold.com

Editor's Notes

  1. Good afternoon, my name is Jay Henry and I’m with Ringgold – we are a data services company with offices in Portland and near Oxford UK. Our business has two main areas of focus- working with publishers to normalize (clean) and uniquely identify their internal data and on another side of the business we provide metadata creation and dissemination services. [may want to mention Book News] Today I will be speaking as an advocate for excellent metadata, and while I believe everything worth creating is worth thoroughly describing, for the purposes of this talk I will be focusing my comments on scholarly monographs and basic data elements common to all areas of publishing. The content of this presentation is meant to inform the initiated and educate those new to the concept of “Web Scale Discovery Services”, but my focus on metadata should apply to any publishing strategy regardless of the downstream target of your data. I will touch on how the emergence of this technology is enabling new types of acquisition models, highlight the challenges to publishers, and provide some practical information for you to consider when deciding how to approach metadata creation. Please understand, I will be speaking about only a small portion of the supply chain directly related to exposure and discovery of content. Specifically, the linkage between Publishers & their contributors, intermediaries, libraries and their patrons, and the effect and importance of WSDs in that context. Of course, the benefits of well-formed metadata are so profound as to provide a direct benefit to scholarship… I won’t go down the road of making a philosophical argument that publisher’s have a moral obligation to strive for the highest standards, but you can see how I’m thinking about this topic. Let’s just not forget that good quality metadata has a positive effect in many areas of the supply chain, and natural efficiencies are the result; that should be reason enough to attempt to stay awake at this awkward hour for consciousness. I will be making the case that the emergence of WSD in conjunction with new acquisition models represents a real change in the supply chain which requires attention from publishers to ensure they are doing everything possible to ensure their content will be in the best possible position to be discovered.
  2. There are a lot of terms tossed around when we talk about search [read terms] – read next A quick clarification on definitions - Are we hearing different names for the same things? No. The term Web Scale or “discovery services” is being used throughout the publishing industry as the most recent darling buzzword…and for good reason. Web search utilities (Google, Bing, etc.) have transformed library patron and researcher behavior. “Search” is maturing as a concept and taking on new dimensions within libraries as they strive to compete with mainstream search services. Define WebScale as the next step in focused, de-cluttered, search capability that provides visibility to resources beyond the library, and puts more power in the hands of patrons to influence purchasing– not only through DDA, but by their behavior and the extent to which they interact with content (circ stats as a means of judging/vetting quality/utility  often the same– and then making purchase/renewal decisions. About discovery – and systems like these… this is what exposes content, not catalogues or flyers or special promotional emails…what sells and circulates content is putting the right information in front of the right consumer and enabling access– users (especially librarian buyers) will spend vast amounts of their time with a handful of familiar tools—presenting the right data within those tools should be a top priority for every publisher.
  3. The term “discovery services” is being used throughout the publishing industry as the most recent darling buzzword…and for good reason. Web search utilities (Google, Bing, etc.) have transformed library patron and researcher behavior. “Search” is maturing as a concept and taking on new dimensions within libraries as they strive to compete with mainstream search services.
  4. More of a “game changer” I believe WSD services represent a truly mature search technology for libraries that will provide benefits to users and the libraries themselves by allowing non-owned resources to be part of the central index. DDA Emerging as an important new way to present title information to patrons This model delivers what patrons want – and users have driven adoption of change more than any other factor The proliferation of WSD goes beyond the main players that I mentioned earlier; some system vendors (of current ILS installations within libraries) have begun to integrate WSD services by partnership and technology integration.
  5. Re: web search – the ability to search across the web changed user behavior and their expectations– federated search has been trying to delivery a similar experience to users, but only now is there the potential to delivery a vastly improved, yet focused, search for academic research. Non-linear lending– might want to mention ProQuest/EBL/Ebrary as innovators in experimenting with new acquisition models;
  6. Complexity can be managed by systems—in fact, whenever a need arises, a solution appears; however, the best solutions can not work with poor quality data—the old cliché of ‘garbage in garbage out’ still applies. There is more content to describe than ever, and as a result, unique identifiers are the best way to disambiguate and link your data to relevant sources. metadata has been cooped up for a while, and is not feeling it’s old strength I’m here to talk about the importance of good quality metadata (and what is meant by “good quality”) in the context of web scale discovery systems not because the term is the flavor of the month, but because they matter—this is an important trend that I believe will become the standard model not only within academic institutions, but everywhere. ---COUNTER DATA---???? ---Some publishers are better than others– there is a range, and those doing the best job tend to be the largest and most well recognized brands which increases their ability to ensure their content is discovered; more than ever, descriptive data is a competitive factor
  7. WSDS – the importance of complete metadata in order to support systems no one really understands The only solution is to provide as much data as possible in order to provide the broadest description possible to provide the algorithms at work the raw material that will ultimately produce hits and increased visibility
  8. Publishers must drive the creation and initial proliferation of complete and high quality metadata: Reference -Nielsen study Publishers are the first, and should be the best, source of metadata for a title. Still, much of what can and will be added as part of a ‘description’ of a work will be created after the thing is actually published, and so, metadata grows within the supply chain over time; those records that have a strong start will be the most utilized and afford the greatest benefit to the publisher In my introduction I used the term, ‘Publisher as Parent”… one thing a good parent provides for it’s newly created work is a unique name; in the case of monographs it is possible and advantageous to uniquely identify not only the work and it’s various manifestations, but also content within the work; deep linking content and expanding the descriptive data associated with each discrete chunk (e.g. chapters) provides an excellent start to a young work’s descriptive foundation. Ultimately, publishers benefit from looking at meta-metadata…metrics that allow publishers to evaluate their publishing strategies and focus on areas where they experience greater success or can see trends in user behavior. Just as important, content will increasingly be judged based on usage– the same data that exposes titles for purchase drives ongoing circulation and renewals.
  9. I’ve listed the bare minimum here; the BIC bibliographic standard is a good list of what should be supplied, but of course, more is better, always always always. The important thing to remember about creating good quality metadata is to adhere to standards and uniquely identify everything possible.
  10. Let’s get specific about what kind of metadata is worthy of adjectives like, ‘better’, or ‘complete’– Unique Identifiers allow content to be disambiguation, internal, external, etc… standards grease the wheels of the supply chain
  11. In my introduction I used the term, ‘Publisher as Parent”… one thing a good parent provides for it’s newly created work is a unique name; in the case of monographs it is possible and advantageous to uniquely identify not only the work and it’s various manifestations, but also content within the work; deep linking content and expanding the descriptive data associated with each discrete chunk (e.g. chapters) provides an excellent start to a young work’s descriptive foundation. If we take a few of the participants, apply standard identifiers and adopt a data distribution policy that spreads and enhances the initial record, and things begin to change.
  12. Highlight slide
  13. Highlight slide
  14. Highlight slide – metadata combined with standard identifiers CHANGE the supply chain...merging at an ever increasing rate and the flow of information across systems will be key to exposing content and realizing sales and use of works.
  15. After everything I’ve said to this point, this slide is really a summary of what I’ve already advocated – more is better, wide distribution, standards, unique identification and a policy to create consistent descriptive output… from that strategic foundation, more can be done, but this is the minimum. What do I mean by complete? –deep, chapters, summaries of chapters, links to chapters, images, etc. Efficient distribution means pointing data in directions which “trickle out” and which leads to further enrichment of the description, including the addition of user generated content/reviews etc. Powerful new tools are now widely available to create clear metrics that provide the basis for better informed decisions by institutional purchasers.
  16. Re: apply unique ids – “once uniquely identified, always uniquely identified”Each format through which you publish your book requires its own ISBNbecause this thirteen-digit numeral unmistakably identifies the title, edition, binding, and publisher of a given work. So your paper book will have its own ISBN, the audiobook will have its own ISBN, and the ebookits own ISBN. Re: Evaluate: Many publishers have the resources to do a good job and are doing so, others simply don’t have the resources to put a complete plan together and execute—nonetheless creating the best possible data for your content is critical regardless of how it’s accomplished.
  17. “ Once uniquely identified, always uniquely identified”… by definition
  18. If you remember nothing else, remember this!