SlideShare uma empresa Scribd logo
1 de 24
Overview of Bowker’s
            Metadata Processes


Patricia Payton
Senior Director, Publisher Relations & Content Development for
Bowker
908.219.0241 Patricia.Payton@bowker.com
Agenda
• Bowker’s role in the marketplace
   – Customer workflows
   – Selected client lists
   – Bowker products
• Bowker metadata management
   –   Aggregated and enhanced content
   –   Value added processes
   –   Processing of publisher data and audits
   –   Testing new data feeds
   –   Publisher outreach priorities
• Next cooperative steps
Bowker’s Role in the Marketplace
Representative Clients by Market Segment
   Publishers:           Retailers:        Libraries:
   Random House          Barnes & Noble    New York Public
   HarperCollins         Follett College   Brooklyn Public
   Hachette              Stores            Chicago Public
   Elsevier              Indigo            Johns Hopkins University
   Macmillan             Abebooks.com      Harvard
   Cengage               Hastings          Yale
   Wiley                 Sony              Princeton
                         Apple             Queens Borough Public
   Schools:              eBay              State of Oklahoma
   NYC DOE

   web services:
   Anthology             Blackboard.com
   Columbia University   New York Times
   MIT                   EdMap
Bowker eBook Customers Today
• 45 customers currently purchase eBook data feeds
    – Borders, B&N, SWETS, NY Times
• Libraries need central repository to better identify all eContent
• All products incorporate eBook metadata
    – Publisher data represents 82% of collection
    – Aggregator and conversion house data also stored
Search & Discovery Products
Bowker Books In Print®
    •   > 1,200 retail & library clients (>10K locations) make buying decisions using this online
        bibliographic reference tool
    •   Content is aggregated and standardized
    •   > 20M records; > 13M “active” book records
Search & Discovery Products
Bowker ® Syndetic Solutions
    •   Library catalog (OPAC) enrichment service
    •   2.3B queries/month; >11M content elements; updated weekly
    •   Cover images, Tables of Contents, Summaries, Reviews, First chapters, Author
        notes, Awards, & Knowledge Profiles
    •   Includes books, videos & music in English, Spanish, German, Swedish & Italian
    •   Analytics show users search “long tail”—29K hits, most requested title 18
Traditional vs. Content Searching
                                    Searches select
                                    metadata fields only




Searches all
available content
Search & Discovery Products
Bowker Data Licensing
   •    Embed data in customer
        acquisition & workflow
        processes
   •    60+ clients including major
        retailers, small startups,
        eBook platforms, and
        search engines
   •    User controls processing
        rules
   •    Works via pull or push
        methods
Metadata Management
              Customer and
              Product Needs


Audits and                     Aggregated
Gap Filling                     Metadata




  Value Added                 Enhanced
   Processes                   Content
Aggregated & Enhanced Content
Content from the Supply Chain
    •   Data Feeds – National Libraries, Publishers, & Distributors
    •   Price & Availability Notifications – Wholesalers
Licensed Content
    •   Full Text Reviews – PW, LJ and SLJ, NYT (adding UK sources in 2011)
    •   Review citations – 10 trusted sources Included on > 145K ISBNs
           •     NY Times Book Review, Los Angeles Times, San Fran. Chronicle, and USA Today
Bowker Created Content
    •   Author Biographies – > 80,000 authors
    •   Bestseller lists – 23 sources
           •     Including New York Times, Los Angeles Times, USA Today, and The Wall Street Journal
           •     Included on >225K ISBNs even Audio, Video, Print and E-Book
           •     > 100 Years At a Glance synopses
           •     Detail listings for PW and NY Times on position and length on list
    •   Media mentions – 25 sources
           •     Business Week, Entertainment Weekly, Time, Good Morning America, Oprah, NPR
    •   Awards – > 400 sources
    •   Knowledge Profiles – 225K unique across all subjects
           •     Genre and sub-genre, Author, Title, Characters of book and traits, themes, keyword related
           •     U.S. titles only
Knowledge Profile Creation
Value Added Processes
Subject Classification
     •   Bowker stores and forwards publisher-assigned BISAC subject codes
     •   Many of Bowker customers use our more specific subject terms
           • Bowker’s scheme has > 80K Bowker subject terms compared to BISAC’s 3700 codes
           • All Bowker codes are mapped to BISAC and BIC codes for easy updating
 Title Linking
     •   ISBNs of the same intellectual work are linked
     •   Title, subtitle, and first contributor matches are given a unique title record number
     •   Unique title record number links all editions to valued-added data such as:
           • Bowker subject classifications
           • Reviews & review citations
           • Awards
           • Media mentions
           • Bestseller notations
           • Chapter excerpts
           • Dewey, Library of Congress and British Library classification schemes
           • Lexile measures from Metametrics (for children’s books)
Linking Enriched Content Across Formats
Linking eBook Metadata
• Feature vendor specific information
• Display of agency and institutional pricing
Processing of Publisher Data
File Process
     •   Process goal is 48 hours of receipt
     •   Automated process pulls from FTP and submits each file
     •   Data locks down 90 days past publication date
           • Only updates to status, returns, and price related fields are allowed
Individual file audit reports run
     •   Exclude Report--
           • ISBN is invalid (e.g., 9 digits, or check-digit will not validate)
           • Publisher is not properly linked to current Distributor
           • New Imprint for publisher is in file but not in Bowker’s Publisher Authority database
           • ISBN status is “No Longer Stocked by Us” or “Refer to another Supplier” (meaning
               the supplier of the file no longer carries that ISBN)
     •   Title Change Report
     •   Contributor Change Report
Processes vary for print, eBooks, and cover images
Database Audit Processes
Daily
 • Query/review prices over $400

Weekly
 • High profile titles

Monthly
 • Un-fielded data
 • Upper case titles
 • Undefined articles
 • Bestselling and classic authors are cleaned
 • Bad contributor cleaning
 • Research ISBNs with “untitled” titles
 • Remove pipe characters, carriage returns and line feeds from titles and contributors
On demand
 • Review for timeliness of data
 • Bad publisher/imprint symbols
Testing Process for New Feeds
     Publisher                            Data Integration                 Quality Assurance                    Production
     Relations                        •     Map file                      •     In-depth quality          •     FTP account set up
                                            imprint/publishers                  review of all titles      •     Statement of Use
•   Validation of
                                            to our database               •     Compare file to                 supplied to
    ONIX files
                                      •     Load data to test                   data already in BIP             publisher
•   Check required                          system                        •     Review                    •     Cover images
    data fields                       •     Work excludes                       completeness of                 requested
    present                           •     Supply audit of                     data
•   Brief quality scan                      records to QA                 •     For Excel
                                                                                files, verify scripting
•   Determine                                                                   was correct
    quantity of
    records supplied
•   Write script for
    conversion of
    Excel files to
    ONIX
File can move on                                                              File can move on
in process or be                                                              in process or be
   returned to                                                                   returned to
    publisher             6 weeks                                                 publisher
                          average
                            wait
    1 week on            time due                          2 weeks on average to complete the testing process
     average               to files
                         in queue
Publisher Outreach Priorities
Gap filling
     •   Forthcoming titles (i.e. price, annotation, and cover image at 60 days prior to
         publication)
     •   Validating that older titles (pre 2000) that are still active in our system are still available
     •   Identifying issues around items lacking prices in our system
           • Including items that were cancelled, are not for sale separately, or are no longer
              distributed
Establishing eBook metadata feeds
     • With publishers, eBook aggregators and distributors
Free full content indexing service
     •   Whereby Bowker extracts keywords and phrases with relevancy and frequency scores to
         embed behind the scenes in products
Understanding the use of ISBNs for digital products
Next Cooperative Steps
•       Data Submission Guides
•       Additional documents available
    –     Data integrity document (more detail on audit reports and processes)
    –     Publisher profile data (details on current state of your data)
•       Exchange contact details for particular types of issues
•       Discuss file format and data fields best for your title set
•       Set date for test file submission
About Bowker
 Bowker is the world's leading provider of bibliographic information
 management solutions designed to help
 publishers, booksellers, and libraries better serve their customers.
 The company is focused on developing various tools and products
 that make books easier for people to discover, evaluate, order, and
 experience, as well as providing services to publishers that help
 them better understand and meet the interests of readers
 worldwide. Bowker is an affiliated business of ProQuest and is
 headquartered in New Providence, New Jersey, with additional
 operations in England and Australia.

 For more information, please visit www.bowker.com.
 Follow Us On Twitter @DiscoverBowker

Mais conteúdo relacionado

Semelhante a Overview of Bowker's Metdata Processes

BISAC Identification Committee -- ISTC
BISAC Identification Committee -- ISTCBISAC Identification Committee -- ISTC
BISAC Identification Committee -- ISTC
bisg
 
POOFCharleston2011
POOFCharleston2011POOFCharleston2011
POOFCharleston2011
2CULPOOF
 
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
NASIG
 
Kuali update v4 - mw
Kuali update   v4 - mwKuali update   v4 - mw
Kuali update v4 - mw
sarnoa
 
Building enterprise records management solutions for share point 2010
Building enterprise records management solutions for share point 2010Building enterprise records management solutions for share point 2010
Building enterprise records management solutions for share point 2010
Eric Shupps
 
Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...
Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...
Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...
cuyeki
 

Semelhante a Overview of Bowker's Metdata Processes (20)

Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"
 
Bowker's Metadata Flow
Bowker's Metadata FlowBowker's Metadata Flow
Bowker's Metadata Flow
 
Converting your e resource records to rda-guajardo
Converting your e resource records to rda-guajardoConverting your e resource records to rda-guajardo
Converting your e resource records to rda-guajardo
 
Maximising sales through quality metadata
Maximising sales through quality metadataMaximising sales through quality metadata
Maximising sales through quality metadata
 
ER&L 2019 - Forming a More Perfect Knowledgebase: A Tale of Publisher, Vendor...
ER&L 2019 - Forming a More Perfect Knowledgebase: A Tale of Publisher, Vendor...ER&L 2019 - Forming a More Perfect Knowledgebase: A Tale of Publisher, Vendor...
ER&L 2019 - Forming a More Perfect Knowledgebase: A Tale of Publisher, Vendor...
 
BISAC Identification Committee -- ISTC
BISAC Identification Committee -- ISTCBISAC Identification Committee -- ISTC
BISAC Identification Committee -- ISTC
 
NASIG 2021 Don't wait automate! Industry perspectives on KBART automation
NASIG 2021   Don't wait automate! Industry perspectives on KBART automationNASIG 2021   Don't wait automate! Industry perspectives on KBART automation
NASIG 2021 Don't wait automate! Industry perspectives on KBART automation
 
Meeting the aims of Plan M and streamlining metadata workflows with the BDS A...
Meeting the aims of Plan M and streamlining metadata workflows with the BDS A...Meeting the aims of Plan M and streamlining metadata workflows with the BDS A...
Meeting the aims of Plan M and streamlining metadata workflows with the BDS A...
 
POOFCharleston2011
POOFCharleston2011POOFCharleston2011
POOFCharleston2011
 
38 cc 4_a_r-rosy
38 cc 4_a_r-rosy38 cc 4_a_r-rosy
38 cc 4_a_r-rosy
 
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
Building a Better Knowledgebase: An Investigation of Current Practical Uses a...
 
Levin, Benson, Johnson, Heaton, and Rathemacher "KBART 101: An Introduction t...
Levin, Benson, Johnson, Heaton, and Rathemacher "KBART 101: An Introduction t...Levin, Benson, Johnson, Heaton, and Rathemacher "KBART 101: An Introduction t...
Levin, Benson, Johnson, Heaton, and Rathemacher "KBART 101: An Introduction t...
 
Kent State University Libraries Develops a New System for Resource Selection
Kent State University Libraries Develops a New System for Resource SelectionKent State University Libraries Develops a New System for Resource Selection
Kent State University Libraries Develops a New System for Resource Selection
 
Kuali update v4 - mw
Kuali update   v4 - mwKuali update   v4 - mw
Kuali update v4 - mw
 
UKSG 2023 - Streamlining monograph metadata supply with BDS
UKSG 2023 - Streamlining monograph metadata supply with BDSUKSG 2023 - Streamlining monograph metadata supply with BDS
UKSG 2023 - Streamlining monograph metadata supply with BDS
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO Way
 
ER&L 2022 - Set It and Forget It: Librarian, Publisher, and Vendor Perspectiv...
ER&L 2022 - Set It and Forget It: Librarian, Publisher, and Vendor Perspectiv...ER&L 2022 - Set It and Forget It: Librarian, Publisher, and Vendor Perspectiv...
ER&L 2022 - Set It and Forget It: Librarian, Publisher, and Vendor Perspectiv...
 
How companies use NoSQL and Couchbase
How companies use NoSQL and CouchbaseHow companies use NoSQL and Couchbase
How companies use NoSQL and Couchbase
 
Building enterprise records management solutions for share point 2010
Building enterprise records management solutions for share point 2010Building enterprise records management solutions for share point 2010
Building enterprise records management solutions for share point 2010
 
Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...
Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...
Publishing Partnerships: Why, When, and How Collaboration Sometimes Trumps Co...
 

Mais de Bowker

Mais de Bowker (20)

Ebook Central Submission Guide for Content Providers -- Revised, July 2020
Ebook Central Submission Guide for Content Providers -- Revised, July 2020Ebook Central Submission Guide for Content Providers -- Revised, July 2020
Ebook Central Submission Guide for Content Providers -- Revised, July 2020
 
2018 Metadata Tips A to Z
2018 Metadata Tips A to Z2018 Metadata Tips A to Z
2018 Metadata Tips A to Z
 
Enhanced Metadata for Discovery -- Beyond the Basics
Enhanced Metadata for Discovery -- Beyond the BasicsEnhanced Metadata for Discovery -- Beyond the Basics
Enhanced Metadata for Discovery -- Beyond the Basics
 
BEA Content & Digital Conference Leading Readers to Your Children's and YA Co...
BEA Content & Digital Conference Leading Readers to Your Children's and YA Co...BEA Content & Digital Conference Leading Readers to Your Children's and YA Co...
BEA Content & Digital Conference Leading Readers to Your Children's and YA Co...
 
BEA Content & Digital Conference Maximizing Metadata & Improving the Bottom Line
BEA Content & Digital Conference Maximizing Metadata & Improving the Bottom LineBEA Content & Digital Conference Maximizing Metadata & Improving the Bottom Line
BEA Content & Digital Conference Maximizing Metadata & Improving the Bottom Line
 
IDPF Digicon Future of Metadata
IDPF Digicon Future of MetadataIDPF Digicon Future of Metadata
IDPF Digicon Future of Metadata
 
BEA Content & Digital Conference & IDPF 2016
BEA Content & Digital Conference & IDPF 2016BEA Content & Digital Conference & IDPF 2016
BEA Content & Digital Conference & IDPF 2016
 
PSP Subject Discovery
PSP Subject DiscoveryPSP Subject Discovery
PSP Subject Discovery
 
The Higher Education Persona
The Higher Education PersonaThe Higher Education Persona
The Higher Education Persona
 
Creating Effective ONIX Metadata: Five Keys to Promote Discovery
Creating Effective ONIX Metadata:  Five Keys to Promote DiscoveryCreating Effective ONIX Metadata:  Five Keys to Promote Discovery
Creating Effective ONIX Metadata: Five Keys to Promote Discovery
 
BNC Educational Standards
BNC Educational StandardsBNC Educational Standards
BNC Educational Standards
 
UPU 2015 Get Discovered By the Right Readers--Keywords
UPU 2015 Get Discovered By the Right Readers--KeywordsUPU 2015 Get Discovered By the Right Readers--Keywords
UPU 2015 Get Discovered By the Right Readers--Keywords
 
BEA 2015 Generating Metadata by Machine
BEA 2015 Generating Metadata by MachineBEA 2015 Generating Metadata by Machine
BEA 2015 Generating Metadata by Machine
 
5 Cool Things You Didn't Know You Could Do With Metadata
5 Cool Things You Didn't Know You Could Do With Metadata5 Cool Things You Didn't Know You Could Do With Metadata
5 Cool Things You Didn't Know You Could Do With Metadata
 
BEA 2015 Demystifying Subject Codes and Keywords
BEA 2015 Demystifying Subject Codes and KeywordsBEA 2015 Demystifying Subject Codes and Keywords
BEA 2015 Demystifying Subject Codes and Keywords
 
Pubwest metadata exposed
Pubwest metadata exposedPubwest metadata exposed
Pubwest metadata exposed
 
Improving Subject Coding
Improving Subject CodingImproving Subject Coding
Improving Subject Coding
 
AAUP 2014--Metadata Standards
AAUP 2014--Metadata StandardsAAUP 2014--Metadata Standards
AAUP 2014--Metadata Standards
 
BEA 2014--Let Common Core Power Your Publishing Accompanying Script
BEA 2014--Let Common Core Power Your Publishing Accompanying ScriptBEA 2014--Let Common Core Power Your Publishing Accompanying Script
BEA 2014--Let Common Core Power Your Publishing Accompanying Script
 
BEA 2014--Let Common Core Power Your Publishing
BEA 2014--Let Common Core Power Your PublishingBEA 2014--Let Common Core Power Your Publishing
BEA 2014--Let Common Core Power Your Publishing
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 

Overview of Bowker's Metdata Processes

  • 1. Overview of Bowker’s Metadata Processes Patricia Payton Senior Director, Publisher Relations & Content Development for Bowker 908.219.0241 Patricia.Payton@bowker.com
  • 2. Agenda • Bowker’s role in the marketplace – Customer workflows – Selected client lists – Bowker products • Bowker metadata management – Aggregated and enhanced content – Value added processes – Processing of publisher data and audits – Testing new data feeds – Publisher outreach priorities • Next cooperative steps
  • 3. Bowker’s Role in the Marketplace
  • 4.
  • 5.
  • 6.
  • 7. Representative Clients by Market Segment Publishers: Retailers: Libraries: Random House Barnes & Noble New York Public HarperCollins Follett College Brooklyn Public Hachette Stores Chicago Public Elsevier Indigo Johns Hopkins University Macmillan Abebooks.com Harvard Cengage Hastings Yale Wiley Sony Princeton Apple Queens Borough Public Schools: eBay State of Oklahoma NYC DOE web services: Anthology Blackboard.com Columbia University New York Times MIT EdMap
  • 8. Bowker eBook Customers Today • 45 customers currently purchase eBook data feeds – Borders, B&N, SWETS, NY Times • Libraries need central repository to better identify all eContent • All products incorporate eBook metadata – Publisher data represents 82% of collection – Aggregator and conversion house data also stored
  • 9. Search & Discovery Products Bowker Books In Print® • > 1,200 retail & library clients (>10K locations) make buying decisions using this online bibliographic reference tool • Content is aggregated and standardized • > 20M records; > 13M “active” book records
  • 10. Search & Discovery Products Bowker ® Syndetic Solutions • Library catalog (OPAC) enrichment service • 2.3B queries/month; >11M content elements; updated weekly • Cover images, Tables of Contents, Summaries, Reviews, First chapters, Author notes, Awards, & Knowledge Profiles • Includes books, videos & music in English, Spanish, German, Swedish & Italian • Analytics show users search “long tail”—29K hits, most requested title 18
  • 11. Traditional vs. Content Searching Searches select metadata fields only Searches all available content
  • 12. Search & Discovery Products Bowker Data Licensing • Embed data in customer acquisition & workflow processes • 60+ clients including major retailers, small startups, eBook platforms, and search engines • User controls processing rules • Works via pull or push methods
  • 13. Metadata Management Customer and Product Needs Audits and Aggregated Gap Filling Metadata Value Added Enhanced Processes Content
  • 14. Aggregated & Enhanced Content Content from the Supply Chain • Data Feeds – National Libraries, Publishers, & Distributors • Price & Availability Notifications – Wholesalers Licensed Content • Full Text Reviews – PW, LJ and SLJ, NYT (adding UK sources in 2011) • Review citations – 10 trusted sources Included on > 145K ISBNs • NY Times Book Review, Los Angeles Times, San Fran. Chronicle, and USA Today Bowker Created Content • Author Biographies – > 80,000 authors • Bestseller lists – 23 sources • Including New York Times, Los Angeles Times, USA Today, and The Wall Street Journal • Included on >225K ISBNs even Audio, Video, Print and E-Book • > 100 Years At a Glance synopses • Detail listings for PW and NY Times on position and length on list • Media mentions – 25 sources • Business Week, Entertainment Weekly, Time, Good Morning America, Oprah, NPR • Awards – > 400 sources • Knowledge Profiles – 225K unique across all subjects • Genre and sub-genre, Author, Title, Characters of book and traits, themes, keyword related • U.S. titles only
  • 16. Value Added Processes Subject Classification • Bowker stores and forwards publisher-assigned BISAC subject codes • Many of Bowker customers use our more specific subject terms • Bowker’s scheme has > 80K Bowker subject terms compared to BISAC’s 3700 codes • All Bowker codes are mapped to BISAC and BIC codes for easy updating Title Linking • ISBNs of the same intellectual work are linked • Title, subtitle, and first contributor matches are given a unique title record number • Unique title record number links all editions to valued-added data such as: • Bowker subject classifications • Reviews & review citations • Awards • Media mentions • Bestseller notations • Chapter excerpts • Dewey, Library of Congress and British Library classification schemes • Lexile measures from Metametrics (for children’s books)
  • 17. Linking Enriched Content Across Formats
  • 18. Linking eBook Metadata • Feature vendor specific information • Display of agency and institutional pricing
  • 19. Processing of Publisher Data File Process • Process goal is 48 hours of receipt • Automated process pulls from FTP and submits each file • Data locks down 90 days past publication date • Only updates to status, returns, and price related fields are allowed Individual file audit reports run • Exclude Report-- • ISBN is invalid (e.g., 9 digits, or check-digit will not validate) • Publisher is not properly linked to current Distributor • New Imprint for publisher is in file but not in Bowker’s Publisher Authority database • ISBN status is “No Longer Stocked by Us” or “Refer to another Supplier” (meaning the supplier of the file no longer carries that ISBN) • Title Change Report • Contributor Change Report Processes vary for print, eBooks, and cover images
  • 20. Database Audit Processes Daily • Query/review prices over $400 Weekly • High profile titles Monthly • Un-fielded data • Upper case titles • Undefined articles • Bestselling and classic authors are cleaned • Bad contributor cleaning • Research ISBNs with “untitled” titles • Remove pipe characters, carriage returns and line feeds from titles and contributors On demand • Review for timeliness of data • Bad publisher/imprint symbols
  • 21. Testing Process for New Feeds Publisher Data Integration Quality Assurance Production Relations • Map file • In-depth quality • FTP account set up imprint/publishers review of all titles • Statement of Use • Validation of to our database • Compare file to supplied to ONIX files • Load data to test data already in BIP publisher • Check required system • Review • Cover images data fields • Work excludes completeness of requested present • Supply audit of data • Brief quality scan records to QA • For Excel files, verify scripting • Determine was correct quantity of records supplied • Write script for conversion of Excel files to ONIX File can move on File can move on in process or be in process or be returned to returned to publisher 6 weeks publisher average wait 1 week on time due 2 weeks on average to complete the testing process average to files in queue
  • 22. Publisher Outreach Priorities Gap filling • Forthcoming titles (i.e. price, annotation, and cover image at 60 days prior to publication) • Validating that older titles (pre 2000) that are still active in our system are still available • Identifying issues around items lacking prices in our system • Including items that were cancelled, are not for sale separately, or are no longer distributed Establishing eBook metadata feeds • With publishers, eBook aggregators and distributors Free full content indexing service • Whereby Bowker extracts keywords and phrases with relevancy and frequency scores to embed behind the scenes in products Understanding the use of ISBNs for digital products
  • 23. Next Cooperative Steps • Data Submission Guides • Additional documents available – Data integrity document (more detail on audit reports and processes) – Publisher profile data (details on current state of your data) • Exchange contact details for particular types of issues • Discuss file format and data fields best for your title set • Set date for test file submission
  • 24. About Bowker Bowker is the world's leading provider of bibliographic information management solutions designed to help publishers, booksellers, and libraries better serve their customers. The company is focused on developing various tools and products that make books easier for people to discover, evaluate, order, and experience, as well as providing services to publishers that help them better understand and meet the interests of readers worldwide. Bowker is an affiliated business of ProQuest and is headquartered in New Providence, New Jersey, with additional operations in England and Australia. For more information, please visit www.bowker.com. Follow Us On Twitter @DiscoverBowker