SlideShare uma empresa Scribd logo
1 de 18
Baixar para ler offline
From Records To
   Statements
    Taking the Leap
What’s different
about statement
     data?

Library data compliance
has been defined by
consensus since MARC
was a pup
But outside the MARC
silo we need different
strategies
To accomplish this we
need to look at value,
costs and investments
very differently

                          2
                              Flickr photo by Robert Jagendorf ALA Dallas, 1/20/12
What Are Statements?

• A MARC record can be viewed as an aggregation
  of statements
  • All the attribute = value pairs relate to the same
    resource

• In a linked data world, statements are dis-
  aggregated and each carries the relationship to a
  resource as the ‘subject’ of each triple
• Though it seems more complicated to deal with
  statements in isolation, it is really simpler (the
  complications are that we know little about it)

                             3                           ALA Dallas, 1/20/12
Future Metadata
              Strategies
• Statement level rather than record level management
• Records as units of transport rather than units of
  management
• Emphasis on evaluation coming in and provenance
  going out
• Shift in human effort from creating standard cataloging
  to careful human intervention in machine-based
  processes
• Extensive use of data created outside libraries
• Intelligent re-use of our legacy data
                             4                         ALA Dallas, 1/20/12
Managing Statements




       http://dcpapers.dublincore.org/ojs/pubs/article/view/770/766
            5                                       ALA Dallas, 1/20/12
[Possible] New Roles for
           Librarians
• Aggregators of relevant metadata content
  • Developing methods to expose & redistribute without a
    central node

• Modeling and documenting best practices in metadata
  creation, improvement and exposure
  • Application profiles important in this effort

• Developers of vocabularies using bibliographic
  relationships
• Innovators in using social networks to enhance
  bibliographic description


                                6                       ALA Dallas, 1/20/12
Re-Thinking Metadata
        Management




7               ALA Dallas, 1/20/12
8   ALA Dallas, 1/20/12
Harvest/Ingest Plan

• Choosing data sources
  • There are known sources out there, some of them
    are of good quality, others are usable, with
    improvement

• Tools are needed to help pull data, validate it,
  cache it, and set it up for evaluation
  • Most of these tasks can/should be set up with
    automated processes, with alerts to human minders
    when something goes wrong


                           9                         ALA Dallas, 1/20/12
10   ALA Dallas, 1/20/12
Metadata Evaluation

• Evaluation needs to scale well beyond random
  sampling

• Statistical and data mining tools need to be
  brought into the process, to provide both
  ‘overview’ and specifics of whole data sets

• Improvement specifications, techniques, quality
  criteria and tools need to be iterative, granular,
  and shareable


                           11                      ALA Dallas, 1/20/12
12   ALA Dallas, 1/20/12
Testing, Monitoring & Re-
               evaluation

• Data will change, and processes must be able to
  detect that, based on data profiles
  • Human intervention should be limited

• Tools need to be built so that non-programmers
  can run them
  • Reading logs, monitoring error reports, checking
    results, writing specs, can/should be done by data
    specialists (a.k.a. catalogers w/training)
  • Looking for opportunities for programmers and
    catalogers to learn together is essential

                           13                        ALA Dallas, 1/20/12
14   ALA Dallas, 1/20/12
Re-distribution Plan

• If we improve data, we need to expose how we
  did it (and what we did), for the use of
  downstream consumers
  • New metadata provenance efforts designed to do
    this at the statement level

• This strategy can only exist successfully where
  open licenses allow innovation and wide re-use

• Ideally, distribution AND redistribution should be
  accomplished with Application Profiles

                          15                      ALA Dallas, 1/20/12
Will This Shift Cost Too
               Much?
• It’s the human effort that costs us
  • Cost of traditional cataloging is far too high, for
    increasingly dubious value

• Our current investments have reached the end of their
  usefulness
  • All the possible efficiencies for traditional cataloging have
    already been accomplished

• Waiting for leadership from the big players costs us
  valuable time with no guarantees of results
• We need to figure out how to invest in more distributed
  innovation and focused collaboration

                                16                           ALA Dallas, 1/20/12
ROI in the LOD World

• Free metadata is essential in a ‘culture economy’
  • We need eyeballs, attention, connection for our
    content!

• Thinking about ROI based on recovering the cost
  of creating metadata is a dead end

• To drive people to your content, you need to put
  your data out there
  • But once it’s there, it’s out of your control, and we
    need to get comfortable with that

                             17                         ALA Dallas, 1/20/12
Thank you!
 Questions?
Contact info:
metadata.maven
@gmail.com


Metadata
Matters:
http://managemet
adata.com/blog



       ALA Dallas, 1/20/12   18

Mais conteúdo relacionado

Semelhante a Managing statements

Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business EnablerSrinivasan Sankar
 
Data Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: MetadataData Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: MetadataDATAVERSITY
 
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData Blueprint
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedDunn Solutions Group
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Denodo
 
chapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdfchapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdfMahmoudSOLIMAN380726
 
Chapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data AssetsChapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data AssetsAhmed Alorage
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data SquaredDATAVERSITY
 
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...DATAVERSITY
 
Navigating the BI Stack _
Navigating the BI Stack _Navigating the BI Stack _
Navigating the BI Stack _Michael Phipps
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures DATAVERSITY
 
Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data Blueprint
 
Maximizing The Value of Your Structured and Unstructured Data with Data Catal...
Maximizing The Value of Your Structured and Unstructured Data with Data Catal...Maximizing The Value of Your Structured and Unstructured Data with Data Catal...
Maximizing The Value of Your Structured and Unstructured Data with Data Catal...Molly Alexander
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationDatabricks
 
Differences between data lakes and datawarehouse
  Differences between data lakes and datawarehouse  Differences between data lakes and datawarehouse
Differences between data lakes and datawarehouseamarkayam
 
Data-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesData-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesDATAVERSITY
 
Metadata Strategies
Metadata StrategiesMetadata Strategies
Metadata StrategiesDATAVERSITY
 
RDMRose 1.6 Research data services
RDMRose 1.6 Research data servicesRDMRose 1.6 Research data services
RDMRose 1.6 Research data servicesRDMRose
 

Semelhante a Managing statements (20)

Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business Enabler
 
Data Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: MetadataData Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: Metadata
 
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They Need
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
 
chapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdfchapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdf
 
Chapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data AssetsChapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data Assets
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data Squared
 
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
 
Navigating the BI Stack _
Navigating the BI Stack _Navigating the BI Stack _
Navigating the BI Stack _
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures
 
Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures
 
Maximizing The Value of Your Structured and Unstructured Data with Data Catal...
Maximizing The Value of Your Structured and Unstructured Data with Data Catal...Maximizing The Value of Your Structured and Unstructured Data with Data Catal...
Maximizing The Value of Your Structured and Unstructured Data with Data Catal...
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Differences between data lakes and datawarehouse
  Differences between data lakes and datawarehouse  Differences between data lakes and datawarehouse
Differences between data lakes and datawarehouse
 
Data-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesData-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata Strategies
 
Metadata Strategies
Metadata StrategiesMetadata Strategies
Metadata Strategies
 
RDMRose 1.6 Research data services
RDMRose 1.6 Research data servicesRDMRose 1.6 Research data services
RDMRose 1.6 Research data services
 

Mais de Diane Hillmann

RDA and Linked Data: where's the beef
RDA and Linked Data: where's the beefRDA and Linked Data: where's the beef
RDA and Linked Data: where's the beefDiane Hillmann
 
RDA: Alive and Well and Still Speaking MARC
RDA: Alive and Well and Still Speaking MARCRDA: Alive and Well and Still Speaking MARC
RDA: Alive and Well and Still Speaking MARCDiane Hillmann
 
Vocabulary Development for Local Use: A DIY Introduction
Vocabulary Development for Local Use: A DIY IntroductionVocabulary Development for Local Use: A DIY Introduction
Vocabulary Development for Local Use: A DIY IntroductionDiane Hillmann
 
What Can We Do About Our Legacy Data?
What Can We Do About Our Legacy Data?What Can We Do About Our Legacy Data?
What Can We Do About Our Legacy Data?Diane Hillmann
 
Moving to an open world
Moving to an open worldMoving to an open world
Moving to an open worldDiane Hillmann
 
Versioning for Authorities, presentation at Midwinter Chicago 2015
Versioning  for Authorities, presentation at Midwinter Chicago 2015Versioning  for Authorities, presentation at Midwinter Chicago 2015
Versioning for Authorities, presentation at Midwinter Chicago 2015Diane Hillmann
 
RDA as linked data (RDA Forum)
RDA as linked data (RDA Forum)RDA as linked data (RDA Forum)
RDA as linked data (RDA Forum)Diane Hillmann
 
What is an RDA Record?
What is an RDA Record?What is an RDA Record?
What is an RDA Record?Diane Hillmann
 
The RDA Vocabularies: What They Are, How They Work
The RDA Vocabularies: What They Are, How They WorkThe RDA Vocabularies: What They Are, How They Work
The RDA Vocabularies: What They Are, How They WorkDiane Hillmann
 
Oregon State visit 2011
Oregon State visit 2011Oregon State visit 2011
Oregon State visit 2011Diane Hillmann
 
RDA & the New World of Metadata
RDA & the New World of MetadataRDA & the New World of Metadata
RDA & the New World of MetadataDiane Hillmann
 
The Other Side of Linked Open Data: Managing Metadata Aggregation
The Other Side of Linked Open Data: Managing Metadata AggregationThe Other Side of Linked Open Data: Managing Metadata Aggregation
The Other Side of Linked Open Data: Managing Metadata AggregationDiane Hillmann
 
A Consideration of Library Holdings in the World Beyond MARC
A Consideration of Library Holdings in the World Beyond MARCA Consideration of Library Holdings in the World Beyond MARC
A Consideration of Library Holdings in the World Beyond MARCDiane Hillmann
 
Maps & gaps: strategies for vocabulary design and development
Maps & gaps: strategies for vocabulary design and developmentMaps & gaps: strategies for vocabulary design and development
Maps & gaps: strategies for vocabulary design and developmentDiane Hillmann
 
NISO Bibliographic Roadmap Meeting Proposal
NISO Bibliographic Roadmap Meeting ProposalNISO Bibliographic Roadmap Meeting Proposal
NISO Bibliographic Roadmap Meeting ProposalDiane Hillmann
 
Challenges for a new era
Challenges for a new eraChallenges for a new era
Challenges for a new eraDiane Hillmann
 

Mais de Diane Hillmann (20)

RDA and Linked Data: where's the beef
RDA and Linked Data: where's the beefRDA and Linked Data: where's the beef
RDA and Linked Data: where's the beef
 
RDA: Alive and Well and Still Speaking MARC
RDA: Alive and Well and Still Speaking MARCRDA: Alive and Well and Still Speaking MARC
RDA: Alive and Well and Still Speaking MARC
 
Vocabulary Development for Local Use: A DIY Introduction
Vocabulary Development for Local Use: A DIY IntroductionVocabulary Development for Local Use: A DIY Introduction
Vocabulary Development for Local Use: A DIY Introduction
 
What Can We Do About Our Legacy Data?
What Can We Do About Our Legacy Data?What Can We Do About Our Legacy Data?
What Can We Do About Our Legacy Data?
 
Moving to an open world
Moving to an open worldMoving to an open world
Moving to an open world
 
Why change?
Why change?Why change?
Why change?
 
Versioning for Authorities, presentation at Midwinter Chicago 2015
Versioning  for Authorities, presentation at Midwinter Chicago 2015Versioning  for Authorities, presentation at Midwinter Chicago 2015
Versioning for Authorities, presentation at Midwinter Chicago 2015
 
RDA as linked data (RDA Forum)
RDA as linked data (RDA Forum)RDA as linked data (RDA Forum)
RDA as linked data (RDA Forum)
 
What's goin' on?
What's goin' on?What's goin' on?
What's goin' on?
 
Playing with Jane
Playing with JanePlaying with Jane
Playing with Jane
 
What is an RDA Record?
What is an RDA Record?What is an RDA Record?
What is an RDA Record?
 
The RDA Vocabularies: What They Are, How They Work
The RDA Vocabularies: What They Are, How They WorkThe RDA Vocabularies: What They Are, How They Work
The RDA Vocabularies: What They Are, How They Work
 
Oregon State visit 2011
Oregon State visit 2011Oregon State visit 2011
Oregon State visit 2011
 
RDA & the New World of Metadata
RDA & the New World of MetadataRDA & the New World of Metadata
RDA & the New World of Metadata
 
The Other Side of Linked Open Data: Managing Metadata Aggregation
The Other Side of Linked Open Data: Managing Metadata AggregationThe Other Side of Linked Open Data: Managing Metadata Aggregation
The Other Side of Linked Open Data: Managing Metadata Aggregation
 
Mapmakers
MapmakersMapmakers
Mapmakers
 
A Consideration of Library Holdings in the World Beyond MARC
A Consideration of Library Holdings in the World Beyond MARCA Consideration of Library Holdings in the World Beyond MARC
A Consideration of Library Holdings in the World Beyond MARC
 
Maps & gaps: strategies for vocabulary design and development
Maps & gaps: strategies for vocabulary design and developmentMaps & gaps: strategies for vocabulary design and development
Maps & gaps: strategies for vocabulary design and development
 
NISO Bibliographic Roadmap Meeting Proposal
NISO Bibliographic Roadmap Meeting ProposalNISO Bibliographic Roadmap Meeting Proposal
NISO Bibliographic Roadmap Meeting Proposal
 
Challenges for a new era
Challenges for a new eraChallenges for a new era
Challenges for a new era
 

Último

Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxGDSC PJATK
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1DianaGray10
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesDavid Newbury
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6DianaGray10
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsSafe Software
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UbiTrack UK
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Brian Pichman
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPathCommunity
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7DianaGray10
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostMatt Ray
 
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfIaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfDaniel Santiago Silva Capera
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioChristian Posta
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXTarek Kalaji
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopBachir Benyammi
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAshyamraj55
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxUdaiappa Ramachandran
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdfPedro Manuel
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesMd Hossain Ali
 

Último (20)

Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
 
201610817 - edge part1
201610817 - edge part1201610817 - edge part1
201610817 - edge part1
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond Ontologies
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation Developers
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
 
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfIaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBX
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptx
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdf
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
 

Managing statements

  • 1. From Records To Statements Taking the Leap
  • 2. What’s different about statement data? Library data compliance has been defined by consensus since MARC was a pup But outside the MARC silo we need different strategies To accomplish this we need to look at value, costs and investments very differently 2 Flickr photo by Robert Jagendorf ALA Dallas, 1/20/12
  • 3. What Are Statements? • A MARC record can be viewed as an aggregation of statements • All the attribute = value pairs relate to the same resource • In a linked data world, statements are dis- aggregated and each carries the relationship to a resource as the ‘subject’ of each triple • Though it seems more complicated to deal with statements in isolation, it is really simpler (the complications are that we know little about it) 3 ALA Dallas, 1/20/12
  • 4. Future Metadata Strategies • Statement level rather than record level management • Records as units of transport rather than units of management • Emphasis on evaluation coming in and provenance going out • Shift in human effort from creating standard cataloging to careful human intervention in machine-based processes • Extensive use of data created outside libraries • Intelligent re-use of our legacy data 4 ALA Dallas, 1/20/12
  • 5. Managing Statements http://dcpapers.dublincore.org/ojs/pubs/article/view/770/766 5 ALA Dallas, 1/20/12
  • 6. [Possible] New Roles for Librarians • Aggregators of relevant metadata content • Developing methods to expose & redistribute without a central node • Modeling and documenting best practices in metadata creation, improvement and exposure • Application profiles important in this effort • Developers of vocabularies using bibliographic relationships • Innovators in using social networks to enhance bibliographic description 6 ALA Dallas, 1/20/12
  • 7. Re-Thinking Metadata Management 7 ALA Dallas, 1/20/12
  • 8. 8 ALA Dallas, 1/20/12
  • 9. Harvest/Ingest Plan • Choosing data sources • There are known sources out there, some of them are of good quality, others are usable, with improvement • Tools are needed to help pull data, validate it, cache it, and set it up for evaluation • Most of these tasks can/should be set up with automated processes, with alerts to human minders when something goes wrong 9 ALA Dallas, 1/20/12
  • 10. 10 ALA Dallas, 1/20/12
  • 11. Metadata Evaluation • Evaluation needs to scale well beyond random sampling • Statistical and data mining tools need to be brought into the process, to provide both ‘overview’ and specifics of whole data sets • Improvement specifications, techniques, quality criteria and tools need to be iterative, granular, and shareable 11 ALA Dallas, 1/20/12
  • 12. 12 ALA Dallas, 1/20/12
  • 13. Testing, Monitoring & Re- evaluation • Data will change, and processes must be able to detect that, based on data profiles • Human intervention should be limited • Tools need to be built so that non-programmers can run them • Reading logs, monitoring error reports, checking results, writing specs, can/should be done by data specialists (a.k.a. catalogers w/training) • Looking for opportunities for programmers and catalogers to learn together is essential 13 ALA Dallas, 1/20/12
  • 14. 14 ALA Dallas, 1/20/12
  • 15. Re-distribution Plan • If we improve data, we need to expose how we did it (and what we did), for the use of downstream consumers • New metadata provenance efforts designed to do this at the statement level • This strategy can only exist successfully where open licenses allow innovation and wide re-use • Ideally, distribution AND redistribution should be accomplished with Application Profiles 15 ALA Dallas, 1/20/12
  • 16. Will This Shift Cost Too Much? • It’s the human effort that costs us • Cost of traditional cataloging is far too high, for increasingly dubious value • Our current investments have reached the end of their usefulness • All the possible efficiencies for traditional cataloging have already been accomplished • Waiting for leadership from the big players costs us valuable time with no guarantees of results • We need to figure out how to invest in more distributed innovation and focused collaboration 16 ALA Dallas, 1/20/12
  • 17. ROI in the LOD World • Free metadata is essential in a ‘culture economy’ • We need eyeballs, attention, connection for our content! • Thinking about ROI based on recovering the cost of creating metadata is a dead end • To drive people to your content, you need to put your data out there • But once it’s there, it’s out of your control, and we need to get comfortable with that 17 ALA Dallas, 1/20/12
  • 18. Thank you! Questions? Contact info: metadata.maven @gmail.com Metadata Matters: http://managemet adata.com/blog ALA Dallas, 1/20/12 18

Notas do Editor

  1. (a la the open source OPAC replacements)