SlideShare uma empresa Scribd logo
1 de 20
Baixar para ler offline
Metadata Magic
By Mary S. Alexander
Metadata Librarian, The University of Alabama Libraries



  Given by Jeannine Keefer
  Moderator

  VRA Conference
  Case Studies IV
  March 21, 2012
The beginning of digitization
Dublin Core
•   15 elements
•   element qualifiers
•   alpha tags
•   broad definitions
•   easy to use
•   optional elements.
•   repeatable elements
Moving from Endeavor to
CONTENTdm
Data in CONTENTdm
•   <dc.title>
•   <dc.creator>
•   <dc.creator> displayLabel=“Artist”
•   <dc.creator> displayLabel=“Author”
•   <dc.creator> displayLabel=“Sender”
•   <dc.date>
•   <dc.publisher>
•   <dc.format>
•   <dc.coverage>
•   <dc.contributor>
Production growth
Display labels for master
spreadsheet
•   Title
•   Other title
•   Cover title
•   First Line of Text
•   First Line of Chorus
•   Masthead
•   Series Title
•   Special Issue
•   Title from plate
•   Subjects(s)
•   And more
Catalysts for change
• Institutional repositories
        Born-digital resources
        Electronic thesis and dissertations
• More metadata schemas
       Metadata Object Description Schema (MODS)
       Metadata Encoding and Transmission Standard (METS)
       Encoding Archival Description (EAD)
• Emphasizes on discovery of hidden collections
• Mass digitization
Local changes
Local development for a web-based search and retrieval system
was favored. It is now know as Acumen.

Mass digitization and processing workflows would be
implemented.

MODS would be used for descriptive metadata schema.
DC to MODS crosswalk
displayLabel   DC element          MODS element
Title          Title               <title>
Other Title    Title-alternative   <title type=“alternative”
Subject(s)     Subject             <subject authority=“lcsh”
Description    Description         <description>
Creator(s)     Creator             <name type=‘personal’>
Author(s)      Creator             <name type=“personal”>
Editor         Contributor         <name type=“personal”>
Creating MODS
<?xml version="1.0" encoding="UTF-8" ?>
<mods xmlns="http://www.loc.gov/mods/v3"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.loc.gov/mods/v3
http://www.loc.gov/standards/mods/v3/mods-3-4.xsd"
xmlns:xlink="http://www.w3.org/1999/xlink" version="3.4">
- <titleInfo>
  <title>{{Title}}</title>
  <subTitle>{{Subtitle}}</subTitle>
  <partNumber>{{Part Number}}</partNumber>
  <partName>{{Part Name}}</partName>
….
Archivist Utility
Editing rawMODS
<modsCollection xsi:schemaLocation="http://www.loc.gov/mods/v3
http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
<mods>
<titleInfo displayLabel="title">
  <title>Cotton lint</title>
</titleInfo>
<name type="personal">
  <namePart>Smith, Marjorie L.</namePart>
<role>
      <roleTerm authority="marcRelator" type="text">Photographer
     </roleTerm>
</role>
</name>
Real magic, scripting
Name processing

Name                Role         Type       Authority VIAF_ID           LCCN            Source History Comments




Coffman, J. Henry   Sender(s)    personal   local                                                     no matches




King, Helen         Sender(s)    personal   local                                                     check VIAF again




King, Margaret      Recipient(s) personal   local                                                     1+ matches--check VIAF again




King, Robert S.     Sender(s)    Personal   naf                 76561235 nb2004005427
Subject processing
Subject Master list                               Tagging key

Incorporation#x--West Virginia#z--Ohio County#z   #x topics
                                                  #z geographic location


Ingram, T.#p--Finance, Personal#x                 #p personal name

Jefferson County (Ala.). Tax collector#c          #c corporate name
PURLS
•   <Root>
•    <Row>
•     <identifier>u0003_0000520_0000001</identifier>
•     <purl>http://purl.lib.ua.edu/20394</purl>
•    </Row>
•    <Row>
•     <identifier>u0003_0000520_0000002</identifier>
•     <purl>http://purl.lib.ua.edu/20424</purl>
•    </Row>
•    <Row>
•     <identifier>u0003_0000520_0000003</identifier>
•     <purl>http://purl.lib.ua.edu/20425</purl>
•    </Row>
•   </Root>
More magic
<name type="personal" authority="naf">
     <namePart>King, Robert S.</namePart>
</name>

<subject authority="lcsh">
       <topic>Debtor and creditor</topic>
</subject>

<identifier type="local">u0003_0002865_0000001</identifier>
<identifier type="uri">http://purl.lib.ua.edu/35547</identifier>
Practical lessons learned
• By moving to the directory structure, we are able to implement
 OAI, ETDs, and EADS.

• This structure allows flexibility to adopt other schemas.

• It is important to know the standards.

 Knowing how to manipulate large amount of data is invaluable!
       Current records and digital objects available:
       Items available: 74197
       Scans available: 282091
acumen.lib.ua.edu
Mary S. Alexander
Metadata Librarian
Cataloging and Metadata Services Dept.
University of Alabama
Tuscaloosa, AL 35487

malexand@ua.edu
voice: 205-348-1490

Mais conteúdo relacionado

Semelhante a Metadata Magic: Transforming Data for Discovery

RDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar SRDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar SEmily Nimsakont
 
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod LacoulShamod Lacoul
 
Getting Started with Graph Databases
Getting Started with Graph DatabasesGetting Started with Graph Databases
Getting Started with Graph DatabasesDataStax Academy
 
Mapping Relational Databases to Linked Data
Mapping Relational Databases to Linked DataMapping Relational Databases to Linked Data
Mapping Relational Databases to Linked DataEUCLID project
 
DBpedia Framework - BBC Talk
DBpedia Framework - BBC TalkDBpedia Framework - BBC Talk
DBpedia Framework - BBC TalkGeorgi Kobilarov
 
Schema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibrarySchema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibraryRichard Wallis
 
The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...
The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...
The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...Jay Myers
 
Better design than sorry - let's design our DB schema
Better design than sorry - let's design our DB schemaBetter design than sorry - let's design our DB schema
Better design than sorry - let's design our DB schemaLiron Amitzi
 
RDA is Here: Are You Ready?
RDA is Here: Are You Ready?RDA is Here: Are You Ready?
RDA is Here: Are You Ready?Emily Nimsakont
 
Radically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the WebRadically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the WebJulie Allinson
 
Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)ALATechSource
 
No sql for sql professionals
No sql for sql professionalsNo sql for sql professionals
No sql for sql professionalsRic Centre
 
The Internet Is Your New Database: An Introduction To The Semantic Web
The Internet Is Your New Database: An Introduction To The Semantic WebThe Internet Is Your New Database: An Introduction To The Semantic Web
The Internet Is Your New Database: An Introduction To The Semantic WebWill Strinz
 
So MANY databases, which one do I pick?
So MANY databases, which one do I pick?So MANY databases, which one do I pick?
So MANY databases, which one do I pick?kristinferrier
 
Mongo db 101 dc group
Mongo db 101 dc groupMongo db 101 dc group
Mongo db 101 dc groupJohn Ragan
 
Linked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI MplsLinked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI MplsJay Myers
 
Graph Databases: insight, scandal and the speed you always wanted!
Graph Databases: insight, scandal and the speed you always wanted!Graph Databases: insight, scandal and the speed you always wanted!
Graph Databases: insight, scandal and the speed you always wanted!Data Con LA
 

Semelhante a Metadata Magic: Transforming Data for Discovery (20)

RDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar SRDA: Are We There Yet? Carterette Webinar S
RDA: Are We There Yet? Carterette Webinar S
 
NCompass Live: RDA: Are We There Yet?
NCompass Live: RDA: Are We There Yet?NCompass Live: RDA: Are We There Yet?
NCompass Live: RDA: Are We There Yet?
 
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
 
Getting Started with Graph Databases
Getting Started with Graph DatabasesGetting Started with Graph Databases
Getting Started with Graph Databases
 
Mapping Relational Databases to Linked Data
Mapping Relational Databases to Linked DataMapping Relational Databases to Linked Data
Mapping Relational Databases to Linked Data
 
DBpedia Framework - BBC Talk
DBpedia Framework - BBC TalkDBpedia Framework - BBC Talk
DBpedia Framework - BBC Talk
 
Schema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibrarySchema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your Library
 
The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...
The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...
The Web Comes Alive with Data! Schema.org and Structured Data on the Web: Pas...
 
Ws rest
Ws restWs rest
Ws rest
 
Better design than sorry - let's design our DB schema
Better design than sorry - let's design our DB schemaBetter design than sorry - let's design our DB schema
Better design than sorry - let's design our DB schema
 
RDA is Here: Are You Ready?
RDA is Here: Are You Ready?RDA is Here: Are You Ready?
RDA is Here: Are You Ready?
 
Radically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the WebRadically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the Web
 
Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)
 
No sql for sql professionals
No sql for sql professionalsNo sql for sql professionals
No sql for sql professionals
 
The Internet Is Your New Database: An Introduction To The Semantic Web
The Internet Is Your New Database: An Introduction To The Semantic WebThe Internet Is Your New Database: An Introduction To The Semantic Web
The Internet Is Your New Database: An Introduction To The Semantic Web
 
So MANY databases, which one do I pick?
So MANY databases, which one do I pick?So MANY databases, which one do I pick?
So MANY databases, which one do I pick?
 
Mongo db 101 dc group
Mongo db 101 dc groupMongo db 101 dc group
Mongo db 101 dc group
 
Deep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDBDeep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDB
 
Linked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI MplsLinked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI Mpls
 
Graph Databases: insight, scandal and the speed you always wanted!
Graph Databases: insight, scandal and the speed you always wanted!Graph Databases: insight, scandal and the speed you always wanted!
Graph Databases: insight, scandal and the speed you always wanted!
 

Mais de Visual Resources Association

Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...
Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...
Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...Visual Resources Association
 
The Medieval Kingdom of Sicily Image Database Project: From Concept to Reality
The Medieval Kingdom of Sicily Image Database Project: From Concept to RealityThe Medieval Kingdom of Sicily Image Database Project: From Concept to Reality
The Medieval Kingdom of Sicily Image Database Project: From Concept to RealityVisual Resources Association
 
Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...
Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...
Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...Visual Resources Association
 
Recreating a 19th-Century Spectacle: The 3D Glass Stereo Project
Recreating a 19th-Century Spectacle: The 3D Glass Stereo ProjectRecreating a 19th-Century Spectacle: The 3D Glass Stereo Project
Recreating a 19th-Century Spectacle: The 3D Glass Stereo ProjectVisual Resources Association
 
Cradle of Texas Gay Liberty: An Alternate History of the Alamo City
Cradle of Texas Gay Liberty: An Alternate History of the Alamo CityCradle of Texas Gay Liberty: An Alternate History of the Alamo City
Cradle of Texas Gay Liberty: An Alternate History of the Alamo CityVisual Resources Association
 
Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...
Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...
Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...Visual Resources Association
 
Disinformation and Deepfakes: The Urgent Need for Visual Literacy
Disinformation and Deepfakes: The Urgent Need for Visual LiteracyDisinformation and Deepfakes: The Urgent Need for Visual Literacy
Disinformation and Deepfakes: The Urgent Need for Visual LiteracyVisual Resources Association
 
Pattern and Representation: Critical Cataloging for a New Perspective on Camp...
Pattern and Representation: Critical Cataloging for a New Perspective on Camp...Pattern and Representation: Critical Cataloging for a New Perspective on Camp...
Pattern and Representation: Critical Cataloging for a New Perspective on Camp...Visual Resources Association
 
Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...
Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...
Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...Visual Resources Association
 
Supporting Art History Students' Digital Projects at American University
Supporting Art History Students' Digital Projects at American UniversitySupporting Art History Students' Digital Projects at American University
Supporting Art History Students' Digital Projects at American UniversityVisual Resources Association
 
Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...
Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...
Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...Visual Resources Association
 
Describing Art on the Street: The Graffiti Art Community Voice
Describing Art on the Street: The Graffiti Art Community VoiceDescribing Art on the Street: The Graffiti Art Community Voice
Describing Art on the Street: The Graffiti Art Community VoiceVisual Resources Association
 
Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...
Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...
Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...Visual Resources Association
 
Accessibility Guidance for Digital Cultural Heritage
Accessibility Guidance for Digital Cultural HeritageAccessibility Guidance for Digital Cultural Heritage
Accessibility Guidance for Digital Cultural HeritageVisual Resources Association
 

Mais de Visual Resources Association (20)

Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...
Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...
Comparative Study and Expansion of Metadata Standards for Historic Fashion Co...
 
Unsettling Collections: Bias in the Visual Canon
Unsettling Collections: Bias in the Visual CanonUnsettling Collections: Bias in the Visual Canon
Unsettling Collections: Bias in the Visual Canon
 
The Medieval Kingdom of Sicily Image Database Project: From Concept to Reality
The Medieval Kingdom of Sicily Image Database Project: From Concept to RealityThe Medieval Kingdom of Sicily Image Database Project: From Concept to Reality
The Medieval Kingdom of Sicily Image Database Project: From Concept to Reality
 
Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...
Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...
Interactive Topography with IIIF: Open Access to Photographs from the Ernest ...
 
Recreating a 19th-Century Spectacle: The 3D Glass Stereo Project
Recreating a 19th-Century Spectacle: The 3D Glass Stereo ProjectRecreating a 19th-Century Spectacle: The 3D Glass Stereo Project
Recreating a 19th-Century Spectacle: The 3D Glass Stereo Project
 
Cradle of Texas Gay Liberty: An Alternate History of the Alamo City
Cradle of Texas Gay Liberty: An Alternate History of the Alamo CityCradle of Texas Gay Liberty: An Alternate History of the Alamo City
Cradle of Texas Gay Liberty: An Alternate History of the Alamo City
 
Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...
Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...
Material Order: A Discovery Group, Shared Catalog, and Research Platform for ...
 
Personal Archiving for Undergraduate Students
Personal Archiving for Undergraduate StudentsPersonal Archiving for Undergraduate Students
Personal Archiving for Undergraduate Students
 
Disinformation and Deepfakes: The Urgent Need for Visual Literacy
Disinformation and Deepfakes: The Urgent Need for Visual LiteracyDisinformation and Deepfakes: The Urgent Need for Visual Literacy
Disinformation and Deepfakes: The Urgent Need for Visual Literacy
 
Jean Charlot: Artist as Archivist
Jean Charlot: Artist as ArchivistJean Charlot: Artist as Archivist
Jean Charlot: Artist as Archivist
 
Pattern and Representation: Critical Cataloging for a New Perspective on Camp...
Pattern and Representation: Critical Cataloging for a New Perspective on Camp...Pattern and Representation: Critical Cataloging for a New Perspective on Camp...
Pattern and Representation: Critical Cataloging for a New Perspective on Camp...
 
Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...
Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...
Stories from the Stop (and Re-Start?): Visual Resources Professionals Face Re...
 
Supporting Art History Students' Digital Projects at American University
Supporting Art History Students' Digital Projects at American UniversitySupporting Art History Students' Digital Projects at American University
Supporting Art History Students' Digital Projects at American University
 
Material Objects and Special Collections
Material Objects and Special CollectionsMaterial Objects and Special Collections
Material Objects and Special Collections
 
Digital Art History
Digital Art HistoryDigital Art History
Digital Art History
 
Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...
Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...
Assessing the use of Qualitative Data Analysis Software (QDAS) by Art Histori...
 
Describing Art on the Street: The Graffiti Art Community Voice
Describing Art on the Street: The Graffiti Art Community VoiceDescribing Art on the Street: The Graffiti Art Community Voice
Describing Art on the Street: The Graffiti Art Community Voice
 
Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...
Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...
Photographic Glass Plates and Birthdates: Secrets to Optimizing AI-Generated ...
 
Crowdsourcing Collection Development
Crowdsourcing Collection DevelopmentCrowdsourcing Collection Development
Crowdsourcing Collection Development
 
Accessibility Guidance for Digital Cultural Heritage
Accessibility Guidance for Digital Cultural HeritageAccessibility Guidance for Digital Cultural Heritage
Accessibility Guidance for Digital Cultural Heritage
 

Último

Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipKarl Donert
 
6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroomSamsung Business USA
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxDhatriParmar
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfChristalin Nelson
 
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...HetalPathak10
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17Celine George
 
Shark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsShark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsArubSultan
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxryandux83rd
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptxmary850239
 

Último (20)

Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
Spearman's correlation,Formula,Advantages,
Spearman's correlation,Formula,Advantages,Spearman's correlation,Formula,Advantages,
Spearman's correlation,Formula,Advantages,
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenship
 
6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdf
 
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Shark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsShark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristics
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptx
 
Chi-Square Test Non Parametric Test Categorical Variable
Chi-Square Test Non Parametric Test Categorical VariableChi-Square Test Non Parametric Test Categorical Variable
Chi-Square Test Non Parametric Test Categorical Variable
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx
 

Metadata Magic: Transforming Data for Discovery

  • 1. Metadata Magic By Mary S. Alexander Metadata Librarian, The University of Alabama Libraries Given by Jeannine Keefer Moderator VRA Conference Case Studies IV March 21, 2012
  • 2. The beginning of digitization
  • 3. Dublin Core • 15 elements • element qualifiers • alpha tags • broad definitions • easy to use • optional elements. • repeatable elements
  • 4. Moving from Endeavor to CONTENTdm
  • 5. Data in CONTENTdm • <dc.title> • <dc.creator> • <dc.creator> displayLabel=“Artist” • <dc.creator> displayLabel=“Author” • <dc.creator> displayLabel=“Sender” • <dc.date> • <dc.publisher> • <dc.format> • <dc.coverage> • <dc.contributor>
  • 7. Display labels for master spreadsheet • Title • Other title • Cover title • First Line of Text • First Line of Chorus • Masthead • Series Title • Special Issue • Title from plate • Subjects(s) • And more
  • 8. Catalysts for change • Institutional repositories Born-digital resources Electronic thesis and dissertations • More metadata schemas Metadata Object Description Schema (MODS) Metadata Encoding and Transmission Standard (METS) Encoding Archival Description (EAD) • Emphasizes on discovery of hidden collections • Mass digitization
  • 9. Local changes Local development for a web-based search and retrieval system was favored. It is now know as Acumen. Mass digitization and processing workflows would be implemented. MODS would be used for descriptive metadata schema.
  • 10. DC to MODS crosswalk displayLabel DC element MODS element Title Title <title> Other Title Title-alternative <title type=“alternative” Subject(s) Subject <subject authority=“lcsh” Description Description <description> Creator(s) Creator <name type=‘personal’> Author(s) Creator <name type=“personal”> Editor Contributor <name type=“personal”>
  • 11. Creating MODS <?xml version="1.0" encoding="UTF-8" ?> <mods xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-4.xsd" xmlns:xlink="http://www.w3.org/1999/xlink" version="3.4"> - <titleInfo> <title>{{Title}}</title> <subTitle>{{Subtitle}}</subTitle> <partNumber>{{Part Number}}</partNumber> <partName>{{Part Name}}</partName> ….
  • 13. Editing rawMODS <modsCollection xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd"> <mods> <titleInfo displayLabel="title"> <title>Cotton lint</title> </titleInfo> <name type="personal"> <namePart>Smith, Marjorie L.</namePart> <role> <roleTerm authority="marcRelator" type="text">Photographer </roleTerm> </role> </name>
  • 15. Name processing Name Role Type Authority VIAF_ID LCCN Source History Comments Coffman, J. Henry Sender(s) personal local no matches King, Helen Sender(s) personal local check VIAF again King, Margaret Recipient(s) personal local 1+ matches--check VIAF again King, Robert S. Sender(s) Personal naf 76561235 nb2004005427
  • 16. Subject processing Subject Master list Tagging key Incorporation#x--West Virginia#z--Ohio County#z #x topics #z geographic location Ingram, T.#p--Finance, Personal#x #p personal name Jefferson County (Ala.). Tax collector#c #c corporate name
  • 17. PURLS • <Root> • <Row> • <identifier>u0003_0000520_0000001</identifier> • <purl>http://purl.lib.ua.edu/20394</purl> • </Row> • <Row> • <identifier>u0003_0000520_0000002</identifier> • <purl>http://purl.lib.ua.edu/20424</purl> • </Row> • <Row> • <identifier>u0003_0000520_0000003</identifier> • <purl>http://purl.lib.ua.edu/20425</purl> • </Row> • </Root>
  • 18. More magic <name type="personal" authority="naf"> <namePart>King, Robert S.</namePart> </name> <subject authority="lcsh"> <topic>Debtor and creditor</topic> </subject> <identifier type="local">u0003_0002865_0000001</identifier> <identifier type="uri">http://purl.lib.ua.edu/35547</identifier>
  • 19. Practical lessons learned • By moving to the directory structure, we are able to implement OAI, ETDs, and EADS. • This structure allows flexibility to adopt other schemas. • It is important to know the standards. Knowing how to manipulate large amount of data is invaluable! Current records and digital objects available: Items available: 74197 Scans available: 282091
  • 20. acumen.lib.ua.edu Mary S. Alexander Metadata Librarian Cataloging and Metadata Services Dept. University of Alabama Tuscaloosa, AL 35487 malexand@ua.edu voice: 205-348-1490

Notas do Editor

  1. Mary Alexander is one of three metadata librarians at the University of Alabama Libraries where she has been employed since 1993. In 2003, she transition to a new position working with emerging metadata schemas and their related standards.
  2. Across the nation digitization efforts were in production when the University of Alabama Libraries and the two large state institutions began to write their first grant to digitize archival collections.We had experienced personnel in archives, cataloging, and systems. We were rich in archival collections and limited in staffing and resources.
  3. The grant was awarded in 2003. A metadata group representing the three institutions collaborated on a best practice document. The Dublin Core metadata schema was chosen based on its usage with large collaborative projects. While the group was discussing essential schema elements, individually the institutions proceeded in their efforts.The UA Libraries was already in possession of the digital management system, Endeavor’s Encompass. We began.
  4. Within a short time after starting the project, Endeavor was acquired by a competing company. Encompass would no longer be supported.OCLC’s CONTENTdm was recommended as the next system for UA Libraries Digital Collections. A consultant was hired to get data out of Encompass. The exported data was delivered in a tab-delimited file.Those involved with the state-wide grant also choose CONTENTdm based on its connectivity with other sites. It provided a way for state collections to be discovered through one web site. The state-wide group discussed standards for digitization and metadata. The Dublin Core Metadata Best Practices by the CDP Metadata Working Group of the Collaborative Digitation Program was adopted.
  5. In CONTENTdm, there was a digital collection for every physical collection.The Qualified Dublin Core schema had been implemented in all collections.Each collection had fields tailored to the collections need. Number of fields ranged from 17 to 27. Production was driven by a workflow based on spreadsheets. Each collection had a corresponding spreadsheet. Many of the collection’s fields were repeated elements. Each creator element had a unique display label that reflected the role of the person or corporation. Author, composer, artist, photographer are some of the roles/display labels. These were used as the header for columns in the spreadsheet.
  6. Over a 5 year period, there were 28 collections in CONTENTdm each with a slightly different set of data fields. There were approximately 6,972 digital objects and records in CONTENTdm. A license plateau was quickly approaching.
  7. The Cataloging and Metadata Services Department and the Digital Services Unit decided that a master list of fields should be adopted across collections. This would enable CONTENTdm collections to be combined resulting in fewer collections. Fields usage was analyzed to their frequency and definitions. Common fields were kept. The least used fields were discarded.A master list was developed. New fields were added for creating finding aids or EAD records for special collections. The display labels for field names were applied to a master spreadsheet. Existing CONTENTdm collections were modified to reflect the master data dictionary. When this process was completed, 28 collections were reduced to 12 collections. CONTENTdm collections were called containers locally and renamed using time spans significant to the holdings of the William Stanley Hoole Special Collections Library. One additional collection was added for University of Alabama digital objects.
  8. The UA digital planning group looked forward to build a digital program.Institutional repositories and born-digital resources that would populate them were hot topics!Emerging schemas (MODS, a descriptive metadata schema; METS, a structure metadata schema, and EAD, a descriptive metadata schema for archival collections) were receiving a lot of attention.The Association of Research Libraries continue to emphasize exposing hidden collections.Mass digitization was being practice.With these factors and more, UA Libraries wanted to be in position to implement these possibilities. A Digital Services Unit was created to digitize and manage these collections.
  9. Combining collections in CONTENTdm delayed reaching the next license plateau. The potential of digital objects filling CONTENTdm was a fact.The Digital Services Unit and the Library Office of Technology decided on moving away from CONTENTdm and Dublin Core. Plans were under way for digital objects and their metadata records to reside in a directory. A display and retrieval system would be built over SQL database for users.The metadata workflow was changed. Now the master spreadsheet begun with the archivists. After titles, names, dates, and other description information was added, the spreadsheets would move to the Digital Services Unit for the file names to be added as a step in the digitization process. The spreadsheets would be transformed to MODS records producing a preliminary record. This temporary record provided some access to resources while the Metadata Unit reviewed the spreadsheet for quality control, establishing names, adding abstracts, and assigning subject and genres terms.
  10. The master spreadsheet field was mapped to MODS using EXCEL for a visual crosswalk.The next step was to create a style sheet (xls) for the spreadsheet conversion to MODS records.
  11. This is the template used for the conversion of files to MODS records.
  12. Archivist Utility uses a style sheet template and tab-delimited files derived from the spreadsheets to create raw MODS records. AU was created by a programmer working with the Digital Services Unit leader. Data could be viewed as text or as a MODS record. These preliminary views provide an opportunity for quality control. The error log reports unused or missing columns from tab-delimited files. Errors are corrected in the spreadsheet and loaded again into the utility. When the librarian is satisfied with the records, they are saved to a folder for the next step.[The is the icon for the Archivist Utility. The University’s mascot is Big Al, a elephant.]
  13. The raw MODS records only contained data in the top-element MODS tags.Very simple edits were performed before loading to the server.
  14. A second metadata librarian with hired. She wrote scripts to pull data for names and subjects files, to replace data, and to transform rawMODS data into robust MODS records.
  15. Names were collected from the “name” columns of the spreadsheet to a tab-delimited file using a Python script. The processing included searching names in viag.org. If found, the viaf and LCCN columns would be populated. Notes stating search results for each name would be given. After reviewing this file, a metadata librarian would run a script to place the names and numbers into a MADS file.
  16. Subject processing uses a Window powershell script. It pulls the subjects into a master list for tagging names, topics, geographic, events, genres, occupations, and other headings. After tagging, another script is run to replace subject headings with their tags into the tab-delimited file derived from the spreadsheet.
  17. PURLs, Persistent Uniform Resource Locators, are needed for the digital object and the digital collection. A master list for collection-level PURLS were collected monthly and passed to the Metadata Services Unit.A Python script is used to pull identifiers from spreadsheets for the purpose of gathering PURLs for digital objects from the server. The results are provided in xml list.
  18. A transformation puts the pieces together.The rawMODS records are in a project folder used in xmlSpy. A XSLT transformation populates the rawMODS records from the generated files created earlier. The authority attribute used with the name tag is populated from the MADS files. From the xml list, PURLS are added to their appropriate identifier tags. The subject tags (#c, #p, #x, etc.) are recognized so sub elements for name, topic, geographic, and other headings are assigned.The final step includes validation through Schematron using the item-level PURLs. When the Schematron validation is free of errors, final MODS records are loaded to the server replacing preliminary MODS records.
  19. By moving to adirectory structure, we are able to implement OAI, ETDs, and EADS.This structure allows flexibility to adopt other schemas.It is important to know the standards.Knowing how to manipulate large amount of data is invaluable!
  20. Please visit Acumen. Thank you!