SlideShare uma empresa Scribd logo
1 de 21
Developing the AIP Thesaurus:
 The Platform for an Ontology



        Mark Cassar
 American Institute of Physics

          Jack Bruce
 Marjorie (Margie) M.K. Hlava
     Access Innovations:
        505-998-0800
Background

• Physics and Astronomy Classification Scheme (PACS)
• Six digit code schema used for indexing scholarly
  content
• 10 digit based
   – domain headings with subcategories nested under
     each domain.
• Precoordinated system
   – Combine terms (concepts) at the time of indexing
Why Change?
• Improve searchability
• Move to Post coordinated system
   – Combine terms at time of search
• Semantic enrichment
• Flexible metadata for many applications
• Naturalize the vocabulary
   – Represent concepts succinctly and concisely
   – Easily add new concepts based on new and emerging
     technologies and applications
   – Allow unlimited hierarchy levels and polyhierarchy
Better ROI

• Rules-assisted indexing
   – Provide end users with a swift indexing solution
     based on the Machine-Aided Indexer (M.A.I.)
     engine.
   – Batch index large corpus of scholarly content, as
     well as future content.
• Improve costs
   – Automate a large portion of electronic indexing
   – Less overhead for indexing
Roadmap of the AIP Thesaurus
• Data Collection
   – Load PACS codes and terms
   – Incorporate Search logs; add top searched concepts into the
     vocabulary
• Analysis of Content
   – Test comparison of indexing to humanly indexed articles
• Thesaurus Construction
   – Separate, disambiguate, and migrate concepts; Break up top
     domains
   – Apply thesaurus and taxonomy standardization to each term
   – Multiple reviews for each top section
• Evaluation and Feedback
   – Send back working draft to AIP for review
   – Gather feedback from subject matter experts and incorporate the
     changes into the thesaurus
• Finalization and Product Delivery
Source Data

• PACS 2009 ed.
• 1999 ed. Of AIP Thesaurus (out of date)
• Terms added to INSPEC since 2000
• Internal and external search logs
• Cumulative journal indexes
   – Digital
   – (2006 through 2009)
• List of AIP divisions and their internal classifications
Analysis of Content


• Organizational warrant
   – PACS 2009 (2010)
   – www.aip.org
   – UniPHY
• Literary warrant
   – Where we found the term used
• Most frequent search terms loaded into thesaurus
Thesaurus Creation Process
• Load data (vocabulary) into Data Harmony MAIstro™
• PACS
   – Restructure top domains
   – Separate into discrete
   – Disambiguate terms
   – Remove parenthetical qualifiers
   – Create post coordinated terms
   – Migrate separated terms into new/relevant categories
• Sort flat lists (search logs) into main categories determined
• Use multiple reviewers for each physics domain
• About 8181 preferred terms and 5217 synonyms
PACS TERM:
– Low-energy electron diffraction (LEED) and reflection
high-energy electron diffraction (RHEED) (condensed
matter structure determination)
– Becomes
– BT Condensed matter structure determination
 • NT Low energy electron diffraction
    –Synonym LEED
 • NT Reflection high energy electron diffraction
    –Synonym RHEED
Evaluation and Feedback


• Weekly scheduled live demos of the thesaurus
• Free web-hosted version of the thesaurus and
  periodic spreadsheet exports
• Collect feedback based on SME suggestions and AIP
  PACS experts
   – Correspondence via email
• Incorporate changes into thesaurus
Available versions


• Electronic copy of AIP thesaurus supplied in
   – XML
   – Excel
   – Web-based, read-only versions (Thesviewer)
   – MARC, SKOS, OWL, CSV etc
Taxonomy
  view
            Thesaurus
           Term Record
               view
To make an ontology


• Define additional Associative relationships
• Define additional Hierarchical relationships
   – IsA, IsPartOf, HasA
• Define additional Equivalence relationship
       • Multilingual options
       • Weights and measures
Clearer disambiguation?

                              Temperature
Planets
                IsA
                                         TypeOf

      IsA                                         BrandOf
                  Mercury
Roman god                        IsA                 Automobile




                      Metallic element
Knowledge Organization Systems
•   Uncontrolled list                  Not complex

•   Name authority file
•   Synonym set/ring
•   Controlled vocabulary
•   Taxonomy
•   Thesaurus AIP Thesaurus is here
•   Ontology
•   Semantic network                  Highly complex
Lessons Learned
• Learning the style for indexing
• Tendency to reversion to PACS style of language and
  classification
• SME feedback turnaround
   – Sit with them 2 hours
   – Incorporate suggestions 8 hours
   – 2117 Terms Added
     1354 Terms changed or updated
     1333 Terms deleted
     11259 Other actions
Where are we now?
• Platform is established
• OWL and other formats available
• One kind of Associative relationship
   – (Related terms)
• One kind of Hierarchical relationship
   – Broader Narrower / Parent Child
   – Multiple broader terms for interdisciplinary options
• One kind of Equivalence relationship
      • Synonym non preferred terms
• Built using the Z39.19 standard - interoperable
To Review AIP Thes
• Use a web browser
• http://thesview.accessinn.com/aipThes/
• username/password twice - in all cases both are
  'aip'.
• Begins a java app in your browser that shows the
  thesaurus starting from the top level of the hierarchy.
• Use the collaboration module to comment and
  discuss
Thank you


          Marjorie Hlava
    mhlava@accessin.com
          505-998-0800

Mais conteúdo relacionado

Semelhante a Developing the AIP Thesaurus: The Platform for an Ontology

Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Amanda Vizedom
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementMarcia Zeng
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...locloud
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)Christine Stohn
 
Linking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementLinking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementAccess Innovations, Inc.
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the WebGuus Schreiber
 
Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsTrish Whetzel
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulationstbruce
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic WebSerendipity Seraph
 
Leeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERLeeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERNick Sheppard
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyNatalya Minkovsky
 
Experience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierExperience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierDATAVERSITY
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Khirulnizam Abd Rahman
 
Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Janet Leu
 

Semelhante a Developing the AIP Thesaurus: The Platform for an Ontology (20)

DHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction TrainingDHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction Training
 
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movement
 
Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 
Taxonomies and Metadata
Taxonomies and MetadataTaxonomies and Metadata
Taxonomies and Metadata
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
 
Linking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementLinking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content Management
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the Web
 
Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic Applications
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulations
 
Globe seminar
Globe seminarGlobe seminar
Globe seminar
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic Web
 
Leeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERLeeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OER
 
Knowledge mangement
Knowledge mangementKnowledge mangement
Knowledge mangement
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: Taxonomy
 
Experience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierExperience with MarkLogic at Elsevier
Experience with MarkLogic at Elsevier
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
 
Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.
 

Mais de Access Innovations, Inc.

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsAccess Innovations, Inc.
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8Access Innovations, Inc.
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Access Innovations, Inc.
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Access Innovations, Inc.
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Access Innovations, Inc.
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut ItAccess Innovations, Inc.
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityAccess Innovations, Inc.
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedAccess Innovations, Inc.
 

Mais de Access Innovations, Inc. (20)

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
 

Último

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsManeerUddin
 

Último (20)

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture hons
 

Developing the AIP Thesaurus: The Platform for an Ontology

  • 1. Developing the AIP Thesaurus: The Platform for an Ontology Mark Cassar American Institute of Physics Jack Bruce Marjorie (Margie) M.K. Hlava Access Innovations: 505-998-0800
  • 2. Background • Physics and Astronomy Classification Scheme (PACS) • Six digit code schema used for indexing scholarly content • 10 digit based – domain headings with subcategories nested under each domain. • Precoordinated system – Combine terms (concepts) at the time of indexing
  • 3.
  • 4. Why Change? • Improve searchability • Move to Post coordinated system – Combine terms at time of search • Semantic enrichment • Flexible metadata for many applications • Naturalize the vocabulary – Represent concepts succinctly and concisely – Easily add new concepts based on new and emerging technologies and applications – Allow unlimited hierarchy levels and polyhierarchy
  • 5. Better ROI • Rules-assisted indexing – Provide end users with a swift indexing solution based on the Machine-Aided Indexer (M.A.I.) engine. – Batch index large corpus of scholarly content, as well as future content. • Improve costs – Automate a large portion of electronic indexing – Less overhead for indexing
  • 6. Roadmap of the AIP Thesaurus • Data Collection – Load PACS codes and terms – Incorporate Search logs; add top searched concepts into the vocabulary • Analysis of Content – Test comparison of indexing to humanly indexed articles • Thesaurus Construction – Separate, disambiguate, and migrate concepts; Break up top domains – Apply thesaurus and taxonomy standardization to each term – Multiple reviews for each top section • Evaluation and Feedback – Send back working draft to AIP for review – Gather feedback from subject matter experts and incorporate the changes into the thesaurus • Finalization and Product Delivery
  • 7. Source Data • PACS 2009 ed. • 1999 ed. Of AIP Thesaurus (out of date) • Terms added to INSPEC since 2000 • Internal and external search logs • Cumulative journal indexes – Digital – (2006 through 2009) • List of AIP divisions and their internal classifications
  • 8. Analysis of Content • Organizational warrant – PACS 2009 (2010) – www.aip.org – UniPHY • Literary warrant – Where we found the term used • Most frequent search terms loaded into thesaurus
  • 9. Thesaurus Creation Process • Load data (vocabulary) into Data Harmony MAIstro™ • PACS – Restructure top domains – Separate into discrete – Disambiguate terms – Remove parenthetical qualifiers – Create post coordinated terms – Migrate separated terms into new/relevant categories • Sort flat lists (search logs) into main categories determined • Use multiple reviewers for each physics domain • About 8181 preferred terms and 5217 synonyms
  • 10.
  • 11. PACS TERM: – Low-energy electron diffraction (LEED) and reflection high-energy electron diffraction (RHEED) (condensed matter structure determination) – Becomes – BT Condensed matter structure determination • NT Low energy electron diffraction –Synonym LEED • NT Reflection high energy electron diffraction –Synonym RHEED
  • 12. Evaluation and Feedback • Weekly scheduled live demos of the thesaurus • Free web-hosted version of the thesaurus and periodic spreadsheet exports • Collect feedback based on SME suggestions and AIP PACS experts – Correspondence via email • Incorporate changes into thesaurus
  • 13. Available versions • Electronic copy of AIP thesaurus supplied in – XML – Excel – Web-based, read-only versions (Thesviewer) – MARC, SKOS, OWL, CSV etc
  • 14. Taxonomy view Thesaurus Term Record view
  • 15. To make an ontology • Define additional Associative relationships • Define additional Hierarchical relationships – IsA, IsPartOf, HasA • Define additional Equivalence relationship • Multilingual options • Weights and measures
  • 16. Clearer disambiguation? Temperature Planets IsA TypeOf IsA BrandOf Mercury Roman god IsA Automobile Metallic element
  • 17. Knowledge Organization Systems • Uncontrolled list Not complex • Name authority file • Synonym set/ring • Controlled vocabulary • Taxonomy • Thesaurus AIP Thesaurus is here • Ontology • Semantic network Highly complex
  • 18. Lessons Learned • Learning the style for indexing • Tendency to reversion to PACS style of language and classification • SME feedback turnaround – Sit with them 2 hours – Incorporate suggestions 8 hours – 2117 Terms Added 1354 Terms changed or updated 1333 Terms deleted 11259 Other actions
  • 19. Where are we now? • Platform is established • OWL and other formats available • One kind of Associative relationship – (Related terms) • One kind of Hierarchical relationship – Broader Narrower / Parent Child – Multiple broader terms for interdisciplinary options • One kind of Equivalence relationship • Synonym non preferred terms • Built using the Z39.19 standard - interoperable
  • 20. To Review AIP Thes • Use a web browser • http://thesview.accessinn.com/aipThes/ • username/password twice - in all cases both are 'aip'. • Begins a java app in your browser that shows the thesaurus starting from the top level of the hierarchy. • Use the collaboration module to comment and discuss
  • 21. Thank you Marjorie Hlava mhlava@accessin.com 505-998-0800