Marjorie M.K. Hlava, President and founder of Access Innovations, Inc., unveils the newest version and module updates of the Data Harmony indexing software suite.
2. DH Technical Support Team
ī¯ Development
programming team
īŽ Lamine Idjeraoui **
īŽ Allexander Lyons
īŽ Daniel Vasicek
īŽ Scott Roberts
īŽ Doug Vendcat
ī¯ Customer support
īŽ Mary Garcia **
īŽ Jack Bruce
īŽ Gabe Carr
īŽ Samantha Lewis
ī¯ Documentation
īŽ Jack Bruce **
īŽ Kirk Sanders
īŽ Gena San Nicolas
īŽ Barbara Gilles
ī¯ Systems
īŽ Tom Peterson**
īŽ SWCP
3. DH Customer Support Team
ī¯ Sales and Licensing
īŽ Marjorie Hlava
īŽ Janice McIntyre
īŽ Bill Richardson
īŽ Jay Ven Eman **
īŽ Leland Yates
ī¯ Blog and Web team
īŽ Barbara Gilles
īŽ Melody Smith **
īŽ Timothy Soholt **
ī¯ Marketing
īŽ Heather Kotula **
īŽ Ashley Beard
4. Editorial Team
Taxonomy and Rule Building
ī¯ Gabe Carr
ī¯ Jack Bruce
ī¯ Kathy Brown
ī¯ Barbara Gilles
ī¯ Bob Kasenchak **
ī¯ Samantha Lewis
ī¯ Kirk Sanders
ī¯ Tim Soholt
ī¯ Gena San Nicolas
ī¯ Alice Redmond-Neal
ī¯ Eric Ziecker
5. Access Integrity
ī¯ Kathy Brown
ī¯ Jerry Jorgeson
ī¯ John Kuranz**
ī¯ Leland Yates
ī¯ Access Rule Building Team
ī¯ Access Programming Team
6. Whoâs Who?
ī¯ Introduce yourself
ī¯ Relationship to Data Harmony
ī¯ Where do you use Data Harmony
ī¯ Project Name(s)
8. Four Divisions
ī¯ Database Services
ī¯ Data Harmony
īŽ NewsIndexer
ī¯ National Information Center for
Educational Media (NICEM)
īŽ MediaSleuth
ī¯ Access Integrity
īŽ Medical Claims Compliance
īŽ Integracoder
11. Database Services - 3
ī¯ Applications development
īŽ Search â Lucene and Solr
īŽ Search Harmony interface
īŽ Web services layer
ī¯ Link to user experience or user interface
ī¯ Web calls
īŽ API setup and linking
ī¯ www.accessinn.com
12. Data Harmony
ī¯ Built for our use starting in 1987
ī¯ Visual Basic C++ Java
ī¯ Aid to the editorial and indexing processes
ī¯ Alleviate the clerical aspects
ī¯ Speed the tagging process
ī¯ Guarantee accuracy, consistency, and
depth of indexing
13. Data Harmony Suite â
Main Modules
ī¯ M.A.I.
ī¯ Thesaurus Master
ī¯ XIS
īŽ XML Intranet System
ī¯ Administrative configuration module
ī¯ âThe Data Harmony Suiteâ
14. Tech stuff
ī¯ Downloadable
ī¯ Documentation revised 2014
ī¯ APIs for client server versions
ī¯ Internet accessible Cloud and SaaS
ī¯ Full multilingual display
ī¯ Unicode - Accepts ASCII data
ī¯ Entification tables converted
ī¯ Drivers for display and print
īŽ For most languages
15. Data Harmony
ī¯ Java
īŽ Platform independent
īŽ Applet modules
īŽ Web services
īŽ APIs
ī¯ XML
ī¯ TCP/IP
ī¯ JSON and SSL on WEB Start
ī¯ GlassFish for extension support
ī¯ www.dataharmony.com
17. Data Harmony
ī¯ Machine Aided Indexing (M.A.I.)
īŽ Semantic, syntactic, morphological, etc. layer
īŽ Rule Builder for users
īŽ Concept Extractor for text
īŽ Statistics for Machine Learning
īŽ Use in automatic, batch, or assisted mode
ī¯ Thesaurus Master
īŽ For creating taxonomies, thesauri, ontologies, and
authority files
ī¯ MAIstro
īŽ Thesaurus Master and M.A.I. combined
23. Data Harmony Forum
ī¯ Discussion threads
ī¯ Solutions to reported problems
ī¯ Access to the newest documentation
ī¯ Announcements of features
ī¯ Bug reports
ī¯ Enhancement requests
24. Data Harmony Partners
ī¯ EJ Press
ī¯ MarkLogic
īŽ Really strategies (R Suite)
īŽ Yuxi
īŽ Xquire
ī¯ Publishing Technology
ī¯ More âĻ.
25. Some DH Connectors & ExportsâĻ
ACD/Labsâ
Lucene(org.&Solr)
PerfectSearch
Oracle/StellentUniversal
ContentManagement
JiveSoftwareâs
Clearspace
EJPress
PublishingTechnology
OpenOffice
MarkLogicâsMarkLogic
Server
MicrosoftâsSharePoint
NorthPlains
Temis
Synaptica
and moreâĻ
26. Other DH offerings
ī¯ Off-the-shelf taxonomy
īŽ Term records
īŽ Browseable list
īŽ Rule bases
ī¯ Consulting
īŽ Information architecture
īŽ DTD and schema creation
ī¯ Search implementation
27. Knowledge Domains
in over 40 subject areas.
âĸ Agriculture
âĸ Applied Technologies
âĸ Business (popular)
âĸ Business and Finance
âĸ Communications
âĸ Computer and Information
Science (popular)
âĸ Computer Science
âĸ Consumer and Homemaking
Education
âĸ Corporate Names
âĸ Counseling and Guidance
âĸ Economics
âĸ Education
âĸ Engineering
âĸ Environment
âĸ Geography (subject)
âĸ Geographical Place Names
âĸ Health and Safety
âĸ History
âĸ Language Arts
âĸ Languages
âĸ Literature and Drama
âĸ Mathematics
âĸ News
âĸ Occupations
âĸ Organizational Names
âĸ Personal Names
âĸ Physical Education and
Recreation
âĸ Political Science
âĸ Psychology
âĸ Religion and Philosophy
âĸ Science (popular)
âĸ Science, Technology, and
Medicine (STM)
âĸ Society
âĸ Sports
âĸ Technology
âĸ Visual and Performing Arts
âĸ US Industrial Codes (NAICS)
âĸ US Zip Codes and Places
Go to
TaxoBank
formore!
28. NewsIndexer
ī¯ Automatic indexing of newspapers
ī¯ 8 topical areas
ī¯ Maps to IPTC, NAICS, ICB, and GICS
codes
ī¯ Popular, automatic, and fast
ī¯ Remote submission / ASP
ī¯ 13 levels Filter to 3
ī¯ License and augment
ī¯ www.newsindexer.com
29. National Information Center for
Educational Media - NICEM
ī¯ 667,000 records for non-print educational
media
ī¯ 23,000 producers and distributors
ī¯ Based on school curriculum needs
ī¯ Online and CD-ROMs
ī¯ MARC cataloging
ī¯ Thesaurus
ī¯ Print
ī¯ www.nicem.com
30. MediaSleuth
ī¯ Online ordering of media from NICEM
ī¯ Search Harmony implementation
ī¯ Full e-commerce platform for ordering
ī¯ Educational and popular materials
ī¯ www.mediasleuth.com
31. Access Integrity, Inc. (AI2)
ī¯ Medical Claims Compliance
ī¯ Automatic IDC-9 suggestions
ī¯ CPT rule base
ī¯ HCPCS rule base
ī¯ ICD-9 V 3 Hospitals
ī¯ ICD-10
ī¯ Accurate, deep, consistent coding
ī¯ Making medical billing efficient
32. Corporate Information
ī¯ Closely held
ī¯ Financed by
īŽ Sweat and Persistence
īŽ Good Cash Flow and Management
ī¯ Since 1978 - 35 years in business
ī¯ Marjorie M.K. Hlava
ī¯ Jay Ven Eman
ī¯ Joanna Ginter
ī¯ www.accessinn.com
Woman Owned Small Business
36. Data Harmony 2014 Stack
Rule
Base
Term
Key
Record
Concept
Extractor
Statistics
Module
Taxonomy
Authority files
All terms
Alphabetic
Permuted view
XML (Extensible Markup Language) - Unicode
Java Virtual Machine
TCP/IP Transmission Control Protocol / Internet Protocol
Native XML
Content
Creation
Repository
OWL
Zthes
SKOS
XML
MARC, etc.
Administrative modules
37. Admin Module
ī¯ Configuration of Thesaurus Master,
M.A.I., MAIstro
ī¯ Separate Admin Module for XIS
ī¯ MAIBatch added to MAIstro Admin
Module
38. The author pastes
the data into the
document
template,
attaching images,
graphs, etc. as
necessary:
Copyright Š 2013 Access Innovations, Inc.
Author Submission
Module
39. Author Submission Module
Copyright Š 2013 Access Innovations, Inc.
The author fills in the data to the document template, attaching images
and graphs as necessary.
An API calls Data Harmony and generates a list of indexing terms
based on the content.
40. Authors review the
indexing and may
change it.
Content is stored
into a data
repository as
HTML, XML, etc.
Author Submission Module
Copyright Š 2013 Access Innovations, Inc.
41.
42. DH Author Submission
System
ī¯ Leveraging Records Management with
Documentum, Author Submission, and
MAIstro
Marjorie M.K. Hlava and Leland Yates,
Access Innovations, Inc.
45. ī¯ Configure any field
ī¯ Index on any field
ī¯ XML or XHTML
ī¯ Link to the CMS
Author Submission
System Configuration
Module
46. Author Disambiguation
ī¯ Build a file of authors
īŽ Name: first, second, surname
īŽ DOIs published
īŽ Publication rank (first author, etc.)
īŽ Keywords for those DOIs
īŽ Affiliation(s)
īŽ Location(s) city, state, country, etc.
īŽ Co-authors (inferred by DOI)
īŽ Etc.
47. Affiliation Disambiguation
ī¯ Build a file of affiliations
īŽ Name
ī¯ Lab, institute, etc. name
īŽ DOI
īŽ Location
īŽ Full address
īŽ Keywords
īŽ Etc.
48. Author Disambiguation
ī¯ Link the two databases
ī¯ Build a web service to accept files
ī¯ Auto-disambiguate incoming files
ī¯ Review new or non-match to ensure
accuracy
ī¯ Leveraging Semantic Fingerprinting for
Building Author Networks
Bob Kasenchak, Wednesday @ 9:30 AM
49. Inline Tagging
ī¯ Full text tagging
ī¯ Send search query directly to the place in
the document where the concept is
mentioned.
ī¯ Flexible in XML and HTML views
ī¯ Inline Tagging and Dictionary Connection
Gena San Nicolas, Wednesday @ 2:15
50. Inline tagging Web service
ī¯ Use M.A.I. to put terms in context for
high-precision indexing
51. Inline Tagging
Shows the exact point where the
concept is mentioned
Mouse over to view the term record
Statistical summary, showing the
number of times each term is
mentioned in the article
53. Metadata Extractor
ī¯ Automatic creation from PDF digital layer
ī¯ Position training needed
ī¯ Dublin Core metadata
ī¯ Bibliographic citation created
ī¯ Automatic summarization added
ī¯ Uses M.A.I. on full text
ī¯ Can be linked to Author Disambiguation
58. Or use with HTML Pages
. <document>
<title>Access Innovations -
Knowledge Management Professionals</title>
<document-type>Web Page</document-type>
<copyright>Š 2007 Access Innovations, Inc.</copyright>
<address>
<street>131 Adams NE</street>
<city>Albuquerque</city>
<state>New Mexico</state>
</address>
<subject-terms>
<term>Data Harmony</term>
<term>Indexing</term>
<term>Taxonomies</term>
</subject-terms>
</document>
59. M.A.I.
ī¯ M.A.I. is used to describe or categorize
items by matching text to controlled
vocabulary terms
īŽ Rule Builder
īŽ Concept Extractor
īŽ Statistics Collector
īŽ Test MAI
60. M.A.I. 2014
ī¯ Find in Test MAI
ī¯ Export Fields function
ī¯ Expanded warning and information labels
ī¯ Expanded print functions
ī¯ Rule error details
ī¯ Emphasis tags
ī¯ MAIBatch GUI
73. MAIBatch XML
ī¯ Add Custom tags
ī¯ Click on âXML tagsâ in
the Settings menu.
74. MAIBatch - Adding files
Viewing results
Upload File/Directory
Row of asterisks separates each document
file path of a document
suggested thesaurus terms
75. Log Statistics
ī¯ From source data to
compare accuracy
ī¯ By human editors
assigning values
īŽ HIT
īŽ MISS
īŽ NOISE
From source file data
<USEDTERMS>
<TERM>Term 1</TERM>
<TERM>Term 2</TERM>
</USEDTERMS>
79. Ontology Master
ī¯ Sneak Peek
ī¯ Built on Thesaurus Master
ī¯ Full OWL and SKOS exports
ī¯ Full directional relationships
ī¯ Same extensive functionality
ī¯ Bob Kasenchak â Wednesday @ 1:15
PM
83. Search Harmony
ī¯ Built to leverage semantically enriched
text
ī¯ Uses the thesaurus sections
īŽ BT-NT relationships for taxonomy tree
īŽ Type ahead from tab, permuted index
īŽ Related terms
īŽ Narrower terms
84. Copyright Š 2005 - Access Innovations, Inc.
Taxonomy
view
Thesaurus
Term Record
view
87. Search Presentation Layer
The Hierarchical view of the thesaurus
is also a browseable view of the
content.
The numbers include the number of hits
1. For the term
2. For the branch
88. Semantic Fingerprinting
ī¯ People / Authors
ī¯ Articles
ī¯ Medical records
ī¯ Organizations and affiliations
ī¯ Point ads to users
ī¯ Related to author disambiguation
89. Thesaurus
Master
Machine Aided
Indexer
(M.A.I.âĸ)
Repository
Search
Presentation:
90% accuracy
Browse by
Subject
Auto-completion
Broader Terms
Narrower Terms
Related Terms
Client Taxonomy
Inline Tagging
Metadata and
Entity Extractor
Automatic
Summarization
Search
Software
Client Data
Full Text
HTML, PDF,
Data Feeds, etc.
Client
taxonomy
Fully integrated SharePoint
Copyright Š 2013 Access Innovations, Inc.
[Data Harmony fully integrated with MOSS.]
90. Select term store management
located under Site Administration
Edit term sets to accurately reflect your document libraries and
content types. Term sets can be individual taxonomies or flat
controlled vocabulary lists. 90
91. Thesaurus Master - 2014
ī¯ Built for vocabulary control
īŽ Taxonomy
īŽ Thesaurus
īŽ Entities
ī¯ Full standards compliance
īŽ ISO 25964 Parts 1 and 2
īŽ NISO Z39.19 â 2010
92. Emphasis Is Available
for Preferred Terms
ī¯ bold, italics, or underline
īŽ Term with emphasized words
īŽ Term with enriched words
īŽ Change Term dialog with enhancement buttons
94. Full Path Export
ī¯ Data Harmony Custom Features as
Implemented for Triumph Learning
ī¯ Kirk Sanders Wednesday @ 11:00
ī¯ Emphasis
ī¯ Full path export
95. Thesaurus Master 2014
ī¯ Emphasis tags â more
ī¯ Wednesday @ 11:00
ī¯ Data Harmony Custom Features as
Implemented for Triumph Learning
Kirk Sanders, Access Innovations, Inc.
98. Web Start
ī¯ Replacing WebThes and ThesViewer
ī¯ Allows auto-start from the browser
ī¯ Full featured
ī¯ Password access control
ī¯ Everything from view only to full access
103. XIS
ī¯ A XIS project consists of the following:
īŽ Folders that XIS uses. These are the âproject
folders.â
īŽ A schema (configuration file) called
projects.MyProject.xml.
īŽ A XIS DTD, called âprojects.dtd.â
105. XIS and Lucene
Search within a search (recursive search)
New Lucene search
Using Lucene for Search within XIS
Allexander Lyons,
Wednesday @ 11:45
106. DHUG 2015
ī¯ Albuquerque
ī¯ February 16 â 20
ī¯ Call for papers is now open
ī¯ Ideas for what to do better and differently
VERY welcome
107. We Apply Imagination
Keep the System Flexible
Make the Applications Fun
Thank you!
Marjorie M.K. Hlava, President,
Access Innovations
505-998-0800
mhlava@accessinn.com