SlideShare uma empresa Scribd logo
1 de 47
How to Get Enterprise Search Right
Juan J. Celaya
President & CEO
COMPU-DATA International, LLC
jcelaya@cdlac.com
Twitter @conceptsearch
Ken Lemons
VP Federal Programs
Concept Searching
kenl@conceptsearching.com
Expert Speakers
Ken Lemons – VP Federal Programs at Concept Searching
has over 25 years’ experience in the IT industry, with a track record in
consulting, solutions delivery, sales and project management in the federal
sector. He has managed Microsoft consulting practices for several US
government integrators, latterly as VP of Business Development for Air Force
and DoD programs. Ken has provided US DoD agencies with solutions to
address a range of challenges, leveraging a combination of Microsoft and third
party technology solutions.
Juan Celaya – President and CEO at COMPU-DATA International
founded COMPU-DATA International, LLC in 1988 and has been successfully
delivering content and data integration solutions for ECM implementations
through the use of capture, administration, collaboration, retrieval technologies
and solutions. Juan has a broad Information Technology background covering
30 years of experience, with successful system implementations in Energy,
Healthcare, Pharmaceutical and Transportation companies, the US Department
of Defense and the National Nuclear Security Administration.
Agenda
• Introductions
• Concept Searching
• Our Approach and Technologies
• The Challenge of Search
• Products
• COMPU-DATA International
• Why Searching Raw Corporate Content is not Enough
• The Managed Metadata Environment – Your Organization’s “Controlled Vocabulary”
• CDI’s “IGT” Model – A simple approach to the Managed Medata Environment
• Initial Setup Sample
• Rules
• Taxonomy
• Leveraging Automatically Applied Metadata to Deliver
• Improved Search Results
• Utilizing the Combined Power of
• Classification and Taxonomy Management Technology
• Company founded in 2002
• Product launched in 2003
• Focus on management of structured and unstructured information
• Technology Platform
• Delivered as a web service
• Automatic concept identification, content tagging, auto-classification,
taxonomy management
• Only statistical vendor that can extract conceptual metadata
• 2009, 2010, 2011, 2012, 2013 ‘100 Companies that Matter in KM’
(KMWorld Magazine) and Trend Setting product of 2009, 2010, 2011, 2012,
2013
• Authority to Operate enterprise wide US Air Force and enterprise wide
NETCON US Army
• Locations: US, UK, and South Africa
• Client base: Fortune 500/1000 organizations
• Managed Partner under Microsoft global ISV Program - ‘go to partner’
for Microsoft for auto-classification and taxonomy management
• Smart Content Framework™ for Information Governance
• Product Suite: conceptSearch, conceptTaxonomyManager, conceptClassifier,
conceptClassifier for SharePoint, conceptTaxonomyWorkflow, conceptContentTypeUpdater for SharePoint
Concept Searching – The Industry Leader in
Managed Metadata Solutions
• Metadata driven application and enforcement of policies - conceptClassifier has been
deployed since 2003 to automatically generate metadata and use that metadata to apply and enforce
policies. Most clients are using the platform to support their information governance strategy.
• Proven, mature functionality out of the box - The platform has been deployed in numerous sites
and applications across the enterprise, including MOSS and SharePoint 2010, 2013, Office 365, Stellent,
Documentum, SQL, Oracle, File Shares, Exchange via SharePoint and across the enterprise.
Smart Content Framework™
Getting It Right
• Concept Searching’s statistical concept identification underpins all technologies
• Multi-word suggestion is explicitly more valuable than single term suggestion algorithms
• conceptClassifier will generate conceptual metadata by
extracting multi-word terms that identify ‘triple heart bypass’
as a concept as opposed to single keywords
• conceptTaxonomyManager uses statistical concept
identification to provide real-time feedback during the process
of building, testing, refining, and deploying taxonomies
• Metadata can be used by any search engine index or any
application/process that uses metadata.
Concept Searching
provides Automatic
Concept Term Extraction
Triple
Baseball
Three
Heart
Organ
Center
Bypass
Highway
Avoid
Industry Unique Technology
A Manual Metadata Approach Will Fail 95%+ Of The Time
Issue Organizational Impact
Inconsistent Less than 50% of content is correctly indexed, meta-tagged or
efficiently searchable rendering it unusable to the organization. (IDC)
Risky 59% of middle managers miss valuable information every day
because they can’t find it or never see it (Accenture)
Cumbersome - expensive Average cost of manually tagging one item runs from $4 - $7 per
document and does not factor in the accuracy of the meta tags nor the
repercussions from mis-tagged content. (Hoovers)
Malicious compliance End users select first value in list.
(Perspectives on Metadata, Sarah Courier)
No perceived value for end user What’s in it for me? End user does not see value for organization nor
risks associated with litigation and non- conformance to policies. Less
than 14% of end users receive training. (AIIM)
What have you seen Metadata will continue to be a problem due to inconsistent human
behavior.
The answer to consistent metadata is an automated approach that can extract the meaning
from content eliminating manual metadata generation yet still providing the ability to manage
knowledge assets in alignment with the unique corporate knowledge infrastructure.
Manual Approach Leads to Failure
Learning to Search
• Searchers do not know “how to search”
• 56% constructed poor queries
• Proficiency with the machine does not translate
into proficiency with the software
• Searchers get lost in the data
• 33% had difficulty navigating/orienting search
results
• 28% had difficulty maintaining orientation on a
website
• Loss of capacity for discernment
• 36% did not go beyond the first 3 search results
• (not pages…results on page 1)
• 91% did not go beyond the first page of search
results
• 55% selected irrelevant results 1 or more times
• Enterprise versus Internet search is a different animal
• In the enterprise end users know the information is there
if they could only find it
• As a result, they will spend more time and aggravation
looking for that one asset and don’t want to give up
(IDC)
• Enterprise end users expect information to be found
within 4 minutes but will actually spend 2.5 hours per
day looking for information
• How do enterprise users overcome poor search results?
• Recreate information
• Use outdated or older versions of information
• Interrupt a co-worker
• Forget about finding the information
• Just don’t start the task
The Typical Search Approach
The Hidden Costs of Search
“There is a debilitating disconnect between the proliferation of electronic information and the
constant need to quickly and accurately find all of the information and expertise that is
essential for work every day. From top to bottom, enterprises have failed to take seriously the
high cost of being grossly inadequate at finding information, data, documents, experts. Instead
they have settled for low performance, low-return techniques to… sort of handle Search.”
Julie Hunt - Search Consultant
The cost to a 500 employee company is
$2.4 million per year in inefficiencies
and lost productivity.
Gartner Group
The Intelligent Search Approach
• People explore concepts – computers find keywords
• Recall versus Precision
• Location Search
• User knows what they are seeking
• Search engine must retrieve exactly and only the information required
• Discovery Search
• User does not know precisely what they are seeking
• Search engine must retrieve content that “appears” to answer the query
• Search engine must be able to accommodate both types of searching
• The hierarchy provided by a taxonomy addresses the two different search
approaches. Location based searches appear simple, but in fact are not.
• If the end user does not immediately find what they are looking for, they can use the
hierarchical structure to drill down by searching the concepts or taxonomy nodes.
• Outcome: Identify associations and relationships that are typically not obvious in
searching
• More relevant information being found more quickly
• Accessing inter-related ideas and concepts supports a fundamental change in
user focus and activity and transforms it from searching to insight and discovery
Taxonomy Navigation
It’s Not Just About Search!!
• Data Privacy
• Records Management
• Migration
• Enterprise Content Management
• Information Governance
• Legal, eDiscovery, FOIA
• Collaboration/Social
• Text Analytics
conceptClassifier for SharePoint
• conceptClassifier for SharePoint
• Combination of automatic classification, taxonomy management and Concept Searching’s
APIs packaged for delivery into the SharePoint environment
• Single code able to be deployed with SharePoint 2007, 2010, 2013, and Office 365
• Provides clients with the choice of on-premise, cloud based, or hybrid solutions to best meet
their needs
• Integrates with any search engine (SharePoint, former FAST products, Google Search
Appliance, etc.)
• Classifies content as it is created or ingested from diverse repositories within and outside of
SharePoint
• conceptTaxonomyWorkflow
• Optional component that can perform an action on a document following a classification
decision when the criteria are met
• Built with a plug-in architecture enabling the simple development of content sources
• Uses records retention codes, semantic, and security metadata associated to data assets to
identify and process the automatic application of content types
• Once documents have the appropriate content type, based on natural language and
automatically applied metadata, workflows can be initiated.
• Workflow source type works in the SharePoint 2007, 2010, and 2013 as well as for all
document types, FILE document types, and HTTP document types
conceptClassifier for Office 365
• Runs natively and bi-directionally with the SharePoint Term Store in any
environment
• Portability – ubiquitous access to information regardless of where it resides or
how it is stored
• Provides “intelligent” migration capabilities
• Enables management of one term store for on premise and Office 365 use
• Maintains GUIDs
• Delivers enterprise class automatic document classification for all SharePoint,
FILE, and HTTP document types
• Protects records and confidential information from inadvertently being place in
the cloud to avoid data breaches and unauthorized access to information
• Enables concept based search and retrieval integrated with Microsoft search
solutions
• Provides a method to enable text analytics from multiple data sources without
impacting on-premise server utilization
Preserving
the
World's Knowledge
  
Available Anytime Anywhere
SM
Getting
Enterprise
Search Right
by Using
Your Own
Vocabulary
For Indexing
and
Searching
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Many Thanks To
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
• Why Searching Raw Corporate Content is not enough
• The Managed Metadata Environment –
Your Organization’s “Controlled Vocabulary”
• CDI’s “IGT” Model –
A simple approach to the Managed Medata Environment
• Initial Setup Sample:
 Rules
 Taxonomy
• Leveraging Automatically Applied Metadata to Deliver:
 Improved Search Results
• Utilizing the combined power of:
 Classification & Taxonomy Management Technology
Presentation Overview
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Searching Raw Corporate Content is not enough
• Limited Resources
• Corporate Content Control
 Not written for SEO
 Not HTML
• Not HTML
 No Title Tags
 Header Tags
 Content missing keywords
 No Meta Tags
• How do you make content relevant?
 Add corporate vocabulary (Tags, Tags and more Tags)
Keywords too!
 Adapt to the changing business environment
 Leverage automatic metadata tagging
You are not Google,
Bing or Yahoo!
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
“The managed meta data environment represents the architectural
components, people and processes that are required to properly and
systematically gather, retain and disseminate meta data throughout the
enterprise.”
Components:
1. Meta data sourcing layer
2. Meta data integration layer
3. Meta data repository
4. Meta data management layer
5. Meta data marts
6. Meta data delivery layer
Managed Metadata
Environment*
*From the book Universal Meta Data Models by David Marco
and Michael Jennings (John Wiley & Sons)
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
The “IGT” Model:
1. Metadata Identification
2. Automatic Metadata Generation
3. Automatic Metadata Tagging
Delivering a Dependable
Automatic Data Enhancement Solution
Applicable Across the Enterprise
Implementing a Simple
Managed Metadata Environment
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
The “IGT” Model:
1. Metadata Identification
Metadata
Identification
1. The process to create the rules by
which Metadata can be identified for
electronic content and data files.
2. Requires technologies to process
text content such as that from
Concept Searching and others that
can be used on images, audio, and
video.
3. Requires Expertise: Data Science,
Classification, Statistical Theory,
Content SMEs, Library Sciences,
Records Management. Other areas
for which content must be identified.
4. Recursive process receiving input
from actual results and new input
from business needs and
requirements.
Implementing a Simple
Managed Metadata Environment
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
The “IGT” Model:
1. Metadata Identification
2. Automatic Metadata Generation
Automatic
Metadata
Generation
1. Is the process to automatically
provide Metadata to a requestor
based on a predefined set of rules
and processes.
2. Functionality either included in the
tool and technology used or it is built
to support this function.
3. Requires Expertise: Programming
and integration, knowledge of tool
and technology APIs and
functionality.
Implementing a Simple
Managed Metadata Environment
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
The “IGT” Model:
1. Metadata Identification
2. Automatic Metadata Generation
3. Automatic Metadata Tagging
Automatic
Metadata
Tagging
1. Is the process and methods by
which content is enhanced when
one or more Metadata Tags are
attached, assigned, joined or
associated with the content’s
electronic file through an automated
process.
2. Functionality either included in the
tool and technology used, it is
provided from a 3rd party or it is built
internally.
3. Requires Expertise: Programming
and integration, knowledge of tool
and technology APIs and process
functionality.
4. Provides feedback to improve
Metadata Identification.
Implementing a Simple
Managed Metadata Environment
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Overview of CDI’s
“IGT“ Model steps
to a Successful
Automatic
Metadata Tagging
Solution
Metadata
Identification
The process to create the rules by
which Metadata can be identified.
Tools &
Technologies: Text
based technology
such as Concept
Searching, and
others that can be
used on images,
audio, and video.
Required Expertise:
Data Science,
Classification,
Statistical Theory ,
Content SMEs,
Library Sciences,
Records
Management.
STEP #1
Automatic
Metadata
Generation
The process to Automatically
Generate Metadata for content by
using the technologies used and
rules defined in the Metadata
Identification process.
Functionality either
included in the tool
and technology
used or it is built to
support the tool
and technology.
Required Expertise:
Programming and
integration,
knowledge of tool
and technology
APIs and
functionality.
STEP #2
STEP #3
Automatic
Metadata
Tagging
The process and methods by which
one or more Metadata Tags are
attached, assigned, joined or
associated to an electronic file.
Functionality either
included in the tool
and technology
used, it is provided
from a 3rd party or
it is built internally.
Required Expertise:
Programming and
integration,
knowledge of tool
and technology
APIs and process
functionality.
Feed
back
from
actual
results
Implementing a Simple
Managed Metadata Environment
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Metadata Identification
Advantages provided by
each step in the “IGT” Model
Automatic
Metadata Generation
Automatic
Metadata Tagging
1. Standardize identification rules and
establish a common metadata
vocabulary.
2. Able to manage (distributed at and
centrally for) different levels of the
organization as needed.
3. Provides for ad hock support based
on needs arising from eDiscovery
and litigation.
4. Improve reliability in the result lists
delivered by enterprise search
technologies used.
5. Multiple approaches and
technologies can be used that are
applicable to varying content.
6. Easy to distribute improvements as
feedback is applied.
1. SOP across the organization for
getting corporate, departmental,
project and any other applicable
metadata.
2. Single interface for access to
varying metadata identification
technologies.
1. Enforce rules established in the
organization and reduce costs due
to changes in regulations/business.
2. Eliminates human factors and
behaviors that contribute to negative
outcomes associated with metadata
assignment.
3. Makes it easier to achieve
“findability”, enforce access controls,
apply corporate governance and
implement a record retention
schedule.
4. Centralized and distributed tagging
integration and control of data
sources.
5. Bridges the implementation of any
metadata identification technology to
any data source.
Note: Advantages of each step are not “all inclusive”
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Supports
Storage &
Preservation
Business
Function
Application of
record retention
codes with
metadata deriving
storage &
preservation
Supports
Information
Rights
Management
Business
Functions
Content types
drive automatic
application of
RMS templates
Supports
Search
Business
Function
Application of
metadata
drives search
precision
TAXONOMY MANAGEMENT ENVIRONMENT
(Developed using Taxonomy Management Technology)
Data Privacy and Security Compliance Taxonomies
(Data Privacy & Security Enforcement, see (A, B))
Data Transparency by Metadata Tagging
Compliance
(Access Policies & Rights Management), see (A, C))
Records Management Compliance Taxonomies
(Records Retention Schedules, see (A))
Improved Enterprise Search
(Search Precision, see (B, C))
• The Taxonomy Management
Environment (TME) is made up of the
technologies, processes and utilities
used to create and administer
Taxonomies.
• Within the TME subject matter experts,
librarians and operations personnel
work together to create taxonomies
which define the business rules by
which content files are analyzed.
• These taxonomies are then used to
classify electronic content and associate
each applicable rule to the content file
processed.
• Content is “Automatically Tagged” with
the appropriate rule identifier and
corresponding vocabularies used within
the organization.
Taxonomy Management Environment
A B C
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Document
Creation/
Modification
by User
SharePoint
Update
Content
Types
28
Enterprise
Search
Search on user’s compliance
domain (based on document
content & metadata)
Document
Search by
User
Supports
Storage &
Preservation
Business
Function
Application of
record retention
codes with
metadata deriving
storage &
preservation
Supports
Information
Rights
Management
Business
Functions
Content types
drive automatic
application of
RMS templates
Supports
Search
Business
Function
Application of
metadata
drives search
precision
A
TAXONOMY MANAGEMENT ENVIRONMENT
(Developed using Taxonomy Management Technology)
Data Privacy and Security Compliance Taxonomies
(Data Privacy & Security Enforcement, see (A, B))
Data Transparency by Metadata Tagging
Compliance
(Access Policies & Rights Management), see (A, C))
Records Management Compliance Taxonomies
(Records Retention Schedules, see (A))
Improved Enterprise Search
(Search Precision, see (B, C))
B C
SharePoint Implementation
conceptClassifier Classification Engine
Tagging Processes
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Document
Creation/
Modification
by User
Shared
Drives
SharePoint
Other Data
Sources
Update
Content
Types
29
Enterprise
Search
Search on user’s compliance
domain (based on document
content & metadata)
Document
Search by
User
Supports
Storage &
Preservation
Business
Function
Application of
record retention
codes with
metadata deriving
storage &
preservation
Supports
Information
Rights
Management
Business
Functions
Content types
drive automatic
application of
RMS templates
Supports
Search
Business
Function
Application of
metadata
drives search
precision
A
TAXONOMY MANAGEMENT ENVIRONMENT
(Developed using Taxonomy Management Technology)
Data Privacy and Security Compliance Taxonomies
(Data Privacy & Security Enforcement, see (A, B))
Data Transparency by Metadata Tagging
Compliance
(Access Policies & Rights Management), see (A, C))
Records Management Compliance Taxonomies
(Records Retention Schedules, see (A))
Improved Enterprise Search
(Search Precision, see (B, C))
B C
Enterprise Implementation
Real Time
Tagging
Metadata Tagging & Workflow
of Data Source Content
Managed by DES*
Move file to
another
location
Sensitive
Info?
Auto
Tagging of
Content
Files
DES* automatically tags & classifies
content based on semantics contained
within actual content files.
N
Y
Move to
SharePoint
Manage
Relations
of Rules &
Content
conceptClassifier Classification Engine
*Data Enhancement System™ (DES) is a patent pending
product from COMPU-DATA International, LLC.
conceptClassifier
Tool
CDI’s DES* (Data Enhancement System™) Management Layer
Automated Metadata Tagging of Content To/From Data Sources
Other File
Format Tools
Data store for
testing new
taxonomies
Data store for
testing new
taxonomies
Librarian/Client-Generated
Taxonomies & Fields
Taxonomy A TA - Fields
Taxonomy B TB - Fields
● ● ● ● ● ●
Taxonomy N TN - Fields
Librarian/Client-Generated
Taxonomies & Fields
Taxonomy A TA - Fields
Taxonomy B TB - Fields
● ● ● ● ● ●
Taxonomy N TN - Fields
Metadata Generation Technologies
SharePoint®
in the Cloud
SharePoint®
On-Premise
Shared
Drives
Other
ECMs/APPs
DATA SOURCES
● ● ●
Automated Tagging of
Content
Automated Tagging of
Content
Automated Tagging of
Content
DIRECT DATA SOURCE MONITORING
Taxonomy
Managers &
Subject
Matter
Experts
● ● ●
©2012 - 2013 COMPU-
DATA International, LLC, All
Rights Reserved
TAXONOMY
MANAGEMENT
ENVIRONMENT
MANAGE
AUTOMATIC
METADATA
TAGGING OF
CONTENT
*Data Enhancement System™ (DES) is a patent pending
product from COMPU-DATA International, LLC.
conceptClassifier
Tool
CDI’s DES * (Data Enhancement System™) Management Layer
Automated Metadata Tagging of Content To/From Data Generation Sources
Other File
Format Tools
Data store for
testing new
taxonomies
Data store for
testing new
taxonomies
Librarian/Client-Generated
Taxonomies & Fields
Taxonomy A TA - Fields
Taxonomy B TB - Fields
● ● ● ● ● ●
Taxonomy N TN - Fields
Librarian/Client-Generated
Taxonomies & Fields
Taxonomy A TA - Fields
Taxonomy B TB - Fields
● ● ● ● ● ●
Taxonomy N TN - Fields
Metadata Generation Technologies
Automated Tagging
&
Delivery of Content
INTEGRATION WITH CAPTURE, PROCESSES & APPLICATIONS
SharePoint®
In the Cloud
SharePoint®
On-Premise
Shared Drives
Others
ECMs/APPs
DATA SOURCES
● ● ●
Business & Application Processes
Taxonomy
Managers &
Subject
Matter
Experts
Kodak Capture Pro
● ● ●
©2012 - 2013 COMPU-
DATA International, LLC, All
Rights Reserved
Info Activate
MANAGE
AUTOMATIC
METADATA
TAGGING OF
CONTENT
*Data Enhancement System™ (DES) is a patent pending product from COMPU-DATA International, LLC.
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Supports
Storage &
Preservation
Business
Function
Application of
record retention
codes with
metadata deriving
storage &
preservation
Supports
Information
Rights
Management
Business
Functions
Content types
drive automatic
application of
RMS templates
Supports
Search
Business
Function
Application of
metadata drives
search precision
Data Privacy and Security Compliance
Data Privacy & Security Enforcement
Data Transparency by Metadata Tagging Compliance
Access Policies & Rights Management
Records Management Compliance
Records Retention Schedules
Improved Enterprise Search
Search Precision
Data Enhancement Through
Automatic Metadata Tagging
B
A
C
CB
A B C
A
A
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Supports
Search
Business
Function
Application of
metadata drives
search precision
Improved Enterprise Search
Search Precision
Automatic Metadata Tagging to all Data Sources
using your own Vocabulary
Identify, Retrieve, Secure & Control Access to Data Assets
across Distinct Business & Service Delivery Units
Supports
Information
Rights
Management
Business
Functions
Content types
drive automatic
application of
RMS templates
Data Enhancement Through
Automatic Metadata Tagging
Taxonomy Management
Enabling the Automatic Meta-tagging and Auto-Classification of Documents and Records
Each node is a piece of metadata that gets tagged to a document or record based upon the
prevalence of a clue within the document
Manually Created Metadata associated
with the concept of “Weather”
Distribution Statement A: Approved for public release; distribution is unlimited
311 ABG/PA No. 09-488, 16 Oct 2009
Automatic Metadata Generation
Unique IP of Compound Term Processing enables the identification of compound terms
(not keywords) from highly relevant content that can be used to trigger the automatic
meta-tagging and auto-classification processes
Automatically Generated
Metadata associated with
the concept of “Weather”
Distribution Statement A: Approved for public release; distribution is unlimited
311 ABG/PA No. 09-488, 16 Oct 2009
Automatic Metadata Generation
Automatically generated metadata is added to original metadata for the category/folder
Outcome: more semantics that can be linked to a document or record result in information that
becomes more actionable (the document/record is now retrievable and classifiable)
Highly relevant metadata generated by
Taxonomy Manager added to original clue
set for the concept of “Weather”
Distribution Statement A: Approved for public release; distribution is unlimited
311 ABG/PA No. 09-488, 16 Oct 2009
Automatic Meta-tagging
Metatags are automatically added to the properties field of each document
enhancing the document’s valuable to the organization by increasing
the ability of the document to be retrieved using enterprise search
solutions that use keywords and metadata to retrieve information
Automatic Meta-tagging in Action
One of the Metatags for the document was “Turbulence Encounter” however when
we search for this term within the document we do not find it
Why did this happen?
Automatic Meta-tagging in Action
Turbulence Encounter is only one of 4 “clues” that must exist within a document in order for
that document to be automatically meta-tagged with the concept of Turbulence Encounter
Distribution Statement A: Approved for public release; distribution is unlimited
311 ABG/PA No. 09-488, 16 Oct 2009
Automatic Meta-tagging in Action
When we search the document using another clue for Turbulence Encounter, “Windshear”,
we see that its existence within the document triggered the automated meta-tagging event
that resulted in the document being tagged with “Turbulence Encounter”
Automatic Meta-tagging
Metatags are automatically added to the properties field of each document
making the document more valuable to the organization by increasing
the ability of the document to be retrieved using enterprise search
solutions that use keywords and metadata to retrieve information
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Automatically Apply Semantic Metadata
Records Retention Codes
Data Privacy & Security Metadata
To Every Document in SharePoint, File Shares
and other Data Sources
“Corporate
Vocabulary”
as the Metadata
Environment
(Taxonomy)
“Non-Preferred
Vocabulary Terms”
• US Army - Records Management and Declassification Agency (RMDA)
• Solution
• An integrated search solution to consolidate over 20 data sources and
databases into just a few repositories, with access to high data volumes
• Benefits
• Centralized data standardization
• Increased findability
• Terabyte size data store support
• Increased productivity
• Scalable
Use Case - RMDA
• A global company which designs, manufactures, and distributes
appliances to more than 70 countries
• Solution
• An intranet and content management solution to improve search,
sensitive/confidential information protection, and records identification
and tagging
• Benefits
• Accurate search
• Data privacy
• Effective records management
Use Case – Global Appliance Company
Preserving the Worlds Knowledge - Available Anytime AnywhereSM
©2013 COMPU-DATA International, LLC, All Rights Reserved
COMPU-DATA International, LLC
Time for a Demo
Let’s Tie all of this Together
With a Live Demo
Thank You
Juan J. Celaya
President & CEO
COMPU-DATA International, LLC
jcelaya@cdlac.com
Twitter @conceptsearch
Ken Lemons
VP Federal Programs
Concept Searching
kenl@conceptsearching.com

Mais conteúdo relacionado

Mais procurados

Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarConcept Searching, Inc
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesDATAVERSITY
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarConcept Searching, Inc
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingDATAVERSITY
 
Data Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityData Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityPrecisely
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
 
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...Concept Searching, Inc
 
2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management StrategyBrad Bronsch
 
Data-Ed Online: Emerging Trends in Data Jobs
Data-Ed Online: Emerging Trends in Data JobsData-Ed Online: Emerging Trends in Data Jobs
Data-Ed Online: Emerging Trends in Data JobsDATAVERSITY
 
DataEd Online: Building the Case for the Top Data Job
DataEd Online: Building the Case for the Top Data JobDataEd Online: Building the Case for the Top Data Job
DataEd Online: Building the Case for the Top Data JobDATAVERSITY
 
A Data Integration Case Study - Avoid Creating a “Franken-Beast”
A Data Integration Case Study - Avoid  Creating a “Franken-Beast”A Data Integration Case Study - Avoid  Creating a “Franken-Beast”
A Data Integration Case Study - Avoid Creating a “Franken-Beast”DATAVERSITY
 
How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectDATAVERSITY
 
Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Precisely
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for FindabilityFindwise
 
Sweeny group think-ias2015
Sweeny group think-ias2015Sweeny group think-ias2015
Sweeny group think-ias2015Marianne Sweeny
 
The evolution of Search spscinci
The evolution of Search spscinciThe evolution of Search spscinci
The evolution of Search spscinciJohnny Lopez
 

Mais procurados (20)

Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations Webinar
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance Strategies
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data Modeling
 
Data Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityData Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data Quality
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
 
2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy
 
Km share point and semantik-va
Km   share point and semantik-vaKm   share point and semantik-va
Km share point and semantik-va
 
Data-Ed Online: Emerging Trends in Data Jobs
Data-Ed Online: Emerging Trends in Data JobsData-Ed Online: Emerging Trends in Data Jobs
Data-Ed Online: Emerging Trends in Data Jobs
 
DataEd Online: Building the Case for the Top Data Job
DataEd Online: Building the Case for the Top Data JobDataEd Online: Building the Case for the Top Data Job
DataEd Online: Building the Case for the Top Data Job
 
Metadata Matters
Metadata MattersMetadata Matters
Metadata Matters
 
A Data Integration Case Study - Avoid Creating a “Franken-Beast”
A Data Integration Case Study - Avoid  Creating a “Franken-Beast”A Data Integration Case Study - Avoid  Creating a “Franken-Beast”
A Data Integration Case Study - Avoid Creating a “Franken-Beast”
 
How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot Project
 
Benchmarking IT Agility Final Report
Benchmarking IT Agility Final ReportBenchmarking IT Agility Final Report
Benchmarking IT Agility Final Report
 
Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality 
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
Sweeny group think-ias2015
Sweeny group think-ias2015Sweeny group think-ias2015
Sweeny group think-ias2015
 
The evolution of Search spscinci
The evolution of Search spscinciThe evolution of Search spscinci
The evolution of Search spscinci
 
2014 dqe handouts
2014 dqe handouts2014 dqe handouts
2014 dqe handouts
 

Semelhante a How to Get Enterprise Search Right with Automatic Metadata

Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Concept Searching, Inc
 
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...Concept Searching, Inc
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarConcept Searching, Inc
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Concept Searching, Inc
 
Data Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint WebinarData Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint WebinarConcept Searching, Inc
 
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint WebinarConcept Searching, Inc
 
Going Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the TradeGoing Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the TradeConcept Searching, Inc
 
Using Metadata-Driven Taxonomies to Solve Business Problems
Using Metadata-Driven Taxonomies to Solve Business ProblemsUsing Metadata-Driven Taxonomies to Solve Business Problems
Using Metadata-Driven Taxonomies to Solve Business ProblemsConcept Searching, Inc
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...Concept Searching, Inc
 
Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...Concept Searching, Inc
 
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...Concept Searching, Inc
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarConcept Searching, Inc
 
conceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business ValueconceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business Valuemartingarland
 
Concept Searching Webinar P
Concept Searching Webinar PConcept Searching Webinar P
Concept Searching Webinar PPaul_Billingham
 
How to Get the Most Out of Search Webinar
How to Get the Most Out of Search WebinarHow to Get the Most Out of Search Webinar
How to Get the Most Out of Search WebinarConcept Searching, Inc
 
Enterprise Search, Simple, Complex and Powerful
Enterprise Search, Simple, Complex and PowerfulEnterprise Search, Simple, Complex and Powerful
Enterprise Search, Simple, Complex and PowerfulFindwise
 
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...Concept Searching, Inc
 
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitSearch Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitJoel Oleson
 

Semelhante a How to Get Enterprise Search Right with Automatic Metadata (20)

Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
 
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance Webinar
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!
 
Data Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint WebinarData Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint Webinar
 
SharePoint Fest Chicago Presentation
SharePoint Fest Chicago PresentationSharePoint Fest Chicago Presentation
SharePoint Fest Chicago Presentation
 
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
 
Going Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the TradeGoing Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the Trade
 
Using Metadata-Driven Taxonomies to Solve Business Problems
Using Metadata-Driven Taxonomies to Solve Business ProblemsUsing Metadata-Driven Taxonomies to Solve Business Problems
Using Metadata-Driven Taxonomies to Solve Business Problems
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...
 
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
 
FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search Webinar
 
conceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business ValueconceptClassifier For SharePoint Driving Business Value
conceptClassifier For SharePoint Driving Business Value
 
Concept Searching Webinar P
Concept Searching Webinar PConcept Searching Webinar P
Concept Searching Webinar P
 
How to Get the Most Out of Search Webinar
How to Get the Most Out of Search WebinarHow to Get the Most Out of Search Webinar
How to Get the Most Out of Search Webinar
 
Enterprise Search, Simple, Complex and Powerful
Enterprise Search, Simple, Complex and PowerfulEnterprise Search, Simple, Complex and Powerful
Enterprise Search, Simple, Complex and Powerful
 
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
 
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitSearch Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
 

Mais de Concept Searching, Inc

ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase Concept Searching, Inc
 
Why You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management WebinarWhy You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management WebinarConcept Searching, Inc
 
Using Metadata and Classification in Records Management
Using Metadata and Classification in Records ManagementUsing Metadata and Classification in Records Management
Using Metadata and Classification in Records ManagementConcept Searching, Inc
 
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarDiscovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarConcept Searching, Inc
 
Drowning in Data and Starving for Information
Drowning in Dataand Starving for InformationDrowning in Dataand Starving for Information
Drowning in Data and Starving for InformationConcept Searching, Inc
 
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarMetadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarConcept Searching, Inc
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointConcept Searching, Inc
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementConcept Searching, Inc
 
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance WebinarEnough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance WebinarConcept Searching, Inc
 
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge DiscoveryWhat You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge DiscoveryConcept Searching, Inc
 
Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365Concept Searching, Inc
 
Eliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches WebinarEliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches WebinarConcept Searching, Inc
 
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...Concept Searching, Inc
 
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic WebinarWhy Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic WebinarConcept Searching, Inc
 
ECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish WebinarECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish WebinarConcept Searching, Inc
 
Collaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous WebinarCollaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous WebinarConcept Searching, Inc
 
SharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results WebinarSharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results WebinarConcept Searching, Inc
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Concept Searching, Inc
 
The Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchThe Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchConcept Searching, Inc
 
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term SetsExploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term SetsConcept Searching, Inc
 

Mais de Concept Searching, Inc (20)

ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase
 
Why You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management WebinarWhy You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management Webinar
 
Using Metadata and Classification in Records Management
Using Metadata and Classification in Records ManagementUsing Metadata and Classification in Records Management
Using Metadata and Classification in Records Management
 
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarDiscovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
 
Drowning in Data and Starving for Information
Drowning in Dataand Starving for InformationDrowning in Dataand Starving for Information
Drowning in Data and Starving for Information
 
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarMetadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email Webinar
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePoint
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records Management
 
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance WebinarEnough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
 
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge DiscoveryWhat You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
 
Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365
 
Eliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches WebinarEliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches Webinar
 
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
 
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic WebinarWhy Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
 
ECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish WebinarECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish Webinar
 
Collaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous WebinarCollaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous Webinar
 
SharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results WebinarSharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results Webinar
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
 
The Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchThe Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online Search
 
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term SetsExploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
 

Último

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 

Último (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 

How to Get Enterprise Search Right with Automatic Metadata

  • 1. How to Get Enterprise Search Right Juan J. Celaya President & CEO COMPU-DATA International, LLC jcelaya@cdlac.com Twitter @conceptsearch Ken Lemons VP Federal Programs Concept Searching kenl@conceptsearching.com
  • 2. Expert Speakers Ken Lemons – VP Federal Programs at Concept Searching has over 25 years’ experience in the IT industry, with a track record in consulting, solutions delivery, sales and project management in the federal sector. He has managed Microsoft consulting practices for several US government integrators, latterly as VP of Business Development for Air Force and DoD programs. Ken has provided US DoD agencies with solutions to address a range of challenges, leveraging a combination of Microsoft and third party technology solutions. Juan Celaya – President and CEO at COMPU-DATA International founded COMPU-DATA International, LLC in 1988 and has been successfully delivering content and data integration solutions for ECM implementations through the use of capture, administration, collaboration, retrieval technologies and solutions. Juan has a broad Information Technology background covering 30 years of experience, with successful system implementations in Energy, Healthcare, Pharmaceutical and Transportation companies, the US Department of Defense and the National Nuclear Security Administration.
  • 3. Agenda • Introductions • Concept Searching • Our Approach and Technologies • The Challenge of Search • Products • COMPU-DATA International • Why Searching Raw Corporate Content is not Enough • The Managed Metadata Environment – Your Organization’s “Controlled Vocabulary” • CDI’s “IGT” Model – A simple approach to the Managed Medata Environment • Initial Setup Sample • Rules • Taxonomy • Leveraging Automatically Applied Metadata to Deliver • Improved Search Results • Utilizing the Combined Power of • Classification and Taxonomy Management Technology
  • 4. • Company founded in 2002 • Product launched in 2003 • Focus on management of structured and unstructured information • Technology Platform • Delivered as a web service • Automatic concept identification, content tagging, auto-classification, taxonomy management • Only statistical vendor that can extract conceptual metadata • 2009, 2010, 2011, 2012, 2013 ‘100 Companies that Matter in KM’ (KMWorld Magazine) and Trend Setting product of 2009, 2010, 2011, 2012, 2013 • Authority to Operate enterprise wide US Air Force and enterprise wide NETCON US Army • Locations: US, UK, and South Africa • Client base: Fortune 500/1000 organizations • Managed Partner under Microsoft global ISV Program - ‘go to partner’ for Microsoft for auto-classification and taxonomy management • Smart Content Framework™ for Information Governance • Product Suite: conceptSearch, conceptTaxonomyManager, conceptClassifier, conceptClassifier for SharePoint, conceptTaxonomyWorkflow, conceptContentTypeUpdater for SharePoint Concept Searching – The Industry Leader in Managed Metadata Solutions
  • 5. • Metadata driven application and enforcement of policies - conceptClassifier has been deployed since 2003 to automatically generate metadata and use that metadata to apply and enforce policies. Most clients are using the platform to support their information governance strategy. • Proven, mature functionality out of the box - The platform has been deployed in numerous sites and applications across the enterprise, including MOSS and SharePoint 2010, 2013, Office 365, Stellent, Documentum, SQL, Oracle, File Shares, Exchange via SharePoint and across the enterprise. Smart Content Framework™ Getting It Right
  • 6. • Concept Searching’s statistical concept identification underpins all technologies • Multi-word suggestion is explicitly more valuable than single term suggestion algorithms • conceptClassifier will generate conceptual metadata by extracting multi-word terms that identify ‘triple heart bypass’ as a concept as opposed to single keywords • conceptTaxonomyManager uses statistical concept identification to provide real-time feedback during the process of building, testing, refining, and deploying taxonomies • Metadata can be used by any search engine index or any application/process that uses metadata. Concept Searching provides Automatic Concept Term Extraction Triple Baseball Three Heart Organ Center Bypass Highway Avoid Industry Unique Technology
  • 7. A Manual Metadata Approach Will Fail 95%+ Of The Time Issue Organizational Impact Inconsistent Less than 50% of content is correctly indexed, meta-tagged or efficiently searchable rendering it unusable to the organization. (IDC) Risky 59% of middle managers miss valuable information every day because they can’t find it or never see it (Accenture) Cumbersome - expensive Average cost of manually tagging one item runs from $4 - $7 per document and does not factor in the accuracy of the meta tags nor the repercussions from mis-tagged content. (Hoovers) Malicious compliance End users select first value in list. (Perspectives on Metadata, Sarah Courier) No perceived value for end user What’s in it for me? End user does not see value for organization nor risks associated with litigation and non- conformance to policies. Less than 14% of end users receive training. (AIIM) What have you seen Metadata will continue to be a problem due to inconsistent human behavior. The answer to consistent metadata is an automated approach that can extract the meaning from content eliminating manual metadata generation yet still providing the ability to manage knowledge assets in alignment with the unique corporate knowledge infrastructure. Manual Approach Leads to Failure
  • 8. Learning to Search • Searchers do not know “how to search” • 56% constructed poor queries • Proficiency with the machine does not translate into proficiency with the software • Searchers get lost in the data • 33% had difficulty navigating/orienting search results • 28% had difficulty maintaining orientation on a website • Loss of capacity for discernment • 36% did not go beyond the first 3 search results • (not pages…results on page 1) • 91% did not go beyond the first page of search results • 55% selected irrelevant results 1 or more times
  • 9. • Enterprise versus Internet search is a different animal • In the enterprise end users know the information is there if they could only find it • As a result, they will spend more time and aggravation looking for that one asset and don’t want to give up (IDC) • Enterprise end users expect information to be found within 4 minutes but will actually spend 2.5 hours per day looking for information • How do enterprise users overcome poor search results? • Recreate information • Use outdated or older versions of information • Interrupt a co-worker • Forget about finding the information • Just don’t start the task The Typical Search Approach
  • 10. The Hidden Costs of Search “There is a debilitating disconnect between the proliferation of electronic information and the constant need to quickly and accurately find all of the information and expertise that is essential for work every day. From top to bottom, enterprises have failed to take seriously the high cost of being grossly inadequate at finding information, data, documents, experts. Instead they have settled for low performance, low-return techniques to… sort of handle Search.” Julie Hunt - Search Consultant The cost to a 500 employee company is $2.4 million per year in inefficiencies and lost productivity. Gartner Group
  • 12. • People explore concepts – computers find keywords • Recall versus Precision • Location Search • User knows what they are seeking • Search engine must retrieve exactly and only the information required • Discovery Search • User does not know precisely what they are seeking • Search engine must retrieve content that “appears” to answer the query • Search engine must be able to accommodate both types of searching • The hierarchy provided by a taxonomy addresses the two different search approaches. Location based searches appear simple, but in fact are not. • If the end user does not immediately find what they are looking for, they can use the hierarchical structure to drill down by searching the concepts or taxonomy nodes. • Outcome: Identify associations and relationships that are typically not obvious in searching • More relevant information being found more quickly • Accessing inter-related ideas and concepts supports a fundamental change in user focus and activity and transforms it from searching to insight and discovery Taxonomy Navigation
  • 13. It’s Not Just About Search!! • Data Privacy • Records Management • Migration • Enterprise Content Management • Information Governance • Legal, eDiscovery, FOIA • Collaboration/Social • Text Analytics
  • 14. conceptClassifier for SharePoint • conceptClassifier for SharePoint • Combination of automatic classification, taxonomy management and Concept Searching’s APIs packaged for delivery into the SharePoint environment • Single code able to be deployed with SharePoint 2007, 2010, 2013, and Office 365 • Provides clients with the choice of on-premise, cloud based, or hybrid solutions to best meet their needs • Integrates with any search engine (SharePoint, former FAST products, Google Search Appliance, etc.) • Classifies content as it is created or ingested from diverse repositories within and outside of SharePoint • conceptTaxonomyWorkflow • Optional component that can perform an action on a document following a classification decision when the criteria are met • Built with a plug-in architecture enabling the simple development of content sources • Uses records retention codes, semantic, and security metadata associated to data assets to identify and process the automatic application of content types • Once documents have the appropriate content type, based on natural language and automatically applied metadata, workflows can be initiated. • Workflow source type works in the SharePoint 2007, 2010, and 2013 as well as for all document types, FILE document types, and HTTP document types
  • 15. conceptClassifier for Office 365 • Runs natively and bi-directionally with the SharePoint Term Store in any environment • Portability – ubiquitous access to information regardless of where it resides or how it is stored • Provides “intelligent” migration capabilities • Enables management of one term store for on premise and Office 365 use • Maintains GUIDs • Delivers enterprise class automatic document classification for all SharePoint, FILE, and HTTP document types • Protects records and confidential information from inadvertently being place in the cloud to avoid data breaches and unauthorized access to information • Enables concept based search and retrieval integrated with Microsoft search solutions • Provides a method to enable text analytics from multiple data sources without impacting on-premise server utilization
  • 16. Preserving the World's Knowledge    Available Anytime Anywhere SM Getting Enterprise Search Right by Using Your Own Vocabulary For Indexing and Searching
  • 17. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Many Thanks To
  • 18. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC • Why Searching Raw Corporate Content is not enough • The Managed Metadata Environment – Your Organization’s “Controlled Vocabulary” • CDI’s “IGT” Model – A simple approach to the Managed Medata Environment • Initial Setup Sample:  Rules  Taxonomy • Leveraging Automatically Applied Metadata to Deliver:  Improved Search Results • Utilizing the combined power of:  Classification & Taxonomy Management Technology Presentation Overview
  • 19. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Searching Raw Corporate Content is not enough • Limited Resources • Corporate Content Control  Not written for SEO  Not HTML • Not HTML  No Title Tags  Header Tags  Content missing keywords  No Meta Tags • How do you make content relevant?  Add corporate vocabulary (Tags, Tags and more Tags) Keywords too!  Adapt to the changing business environment  Leverage automatic metadata tagging You are not Google, Bing or Yahoo!
  • 20. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC “The managed meta data environment represents the architectural components, people and processes that are required to properly and systematically gather, retain and disseminate meta data throughout the enterprise.” Components: 1. Meta data sourcing layer 2. Meta data integration layer 3. Meta data repository 4. Meta data management layer 5. Meta data marts 6. Meta data delivery layer Managed Metadata Environment* *From the book Universal Meta Data Models by David Marco and Michael Jennings (John Wiley & Sons)
  • 21. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC The “IGT” Model: 1. Metadata Identification 2. Automatic Metadata Generation 3. Automatic Metadata Tagging Delivering a Dependable Automatic Data Enhancement Solution Applicable Across the Enterprise Implementing a Simple Managed Metadata Environment
  • 22. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC The “IGT” Model: 1. Metadata Identification Metadata Identification 1. The process to create the rules by which Metadata can be identified for electronic content and data files. 2. Requires technologies to process text content such as that from Concept Searching and others that can be used on images, audio, and video. 3. Requires Expertise: Data Science, Classification, Statistical Theory, Content SMEs, Library Sciences, Records Management. Other areas for which content must be identified. 4. Recursive process receiving input from actual results and new input from business needs and requirements. Implementing a Simple Managed Metadata Environment
  • 23. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC The “IGT” Model: 1. Metadata Identification 2. Automatic Metadata Generation Automatic Metadata Generation 1. Is the process to automatically provide Metadata to a requestor based on a predefined set of rules and processes. 2. Functionality either included in the tool and technology used or it is built to support this function. 3. Requires Expertise: Programming and integration, knowledge of tool and technology APIs and functionality. Implementing a Simple Managed Metadata Environment
  • 24. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC The “IGT” Model: 1. Metadata Identification 2. Automatic Metadata Generation 3. Automatic Metadata Tagging Automatic Metadata Tagging 1. Is the process and methods by which content is enhanced when one or more Metadata Tags are attached, assigned, joined or associated with the content’s electronic file through an automated process. 2. Functionality either included in the tool and technology used, it is provided from a 3rd party or it is built internally. 3. Requires Expertise: Programming and integration, knowledge of tool and technology APIs and process functionality. 4. Provides feedback to improve Metadata Identification. Implementing a Simple Managed Metadata Environment
  • 25. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Overview of CDI’s “IGT“ Model steps to a Successful Automatic Metadata Tagging Solution Metadata Identification The process to create the rules by which Metadata can be identified. Tools & Technologies: Text based technology such as Concept Searching, and others that can be used on images, audio, and video. Required Expertise: Data Science, Classification, Statistical Theory , Content SMEs, Library Sciences, Records Management. STEP #1 Automatic Metadata Generation The process to Automatically Generate Metadata for content by using the technologies used and rules defined in the Metadata Identification process. Functionality either included in the tool and technology used or it is built to support the tool and technology. Required Expertise: Programming and integration, knowledge of tool and technology APIs and functionality. STEP #2 STEP #3 Automatic Metadata Tagging The process and methods by which one or more Metadata Tags are attached, assigned, joined or associated to an electronic file. Functionality either included in the tool and technology used, it is provided from a 3rd party or it is built internally. Required Expertise: Programming and integration, knowledge of tool and technology APIs and process functionality. Feed back from actual results Implementing a Simple Managed Metadata Environment
  • 26. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Metadata Identification Advantages provided by each step in the “IGT” Model Automatic Metadata Generation Automatic Metadata Tagging 1. Standardize identification rules and establish a common metadata vocabulary. 2. Able to manage (distributed at and centrally for) different levels of the organization as needed. 3. Provides for ad hock support based on needs arising from eDiscovery and litigation. 4. Improve reliability in the result lists delivered by enterprise search technologies used. 5. Multiple approaches and technologies can be used that are applicable to varying content. 6. Easy to distribute improvements as feedback is applied. 1. SOP across the organization for getting corporate, departmental, project and any other applicable metadata. 2. Single interface for access to varying metadata identification technologies. 1. Enforce rules established in the organization and reduce costs due to changes in regulations/business. 2. Eliminates human factors and behaviors that contribute to negative outcomes associated with metadata assignment. 3. Makes it easier to achieve “findability”, enforce access controls, apply corporate governance and implement a record retention schedule. 4. Centralized and distributed tagging integration and control of data sources. 5. Bridges the implementation of any metadata identification technology to any data source. Note: Advantages of each step are not “all inclusive”
  • 27. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Supports Storage & Preservation Business Function Application of record retention codes with metadata deriving storage & preservation Supports Information Rights Management Business Functions Content types drive automatic application of RMS templates Supports Search Business Function Application of metadata drives search precision TAXONOMY MANAGEMENT ENVIRONMENT (Developed using Taxonomy Management Technology) Data Privacy and Security Compliance Taxonomies (Data Privacy & Security Enforcement, see (A, B)) Data Transparency by Metadata Tagging Compliance (Access Policies & Rights Management), see (A, C)) Records Management Compliance Taxonomies (Records Retention Schedules, see (A)) Improved Enterprise Search (Search Precision, see (B, C)) • The Taxonomy Management Environment (TME) is made up of the technologies, processes and utilities used to create and administer Taxonomies. • Within the TME subject matter experts, librarians and operations personnel work together to create taxonomies which define the business rules by which content files are analyzed. • These taxonomies are then used to classify electronic content and associate each applicable rule to the content file processed. • Content is “Automatically Tagged” with the appropriate rule identifier and corresponding vocabularies used within the organization. Taxonomy Management Environment A B C
  • 28. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Document Creation/ Modification by User SharePoint Update Content Types 28 Enterprise Search Search on user’s compliance domain (based on document content & metadata) Document Search by User Supports Storage & Preservation Business Function Application of record retention codes with metadata deriving storage & preservation Supports Information Rights Management Business Functions Content types drive automatic application of RMS templates Supports Search Business Function Application of metadata drives search precision A TAXONOMY MANAGEMENT ENVIRONMENT (Developed using Taxonomy Management Technology) Data Privacy and Security Compliance Taxonomies (Data Privacy & Security Enforcement, see (A, B)) Data Transparency by Metadata Tagging Compliance (Access Policies & Rights Management), see (A, C)) Records Management Compliance Taxonomies (Records Retention Schedules, see (A)) Improved Enterprise Search (Search Precision, see (B, C)) B C SharePoint Implementation conceptClassifier Classification Engine Tagging Processes
  • 29. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Document Creation/ Modification by User Shared Drives SharePoint Other Data Sources Update Content Types 29 Enterprise Search Search on user’s compliance domain (based on document content & metadata) Document Search by User Supports Storage & Preservation Business Function Application of record retention codes with metadata deriving storage & preservation Supports Information Rights Management Business Functions Content types drive automatic application of RMS templates Supports Search Business Function Application of metadata drives search precision A TAXONOMY MANAGEMENT ENVIRONMENT (Developed using Taxonomy Management Technology) Data Privacy and Security Compliance Taxonomies (Data Privacy & Security Enforcement, see (A, B)) Data Transparency by Metadata Tagging Compliance (Access Policies & Rights Management), see (A, C)) Records Management Compliance Taxonomies (Records Retention Schedules, see (A)) Improved Enterprise Search (Search Precision, see (B, C)) B C Enterprise Implementation Real Time Tagging Metadata Tagging & Workflow of Data Source Content Managed by DES* Move file to another location Sensitive Info? Auto Tagging of Content Files DES* automatically tags & classifies content based on semantics contained within actual content files. N Y Move to SharePoint Manage Relations of Rules & Content conceptClassifier Classification Engine *Data Enhancement System™ (DES) is a patent pending product from COMPU-DATA International, LLC.
  • 30. conceptClassifier Tool CDI’s DES* (Data Enhancement System™) Management Layer Automated Metadata Tagging of Content To/From Data Sources Other File Format Tools Data store for testing new taxonomies Data store for testing new taxonomies Librarian/Client-Generated Taxonomies & Fields Taxonomy A TA - Fields Taxonomy B TB - Fields ● ● ● ● ● ● Taxonomy N TN - Fields Librarian/Client-Generated Taxonomies & Fields Taxonomy A TA - Fields Taxonomy B TB - Fields ● ● ● ● ● ● Taxonomy N TN - Fields Metadata Generation Technologies SharePoint® in the Cloud SharePoint® On-Premise Shared Drives Other ECMs/APPs DATA SOURCES ● ● ● Automated Tagging of Content Automated Tagging of Content Automated Tagging of Content DIRECT DATA SOURCE MONITORING Taxonomy Managers & Subject Matter Experts ● ● ● ©2012 - 2013 COMPU- DATA International, LLC, All Rights Reserved TAXONOMY MANAGEMENT ENVIRONMENT MANAGE AUTOMATIC METADATA TAGGING OF CONTENT *Data Enhancement System™ (DES) is a patent pending product from COMPU-DATA International, LLC.
  • 31. conceptClassifier Tool CDI’s DES * (Data Enhancement System™) Management Layer Automated Metadata Tagging of Content To/From Data Generation Sources Other File Format Tools Data store for testing new taxonomies Data store for testing new taxonomies Librarian/Client-Generated Taxonomies & Fields Taxonomy A TA - Fields Taxonomy B TB - Fields ● ● ● ● ● ● Taxonomy N TN - Fields Librarian/Client-Generated Taxonomies & Fields Taxonomy A TA - Fields Taxonomy B TB - Fields ● ● ● ● ● ● Taxonomy N TN - Fields Metadata Generation Technologies Automated Tagging & Delivery of Content INTEGRATION WITH CAPTURE, PROCESSES & APPLICATIONS SharePoint® In the Cloud SharePoint® On-Premise Shared Drives Others ECMs/APPs DATA SOURCES ● ● ● Business & Application Processes Taxonomy Managers & Subject Matter Experts Kodak Capture Pro ● ● ● ©2012 - 2013 COMPU- DATA International, LLC, All Rights Reserved Info Activate MANAGE AUTOMATIC METADATA TAGGING OF CONTENT *Data Enhancement System™ (DES) is a patent pending product from COMPU-DATA International, LLC.
  • 32. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Supports Storage & Preservation Business Function Application of record retention codes with metadata deriving storage & preservation Supports Information Rights Management Business Functions Content types drive automatic application of RMS templates Supports Search Business Function Application of metadata drives search precision Data Privacy and Security Compliance Data Privacy & Security Enforcement Data Transparency by Metadata Tagging Compliance Access Policies & Rights Management Records Management Compliance Records Retention Schedules Improved Enterprise Search Search Precision Data Enhancement Through Automatic Metadata Tagging B A C CB A B C A A
  • 33. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Supports Search Business Function Application of metadata drives search precision Improved Enterprise Search Search Precision Automatic Metadata Tagging to all Data Sources using your own Vocabulary Identify, Retrieve, Secure & Control Access to Data Assets across Distinct Business & Service Delivery Units Supports Information Rights Management Business Functions Content types drive automatic application of RMS templates Data Enhancement Through Automatic Metadata Tagging
  • 34. Taxonomy Management Enabling the Automatic Meta-tagging and Auto-Classification of Documents and Records Each node is a piece of metadata that gets tagged to a document or record based upon the prevalence of a clue within the document Manually Created Metadata associated with the concept of “Weather” Distribution Statement A: Approved for public release; distribution is unlimited 311 ABG/PA No. 09-488, 16 Oct 2009
  • 35. Automatic Metadata Generation Unique IP of Compound Term Processing enables the identification of compound terms (not keywords) from highly relevant content that can be used to trigger the automatic meta-tagging and auto-classification processes Automatically Generated Metadata associated with the concept of “Weather” Distribution Statement A: Approved for public release; distribution is unlimited 311 ABG/PA No. 09-488, 16 Oct 2009
  • 36. Automatic Metadata Generation Automatically generated metadata is added to original metadata for the category/folder Outcome: more semantics that can be linked to a document or record result in information that becomes more actionable (the document/record is now retrievable and classifiable) Highly relevant metadata generated by Taxonomy Manager added to original clue set for the concept of “Weather” Distribution Statement A: Approved for public release; distribution is unlimited 311 ABG/PA No. 09-488, 16 Oct 2009
  • 37. Automatic Meta-tagging Metatags are automatically added to the properties field of each document enhancing the document’s valuable to the organization by increasing the ability of the document to be retrieved using enterprise search solutions that use keywords and metadata to retrieve information
  • 38. Automatic Meta-tagging in Action One of the Metatags for the document was “Turbulence Encounter” however when we search for this term within the document we do not find it Why did this happen?
  • 39. Automatic Meta-tagging in Action Turbulence Encounter is only one of 4 “clues” that must exist within a document in order for that document to be automatically meta-tagged with the concept of Turbulence Encounter Distribution Statement A: Approved for public release; distribution is unlimited 311 ABG/PA No. 09-488, 16 Oct 2009
  • 40. Automatic Meta-tagging in Action When we search the document using another clue for Turbulence Encounter, “Windshear”, we see that its existence within the document triggered the automated meta-tagging event that resulted in the document being tagged with “Turbulence Encounter”
  • 41. Automatic Meta-tagging Metatags are automatically added to the properties field of each document making the document more valuable to the organization by increasing the ability of the document to be retrieved using enterprise search solutions that use keywords and metadata to retrieve information
  • 42.
  • 43. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Automatically Apply Semantic Metadata Records Retention Codes Data Privacy & Security Metadata To Every Document in SharePoint, File Shares and other Data Sources “Corporate Vocabulary” as the Metadata Environment (Taxonomy) “Non-Preferred Vocabulary Terms”
  • 44. • US Army - Records Management and Declassification Agency (RMDA) • Solution • An integrated search solution to consolidate over 20 data sources and databases into just a few repositories, with access to high data volumes • Benefits • Centralized data standardization • Increased findability • Terabyte size data store support • Increased productivity • Scalable Use Case - RMDA
  • 45. • A global company which designs, manufactures, and distributes appliances to more than 70 countries • Solution • An intranet and content management solution to improve search, sensitive/confidential information protection, and records identification and tagging • Benefits • Accurate search • Data privacy • Effective records management Use Case – Global Appliance Company
  • 46. Preserving the Worlds Knowledge - Available Anytime AnywhereSM ©2013 COMPU-DATA International, LLC, All Rights Reserved COMPU-DATA International, LLC Time for a Demo Let’s Tie all of this Together With a Live Demo
  • 47. Thank You Juan J. Celaya President & CEO COMPU-DATA International, LLC jcelaya@cdlac.com Twitter @conceptsearch Ken Lemons VP Federal Programs Concept Searching kenl@conceptsearching.com