SlideShare a Scribd company logo
1 of 11
Download to read offline
Beyond the Enterprise Data Warehouse

           The emerging Enterprise Search solutions

           Findwise Göteborg
           2009-02-25




                                                                                             Helge Legernes
                                                                                         helge.legernes@findwise.se




www.findwise .se | info@findwise.se | +46 31 288 400 | Drottningg. 5 | 411 14 Göteborg
Abstract
This document elaborates the role of the enterprise search technology as an intelligent retrieval platform for
structured data, a role traditionally held by the Relational Database Management Systems (RDBMS).
Furthermore it investigates the great possibility by enterprise search solutions to derive insights and patterns
by also analyzing the unstructured data, which is not possible to do with traditional data warehouse systems
based on RDBMS.




© 2009 Findwise AB                                                                                            1
Table of Contents
Enterprise search as enterprise data warehouse ..............................................................4

  An example ..................................................................................................................................................................5


Enterprise search vs. EDW solution .................................................................................6

  Scalability and distributed architecture ....................................................................................................................6
  True Ad-hoc Querying and Relevancy .......................................................................................................................6
  User interaction ..........................................................................................................................................................6
  More content – better analysis..................................................................................................................................7
  Real time content – not batch oriented content ......................................................................................................7
  Low implementation and maintenance costs with great flexibility ........................................................................7
  One common index used by all users in the enterprise ...........................................................................................8


Conclusion ....................................................................................................................9

References .................................................................................................................. 10




© 2009 Findwise AB                                                                                                                                                            2
Introduction
It has long been held as convention that intelligent exploration of corporate operational systems requires a
commitment to Business Intelligence and Data Warehousing technologies. Recent demands, however, have
compelled organizations to look for ways to enhance their technical portfolio with capabilities similar to those
found at public Web search portals or with their current Content Management Systems. Generally, the demand
arises from extreme requirements in performance and scalability, a desire to incorporate unstructured data, or
adopting more effective analytics via concept and context based textual mining.
Every once in a while an industry develops a peculiar ability to sustain two seemingly identical technologies in
fundamentally incompatible ways. Think of the media industry with the computer and the television. Both
involve a source of electronic information, a vehicle for transmission, and a device for visual display. Yet how
long did it take before we could share the same monitor, or record a show on the same machine as the family
photos?

Now consider OLAP (Online Analytical Processing) and Search. Both include a central data store; mechanisms
for extracting data from multiple sources; logic for intelligent, ad-hoc retrieval; and visualization paradigms to
organize the results. They even have parallel jargon: knowledge and data management, text and data mining,
search and query, document and row, and so on. And they are entirely incompatible with each other – or were.
We are all quite familiar with the RDBMS as the de facto standard, but the extraction and analysis process, the
traditional prevue of the OLAP environment – ETL (Extract Transform and Load), data warehouse, data mart,
and business intelligence – is beginning to strain the relational model. Why is it so hard to support natural
language queries? How is meaning extracted from textual content? Why can’t the system be more real-time?
How can it provide better analytics? Why is the whole process so expensive to manage?

The RDBMS vendors are just now releasing new versions of their database engines embedded with the
beginnings of search technology. Perhaps, they “peered over the wall” at the search vendors to see how
they’ve been solving these problems. If they did, they would have been surprised to see the search vendors
staring right back at them, for they have been mining both structured and unstructured content for some time
now, and doing so in ways quite different than the RDBMS.
Conventional approaches to information discovery and access on structured data are running out of steam.
Traditional proprietary applications are often too complex for people to use without expertise and specific
training.

Furthermore software licenses are too expensive for wide-scale deployment to all the users who may need
access.




© 2009 Findwise AB                                                                                              3
Enterprise search as enterprise data warehouse
When corporate users want to know something, they are looking inward, using search techniques to seek out
clues that are scattered across the entire corporate IT resource—from the data submerged in silos including
ERP, ordering and financial systems, intranets, e-mail and Web servers, and users’ own workstations. This
involves going through multiple sources and linking together the pieces of information to give a starting point
for decisions. The past three to four years we have seen a real focus on enabling corporate users to do this
more efficiently, because without effective companywide search techniques, users end up wasting time as they
hunt across multiple systems for a piece of information. As a consequence the companies suffer as well.

Over time, the Enterprise Search Engine System will
certainly become the most shared system within the                  Unstructured information
enterprise. A big difference until now has been that the            Data held in Word and PDF documents on
EDW systems have mostly dealt with structured data, while           servers and users’ desktops that are not
the ESE systems have dealt with both structured and                 indexed, tagged or archived for easy location.
unstructured data. So, how can business users utilize the           European and U.S. analysts agree that about 80
unstructured and structured information that resides both           percent of information within businesses is
in their companies as well as out on the Internet to make           unstructured.
better tactical and strategic decisions?

This collection of information might include business intelligence reports as well as the data underlying these
reports, blogs, news feeds, or other information that resides in the company or on the Internet.




                                  Figure 1 – Typical enterprise search architecture



We need to set some definitions: First, Enterprise search refers to technology that indexes and retrieves
information in structured and unstructured sources in both internal data sources and web sites for a company's
user base. Typically, this search is still goal-oriented—the users know what they are looking for. Enterprises use
search for a variety of purposes including simple site search as well as finding basic answers to questions such
as: “What is my authorization limit?” or, “How do I order a new computer?” As well as more advanced analysis
that leverages text analytics.




© 2009 Findwise AB                                                                                                   4
Second, Text analytics is a set of techniques that can discover and extract unstructured text and transform it
into structured information that can then be leveraged in various ways. Various extraction techniques exist
including entity extraction (e.g. people, place, financial amount), concept extraction (e.g. unhappy customer) or
fact extraction (e.g. an event specification), to name a few. For example, concepts may be created using
various linguistic- and statistics-based processes. Different vendors make use of different techniques and have
created patents for these techniques.

Lastly, Convergence is bringing these two technologies together and then searching using extraction indexes
created by the text analytics. In other words, search results are filtered by concepts, entities, facts, or other
extraction types and utilized to answer questions.


An example
Consider the following prime example of what might be possible utilizing this approach. A manufacturing
company is interested in gathering intelligence on why it is losing market share. In the past, it would have
looked at numerous reports including: sales reports, warranty reports, customer satisfaction surveys and
analyst reports. It would have also had business analysts searching the Internet to find any relevant
information. Leveraging this new approach, the organization might create concepts that it uses to search
through all of its internal documents and business intelligence reports as well as any external news feeds and
articles. These concepts might include “unhappy customer”, and/or utilize various entities like “competitors
making financial transactions.”

The concept itself may come from a taxonomy or ontology that has already been created by the organization or
a third party. Or, it may have been created by a subject matter expert working with this information via a GUI
that is part of a software package. This extracted information is sent to a search engine interface. The results
might appear as they would with any search engine. The results might also be piped to a business intelligence
product to produce plots of percent unhappy, happy, and neutral (no comment) customers. These table plots
might have been derived from the text of customer care centres or customer surveys and would have drill
down capabilities to enable line of business users to explore why these customers are unhappy and do further
analysis. Levering this kind of approach, companies can then derive insights that were not possible before.




© 2009 Findwise AB                                                                                             5
Enterprise search vs. EDW solution
There are several reasons for choosing enterprise search, some of which are discussed in the following
sections.


Scalability and distributed architecture
Traditional enterprise architectures gain performance primarily through increased use of CPU and memory.
This is scalability through more grid iron and its costs increase exponentially with demand. RDBMS technologies
adopted this model from the beginning, which is understandable since it was the prevailing solution when the
industry was new and growing. They have recently begun a move towards more distributed models, but the
journey must be difficult. To be truly multi-dimensionally distributed means a basic rewrite of your core.

New enterprise architectures are distributed from the ground up, balancing demand for CPU, memory, and disk
to reflect the realities of commodity hardware, which is any collection of off-the-shelf inexpensive computers,
typically “Lintel/Wintel”, networked together in a grid. Since performance is gained through simply adding
another computer to the grid, costs are linear in growth. This is scalability through grid computing, and IT
organizations are beginning to adopt this model for all their enterprise requirements.


True Ad-hoc Querying and Relevancy
The relational model is exactly that – a model. The fact that there is a schema at all means the RDBMS works
better for some queries over others. The issue is not so much who has the fastest query but who has the
slowest. For example, the difference between a 20 and 50 millisecond delay is not discernable to the user, but
a 30 second response time is. Variance is often far more important than average query time.

The problem lies with the fact that a schema is counter to the notion of a purely ad-hoc query environment.
The industry devised new denormalized designs, such as star and snowflake schemas, and even introduced
multi-dimensional database engines to alleviate the problem as much as possible, but while these technologies
were good for queries generally focused on sales, financial, and production analyses, they were no good for
real-time processing, heavy analytics, historical comparisons, complex updates (e.g. data corrections),
textually-centric content, or environments requiring a large number of dimensions. A truly unbiased model is
one that has no schema at all (i.e. hyper-denormalized to one table). This is the index of an enterprise search
engine.

Another problem is the language itself. SQL is designed to query systems where the results support a binary
inclusion model. In other words, data is either part of the answer or not. The order of the data itself in the
result set has no relevant meaning other than a preference for sorting and summation. This model leaves out
the whole world of relative inclusion, where data has a graduated score of relevance in the result list.
The omission is understandable. Business intelligence focuses almost exclusively on financial and other
numerical data, really to support monetary trends to measure an organization’s health or assess risk. However,
a lot of information is left on the table when one elects to ignore the textual content or relegates it to simple
identifiers for qualifying these analyses


User interaction
The popularity of the Web search vendors (Google, Yahoo!, and Microsoft) has had an interesting effect on the
expectations of users in the enterprise market. The simplicity of the search bar, the power of navigators, and
the convenience of simple ranked results are now basic requirements. “Why can’t our stuff be as simple as


© 2009 Findwise AB                                                                                             6
Google?” is a common cry. It is not surprising this is the case, since what better an audience to test the
acceptability of a user interface than the world at large?

The net effect is a shift in user interaction design (UI is more about “interaction” these days than “interface”) as
it relates to information retrieval, exploration, and analysis. The new model recognizes the need for an
integrated approach to querying and exploring content. The interaction is a combination of techniques that all
resolve to the same action: asking for information and getting results. How intelligently the system allows you
to ask your question measures the quality of the interaction.

The basic model is this: providing both a search box and several smart navigation approaches offers the best
interface for query and exploration, and an intelligent balance of the two is the most efficient interaction
model for intelligent extraction of information. A good second generation enterprise search engine supports all
of these capabilities out of the box. The RDBMS vendors are just beginning to incorporate some of them in
their latest versions.


More content – better analysis
As mentioned earlier one often claim that 80 percent of an organization’s content resides in unstructured
repositories (email, documents, etc.), while only 20 percent resides in databases. The content management
vendors use this statistic often. While true, it may also be the case that the 20 percent in the database has 80
percent of the importance. The argument is largely academic, because the real answer is to make the entire
100 percent available.

You can’t find something that isn’t there, so why not make sure it is in the corpus of searchable content? This is
simple insurance against the “not knowing what you don’t know” problem that inevitably inflicts the worst pain
on an organization. Furthermore, the integration of structured and unstructured content allows for new
investigative approaches. Imagine monitoring the efficacy of an advertising campaign by tracking product sales
and market intelligence in a coordinated fashion


Real time content – not batch oriented content
We often think of content as primarily historical, a snapshot of activity that occurs over a period of time (which
may be as recent as the last 24 hours). This is such a standard assumption the batch orientation of the data
warehousing market requires that it be true. But it is not always true. What if I’m monitoring stock transactions
and I wish to get answers before the trader executes the manager’s requests? What if my job is to look for
patterns on the newswire? The data in these scenarios is streamed in real time, not digested in batches, and it
should be supported as such. Note the importance of latency here. Streamed content should be searchable
with minimal delay, often less than a second.

Second generation enterprise search engines support alerts and sub-second latency. In an OLAP environment
this would be considerably more difficult.


Low implementation and maintenance costs with great flexibility
In a typical OLAP environment you will find one or more data warehouses, several operational data stores
and/or data marts, assorted ETL technologies, and a myriad of business intelligence repositories that define
reports, queries and other assorted meta-data. Creating and maintaining these components requires a host of
engineers, DBAs, and knowledge workers (who we pretend do not need to understand relational concepts but
really do if they build reports or queries, even graphically). Now add to this environment meta-data
management, keyword and acronym dictionaries, multiple languages, thesauri, etc. and you can see how the
management costs can proliferate.



© 2009 Findwise AB                                                                                                7
Search engines, especially those with a history of servicing the Web, are designed for the widest audience,
consumers who know absolutely nothing about technology. The knowledge worker does not exist in the
consumer market, so the ad hoc report at its most simplest is merely a search bar that returns ranked results
and sometimes dynamically generated navigators to assist in exploration. The management process, as you
might imagine, is also much easier, if for no other reason that there is one single component to install,
configure, and manage rather than several. Please note that a DBA is no longer required.




                                           Figure 2 – Search vs. EDW

The “single component” comment requires some explanation. Recall that data marts and operational data
stores were created to overcome the limitations of the relational model for handling ad-hoc querying –
avoiding the “killer query” – and because the RDBMS was never able to efficiently support high volume data
updates and query traffic at the same time. The search engine has no such limitations.


One common index used by all users in the enterprise
As mentioned earlier an enterprise search solution could serve as the basis for one “single component” where
the index can serve many different applications and different users and their needs.

Many of the search solutions have a range of powerful presentation-layer functions. These enable overlaying of
results of multiple related searches—allowing users to explore commonalities between groups or search
results, building an in-depth picture of results from seemingly unconnected data.

The enterprise search solutions can also feed different BI tools, but of course as time evolves the search
vendors will enhance their products with integrated BI tools.




© 2009 Findwise AB                                                                                          8
Figure 3 – Enterprise search: one common platform and shared data




Conclusion
Enterprise search technology may not have been around as long as it’s RDMS counterpart, but it has been
much more aggressive in its intelligent retrieval of information from both structured and unstructured content.
Whatever the reasons, for organizations to compete effectively, report accurately, defend aggressively, or most
any other activity, the key to their success is their ability to get the right information to the right people at the
right time (and at the right cost); from any source, internal or external to the organization. The enterprise
search solution should be considered to be the best candidate that can effectively deliver.




© 2009 Findwise AB                                                                                                 9
References
Ardentia ”Zen and the art of enterprise search”, by Richard Lewis,

FAST Search & Transfer “Search in a structured data environment”, by Davor Peter Sutija,

Hurwitz & Associates “Enterprise search evolves”, by Fern Halper.




© 2009 Findwise AB                                                                         10

More Related Content

What's hot

PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataSemantic Web Company
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET Journal
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsRackspace
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big dataMark Albala
 
Extract the Analyzed Information from Dark Data
Extract the Analyzed Information from Dark DataExtract the Analyzed Information from Dark Data
Extract the Analyzed Information from Dark Dataijtsrd
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET Journal
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveKun Le
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1Parviz Vakili
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPeter Wang
 
01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BI01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BIAchmad Solichin
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantic Web Company
 
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...emermell
 
Disruptive Data Science Series: Transforming Your Company into a Data Science...
Disruptive Data Science Series: Transforming Your Company into a Data Science...Disruptive Data Science Series: Transforming Your Company into a Data Science...
Disruptive Data Science Series: Transforming Your Company into a Data Science...EMC
 
Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Barry Devlin
 
A Primer for a layman about Big Data, Business Analytics and Cloud
A Primer for a layman  about Big Data, Business Analytics and CloudA Primer for a layman  about Big Data, Business Analytics and Cloud
A Primer for a layman about Big Data, Business Analytics and CloudRajagopalan V
 
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdfbig-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdfAkuhuruf
 

What's hot (19)

PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked Data
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
Abstract
AbstractAbstract
Abstract
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High Expectations
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big data
 
Extract the Analyzed Information from Dark Data
Extract the Analyzed Information from Dark DataExtract the Analyzed Information from Dark Data
Extract the Analyzed Information from Dark Data
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its Challenges
 
Opentext Decisiv
Opentext DecisivOpentext Decisiv
Opentext Decisiv
 
Big data and oracle
Big data and oracleBig data and oracle
Big data and oracle
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep dive
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data Analysis
 
01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BI01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BI
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive Computing
 
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
 
Disruptive Data Science Series: Transforming Your Company into a Data Science...
Disruptive Data Science Series: Transforming Your Company into a Data Science...Disruptive Data Science Series: Transforming Your Company into a Data Science...
Disruptive Data Science Series: Transforming Your Company into a Data Science...
 
Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too
 
A Primer for a layman about Big Data, Business Analytics and Cloud
A Primer for a layman  about Big Data, Business Analytics and CloudA Primer for a layman  about Big Data, Business Analytics and Cloud
A Primer for a layman about Big Data, Business Analytics and Cloud
 
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdfbig-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
 

Similar to Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Emerging Enterprise Search Solution

Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Findwise
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052kavi172
 
The Case for Business Modeling
The Case for Business ModelingThe Case for Business Modeling
The Case for Business ModelingNeil Raden
 
Joining It All Up - KIM Legal
Joining It All Up - KIM LegalJoining It All Up - KIM Legal
Joining It All Up - KIM LegalKate Simpson
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”voginip
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”VOGIN-academie
 
enterprise-data-everywhere
enterprise-data-everywhereenterprise-data-everywhere
enterprise-data-everywhereBill Peer
 
Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engineankur881120
 
Solving data discovery in the enterprise
Solving data discovery in the enterpriseSolving data discovery in the enterprise
Solving data discovery in the enterpriseJesus Rodriguez
 
Make compliance fulfillment count double
Make compliance fulfillment count doubleMake compliance fulfillment count double
Make compliance fulfillment count doubleDirk Ortloff
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective ApproachIRJET Journal
 
Operational Analytics: Best Software For Sourcing Actionable Insights 2013
Operational Analytics: Best Software For Sourcing Actionable Insights 2013Operational Analytics: Best Software For Sourcing Actionable Insights 2013
Operational Analytics: Best Software For Sourcing Actionable Insights 2013Newton Day Uploads
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeCognizant
 
12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content Analytics12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content AnalyticsSeth Grimes
 

Similar to Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Emerging Enterprise Search Solution (20)

Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052
 
Essay Database
Essay DatabaseEssay Database
Essay Database
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Database Essay
Database EssayDatabase Essay
Database Essay
 
The Case for Business Modeling
The Case for Business ModelingThe Case for Business Modeling
The Case for Business Modeling
 
Joining It All Up - KIM Legal
Joining It All Up - KIM LegalJoining It All Up - KIM Legal
Joining It All Up - KIM Legal
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
 
enterprise-data-everywhere
enterprise-data-everywhereenterprise-data-everywhere
enterprise-data-everywhere
 
Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
 
Solving data discovery in the enterprise
Solving data discovery in the enterpriseSolving data discovery in the enterprise
Solving data discovery in the enterprise
 
Make compliance fulfillment count double
Make compliance fulfillment count doubleMake compliance fulfillment count double
Make compliance fulfillment count double
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
 
FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010
 
Operational Analytics: Best Software For Sourcing Actionable Insights 2013
Operational Analytics: Best Software For Sourcing Actionable Insights 2013Operational Analytics: Best Software For Sourcing Actionable Insights 2013
Operational Analytics: Best Software For Sourcing Actionable Insights 2013
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practices
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
 
12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content Analytics12 Things the Semantic Web Should Know about Content Analytics
12 Things the Semantic Web Should Know about Content Analytics
 
Taxonomy and seo sla 05-06-10(jc)
Taxonomy and seo   sla 05-06-10(jc)Taxonomy and seo   sla 05-06-10(jc)
Taxonomy and seo sla 05-06-10(jc)
 

More from Findwise

White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017Findwise
 
AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017Findwise
 
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017Findwise
 
Findwise and IBM Watson
Findwise and IBM WatsonFindwise and IBM Watson
Findwise and IBM WatsonFindwise
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findwise
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findwise
 
Findability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindwise
 
Findability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaborationFindability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaborationFindwise
 
Findability Day 2016 - SKF case study
Findability Day 2016 - SKF case studyFindability Day 2016 - SKF case study
Findability Day 2016 - SKF case studyFindwise
 
Findability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experienceFindability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experienceFindwise
 
Findability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligenceFindability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligenceFindwise
 
Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?Findwise
 
Findability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPRFindability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPRFindwise
 
Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365Findwise
 
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...Findwise
 
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any messFindability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any messFindwise
 
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...Findwise
 
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...Findwise
 
Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!Findwise
 
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...Findwise
 

More from Findwise (20)

White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017
 
AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017
 
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
 
Findwise and IBM Watson
Findwise and IBM WatsonFindwise and IBM Watson
Findwise and IBM Watson
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016
 
Findability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learning
 
Findability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaborationFindability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaboration
 
Findability Day 2016 - SKF case study
Findability Day 2016 - SKF case studyFindability Day 2016 - SKF case study
Findability Day 2016 - SKF case study
 
Findability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experienceFindability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experience
 
Findability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligenceFindability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligence
 
Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?
 
Findability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPRFindability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPR
 
Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365
 
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
 
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any messFindability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
 
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
 
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...
 
Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!
 
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...
 

Recently uploaded

Falcon Invoice Discounting: Aviate Your Cash Flow Challenges
Falcon Invoice Discounting: Aviate Your Cash Flow ChallengesFalcon Invoice Discounting: Aviate Your Cash Flow Challenges
Falcon Invoice Discounting: Aviate Your Cash Flow Challengeshemanthkumar470700
 
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in OmanMifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Omaninstagramfab782445
 
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165meghakumariji156
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxCynthia Clay
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Centuryrwgiffor
 
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Adnet Communications
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1kcpayne
 
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAIGetting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAITim Wilson
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting
 
Rice Manufacturers in India | Shree Krishna Exports
Rice Manufacturers in India | Shree Krishna ExportsRice Manufacturers in India | Shree Krishna Exports
Rice Manufacturers in India | Shree Krishna ExportsShree Krishna Exports
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfAdmir Softic
 
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All TimeCall 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Timegargpaaro
 
TVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdf
TVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdfTVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdf
TVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdfbelieveminhh
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityEric T. Tung
 
Cracking the 'Career Pathing' Slideshare
Cracking the 'Career Pathing' SlideshareCracking the 'Career Pathing' Slideshare
Cracking the 'Career Pathing' SlideshareWorkforce Group
 
Cannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 UpdatedCannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 UpdatedCannaBusinessPlans
 
Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptxRoofing Contractor
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon investment
 
Arti Languages Pre Seed Teaser Deck 2024.pdf
Arti Languages Pre Seed Teaser Deck 2024.pdfArti Languages Pre Seed Teaser Deck 2024.pdf
Arti Languages Pre Seed Teaser Deck 2024.pdfwill854175
 

Recently uploaded (20)

Falcon Invoice Discounting: Aviate Your Cash Flow Challenges
Falcon Invoice Discounting: Aviate Your Cash Flow ChallengesFalcon Invoice Discounting: Aviate Your Cash Flow Challenges
Falcon Invoice Discounting: Aviate Your Cash Flow Challenges
 
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in OmanMifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
 
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Century
 
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1
 
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAIGetting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investors
 
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
 
Rice Manufacturers in India | Shree Krishna Exports
Rice Manufacturers in India | Shree Krishna ExportsRice Manufacturers in India | Shree Krishna Exports
Rice Manufacturers in India | Shree Krishna Exports
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All TimeCall 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
 
TVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdf
TVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdfTVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdf
TVB_The Vietnam Believer Newsletter_May 6th, 2024_ENVol. 006.pdf
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League City
 
Cracking the 'Career Pathing' Slideshare
Cracking the 'Career Pathing' SlideshareCracking the 'Career Pathing' Slideshare
Cracking the 'Career Pathing' Slideshare
 
Cannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 UpdatedCannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 Updated
 
Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptx
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business Potential
 
Arti Languages Pre Seed Teaser Deck 2024.pdf
Arti Languages Pre Seed Teaser Deck 2024.pdfArti Languages Pre Seed Teaser Deck 2024.pdf
Arti Languages Pre Seed Teaser Deck 2024.pdf
 

Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Emerging Enterprise Search Solution

  • 1. Beyond the Enterprise Data Warehouse The emerging Enterprise Search solutions Findwise Göteborg 2009-02-25 Helge Legernes helge.legernes@findwise.se www.findwise .se | info@findwise.se | +46 31 288 400 | Drottningg. 5 | 411 14 Göteborg
  • 2. Abstract This document elaborates the role of the enterprise search technology as an intelligent retrieval platform for structured data, a role traditionally held by the Relational Database Management Systems (RDBMS). Furthermore it investigates the great possibility by enterprise search solutions to derive insights and patterns by also analyzing the unstructured data, which is not possible to do with traditional data warehouse systems based on RDBMS. © 2009 Findwise AB 1
  • 3. Table of Contents Enterprise search as enterprise data warehouse ..............................................................4 An example ..................................................................................................................................................................5 Enterprise search vs. EDW solution .................................................................................6 Scalability and distributed architecture ....................................................................................................................6 True Ad-hoc Querying and Relevancy .......................................................................................................................6 User interaction ..........................................................................................................................................................6 More content – better analysis..................................................................................................................................7 Real time content – not batch oriented content ......................................................................................................7 Low implementation and maintenance costs with great flexibility ........................................................................7 One common index used by all users in the enterprise ...........................................................................................8 Conclusion ....................................................................................................................9 References .................................................................................................................. 10 © 2009 Findwise AB 2
  • 4. Introduction It has long been held as convention that intelligent exploration of corporate operational systems requires a commitment to Business Intelligence and Data Warehousing technologies. Recent demands, however, have compelled organizations to look for ways to enhance their technical portfolio with capabilities similar to those found at public Web search portals or with their current Content Management Systems. Generally, the demand arises from extreme requirements in performance and scalability, a desire to incorporate unstructured data, or adopting more effective analytics via concept and context based textual mining. Every once in a while an industry develops a peculiar ability to sustain two seemingly identical technologies in fundamentally incompatible ways. Think of the media industry with the computer and the television. Both involve a source of electronic information, a vehicle for transmission, and a device for visual display. Yet how long did it take before we could share the same monitor, or record a show on the same machine as the family photos? Now consider OLAP (Online Analytical Processing) and Search. Both include a central data store; mechanisms for extracting data from multiple sources; logic for intelligent, ad-hoc retrieval; and visualization paradigms to organize the results. They even have parallel jargon: knowledge and data management, text and data mining, search and query, document and row, and so on. And they are entirely incompatible with each other – or were. We are all quite familiar with the RDBMS as the de facto standard, but the extraction and analysis process, the traditional prevue of the OLAP environment – ETL (Extract Transform and Load), data warehouse, data mart, and business intelligence – is beginning to strain the relational model. Why is it so hard to support natural language queries? How is meaning extracted from textual content? Why can’t the system be more real-time? How can it provide better analytics? Why is the whole process so expensive to manage? The RDBMS vendors are just now releasing new versions of their database engines embedded with the beginnings of search technology. Perhaps, they “peered over the wall” at the search vendors to see how they’ve been solving these problems. If they did, they would have been surprised to see the search vendors staring right back at them, for they have been mining both structured and unstructured content for some time now, and doing so in ways quite different than the RDBMS. Conventional approaches to information discovery and access on structured data are running out of steam. Traditional proprietary applications are often too complex for people to use without expertise and specific training. Furthermore software licenses are too expensive for wide-scale deployment to all the users who may need access. © 2009 Findwise AB 3
  • 5. Enterprise search as enterprise data warehouse When corporate users want to know something, they are looking inward, using search techniques to seek out clues that are scattered across the entire corporate IT resource—from the data submerged in silos including ERP, ordering and financial systems, intranets, e-mail and Web servers, and users’ own workstations. This involves going through multiple sources and linking together the pieces of information to give a starting point for decisions. The past three to four years we have seen a real focus on enabling corporate users to do this more efficiently, because without effective companywide search techniques, users end up wasting time as they hunt across multiple systems for a piece of information. As a consequence the companies suffer as well. Over time, the Enterprise Search Engine System will certainly become the most shared system within the Unstructured information enterprise. A big difference until now has been that the Data held in Word and PDF documents on EDW systems have mostly dealt with structured data, while servers and users’ desktops that are not the ESE systems have dealt with both structured and indexed, tagged or archived for easy location. unstructured data. So, how can business users utilize the European and U.S. analysts agree that about 80 unstructured and structured information that resides both percent of information within businesses is in their companies as well as out on the Internet to make unstructured. better tactical and strategic decisions? This collection of information might include business intelligence reports as well as the data underlying these reports, blogs, news feeds, or other information that resides in the company or on the Internet. Figure 1 – Typical enterprise search architecture We need to set some definitions: First, Enterprise search refers to technology that indexes and retrieves information in structured and unstructured sources in both internal data sources and web sites for a company's user base. Typically, this search is still goal-oriented—the users know what they are looking for. Enterprises use search for a variety of purposes including simple site search as well as finding basic answers to questions such as: “What is my authorization limit?” or, “How do I order a new computer?” As well as more advanced analysis that leverages text analytics. © 2009 Findwise AB 4
  • 6. Second, Text analytics is a set of techniques that can discover and extract unstructured text and transform it into structured information that can then be leveraged in various ways. Various extraction techniques exist including entity extraction (e.g. people, place, financial amount), concept extraction (e.g. unhappy customer) or fact extraction (e.g. an event specification), to name a few. For example, concepts may be created using various linguistic- and statistics-based processes. Different vendors make use of different techniques and have created patents for these techniques. Lastly, Convergence is bringing these two technologies together and then searching using extraction indexes created by the text analytics. In other words, search results are filtered by concepts, entities, facts, or other extraction types and utilized to answer questions. An example Consider the following prime example of what might be possible utilizing this approach. A manufacturing company is interested in gathering intelligence on why it is losing market share. In the past, it would have looked at numerous reports including: sales reports, warranty reports, customer satisfaction surveys and analyst reports. It would have also had business analysts searching the Internet to find any relevant information. Leveraging this new approach, the organization might create concepts that it uses to search through all of its internal documents and business intelligence reports as well as any external news feeds and articles. These concepts might include “unhappy customer”, and/or utilize various entities like “competitors making financial transactions.” The concept itself may come from a taxonomy or ontology that has already been created by the organization or a third party. Or, it may have been created by a subject matter expert working with this information via a GUI that is part of a software package. This extracted information is sent to a search engine interface. The results might appear as they would with any search engine. The results might also be piped to a business intelligence product to produce plots of percent unhappy, happy, and neutral (no comment) customers. These table plots might have been derived from the text of customer care centres or customer surveys and would have drill down capabilities to enable line of business users to explore why these customers are unhappy and do further analysis. Levering this kind of approach, companies can then derive insights that were not possible before. © 2009 Findwise AB 5
  • 7. Enterprise search vs. EDW solution There are several reasons for choosing enterprise search, some of which are discussed in the following sections. Scalability and distributed architecture Traditional enterprise architectures gain performance primarily through increased use of CPU and memory. This is scalability through more grid iron and its costs increase exponentially with demand. RDBMS technologies adopted this model from the beginning, which is understandable since it was the prevailing solution when the industry was new and growing. They have recently begun a move towards more distributed models, but the journey must be difficult. To be truly multi-dimensionally distributed means a basic rewrite of your core. New enterprise architectures are distributed from the ground up, balancing demand for CPU, memory, and disk to reflect the realities of commodity hardware, which is any collection of off-the-shelf inexpensive computers, typically “Lintel/Wintel”, networked together in a grid. Since performance is gained through simply adding another computer to the grid, costs are linear in growth. This is scalability through grid computing, and IT organizations are beginning to adopt this model for all their enterprise requirements. True Ad-hoc Querying and Relevancy The relational model is exactly that – a model. The fact that there is a schema at all means the RDBMS works better for some queries over others. The issue is not so much who has the fastest query but who has the slowest. For example, the difference between a 20 and 50 millisecond delay is not discernable to the user, but a 30 second response time is. Variance is often far more important than average query time. The problem lies with the fact that a schema is counter to the notion of a purely ad-hoc query environment. The industry devised new denormalized designs, such as star and snowflake schemas, and even introduced multi-dimensional database engines to alleviate the problem as much as possible, but while these technologies were good for queries generally focused on sales, financial, and production analyses, they were no good for real-time processing, heavy analytics, historical comparisons, complex updates (e.g. data corrections), textually-centric content, or environments requiring a large number of dimensions. A truly unbiased model is one that has no schema at all (i.e. hyper-denormalized to one table). This is the index of an enterprise search engine. Another problem is the language itself. SQL is designed to query systems where the results support a binary inclusion model. In other words, data is either part of the answer or not. The order of the data itself in the result set has no relevant meaning other than a preference for sorting and summation. This model leaves out the whole world of relative inclusion, where data has a graduated score of relevance in the result list. The omission is understandable. Business intelligence focuses almost exclusively on financial and other numerical data, really to support monetary trends to measure an organization’s health or assess risk. However, a lot of information is left on the table when one elects to ignore the textual content or relegates it to simple identifiers for qualifying these analyses User interaction The popularity of the Web search vendors (Google, Yahoo!, and Microsoft) has had an interesting effect on the expectations of users in the enterprise market. The simplicity of the search bar, the power of navigators, and the convenience of simple ranked results are now basic requirements. “Why can’t our stuff be as simple as © 2009 Findwise AB 6
  • 8. Google?” is a common cry. It is not surprising this is the case, since what better an audience to test the acceptability of a user interface than the world at large? The net effect is a shift in user interaction design (UI is more about “interaction” these days than “interface”) as it relates to information retrieval, exploration, and analysis. The new model recognizes the need for an integrated approach to querying and exploring content. The interaction is a combination of techniques that all resolve to the same action: asking for information and getting results. How intelligently the system allows you to ask your question measures the quality of the interaction. The basic model is this: providing both a search box and several smart navigation approaches offers the best interface for query and exploration, and an intelligent balance of the two is the most efficient interaction model for intelligent extraction of information. A good second generation enterprise search engine supports all of these capabilities out of the box. The RDBMS vendors are just beginning to incorporate some of them in their latest versions. More content – better analysis As mentioned earlier one often claim that 80 percent of an organization’s content resides in unstructured repositories (email, documents, etc.), while only 20 percent resides in databases. The content management vendors use this statistic often. While true, it may also be the case that the 20 percent in the database has 80 percent of the importance. The argument is largely academic, because the real answer is to make the entire 100 percent available. You can’t find something that isn’t there, so why not make sure it is in the corpus of searchable content? This is simple insurance against the “not knowing what you don’t know” problem that inevitably inflicts the worst pain on an organization. Furthermore, the integration of structured and unstructured content allows for new investigative approaches. Imagine monitoring the efficacy of an advertising campaign by tracking product sales and market intelligence in a coordinated fashion Real time content – not batch oriented content We often think of content as primarily historical, a snapshot of activity that occurs over a period of time (which may be as recent as the last 24 hours). This is such a standard assumption the batch orientation of the data warehousing market requires that it be true. But it is not always true. What if I’m monitoring stock transactions and I wish to get answers before the trader executes the manager’s requests? What if my job is to look for patterns on the newswire? The data in these scenarios is streamed in real time, not digested in batches, and it should be supported as such. Note the importance of latency here. Streamed content should be searchable with minimal delay, often less than a second. Second generation enterprise search engines support alerts and sub-second latency. In an OLAP environment this would be considerably more difficult. Low implementation and maintenance costs with great flexibility In a typical OLAP environment you will find one or more data warehouses, several operational data stores and/or data marts, assorted ETL technologies, and a myriad of business intelligence repositories that define reports, queries and other assorted meta-data. Creating and maintaining these components requires a host of engineers, DBAs, and knowledge workers (who we pretend do not need to understand relational concepts but really do if they build reports or queries, even graphically). Now add to this environment meta-data management, keyword and acronym dictionaries, multiple languages, thesauri, etc. and you can see how the management costs can proliferate. © 2009 Findwise AB 7
  • 9. Search engines, especially those with a history of servicing the Web, are designed for the widest audience, consumers who know absolutely nothing about technology. The knowledge worker does not exist in the consumer market, so the ad hoc report at its most simplest is merely a search bar that returns ranked results and sometimes dynamically generated navigators to assist in exploration. The management process, as you might imagine, is also much easier, if for no other reason that there is one single component to install, configure, and manage rather than several. Please note that a DBA is no longer required. Figure 2 – Search vs. EDW The “single component” comment requires some explanation. Recall that data marts and operational data stores were created to overcome the limitations of the relational model for handling ad-hoc querying – avoiding the “killer query” – and because the RDBMS was never able to efficiently support high volume data updates and query traffic at the same time. The search engine has no such limitations. One common index used by all users in the enterprise As mentioned earlier an enterprise search solution could serve as the basis for one “single component” where the index can serve many different applications and different users and their needs. Many of the search solutions have a range of powerful presentation-layer functions. These enable overlaying of results of multiple related searches—allowing users to explore commonalities between groups or search results, building an in-depth picture of results from seemingly unconnected data. The enterprise search solutions can also feed different BI tools, but of course as time evolves the search vendors will enhance their products with integrated BI tools. © 2009 Findwise AB 8
  • 10. Figure 3 – Enterprise search: one common platform and shared data Conclusion Enterprise search technology may not have been around as long as it’s RDMS counterpart, but it has been much more aggressive in its intelligent retrieval of information from both structured and unstructured content. Whatever the reasons, for organizations to compete effectively, report accurately, defend aggressively, or most any other activity, the key to their success is their ability to get the right information to the right people at the right time (and at the right cost); from any source, internal or external to the organization. The enterprise search solution should be considered to be the best candidate that can effectively deliver. © 2009 Findwise AB 9
  • 11. References Ardentia ”Zen and the art of enterprise search”, by Richard Lewis, FAST Search & Transfer “Search in a structured data environment”, by Davor Peter Sutija, Hurwitz & Associates “Enterprise search evolves”, by Fern Halper. © 2009 Findwise AB 10