SlideShare uma empresa Scribd logo
1 de 52
© COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Matt Turner, CTO Media & Entertainment
Introduction to MarkLogic NoSQL
SLIDE: 2 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Outline
• Something’s Happening Here
• The Old and the New
 Data models
 Data access
• Discussion
Analysis Operations Access
DATA MAKES AN IMPACT
SLIDE: 4 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Stress on Traditional Data Approaches
Complexity
 Structured
 Unstructured
 Semi-structured
 Raw
 Streams of data
 Constant change
 Agile analytics
 Fail-fast
Volume
Velocity Variety
Volume • Many months of system log files
• Every tweet
• Years of articles
• Relative to current size of
operation
Velocity • Streams of customer feedback
to determine sentiment
• Real-time risk analysis
• Real-time Business Intelligence
Variety • Database feeds
• Raw logs
• Web crawl data
• Articles
• Multi-media
• ALSO: questions!
Examples
Big Data: Gartner coined the “three V’s” description
 Data: Petabyte
scale
 Nodes:
Thousands
SLIDE: 5 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
SLIDE: 6 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Stress on Traditional Data Approaches
Complexity
 Structured
 Unstructured
 Semi-structured
 Raw
 Streams of data
 Constant change
 Agile analytics
 Fail-fast
Volume
Velocity Variety
Volume • Many months of system log files
• Every tweet
• Years of articles
• Relative to current size of
operation
Velocity • Streams of customer feedback
to determine sentiment
• Real-time risk analysis
• Real-time Business Intelligence
Variety • Database feeds
• Raw logs
• Web crawl data
• Articles
• Multi-media
• ALSO: questions!
Examples
Big Data: Gartner coined the “three V’s” description
 Data: Petabyte
scale
 Nodes:
Thousands
SLIDE: 7 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Leader Quadrant
Online Transaction
Processing RDBS
(May 2002)
SLIDE: 8 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Leader Quadrant
Operational DBMS
(Oct 2014)
Traditional
Mainstays
Upstarts Storm
the Field
SLIDE: 9 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
MarkLogic:
Best Operational
Data Warehouse
(Aug 2014)
SLIDE: 10 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
A Unified, Actionable
360 View of Data
WHAT BUSINESSES WANT
Analysis Operations Access
DATA MAKES AN IMPACT
SLIDE: 13 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Data Is In Silos
 Data is spread across disconnected databases
 M&A outpaces the speed of data integration
 Data needs to be delivered in real time
THE REALITY
SLIDE: 14 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
80% OF TIME
By data scientists just
wrangling data
WASTED
In 2015 on creating relational
data silos
Of data warehouse projects
is on ETL
The Massive Cost of Integrating Data From Silos
36BILLION IN
SPENDING
$% OF THE
COST60
SLIDE: 15 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Relational Databases with
ETL Sacrifice Agility,
Timeliness, and Cost
 All future data needs must be predictable
 New SQL queries require database re-indexing
 Siloed database changes require ETL re-writes
THE IT CHALLENGE
ETL
OLTP
ARCHIVES
ETL
ETL
ETL
DATA MARTS
ETL
WAREHOUSE
REFERENCE DATA
SLIDE: 16 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
OLTP
Warehouse
Data MartsArchives
“Unstructured”
“ ”
Video
Audio
Signals,
Logs,
Streams
Social
Documents,
Messages
{ }
Metadata
Search🔍
Reference
Data
It’s Complicated
The OLD:
Let’s Design the Application
(And pretend it’s the 80s)
SLIDE: 18 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Name Hair Colour Fulltime Employee? Car type
Paul Blond Y
Alex Auburn Y Porsche
Dom Black Y Hummer
name hr_colr flltme_empl car_tp
Let’s Begin… Cast Members
{
How many characters wide should this be? 8? 16? 32?
{
{
{
SLIDE: 19 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
New Schema – Extend Ours!
name hr_colr flltme_empl car_tp
Paul Blond Y
Alex Auburn Y porsche
Dom Black Y Hummer
house_road town city postcode
11d Yonge Pk Finsbury London N4 3NU
Reading
London N43
• Hang on
• If this table had 10k rows, issues?
• First create new big schema
• Then import rows across
• Delete old table?
• Maybe not, legacy programs might use it!
• What if we want to select “Road” only?
• Split out again
• More extensions?
• House name and number?
SLIDE: 20 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
There is another way!
Create a new table and point to it from the old one!
name hr_colr flltme_empl car_tp Address
Paul Blond Y
Alex Auburn Y porsche
Dom Black Y Hummer
house_road town city postcode
11d Yonge Pk Finsbury London N4 3NU
Reading
London N43
SLIDE: 21 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
…
SLIDE: 22 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Now Let’s Store Something More . . . Complicated
Transcript / Book
Info
Title = “NL April 14”
Author = “SNL Cast”
Section
• Chapter
Page
Paragraph =
“I love penguins because…”
Page
Paragraph =
“On the subject of food…”
• Chapter
Page
Section
• Chapter
• Chapter
• Chapter
• Paragraph
• Paragraph
title author Section
I love
Penguins
S. Lion
Issues with Sections? How many columns?
SLIDE: 23 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Don’t Forget Taxonomies
Hierarchical levels of metadata
Fixed to a specific business purpose
 Can’t be re-used in new contexts
Each record can only be associated with
one level
 How many category fields?
Category
Feature
Series
Action
Drama
Comedy
Documentary
…
Cable
Broadcast
Drama
Comedy
…
Action
Drama
Family
Documentary
…
SLIDE: 24 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Result
Requires everything to be defined up front
Data to be transformed and processed to
fit the system
Needs to be redone as information
changes
Costly to create, maintain and only
captures part of the data!
Title ProductionDate Category AssetType Length
Film1 3/1/14 Feature HD Master 2:40
Show1 6/4/13 Series HD720 0:40
Film2 6/4/05 Feature Archive 1:55
Category
Feature
Series
Action
Drama
Comedy
Documentary
…
Cable
Broadcast
Drama
Comedy
…
Action
Drama
Family
Documentary
…
?
Traditional Technology
SLIDE: 26 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
OLTP
Warehouse
Data MartsArchives
“Unstructured”
“ ”
Video
Audio
Signals,
Logs,
Streams
Social
Documents,
Messages
{ }
Metadata
Search🔍
Reference
Data
*NOTE: We only did this
little bit!
Remember?
SLIDE: 27 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
The NEW! Enter NoSQL
Category Description Examples
Key-value • Persistent hash-table “on steroids”
• Typically no single modeling paradigm (e.g. columns
can be primitives, data structures, binaries, etc.)
• Amazon
DynamoDB
• Redis
• Riak
Columnar • Similar to K-V in some ways
• Column may be arranged in groups (families)
• Data types are usually the expected “primitives”
• Works well with “value crunching” (e.g. time series)
• HBase
• Cassandra
Document • URI-mapped (i.e. keyed) documents in lieu of rows
• Supports structured and unstructured content
• Nested context
• MarkLogic
• MongoDB
• Couchbase
Graph • Deals with inter-object graphs
• Relationship oriented
• Think object cache (with pointers) “on steroids”
• Neo4J
• AllegroGraph
• InfoGrid
SLIDE: 28 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
A Database That
Integrates Data Better,
Faster, with Less Cost
THE DESIRED SOLUTION
SLIDE: 29 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
The MarkLogic Alternative
An Operational and Transactional Enterprise NoSQL Database
 Data ingested as is (no ETL)
 Structured and unstructured data
 Data and metadata together
 Adapts to changing data
and changing data structures
EASY TO
GET DATA IN
Flexible Data Model
 Index once and query endlessly
 Real-time and lightning fast
 Query across JSON, XML, text,
geospatial, and semantic triples
in one database
EASY TO
GET DATA OUT
Ask Anything Universal Index
 Reliable data and transactions
(100% ACID compliant)
 Out-of-the-box automatic
failover, replication, and
backup/recovery
 Enterprise-grade security and
Common Criteria certified
100%
TRUSTED
Enterprise Ready
SLIDE: 30 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
The SNL App
SLIDE: 31 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
No need to define up front
Matched to complex content and
metadata data modeling
Data is managed in its most
accessible, natural form
XML, JSON, RDF, geospatial
Flexible Data Model
Schema-agnostic, structure-aware
SLIDE: 32 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Instead of THIS
SLIDE: 33 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Do it like THIS!
SLIDE: 34 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Search and Query
Search to find answers in documents, relationships, and metadata
 Automatic indexing of every data value, text and data
structure
 Specialized indexes for data values (analytics, facets,
sorting), geospatial and triples
 All updated in the context of ACID transactions to
ensure data integrity and real-time access
 Accessible via fully programmable search API with full-
text search, type-ahead suggestions, facets, snippeting,
highlighted search terms, proximity boosting, relevance
ranking, and language support
JavaScript XQuery SPARQL
Rich Query
Capability
In-database
MapReduce
Full-text
Search
Semantic
Search
Geospatial
Search
Timing
Context
Who’s Smarter?
VS
Do domestic dogs interpret pointing as a command?
Animal Cognition (2012): 1-12 , November 09, 2012
By Scheider, Linda; Kaminski, Juliane; Call, Josep; Tomasello, Michael
Context!
© COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.SLIDE: 39
Machines Don’t Get Context . . .
Manu Sporny Founder/CEO - Digital Bazaar, Inc.
http://www.cambridgesemantics.com/semantic-university/what-is-linked-data
SLIDE: 40 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Enter Semantics!
Manu Sporny Founder/CEO - Digital Bazaar, Inc.
http://www.cambridgesemantics.com/semantic-university/what-is-linked-data
SLIDE: 41 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Semantics
Enterprise triple store, document store, and database combined
 Store and query billions of facts and relationships
 Leverage ontologies for domain and role specific
context access to data and documents
 Efficient metadata management with relationships
to ontologies
 Standards-based for ease of use and integration
– RDF, SPARQL, and standard REST
interfaces
SLIDE: 42 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Semantics to Model Relationships
Data model to manage relationships and link together data
‘triples’ describe single facts
Collections of facts describe complex real-world scenarios
“Chevy” ”NBC"
isOn
”SNL"
isOn
isOn
!
SLIDE: 43 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Ontologies Instead of Categories
Actually model information as it is in
the real world
Not limited to a single purpose
 Ontologies for all categories of
metadata
 Even ‘impossible’ categories
like fictional worlds
SLIDE: 44 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
NoSQL and Semantics!
SLIDE: 45 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Real-Time Analytics
Range indexes can be used for
 Faceted search
 Aggregation and visualization
 Analytics…
…including custom user-defined functions
 Co-occurrence
 SQL, ODBC, and BI integration
SLIDE: 46 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Scalability, Elasticity and Cloud
Massive enterprise scalability and elasticity
 Scale horizontally in clusters on commodity
hardware to hundreds of nodes, petabytes of
data, and billions of documents
 Process thousands of multi-document multi-
statement transactions per second
 Start small and scale up or down to meet capacity
and performance demands without over-
provisioning or over-spending
 Fully cloud enabled for automated deployment
and management on EC2
 Leverage dynamic configurations with Tiered
Storage
D-NODE D-NODE
E-NODE E-NODE
D-NODE
Result: Enterprise-ready to power mission critical products
SLIDE: 47 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Use Case: Deliver Better Information
Present information based on
relationships
Go beyond traditional technology with
depth of content
Drive efficiency using semantic approach
to tagging
SLIDE: 48 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Use Case: Go Beyond Search
• Concept instead of keyword search
• Related content and information
drive the content discovery and new
interactions
 SNL40 continuous viewing
• Dynamically tailored to the users
specific attributes or activity
SLIDE: 49 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Use Case: Integrate Data
• Integrate data across the automoti
Bob Pilz
Taxonomy Manager
Mitchell1
SLIDE: 51 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Semantics-driven search
Talent
Kristen Wiig
Acted in
Episode 4
Anne Hathaway and Killers
Part of
Played
Character
Maharelle Sister
Season 34
Segment
The Lawrence Welk Show
Aired on
Date
10/4/08
Era
Acted in
Includes
Part of
SLIDE: 52 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
Intelligent recommendation engine

Mais conteúdo relacionado

Mais procurados

Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformOntotext
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsSemantic Web Company
 
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionLinking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionRonald Ashri
 
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios SpagoWorld
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLConnected Data World
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data ModelingVital.AI
 
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...Connected Data World
 
Schneller Nutzen mit Neo4j: das Beispiel Panama Papers
Schneller Nutzen mit Neo4j: das Beispiel Panama PapersSchneller Nutzen mit Neo4j: das Beispiel Panama Papers
Schneller Nutzen mit Neo4j: das Beispiel Panama PapersNeo4j
 
Running complex data queries in a distributed system
Running complex data queries in a distributed systemRunning complex data queries in a distributed system
Running complex data queries in a distributed systemArangoDB Database
 
Paris Spark Meetup - Trifacta - 03_04_2017
Paris Spark Meetup - Trifacta - 03_04_2017Paris Spark Meetup - Trifacta - 03_04_2017
Paris Spark Meetup - Trifacta - 03_04_2017Modern Data Stack France
 
Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...
Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...
Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...Semantic Web Company
 
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...Dr. Haxel Consult
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreTrendwise Analytics
 
An introduction to multi-model databases
An introduction to multi-model databasesAn introduction to multi-model databases
An introduction to multi-model databasesBerta Hermida Plaza
 
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...Connected Data World
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopGhassan Al-Yafie
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningSemantic Web Company
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...Dr. Haxel Consult
 

Mais procurados (20)

Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing Platform
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
 
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionLinking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
 
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACL
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
 
Schneller Nutzen mit Neo4j: das Beispiel Panama Papers
Schneller Nutzen mit Neo4j: das Beispiel Panama PapersSchneller Nutzen mit Neo4j: das Beispiel Panama Papers
Schneller Nutzen mit Neo4j: das Beispiel Panama Papers
 
Running complex data queries in a distributed system
Running complex data queries in a distributed systemRunning complex data queries in a distributed system
Running complex data queries in a distributed system
 
Paris Spark Meetup - Trifacta - 03_04_2017
Paris Spark Meetup - Trifacta - 03_04_2017Paris Spark Meetup - Trifacta - 03_04_2017
Paris Spark Meetup - Trifacta - 03_04_2017
 
Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...
Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...
Stephen Buxton: When RDF alone is not enough - triples, documents, and data i...
 
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and More
 
An introduction to multi-model databases
An introduction to multi-model databasesAn introduction to multi-model databases
An introduction to multi-model databases
 
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoop
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine Learning
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 

Destaque

Warum NoSQL Datenbanken auf dem Vormarsch sind
Warum NoSQL Datenbanken auf dem Vormarsch sindWarum NoSQL Datenbanken auf dem Vormarsch sind
Warum NoSQL Datenbanken auf dem Vormarsch sindRegina Holzapfel
 
Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?
Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?
Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?Regina Holzapfel
 
Marklogic and the Linked Data Connection
Marklogic and the Linked Data ConnectionMarklogic and the Linked Data Connection
Marklogic and the Linked Data Connectionpfennell
 
Cassandra Consistency: Tradeoffs and Limitations
Cassandra Consistency: Tradeoffs and LimitationsCassandra Consistency: Tradeoffs and Limitations
Cassandra Consistency: Tradeoffs and LimitationsPanagiotis Papadopoulos
 
SharePoint 2013 Javascript Object Model
SharePoint 2013 Javascript Object ModelSharePoint 2013 Javascript Object Model
SharePoint 2013 Javascript Object ModelInnoTech
 
Share point hosted add ins munich
Share point hosted add ins munichShare point hosted add ins munich
Share point hosted add ins munichSonja Madsen
 
Essential Knowledge for SharePoint Add-Ins
Essential Knowledge for SharePoint Add-InsEssential Knowledge for SharePoint Add-Ins
Essential Knowledge for SharePoint Add-InsInnoTech
 
Build and Deploy Provider-hosted SharePoint Add-ins
Build and Deploy Provider-hosted SharePoint Add-insBuild and Deploy Provider-hosted SharePoint Add-ins
Build and Deploy Provider-hosted SharePoint Add-insDanny Jessee
 
Chris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 apps
Chris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 appsChris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 apps
Chris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 appsChris O'Brien
 
Rev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best PracticesRev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best PracticesEric Shupps
 
Real World SharePoint Add-In Development
Real World SharePoint Add-In DevelopmentReal World SharePoint Add-In Development
Real World SharePoint Add-In DevelopmentEric Shupps
 
Develop a SharePoint App in 45 Minutes
Develop a SharePoint App in 45 MinutesDevelop a SharePoint App in 45 Minutes
Develop a SharePoint App in 45 MinutesTom Resing
 
Top 10 sharepoint interview questions with answers
Top 10 sharepoint interview questions with answersTop 10 sharepoint interview questions with answers
Top 10 sharepoint interview questions with answerswillhoward459
 
10 Reasons your SharePoint Migration Failed
10 Reasons your SharePoint Migration Failed10 Reasons your SharePoint Migration Failed
10 Reasons your SharePoint Migration FailedBenjamin Niaulin
 
10 Reasons to Avoid Folders in SharePoint 2013/2010
10 Reasons to Avoid Folders in SharePoint 2013/201010 Reasons to Avoid Folders in SharePoint 2013/2010
10 Reasons to Avoid Folders in SharePoint 2013/2010Bobby Chang
 
10 Best SharePoint Features You’ve Never Used (But Should)
10 Best SharePoint Features You’ve Never Used (But Should)10 Best SharePoint Features You’ve Never Used (But Should)
10 Best SharePoint Features You’ve Never Used (But Should)Christian Buckley
 
Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!DATAVERSITY
 
SharePoint Permissions Worst Practices
SharePoint Permissions Worst PracticesSharePoint Permissions Worst Practices
SharePoint Permissions Worst PracticesBobby Chang
 
10 Best Productivity Features in SharePoint 2013
10 Best Productivity Features in SharePoint 201310 Best Productivity Features in SharePoint 2013
10 Best Productivity Features in SharePoint 2013Christian Buckley
 

Destaque (20)

Warum NoSQL Datenbanken auf dem Vormarsch sind
Warum NoSQL Datenbanken auf dem Vormarsch sindWarum NoSQL Datenbanken auf dem Vormarsch sind
Warum NoSQL Datenbanken auf dem Vormarsch sind
 
Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?
Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?
Warum NoSQL? Wann macht der Einsatz von NoSQL Datenbanken Sinn?
 
Marklogic and the Linked Data Connection
Marklogic and the Linked Data ConnectionMarklogic and the Linked Data Connection
Marklogic and the Linked Data Connection
 
Cassandra Consistency: Tradeoffs and Limitations
Cassandra Consistency: Tradeoffs and LimitationsCassandra Consistency: Tradeoffs and Limitations
Cassandra Consistency: Tradeoffs and Limitations
 
SharePoint 2013 Javascript Object Model
SharePoint 2013 Javascript Object ModelSharePoint 2013 Javascript Object Model
SharePoint 2013 Javascript Object Model
 
Share point hosted add ins munich
Share point hosted add ins munichShare point hosted add ins munich
Share point hosted add ins munich
 
Essential Knowledge for SharePoint Add-Ins
Essential Knowledge for SharePoint Add-InsEssential Knowledge for SharePoint Add-Ins
Essential Knowledge for SharePoint Add-Ins
 
Xquery
XqueryXquery
Xquery
 
Build and Deploy Provider-hosted SharePoint Add-ins
Build and Deploy Provider-hosted SharePoint Add-insBuild and Deploy Provider-hosted SharePoint Add-ins
Build and Deploy Provider-hosted SharePoint Add-ins
 
Chris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 apps
Chris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 appsChris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 apps
Chris O'Brien - Comparing SharePoint add-ins (apps) with Office 365 apps
 
Rev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best PracticesRev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best Practices
 
Real World SharePoint Add-In Development
Real World SharePoint Add-In DevelopmentReal World SharePoint Add-In Development
Real World SharePoint Add-In Development
 
Develop a SharePoint App in 45 Minutes
Develop a SharePoint App in 45 MinutesDevelop a SharePoint App in 45 Minutes
Develop a SharePoint App in 45 Minutes
 
Top 10 sharepoint interview questions with answers
Top 10 sharepoint interview questions with answersTop 10 sharepoint interview questions with answers
Top 10 sharepoint interview questions with answers
 
10 Reasons your SharePoint Migration Failed
10 Reasons your SharePoint Migration Failed10 Reasons your SharePoint Migration Failed
10 Reasons your SharePoint Migration Failed
 
10 Reasons to Avoid Folders in SharePoint 2013/2010
10 Reasons to Avoid Folders in SharePoint 2013/201010 Reasons to Avoid Folders in SharePoint 2013/2010
10 Reasons to Avoid Folders in SharePoint 2013/2010
 
10 Best SharePoint Features You’ve Never Used (But Should)
10 Best SharePoint Features You’ve Never Used (But Should)10 Best SharePoint Features You’ve Never Used (But Should)
10 Best SharePoint Features You’ve Never Used (But Should)
 
Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!
 
SharePoint Permissions Worst Practices
SharePoint Permissions Worst PracticesSharePoint Permissions Worst Practices
SharePoint Permissions Worst Practices
 
10 Best Productivity Features in SharePoint 2013
10 Best Productivity Features in SharePoint 201310 Best Productivity Features in SharePoint 2013
10 Best Productivity Features in SharePoint 2013
 

Semelhante a Introduction to NoSQL Databases with MarkLogic

Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?DATAVERSITY
 
The Value of Metadata
The Value of MetadataThe Value of Metadata
The Value of MetadataDATAVERSITY
 
Operational Analytics Using Spark and NoSQL Data Stores
Operational Analytics Using Spark and NoSQL Data StoresOperational Analytics Using Spark and NoSQL Data Stores
Operational Analytics Using Spark and NoSQL Data StoresDATAVERSITY
 
Cwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosqlCwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosqlCapgemini
 
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...semanticsconference
 
The Impact of Smart Content
The Impact of Smart ContentThe Impact of Smart Content
The Impact of Smart ContentMatt Turner
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseHortonworks
 
A New Way of Thinking About MDM
A New Way of Thinking About MDMA New Way of Thinking About MDM
A New Way of Thinking About MDMDATAVERSITY
 
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache SparkData-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache SparkDatabricks
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Insight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationInsight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationMapR Technologies
 
MongoDB & Hadoop - Understanding Your Big Data
MongoDB & Hadoop - Understanding Your Big DataMongoDB & Hadoop - Understanding Your Big Data
MongoDB & Hadoop - Understanding Your Big DataMongoDB
 
Data-Centric Infrastructure for Agile Development
Data-Centric Infrastructure for Agile DevelopmentData-Centric Infrastructure for Agile Development
Data-Centric Infrastructure for Agile DevelopmentDATAVERSITY
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015Bipin Singh
 
Foundation for Success: How Big Data Fits in an Information Architecture
Foundation for Success: How Big Data Fits in an Information ArchitectureFoundation for Success: How Big Data Fits in an Information Architecture
Foundation for Success: How Big Data Fits in an Information ArchitectureInside Analysis
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Denodo
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Kent Graziano
 
Mastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoMastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoSpark Summit
 
Key Methodologies for Migrating from Oracle to Postgres
Key Methodologies for Migrating from Oracle to PostgresKey Methodologies for Migrating from Oracle to Postgres
Key Methodologies for Migrating from Oracle to PostgresEDB
 

Semelhante a Introduction to NoSQL Databases with MarkLogic (20)

Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?
 
The Value of Metadata
The Value of MetadataThe Value of Metadata
The Value of Metadata
 
Operational Analytics Using Spark and NoSQL Data Stores
Operational Analytics Using Spark and NoSQL Data StoresOperational Analytics Using Spark and NoSQL Data Stores
Operational Analytics Using Spark and NoSQL Data Stores
 
Cwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosqlCwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosql
 
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
 
The Impact of Smart Content
The Impact of Smart ContentThe Impact of Smart Content
The Impact of Smart Content
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
 
A6 big data_in_the_cloud
A6 big data_in_the_cloudA6 big data_in_the_cloud
A6 big data_in_the_cloud
 
A New Way of Thinking About MDM
A New Way of Thinking About MDMA New Way of Thinking About MDM
A New Way of Thinking About MDM
 
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache SparkData-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Insight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationInsight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital Transformation
 
MongoDB & Hadoop - Understanding Your Big Data
MongoDB & Hadoop - Understanding Your Big DataMongoDB & Hadoop - Understanding Your Big Data
MongoDB & Hadoop - Understanding Your Big Data
 
Data-Centric Infrastructure for Agile Development
Data-Centric Infrastructure for Agile DevelopmentData-Centric Infrastructure for Agile Development
Data-Centric Infrastructure for Agile Development
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
 
Foundation for Success: How Big Data Fits in an Information Architecture
Foundation for Success: How Big Data Fits in an Information ArchitectureFoundation for Success: How Big Data Fits in an Information Architecture
Foundation for Success: How Big Data Fits in an Information Architecture
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
 
Mastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoMastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott Cordo
 
Key Methodologies for Migrating from Oracle to Postgres
Key Methodologies for Migrating from Oracle to PostgresKey Methodologies for Migrating from Oracle to Postgres
Key Methodologies for Migrating from Oracle to Postgres
 

Mais de Matt Turner

Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of DataMatt Turner
 
Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023Matt Turner
 
Data2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptxData2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptxMatt Turner
 
From Data Chaos to Data Culture
From Data Chaos to Data CultureFrom Data Chaos to Data Culture
From Data Chaos to Data CultureMatt Turner
 
How Data is Driving AI Innovation
How Data is Driving AI InnovationHow Data is Driving AI Innovation
How Data is Driving AI InnovationMatt Turner
 
Principles of Information Access
Principles of Information AccessPrinciples of Information Access
Principles of Information AccessMatt Turner
 
Securing the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for YouSecuring the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for YouMatt Turner
 
Operationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business TransformationOperationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business TransformationMatt Turner
 
Three Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsThree Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsMatt Turner
 
Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019Matt Turner
 
BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18Matt Turner
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked DataMatt Turner
 
Smart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data PatternSmart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data PatternMatt Turner
 
Data Security and the Hard Outer Shell
Data Security and the Hard Outer ShellData Security and the Hard Outer Shell
Data Security and the Hard Outer ShellMatt Turner
 
Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016Matt Turner
 
Metadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center StageMetadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center StageMatt Turner
 
New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries Matt Turner
 
Smart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and MetadataSmart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and MetadataMatt Turner
 
Kloptek Publishers Forum Keynote May 2014
Kloptek Publishers Forum Keynote May 2014Kloptek Publishers Forum Keynote May 2014
Kloptek Publishers Forum Keynote May 2014Matt Turner
 
Hollywood IT Summit Metadata Panel
Hollywood IT Summit Metadata PanelHollywood IT Summit Metadata Panel
Hollywood IT Summit Metadata PanelMatt Turner
 

Mais de Matt Turner (20)

Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of Data
 
Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023
 
Data2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptxData2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptx
 
From Data Chaos to Data Culture
From Data Chaos to Data CultureFrom Data Chaos to Data Culture
From Data Chaos to Data Culture
 
How Data is Driving AI Innovation
How Data is Driving AI InnovationHow Data is Driving AI Innovation
How Data is Driving AI Innovation
 
Principles of Information Access
Principles of Information AccessPrinciples of Information Access
Principles of Information Access
 
Securing the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for YouSecuring the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for You
 
Operationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business TransformationOperationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business Transformation
 
Three Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsThree Cool Things You Can Do with Standards
Three Cool Things You Can Do with Standards
 
Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019
 
BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked Data
 
Smart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data PatternSmart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data Pattern
 
Data Security and the Hard Outer Shell
Data Security and the Hard Outer ShellData Security and the Hard Outer Shell
Data Security and the Hard Outer Shell
 
Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016
 
Metadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center StageMetadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center Stage
 
New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries
 
Smart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and MetadataSmart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and Metadata
 
Kloptek Publishers Forum Keynote May 2014
Kloptek Publishers Forum Keynote May 2014Kloptek Publishers Forum Keynote May 2014
Kloptek Publishers Forum Keynote May 2014
 
Hollywood IT Summit Metadata Panel
Hollywood IT Summit Metadata PanelHollywood IT Summit Metadata Panel
Hollywood IT Summit Metadata Panel
 

Último

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfLivetecs LLC
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noidabntitsolutionsrishis
 

Último (20)

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdf
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
 

Introduction to NoSQL Databases with MarkLogic

  • 1. © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Matt Turner, CTO Media & Entertainment Introduction to MarkLogic NoSQL
  • 2. SLIDE: 2 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Outline • Something’s Happening Here • The Old and the New  Data models  Data access • Discussion
  • 4. SLIDE: 4 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Stress on Traditional Data Approaches Complexity  Structured  Unstructured  Semi-structured  Raw  Streams of data  Constant change  Agile analytics  Fail-fast Volume Velocity Variety Volume • Many months of system log files • Every tweet • Years of articles • Relative to current size of operation Velocity • Streams of customer feedback to determine sentiment • Real-time risk analysis • Real-time Business Intelligence Variety • Database feeds • Raw logs • Web crawl data • Articles • Multi-media • ALSO: questions! Examples Big Data: Gartner coined the “three V’s” description  Data: Petabyte scale  Nodes: Thousands
  • 5. SLIDE: 5 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
  • 6. SLIDE: 6 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Stress on Traditional Data Approaches Complexity  Structured  Unstructured  Semi-structured  Raw  Streams of data  Constant change  Agile analytics  Fail-fast Volume Velocity Variety Volume • Many months of system log files • Every tweet • Years of articles • Relative to current size of operation Velocity • Streams of customer feedback to determine sentiment • Real-time risk analysis • Real-time Business Intelligence Variety • Database feeds • Raw logs • Web crawl data • Articles • Multi-media • ALSO: questions! Examples Big Data: Gartner coined the “three V’s” description  Data: Petabyte scale  Nodes: Thousands
  • 7. SLIDE: 7 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Leader Quadrant Online Transaction Processing RDBS (May 2002)
  • 8. SLIDE: 8 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Leader Quadrant Operational DBMS (Oct 2014) Traditional Mainstays Upstarts Storm the Field
  • 9. SLIDE: 9 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. MarkLogic: Best Operational Data Warehouse (Aug 2014)
  • 10. SLIDE: 10 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. A Unified, Actionable 360 View of Data WHAT BUSINESSES WANT
  • 12.
  • 13. SLIDE: 13 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Data Is In Silos  Data is spread across disconnected databases  M&A outpaces the speed of data integration  Data needs to be delivered in real time THE REALITY
  • 14. SLIDE: 14 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. 80% OF TIME By data scientists just wrangling data WASTED In 2015 on creating relational data silos Of data warehouse projects is on ETL The Massive Cost of Integrating Data From Silos 36BILLION IN SPENDING $% OF THE COST60
  • 15. SLIDE: 15 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Relational Databases with ETL Sacrifice Agility, Timeliness, and Cost  All future data needs must be predictable  New SQL queries require database re-indexing  Siloed database changes require ETL re-writes THE IT CHALLENGE ETL OLTP ARCHIVES ETL ETL ETL DATA MARTS ETL WAREHOUSE REFERENCE DATA
  • 16. SLIDE: 16 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. OLTP Warehouse Data MartsArchives “Unstructured” “ ” Video Audio Signals, Logs, Streams Social Documents, Messages { } Metadata Search🔍 Reference Data It’s Complicated
  • 17. The OLD: Let’s Design the Application (And pretend it’s the 80s)
  • 18. SLIDE: 18 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Name Hair Colour Fulltime Employee? Car type Paul Blond Y Alex Auburn Y Porsche Dom Black Y Hummer name hr_colr flltme_empl car_tp Let’s Begin… Cast Members { How many characters wide should this be? 8? 16? 32? { { {
  • 19. SLIDE: 19 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. New Schema – Extend Ours! name hr_colr flltme_empl car_tp Paul Blond Y Alex Auburn Y porsche Dom Black Y Hummer house_road town city postcode 11d Yonge Pk Finsbury London N4 3NU Reading London N43 • Hang on • If this table had 10k rows, issues? • First create new big schema • Then import rows across • Delete old table? • Maybe not, legacy programs might use it! • What if we want to select “Road” only? • Split out again • More extensions? • House name and number?
  • 20. SLIDE: 20 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. There is another way! Create a new table and point to it from the old one! name hr_colr flltme_empl car_tp Address Paul Blond Y Alex Auburn Y porsche Dom Black Y Hummer house_road town city postcode 11d Yonge Pk Finsbury London N4 3NU Reading London N43
  • 21. SLIDE: 21 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. …
  • 22. SLIDE: 22 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Now Let’s Store Something More . . . Complicated Transcript / Book Info Title = “NL April 14” Author = “SNL Cast” Section • Chapter Page Paragraph = “I love penguins because…” Page Paragraph = “On the subject of food…” • Chapter Page Section • Chapter • Chapter • Chapter • Paragraph • Paragraph title author Section I love Penguins S. Lion Issues with Sections? How many columns?
  • 23. SLIDE: 23 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Don’t Forget Taxonomies Hierarchical levels of metadata Fixed to a specific business purpose  Can’t be re-used in new contexts Each record can only be associated with one level  How many category fields? Category Feature Series Action Drama Comedy Documentary … Cable Broadcast Drama Comedy … Action Drama Family Documentary …
  • 24. SLIDE: 24 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Result Requires everything to be defined up front Data to be transformed and processed to fit the system Needs to be redone as information changes Costly to create, maintain and only captures part of the data! Title ProductionDate Category AssetType Length Film1 3/1/14 Feature HD Master 2:40 Show1 6/4/13 Series HD720 0:40 Film2 6/4/05 Feature Archive 1:55 Category Feature Series Action Drama Comedy Documentary … Cable Broadcast Drama Comedy … Action Drama Family Documentary … ?
  • 26. SLIDE: 26 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. OLTP Warehouse Data MartsArchives “Unstructured” “ ” Video Audio Signals, Logs, Streams Social Documents, Messages { } Metadata Search🔍 Reference Data *NOTE: We only did this little bit! Remember?
  • 27. SLIDE: 27 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. The NEW! Enter NoSQL Category Description Examples Key-value • Persistent hash-table “on steroids” • Typically no single modeling paradigm (e.g. columns can be primitives, data structures, binaries, etc.) • Amazon DynamoDB • Redis • Riak Columnar • Similar to K-V in some ways • Column may be arranged in groups (families) • Data types are usually the expected “primitives” • Works well with “value crunching” (e.g. time series) • HBase • Cassandra Document • URI-mapped (i.e. keyed) documents in lieu of rows • Supports structured and unstructured content • Nested context • MarkLogic • MongoDB • Couchbase Graph • Deals with inter-object graphs • Relationship oriented • Think object cache (with pointers) “on steroids” • Neo4J • AllegroGraph • InfoGrid
  • 28. SLIDE: 28 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. A Database That Integrates Data Better, Faster, with Less Cost THE DESIRED SOLUTION
  • 29. SLIDE: 29 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. The MarkLogic Alternative An Operational and Transactional Enterprise NoSQL Database  Data ingested as is (no ETL)  Structured and unstructured data  Data and metadata together  Adapts to changing data and changing data structures EASY TO GET DATA IN Flexible Data Model  Index once and query endlessly  Real-time and lightning fast  Query across JSON, XML, text, geospatial, and semantic triples in one database EASY TO GET DATA OUT Ask Anything Universal Index  Reliable data and transactions (100% ACID compliant)  Out-of-the-box automatic failover, replication, and backup/recovery  Enterprise-grade security and Common Criteria certified 100% TRUSTED Enterprise Ready
  • 30. SLIDE: 30 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. The SNL App
  • 31. SLIDE: 31 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. No need to define up front Matched to complex content and metadata data modeling Data is managed in its most accessible, natural form XML, JSON, RDF, geospatial Flexible Data Model Schema-agnostic, structure-aware
  • 32. SLIDE: 32 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Instead of THIS
  • 33. SLIDE: 33 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Do it like THIS!
  • 34. SLIDE: 34 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Search and Query Search to find answers in documents, relationships, and metadata  Automatic indexing of every data value, text and data structure  Specialized indexes for data values (analytics, facets, sorting), geospatial and triples  All updated in the context of ACID transactions to ensure data integrity and real-time access  Accessible via fully programmable search API with full- text search, type-ahead suggestions, facets, snippeting, highlighted search terms, proximity boosting, relevance ranking, and language support JavaScript XQuery SPARQL Rich Query Capability In-database MapReduce Full-text Search Semantic Search Geospatial Search
  • 38. Do domestic dogs interpret pointing as a command? Animal Cognition (2012): 1-12 , November 09, 2012 By Scheider, Linda; Kaminski, Juliane; Call, Josep; Tomasello, Michael Context!
  • 39. © COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.SLIDE: 39 Machines Don’t Get Context . . . Manu Sporny Founder/CEO - Digital Bazaar, Inc. http://www.cambridgesemantics.com/semantic-university/what-is-linked-data
  • 40. SLIDE: 40 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Enter Semantics! Manu Sporny Founder/CEO - Digital Bazaar, Inc. http://www.cambridgesemantics.com/semantic-university/what-is-linked-data
  • 41. SLIDE: 41 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Semantics Enterprise triple store, document store, and database combined  Store and query billions of facts and relationships  Leverage ontologies for domain and role specific context access to data and documents  Efficient metadata management with relationships to ontologies  Standards-based for ease of use and integration – RDF, SPARQL, and standard REST interfaces
  • 42. SLIDE: 42 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Semantics to Model Relationships Data model to manage relationships and link together data ‘triples’ describe single facts Collections of facts describe complex real-world scenarios “Chevy” ”NBC" isOn ”SNL" isOn isOn !
  • 43. SLIDE: 43 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Ontologies Instead of Categories Actually model information as it is in the real world Not limited to a single purpose  Ontologies for all categories of metadata  Even ‘impossible’ categories like fictional worlds
  • 44. SLIDE: 44 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. NoSQL and Semantics!
  • 45. SLIDE: 45 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Real-Time Analytics Range indexes can be used for  Faceted search  Aggregation and visualization  Analytics… …including custom user-defined functions  Co-occurrence  SQL, ODBC, and BI integration
  • 46. SLIDE: 46 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Scalability, Elasticity and Cloud Massive enterprise scalability and elasticity  Scale horizontally in clusters on commodity hardware to hundreds of nodes, petabytes of data, and billions of documents  Process thousands of multi-document multi- statement transactions per second  Start small and scale up or down to meet capacity and performance demands without over- provisioning or over-spending  Fully cloud enabled for automated deployment and management on EC2  Leverage dynamic configurations with Tiered Storage D-NODE D-NODE E-NODE E-NODE D-NODE Result: Enterprise-ready to power mission critical products
  • 47. SLIDE: 47 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Use Case: Deliver Better Information Present information based on relationships Go beyond traditional technology with depth of content Drive efficiency using semantic approach to tagging
  • 48. SLIDE: 48 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Use Case: Go Beyond Search • Concept instead of keyword search • Related content and information drive the content discovery and new interactions  SNL40 continuous viewing • Dynamically tailored to the users specific attributes or activity
  • 49. SLIDE: 49 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Use Case: Integrate Data • Integrate data across the automoti
  • 51. SLIDE: 51 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Semantics-driven search Talent Kristen Wiig Acted in Episode 4 Anne Hathaway and Killers Part of Played Character Maharelle Sister Season 34 Segment The Lawrence Welk Show Aired on Date 10/4/08 Era Acted in Includes Part of
  • 52. SLIDE: 52 © COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. Intelligent recommendation engine

Notas do Editor

  1. 3
  2. 11
  3. Sources: 80% of time spent by data scientists on just wrangling data “Data scientists, according to interviews and expert estimates, spend from 50 percent to 80 percent of their time mired in this more mundane labor of collecting and preparing unruly digital data, before it can be explored for useful nuggets.” Steve Lohr. “For Big-Data Scientists, ‘Janitor Work’ Is Key Hurdle to Insights.” The New York Times. August 17, 2014. <http://www.nytimes.com/2014/08/18/technology/for-big-data-scientists-hurdle-to-insights-is-janitor-work.html> 60% of the cost of data warehouse projects is on ETL “In a report sponsored by Informatica, analysts at TDWI estimate between 60% and 80% of the total cost of a data warehouse project may be taken up by ETL software and processes.” $36 Billion in spending on database management systems in 2015 Gartner. Forecast: Enterprise Software Markets, Worldwide, 2011-2018, 4Q14. 2014. <https://www.gartner.com/doc/2944023/forecast-enterprise-software-markets-worldwide>