SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
CIGNEX Datamatics Confidential www.cignex.com
Webinar:
Faster Big Data Analytics with MongoDB
Case Study: Building Large Scale Data Processing and Data Analysis Platform using
MongoDB
Date: 06th April 2016
Speakers:
Buzz Moschetti
Enterprise Architecture and Special Programs
MongoDB
Anurag Seth
VP, Big Data Analytics & IoT Practice
CIGNEX Datamatics
CIGNEX Datamatics Confidential www.cignex.com
Buzz Moschetti,
Enterprise Architecture and Special Programs
MongoDB
Buzz works with F1000 companies to help them design
next-generation solutions and develop strategies for
overall technology transformation. He is also the CTO of
the partner program at MongoDB and a liason to
Engineering, Product Management, and Marketing
groups.
– 25+ years experience in the field, mostly in financial
services as CAO of the Investment Bank at
JPMorganChase and Bear Stearns before that
Anurag Seth,
VP, Big Data Analytics & Internet of Things (IoT)
Practice, CIGNEX Datamatics
Anurag has unique blend of technology expertise from
deep tech VLSI chip design to complex high performance
algorithmic software development in EDA (Electronic
Design Automation) to embedded system design to
predictive modelling & Big Data Analytics deployment
for compelling use-cases (including IOT).
– 25 years of strong experience in technology
development & delivery – product as well as services
across VLSI/EDA, Healthcare , Enterprise Big Data
Implementations & IOT
– Has served on board of the VLSI Lab at IIT
Kharagpur, been the general chair of the
International conference on VLSI Design &
Embedded Systems (2009) and still continues to
serve on the steering committee of the conference
2
Who are we ?
CIGNEX Datamatics Confidential www.cignex.com
• Big Data Analytics: Opportunity & Challenges
• Case Study: Building Large Scale Data Processing and Analysis Platform using MongoDB
– Business Needs
– Our Approach
– Solution Architecture
– MongoDB - A Great Fit for Data Processing and Analytics
– MongoDB Performance Tuning - Our Holistic Approach
– Recommended Best Practices
• Why MongoDB ?
• Why CIGNEX Datamatics ?
Topics
3
CIGNEX Datamatics Confidential www.cignex.com
Over 88% of data sources and
types are not being analyzed..
4
Big Data Analytics: Business Opportunities
Transactional &
Application Data
Machine Data
Enterprise
Content
Social Data
Reduce Operational
Costs
Improved Risk
Management
Many
more..
Volume
Structured
Velocity
Semi-structured
Variety
Un-structured
Variety
Un-structured
Sensor Data
Velocity
Semi-structured
CIGNEX Datamatics Confidential www.cignex.com
The organizations that uses Big Data
Analytics to integrate, process and
analyze these data sources are up to 25x
more likely to outperform their
competitors.
5
Big Data Analytics: Business Opportunities
Improve Process Efficiency
(Sales, Marketing, Finance, Operations)
Product/Service
Innovation
Monetize
Information
Improved
Collaboration
Improve customer
experience
Reduce Operational
Costs
Improved Risk
Management
CIGNEX Datamatics Confidential www.cignex.com
• Getting the right data & Infra
architecture for performance &
scalability
• Leverage investments in existing
technologies
• Integrating multi-channel & variety of
data sources at the modern volume
• Data quality & accuracy challenges
• Big data technologies are evolving too
quickly to adapt
• Scarcity of skills and capabilities
6
Big Data Analytics - Implementation to Production Challenges
• Hard ROI from Big Data?
– Identify & monetize existing & new
Data Streams
• Turn-around time for big data
(predictive modelling) deployments
• Difficult to make big data fit-for-
purpose (uncertainty), assess the level
of trust, and ensure security & privacy
• Lack of domain centricity
Technical Business
CIGNEX Datamatics Confidential www.cignex.com
Case Study:
Building Large Scale Data Processing and Analysis Platform using MongoDB
7
CIGNEX Datamatics Confidential www.cignex.com
• SaaS based sales analytics platform that acquires, processes and enriches accessible
public data to deliver data-driven customer and business insights that:
– Enhances efficacy of customer acquisition
– Improve operational efficiency
– Competitive & complementary selling opportunities
– Determine buying propensity, influencers & decision makers
8
Business Need
PUBLIC DATA ACQUISITION SOCIAL LISTENING CUSTOMER/BUSINESS INSIGHTS
CIGNEX Datamatics Confidential www.cignex.com9
Our Approach
Segment data by influential characteristics as the
best variables to use, use case centric
2. DATA PREPARATION
Evaluate and combine multiple models or
techniques that lead to higher efficiency
3. MODELING
Dashboard for Big Data Analytics
4. ANALYTICS
Define data sources that could
influence the outcome.
1. DATA ACQUISITION
Extensive multi-step rule-based ETL process which involves de-
duplication, geo-coding, smart-filtering over huge dataset etc.
Machine Learning ?
Augment with ML algorithms in
the longer run.
Semantic associations ?
Leverage the power of semantic associations
(NLP for Entity Extraction, Entity Associations)
to process millions of entities & implement
complex business rules for data enrichment and
refinement
Social listening that integrate 20+ Open public data sources using REST APIs.
Store and manage 1billion+ objects expected to be ingested and processed by
leveraging elastic scalability of AWS cloud compute
Front-end application with
intuitive search/mining and
dashboard with graphical
visualization of thousands of
records with faster response
time.
CIGNEX Datamatics Confidential www.cignex.com10
Solution Architecture (High Level)
Data Processing Data Visualization
Social Data
Market Data
External Data
Location Data
Data Enrichment Data Processing Cluster
Customized
Core Java based
ETLs and Java scripts
Third Party ETL Cluster (one of these)
Front-End Application
Full Text Search Engine
(one of these)
MongoDB Cluster
Customer Data
Amazon Cloud Hosting (Elastic Cloud Computing - EC2)
MongoDB
Secondary
MongoDB
Primary
MongoDB
Secondary
MongoDB Cluster
MongoDB Primary MongoDB Secondary
MongoDB Secondary
Jasper/ Tableau/ C3/D3.js
Visualization
Front End Application
Framework
CIGNEX Datamatics Confidential www.cignex.com
Requirement MongoDB Features
• Support multiple data processing pipelines
– Via ETL Tool
– Via Custom Code
– Via Custom Scripts
• Integration with leading data integration tools – Alteryx,
Talend, Pentaho
• Java Driver to create custom business logic
• Support for server side JavaScript to trigger custom business
Logic
• Sustain write throughput with increasing data
volumes
• Sharding to scale out horizontally and distribute load
• WiredTiger storage engine (>=Version 3) with features such
as document level concurrency facilitating excellent write
performance, optimal memory usage, data compression for
faster data access and efficient storage
• Provide low latency
• Support large number of concurrent user and
sustain response times
• Sharding to route/distribute read requests to separate nodes
• Data & index compression features in in WiredTiger storage
engine facilitate better performance
• Store indexes on separate mounts and improve read
throughput
11
MongoDB - A Great Fit for Data Processing and Analytics
CIGNEX Datamatics Confidential www.cignex.com13
Implementation Challenges
Implementation Challenges Solution
• Unifying different Data Processing
components(ETL, Custom Code) & overall ETL
efficiency
• Created custom / configurable orchestration engine which
allows full / partial execution of data processing steps
• Created a dashboard which shows monitoring of the
execution steps – allows re-start from anywhere in the
multi-step ETL process
• Performance Tuning of Data Processing &
Analysis frameworks
• Holistic approach to performance tuning (Covered in detail
later)
• Serve different data analysis use cases (Full
Text Search, Sub second response times,
Persistent Data storage)
• Utilize complimentary technologies
– MongoDB for persistent storage, horizontal scalability,
analytics
– Elastic Search or Solr for full text search use cases
• Data Quality • We initially underestimated the extent of quality issues
with the data (more so, since most of the data was public).
During the execution, we budgeted and hired a dedicated
experienced BA who assumed responsibility of data quality
& cleaning-up
CIGNEX Datamatics Confidential www.cignex.com
Best Practices
To be successful, you must address your overall design and
technology stack, not just schema design.
14
CIGNEX Datamatics Confidential www.cignex.com
A Holistic Approach to MongoDB Performance Tuning
Infrastructure Layer
Storage Engine
Data Model
Query Language
Application Layer
Cluster Sizing & Configuration
• Right Size
• Optimum Price benefit
Replica set sizing, Sharding
Map to use case, R/W Heaviness
Access pattern based Schema
Indexes, Query Tuning
• MongoDB Drivers
• Architecture & Design
15
CIGNEX Datamatics Confidential www.cignex.com
• Infrastructure Sizing:
– SSDs provide VERY SIGNIFICANT performance boost specially for write-heavy
workloads
– Investment in CPU with more cores often delivers more benefits than
investing in faster CPU
– Ensure that your working-set fits in the RAM (use db.serverStatus() command
to view an estimate of the the current working set size)
– Evaluate thoroughly whether journaling is needed. Remember that, with
journaling turned on MongoDB ends up using double the RAM.
• Cloud Infrastructure Capacity Planning:
– Leverage cloud platform with the right instance type by evaluating access
patterns, workloads & storage requirements.
16
A Holistic Approach to MongoDB Performance Tuning
Future Scalability
Query Tuning
Design Approach,
Schema Design
OS & Storage
Optimisation
Infrastructure Sizing
& Capacity Planning
CIGNEX Datamatics Confidential www.cignex.com
• Storage Optimization:
– Recommend use of WiredTiger as storage engine
• OS Optimization:
– Disable NUMA – non uniform memory access- not good for operational
database (configure a memory interleave policy )
– Don’t use Huge Pages virtual memory pages – mongo performs better with
normal virtual memory pages
– Readahead size should be set to 32 (use the blockdev --setra <value>)
– Increase ulimit (>20,000)
– Turn off atime for the storage volume containing database files
17
A Holistic Approach to MongoDB Performance Tuning
Future Scalability
Query Tuning
Design Approach,
Schema Design
OS & Storage
Optimisation
Infrastructure Sizing
& Capacity Planning
CIGNEX Datamatics Confidential www.cignex.com
• Schema Design:
– Always invest time in schema design, dynamic schema only means
additional flexibility !!
– Don’t store empty fields in documents
– Create the indexes very carefully. More indexes != more performance.
Indexes not fitting not fitting in RAM are often counterproductive for
performance
– No Index creation on the FLY
– Index creation in designated “Maintenance Window“
– Use Bulk API feature whenever possible. We have often witnessed
significant gains in the write throughput
– Use index optimizations available in the WiredTiger storage engine
18
A Holistic Approach to MongoDB Performance Tuning
Future Scalability
Query Tuning
Design Approach,
Schema Design
OS & Storage
Optimisation
Infrastructure Sizing
& Capacity Planning
CIGNEX Datamatics Confidential www.cignex.com
• Scalability:
– Horizontal scaling through sharding
– Use MongoDB aggregation framework
– Always keep the NFRs on top from design to implementation.
• Query Tuning:
– Effective use of indexes to support queries
– Avoid negation in queries & scatter-gather queries
– Reduce query result set size where-ever possible using limit and
projections
– Effective & frequent use of MongoDB query profiler & explain command
– Leverage each utility provided by MongoDB - mongoperf, mongosniff,
mongostat, mongotop
19
A Holistic Approach to MongoDB Performance Tuning
Future Scalability
Query Tuning
Design Approach,
Schema Design
OS & Storage
Optimisation
Infrastructure Sizing
& Capacity Planning
CIGNEX Datamatics Confidential www.cignex.com
• Simplified solution architecture with the right technologies for the use case
• Performance Tuning & scalability initiated from Day 1
– Holistic approach to performance tuning reduced response times from ~ 2- 3 minutes to
~ 3 -5 seconds
• Proprietary & Open Source can coexist
– Leverage existing investments proprietary tools and Open Source technologies that
reduce licensing costs
– Leverage open source java script components for visualization
• Team composition played critical – Need complimentary skills:
– Solution Architecture | Dev-Ops | Business Analysis/Data Science
• Elastic compute storage
– Leverage AWS cloud features of elastic scalability to upsize/downsize compute power
based on data processing workloads.
20
Benefits Delivered
CIGNEX Datamatics Confidential www.cignex.com
MongoDB Vital Stats
500+ employees 2000+ customers
Over $311 million in funding
Offices in NY & Palo Alto and
across EMEA, and APAC
21
CIGNEX Datamatics Confidential www.cignex.com
The best way to run
MongoDB
Automated.
Supported.
Secured.
Features beyond those in the
community edition:
Enterprise-Grade Support
Commercial License
Ops Manager or Cloud Manager Premium
Encrypted & In-Memory Storage Engines
MongoDB Compass
BI Connector (SQL Bridge)
Advanced Security
Platform Certification
On-Demand Training
MongoDB Enterprise Edition
22
CIGNEX Datamatics Confidential www.cignex.com
{
_id: “123”,
title: "MongoDB: The Definitive Guide",
authors: [
{ _id: "kchodorow", name: "Kristina Chodorow“ },
{ _id: "mdirold", name: “Mike Dirolf“ }
],
published_date: ISODate(”2010-09-24”),
pages: 216,
language: "English",
thumbnail: BinData(0,"AREhMQ=="),
publisher: {
name: "O’Reilly Media",
founded: 1980,
locations: ["CA”, ”NY” ]
}
}
The Data Is The Schema
23
CIGNEX Datamatics Confidential www.cignex.com
> db.authors.find()
{
_id: ”X12",
name: { first: "Kristina”, last: “Chodorow” },
personalData: {
favoritePets: [ “bird”, “dog” ],
awards: [ {name: “Hugo”, when: 1983}, {name: “SSFX”, when: 1992} ]
}
}
{
_id: ”Y45",
name: { first: ”Mike”, last: “Dirolf” } ,
personalData: {
dob: ISODate(“1970-04-05”)
}
}
Treat Your Data More Like Objects
24
CIGNEX Datamatics Confidential www.cignex.com
7x-10x Performance, 50%-80% Less Storage
MongoDB 3.0 Set The Stage…
How: WiredTiger Storage Engine
• Same data model, query language, & ops
• 100% backwards compatible API
• Non-disruptive upgrade
• Storage savings driven by native
compression
• Write performance gains driven by
– Document-level concurrency control
– More efficient use of HW threads
• Much better ability to scale vertically
MongoDB 3.0MongoDB 2.6
Performance
25
CIGNEX Datamatics Confidential www.cignex.com
MongoDB Sweet Spot Use Cases
Big Data Product & Asset
Catalogs
Security &
Fraud
Internet of
Things
Database-as-a-
Service
Mobile
Apps
Customer Data
Management Single View
Social &
Collaboration
Content
Management
Intelligence
Agencies
Top Investment
and Retail Banks
Top Global
Shipping
Company
Top Industrial
Equipment
Manufacturer
Top Media
Company
Top Investment
and Retail Banks
Complex Data
Management
Top Investment
and Retail Banks
Embedded /
ISV
Cushman &
Wakefield
26
CIGNEX Datamatics Confidential www.cignex.com27
CIGNEX Datamatics - Established in 2000, USA
12+ Open Source
Framework/ Components#1 Pure Play Open
Source Services Company
15 Open Source
Books Authored
Global Offices
13+Business Engagement
Platforms4+
Open Source
Community Contributions5000+Open Source
Implementations500+Open Source
Consultants500+
Portals, Content & Collaboration
Portals
Enterprise Integration
Identity Relationship Management
Enterprise Content Management
Document Management
Web Content Management
Learning/Knowledge Management
Imaging and Scanning - OCR/Digitization
Enterprise Search
Business Process Management
E-Commerce
B2B e-Commerce
B2C e-Commerce
Internet of Things (IoT)
Big Data Analytics
Data Integration
Information Delivery
Data Analysis
Open Source Solutions
Business Engagement Platforms
CIGNEX Datamatics Confidential www.cignex.com28
At Glance – CIGNEX Datamatics Big Data Analytics & IoT Case Studies
Improve performance through real-time
intelligence by efficient device
management. & issue identification
GPS Services Company Networking Company
Increase customer satisfaction &
revenue due to uninterrupted video
experience anywhere anytime on any
device
Modernization of legacy Quote Portal
resulting into competitive advantage –
Quote in 5 minutes
Insurance Company
First mover advantage with timely
launch of Sentiment and Trending
Analysis service
SaaS Start-up Company B2B Market Intelligence Services
100% Increase in Conversion Rate with
Single View of Business and Market
Intelligence
E-Learning Community Portal
7x-10x Efficient User Data Management
with Improved application performance
and data security
CIGNEX Datamatics Confidential www.cignex.com29
Questions ?
Test Drive Big Data Analytics & IoT
Engage us for Proof-of-Concept (PoC)
Website: www.cignex.com | Email: info@cignex.com

Mais conteúdo relacionado

Mais procurados

Overcoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDBOvercoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDBMongoDB
 
Webinar: Simplifying the Database Experience with MongoDB Atlas
Webinar: Simplifying the Database Experience with MongoDB AtlasWebinar: Simplifying the Database Experience with MongoDB Atlas
Webinar: Simplifying the Database Experience with MongoDB AtlasMongoDB
 
Webinar: An Enterprise Architect’s View of MongoDB
Webinar: An Enterprise Architect’s View of MongoDBWebinar: An Enterprise Architect’s View of MongoDB
Webinar: An Enterprise Architect’s View of MongoDBMongoDB
 
MongoDB: Agile Combustion Engine
MongoDB: Agile Combustion EngineMongoDB: Agile Combustion Engine
MongoDB: Agile Combustion EngineNorberto Leite
 
MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB
 
MongoDB vs Mysql. A devops point of view
MongoDB vs Mysql. A devops point of viewMongoDB vs Mysql. A devops point of view
MongoDB vs Mysql. A devops point of viewPierre Baillet
 
Maximizing MongoDB Performance on AWS
Maximizing MongoDB Performance on AWSMaximizing MongoDB Performance on AWS
Maximizing MongoDB Performance on AWSMongoDB
 
MongoATL: How Sourceforge is Using MongoDB
MongoATL: How Sourceforge is Using MongoDBMongoATL: How Sourceforge is Using MongoDB
MongoATL: How Sourceforge is Using MongoDBRick Copeland
 
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...MongoDB
 
Bye Bye Legacy: Simplifying the Journey
Bye Bye Legacy: Simplifying the JourneyBye Bye Legacy: Simplifying the Journey
Bye Bye Legacy: Simplifying the JourneyMongoDB
 
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...Alfresco Software
 
Building a Microservices-based ERP System
Building a Microservices-based ERP SystemBuilding a Microservices-based ERP System
Building a Microservices-based ERP SystemMongoDB
 
When to Use MongoDB...and When You Should Not...
When to Use MongoDB...and When You Should Not...When to Use MongoDB...and When You Should Not...
When to Use MongoDB...and When You Should Not...MongoDB
 
MongoDB 3.2 Feature Preview
MongoDB 3.2 Feature PreviewMongoDB 3.2 Feature Preview
MongoDB 3.2 Feature PreviewNorberto Leite
 
MongoDB and RDBMS: Using Polyglot Persistence at Equifax
MongoDB and RDBMS: Using Polyglot Persistence at Equifax MongoDB and RDBMS: Using Polyglot Persistence at Equifax
MongoDB and RDBMS: Using Polyglot Persistence at Equifax MongoDB
 
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionApache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionJoão Gabriel Lima
 
Experian Health: Moving Universal Identity Manager from ANSI SQL to MongoDB
Experian Health: Moving Universal Identity Manager from ANSI SQL to MongoDBExperian Health: Moving Universal Identity Manager from ANSI SQL to MongoDB
Experian Health: Moving Universal Identity Manager from ANSI SQL to MongoDBMongoDB
 
Migrating from RDBMS to MongoDB
Migrating from RDBMS to MongoDBMigrating from RDBMS to MongoDB
Migrating from RDBMS to MongoDBMongoDB
 

Mais procurados (20)

Overcoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDBOvercoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDB
 
Webinar: Simplifying the Database Experience with MongoDB Atlas
Webinar: Simplifying the Database Experience with MongoDB AtlasWebinar: Simplifying the Database Experience with MongoDB Atlas
Webinar: Simplifying the Database Experience with MongoDB Atlas
 
Webinar: An Enterprise Architect’s View of MongoDB
Webinar: An Enterprise Architect’s View of MongoDBWebinar: An Enterprise Architect’s View of MongoDB
Webinar: An Enterprise Architect’s View of MongoDB
 
MongoDB: Agile Combustion Engine
MongoDB: Agile Combustion EngineMongoDB: Agile Combustion Engine
MongoDB: Agile Combustion Engine
 
MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe World
 
MongoDB vs Mysql. A devops point of view
MongoDB vs Mysql. A devops point of viewMongoDB vs Mysql. A devops point of view
MongoDB vs Mysql. A devops point of view
 
Maximizing MongoDB Performance on AWS
Maximizing MongoDB Performance on AWSMaximizing MongoDB Performance on AWS
Maximizing MongoDB Performance on AWS
 
MongoDB on Azure
MongoDB on AzureMongoDB on Azure
MongoDB on Azure
 
MongoATL: How Sourceforge is Using MongoDB
MongoATL: How Sourceforge is Using MongoDBMongoATL: How Sourceforge is Using MongoDB
MongoATL: How Sourceforge is Using MongoDB
 
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
 
Bye Bye Legacy: Simplifying the Journey
Bye Bye Legacy: Simplifying the JourneyBye Bye Legacy: Simplifying the Journey
Bye Bye Legacy: Simplifying the Journey
 
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...
 
Building a Microservices-based ERP System
Building a Microservices-based ERP SystemBuilding a Microservices-based ERP System
Building a Microservices-based ERP System
 
When to Use MongoDB...and When You Should Not...
When to Use MongoDB...and When You Should Not...When to Use MongoDB...and When You Should Not...
When to Use MongoDB...and When You Should Not...
 
MongoDB 3.2 Feature Preview
MongoDB 3.2 Feature PreviewMongoDB 3.2 Feature Preview
MongoDB 3.2 Feature Preview
 
Mongo db 3.4 Overview
Mongo db 3.4 OverviewMongo db 3.4 Overview
Mongo db 3.4 Overview
 
MongoDB and RDBMS: Using Polyglot Persistence at Equifax
MongoDB and RDBMS: Using Polyglot Persistence at Equifax MongoDB and RDBMS: Using Polyglot Persistence at Equifax
MongoDB and RDBMS: Using Polyglot Persistence at Equifax
 
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionApache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
 
Experian Health: Moving Universal Identity Manager from ANSI SQL to MongoDB
Experian Health: Moving Universal Identity Manager from ANSI SQL to MongoDBExperian Health: Moving Universal Identity Manager from ANSI SQL to MongoDB
Experian Health: Moving Universal Identity Manager from ANSI SQL to MongoDB
 
Migrating from RDBMS to MongoDB
Migrating from RDBMS to MongoDBMigrating from RDBMS to MongoDB
Migrating from RDBMS to MongoDB
 

Destaque

Corporate Presentation - May 2016
Corporate Presentation - May 2016Corporate Presentation - May 2016
Corporate Presentation - May 2016Marc Nader
 
Creating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big DataCreating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big DataMongoDB
 
Rtvl Corporate Presentation Final
Rtvl   Corporate Presentation FinalRtvl   Corporate Presentation Final
Rtvl Corporate Presentation FinalNeeraj kumar
 
Thoughts on MongoDB Analytics
Thoughts on MongoDB AnalyticsThoughts on MongoDB Analytics
Thoughts on MongoDB Analyticsrogerbodamer
 
Klmug presentation - Simple Analytics with MongoDB
Klmug presentation - Simple Analytics with MongoDBKlmug presentation - Simple Analytics with MongoDB
Klmug presentation - Simple Analytics with MongoDBRoss Affandy
 
Social Analytics on MongoDB at MongoNYC
Social Analytics on MongoDB at MongoNYCSocial Analytics on MongoDB at MongoNYC
Social Analytics on MongoDB at MongoNYCPatrick Stokes
 
Big Data Analytics 1: Driving Personalized Experiences Using Customer Profiles
Big Data Analytics 1: Driving Personalized Experiences Using Customer ProfilesBig Data Analytics 1: Driving Personalized Experiences Using Customer Profiles
Big Data Analytics 1: Driving Personalized Experiences Using Customer ProfilesMongoDB
 
Blazing Fast Analytics with MongoDB & Spark
Blazing Fast Analytics with MongoDB & SparkBlazing Fast Analytics with MongoDB & Spark
Blazing Fast Analytics with MongoDB & SparkMongoDB
 
MongoDB for Analytics
MongoDB for AnalyticsMongoDB for Analytics
MongoDB for AnalyticsMongoDB
 
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...MongoDB
 
Webinar: MongoDB and Analytics: Building Solutions with the MongoDB BI Connector
Webinar: MongoDB and Analytics: Building Solutions with the MongoDB BI ConnectorWebinar: MongoDB and Analytics: Building Solutions with the MongoDB BI Connector
Webinar: MongoDB and Analytics: Building Solutions with the MongoDB BI ConnectorMongoDB
 
Real Time Data Analytics with MongoDB and Fluentd at Wish
Real Time Data Analytics with MongoDB and Fluentd at WishReal Time Data Analytics with MongoDB and Fluentd at Wish
Real Time Data Analytics with MongoDB and Fluentd at WishMongoDB
 
MongoDB World 2016: The Best IoT Analytics with MongoDB
MongoDB World 2016: The Best IoT Analytics with MongoDBMongoDB World 2016: The Best IoT Analytics with MongoDB
MongoDB World 2016: The Best IoT Analytics with MongoDBMongoDB
 
MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...
MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...
MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...MongoDB
 

Destaque (15)

Corporate Presentation - May 2016
Corporate Presentation - May 2016Corporate Presentation - May 2016
Corporate Presentation - May 2016
 
eInfochips Corporate PPT - Oct 2014 Rev 1 0
eInfochips Corporate PPT - Oct 2014 Rev 1 0eInfochips Corporate PPT - Oct 2014 Rev 1 0
eInfochips Corporate PPT - Oct 2014 Rev 1 0
 
Creating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big DataCreating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big Data
 
Rtvl Corporate Presentation Final
Rtvl   Corporate Presentation FinalRtvl   Corporate Presentation Final
Rtvl Corporate Presentation Final
 
Thoughts on MongoDB Analytics
Thoughts on MongoDB AnalyticsThoughts on MongoDB Analytics
Thoughts on MongoDB Analytics
 
Klmug presentation - Simple Analytics with MongoDB
Klmug presentation - Simple Analytics with MongoDBKlmug presentation - Simple Analytics with MongoDB
Klmug presentation - Simple Analytics with MongoDB
 
Social Analytics on MongoDB at MongoNYC
Social Analytics on MongoDB at MongoNYCSocial Analytics on MongoDB at MongoNYC
Social Analytics on MongoDB at MongoNYC
 
Big Data Analytics 1: Driving Personalized Experiences Using Customer Profiles
Big Data Analytics 1: Driving Personalized Experiences Using Customer ProfilesBig Data Analytics 1: Driving Personalized Experiences Using Customer Profiles
Big Data Analytics 1: Driving Personalized Experiences Using Customer Profiles
 
Blazing Fast Analytics with MongoDB & Spark
Blazing Fast Analytics with MongoDB & SparkBlazing Fast Analytics with MongoDB & Spark
Blazing Fast Analytics with MongoDB & Spark
 
MongoDB for Analytics
MongoDB for AnalyticsMongoDB for Analytics
MongoDB for Analytics
 
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
 
Webinar: MongoDB and Analytics: Building Solutions with the MongoDB BI Connector
Webinar: MongoDB and Analytics: Building Solutions with the MongoDB BI ConnectorWebinar: MongoDB and Analytics: Building Solutions with the MongoDB BI Connector
Webinar: MongoDB and Analytics: Building Solutions with the MongoDB BI Connector
 
Real Time Data Analytics with MongoDB and Fluentd at Wish
Real Time Data Analytics with MongoDB and Fluentd at WishReal Time Data Analytics with MongoDB and Fluentd at Wish
Real Time Data Analytics with MongoDB and Fluentd at Wish
 
MongoDB World 2016: The Best IoT Analytics with MongoDB
MongoDB World 2016: The Best IoT Analytics with MongoDBMongoDB World 2016: The Best IoT Analytics with MongoDB
MongoDB World 2016: The Best IoT Analytics with MongoDB
 
MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...
MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...
MongoDB IoT City Tour STUTTGART: Industrial Internet, Industry 4.0, Smart Fac...
 

Semelhante a MongoDB Webinar Case Study Big Data Analytics

2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDenodo
 
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
InfoSphere BigInsights
InfoSphere BigInsightsInfoSphere BigInsights
InfoSphere BigInsightsWilfried Hoge
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Denodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
Connecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationConnecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationDenodo
 
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsData & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsSonata Software
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingNitesh Khilwani
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONMatt Stubbs
 
How to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosHow to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosCresco International
 
Accelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data StrategyAccelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data StrategyMongoDB
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesAmazon Web Services
 
Overcoming Data Gravity in Multi-Cloud Enterprise Architectures
Overcoming Data Gravity in Multi-Cloud Enterprise ArchitecturesOvercoming Data Gravity in Multi-Cloud Enterprise Architectures
Overcoming Data Gravity in Multi-Cloud Enterprise ArchitecturesVMware Tanzu
 

Semelhante a MongoDB Webinar Case Study Big Data Analytics (20)

BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
 
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
InfoSphere BigInsights
InfoSphere BigInsightsInfoSphere BigInsights
InfoSphere BigInsights
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
Connecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationConnecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data Virtualization
 
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsData & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft Platforms
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in Manufacturing
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
 
How to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosHow to Increase Performance in IBM Cognos
How to Increase Performance in IBM Cognos
 
About CDAP
About CDAPAbout CDAP
About CDAP
 
Accelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data StrategyAccelerating a Path to Digital With a Cloud Data Strategy
Accelerating a Path to Digital With a Cloud Data Strategy
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Overcoming Data Gravity in Multi-Cloud Enterprise Architectures
Overcoming Data Gravity in Multi-Cloud Enterprise ArchitecturesOvercoming Data Gravity in Multi-Cloud Enterprise Architectures
Overcoming Data Gravity in Multi-Cloud Enterprise Architectures
 

Mais de MongoDB

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 MongoDB SoCal 2020: MongoDB Atlas Jump Start MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB SoCal 2020: MongoDB Atlas Jump StartMongoDB
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB
 

Mais de MongoDB (20)

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 MongoDB SoCal 2020: MongoDB Atlas Jump Start MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
 

Último

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 

Último (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 

MongoDB Webinar Case Study Big Data Analytics

  • 1. CIGNEX Datamatics Confidential www.cignex.com Webinar: Faster Big Data Analytics with MongoDB Case Study: Building Large Scale Data Processing and Data Analysis Platform using MongoDB Date: 06th April 2016 Speakers: Buzz Moschetti Enterprise Architecture and Special Programs MongoDB Anurag Seth VP, Big Data Analytics & IoT Practice CIGNEX Datamatics
  • 2. CIGNEX Datamatics Confidential www.cignex.com Buzz Moschetti, Enterprise Architecture and Special Programs MongoDB Buzz works with F1000 companies to help them design next-generation solutions and develop strategies for overall technology transformation. He is also the CTO of the partner program at MongoDB and a liason to Engineering, Product Management, and Marketing groups. – 25+ years experience in the field, mostly in financial services as CAO of the Investment Bank at JPMorganChase and Bear Stearns before that Anurag Seth, VP, Big Data Analytics & Internet of Things (IoT) Practice, CIGNEX Datamatics Anurag has unique blend of technology expertise from deep tech VLSI chip design to complex high performance algorithmic software development in EDA (Electronic Design Automation) to embedded system design to predictive modelling & Big Data Analytics deployment for compelling use-cases (including IOT). – 25 years of strong experience in technology development & delivery – product as well as services across VLSI/EDA, Healthcare , Enterprise Big Data Implementations & IOT – Has served on board of the VLSI Lab at IIT Kharagpur, been the general chair of the International conference on VLSI Design & Embedded Systems (2009) and still continues to serve on the steering committee of the conference 2 Who are we ?
  • 3. CIGNEX Datamatics Confidential www.cignex.com • Big Data Analytics: Opportunity & Challenges • Case Study: Building Large Scale Data Processing and Analysis Platform using MongoDB – Business Needs – Our Approach – Solution Architecture – MongoDB - A Great Fit for Data Processing and Analytics – MongoDB Performance Tuning - Our Holistic Approach – Recommended Best Practices • Why MongoDB ? • Why CIGNEX Datamatics ? Topics 3
  • 4. CIGNEX Datamatics Confidential www.cignex.com Over 88% of data sources and types are not being analyzed.. 4 Big Data Analytics: Business Opportunities Transactional & Application Data Machine Data Enterprise Content Social Data Reduce Operational Costs Improved Risk Management Many more.. Volume Structured Velocity Semi-structured Variety Un-structured Variety Un-structured Sensor Data Velocity Semi-structured
  • 5. CIGNEX Datamatics Confidential www.cignex.com The organizations that uses Big Data Analytics to integrate, process and analyze these data sources are up to 25x more likely to outperform their competitors. 5 Big Data Analytics: Business Opportunities Improve Process Efficiency (Sales, Marketing, Finance, Operations) Product/Service Innovation Monetize Information Improved Collaboration Improve customer experience Reduce Operational Costs Improved Risk Management
  • 6. CIGNEX Datamatics Confidential www.cignex.com • Getting the right data & Infra architecture for performance & scalability • Leverage investments in existing technologies • Integrating multi-channel & variety of data sources at the modern volume • Data quality & accuracy challenges • Big data technologies are evolving too quickly to adapt • Scarcity of skills and capabilities 6 Big Data Analytics - Implementation to Production Challenges • Hard ROI from Big Data? – Identify & monetize existing & new Data Streams • Turn-around time for big data (predictive modelling) deployments • Difficult to make big data fit-for- purpose (uncertainty), assess the level of trust, and ensure security & privacy • Lack of domain centricity Technical Business
  • 7. CIGNEX Datamatics Confidential www.cignex.com Case Study: Building Large Scale Data Processing and Analysis Platform using MongoDB 7
  • 8. CIGNEX Datamatics Confidential www.cignex.com • SaaS based sales analytics platform that acquires, processes and enriches accessible public data to deliver data-driven customer and business insights that: – Enhances efficacy of customer acquisition – Improve operational efficiency – Competitive & complementary selling opportunities – Determine buying propensity, influencers & decision makers 8 Business Need PUBLIC DATA ACQUISITION SOCIAL LISTENING CUSTOMER/BUSINESS INSIGHTS
  • 9. CIGNEX Datamatics Confidential www.cignex.com9 Our Approach Segment data by influential characteristics as the best variables to use, use case centric 2. DATA PREPARATION Evaluate and combine multiple models or techniques that lead to higher efficiency 3. MODELING Dashboard for Big Data Analytics 4. ANALYTICS Define data sources that could influence the outcome. 1. DATA ACQUISITION Extensive multi-step rule-based ETL process which involves de- duplication, geo-coding, smart-filtering over huge dataset etc. Machine Learning ? Augment with ML algorithms in the longer run. Semantic associations ? Leverage the power of semantic associations (NLP for Entity Extraction, Entity Associations) to process millions of entities & implement complex business rules for data enrichment and refinement Social listening that integrate 20+ Open public data sources using REST APIs. Store and manage 1billion+ objects expected to be ingested and processed by leveraging elastic scalability of AWS cloud compute Front-end application with intuitive search/mining and dashboard with graphical visualization of thousands of records with faster response time.
  • 10. CIGNEX Datamatics Confidential www.cignex.com10 Solution Architecture (High Level) Data Processing Data Visualization Social Data Market Data External Data Location Data Data Enrichment Data Processing Cluster Customized Core Java based ETLs and Java scripts Third Party ETL Cluster (one of these) Front-End Application Full Text Search Engine (one of these) MongoDB Cluster Customer Data Amazon Cloud Hosting (Elastic Cloud Computing - EC2) MongoDB Secondary MongoDB Primary MongoDB Secondary MongoDB Cluster MongoDB Primary MongoDB Secondary MongoDB Secondary Jasper/ Tableau/ C3/D3.js Visualization Front End Application Framework
  • 11. CIGNEX Datamatics Confidential www.cignex.com Requirement MongoDB Features • Support multiple data processing pipelines – Via ETL Tool – Via Custom Code – Via Custom Scripts • Integration with leading data integration tools – Alteryx, Talend, Pentaho • Java Driver to create custom business logic • Support for server side JavaScript to trigger custom business Logic • Sustain write throughput with increasing data volumes • Sharding to scale out horizontally and distribute load • WiredTiger storage engine (>=Version 3) with features such as document level concurrency facilitating excellent write performance, optimal memory usage, data compression for faster data access and efficient storage • Provide low latency • Support large number of concurrent user and sustain response times • Sharding to route/distribute read requests to separate nodes • Data & index compression features in in WiredTiger storage engine facilitate better performance • Store indexes on separate mounts and improve read throughput 11 MongoDB - A Great Fit for Data Processing and Analytics
  • 12. CIGNEX Datamatics Confidential www.cignex.com13 Implementation Challenges Implementation Challenges Solution • Unifying different Data Processing components(ETL, Custom Code) & overall ETL efficiency • Created custom / configurable orchestration engine which allows full / partial execution of data processing steps • Created a dashboard which shows monitoring of the execution steps – allows re-start from anywhere in the multi-step ETL process • Performance Tuning of Data Processing & Analysis frameworks • Holistic approach to performance tuning (Covered in detail later) • Serve different data analysis use cases (Full Text Search, Sub second response times, Persistent Data storage) • Utilize complimentary technologies – MongoDB for persistent storage, horizontal scalability, analytics – Elastic Search or Solr for full text search use cases • Data Quality • We initially underestimated the extent of quality issues with the data (more so, since most of the data was public). During the execution, we budgeted and hired a dedicated experienced BA who assumed responsibility of data quality & cleaning-up
  • 13. CIGNEX Datamatics Confidential www.cignex.com Best Practices To be successful, you must address your overall design and technology stack, not just schema design. 14
  • 14. CIGNEX Datamatics Confidential www.cignex.com A Holistic Approach to MongoDB Performance Tuning Infrastructure Layer Storage Engine Data Model Query Language Application Layer Cluster Sizing & Configuration • Right Size • Optimum Price benefit Replica set sizing, Sharding Map to use case, R/W Heaviness Access pattern based Schema Indexes, Query Tuning • MongoDB Drivers • Architecture & Design 15
  • 15. CIGNEX Datamatics Confidential www.cignex.com • Infrastructure Sizing: – SSDs provide VERY SIGNIFICANT performance boost specially for write-heavy workloads – Investment in CPU with more cores often delivers more benefits than investing in faster CPU – Ensure that your working-set fits in the RAM (use db.serverStatus() command to view an estimate of the the current working set size) – Evaluate thoroughly whether journaling is needed. Remember that, with journaling turned on MongoDB ends up using double the RAM. • Cloud Infrastructure Capacity Planning: – Leverage cloud platform with the right instance type by evaluating access patterns, workloads & storage requirements. 16 A Holistic Approach to MongoDB Performance Tuning Future Scalability Query Tuning Design Approach, Schema Design OS & Storage Optimisation Infrastructure Sizing & Capacity Planning
  • 16. CIGNEX Datamatics Confidential www.cignex.com • Storage Optimization: – Recommend use of WiredTiger as storage engine • OS Optimization: – Disable NUMA – non uniform memory access- not good for operational database (configure a memory interleave policy ) – Don’t use Huge Pages virtual memory pages – mongo performs better with normal virtual memory pages – Readahead size should be set to 32 (use the blockdev --setra <value>) – Increase ulimit (>20,000) – Turn off atime for the storage volume containing database files 17 A Holistic Approach to MongoDB Performance Tuning Future Scalability Query Tuning Design Approach, Schema Design OS & Storage Optimisation Infrastructure Sizing & Capacity Planning
  • 17. CIGNEX Datamatics Confidential www.cignex.com • Schema Design: – Always invest time in schema design, dynamic schema only means additional flexibility !! – Don’t store empty fields in documents – Create the indexes very carefully. More indexes != more performance. Indexes not fitting not fitting in RAM are often counterproductive for performance – No Index creation on the FLY – Index creation in designated “Maintenance Window“ – Use Bulk API feature whenever possible. We have often witnessed significant gains in the write throughput – Use index optimizations available in the WiredTiger storage engine 18 A Holistic Approach to MongoDB Performance Tuning Future Scalability Query Tuning Design Approach, Schema Design OS & Storage Optimisation Infrastructure Sizing & Capacity Planning
  • 18. CIGNEX Datamatics Confidential www.cignex.com • Scalability: – Horizontal scaling through sharding – Use MongoDB aggregation framework – Always keep the NFRs on top from design to implementation. • Query Tuning: – Effective use of indexes to support queries – Avoid negation in queries & scatter-gather queries – Reduce query result set size where-ever possible using limit and projections – Effective & frequent use of MongoDB query profiler & explain command – Leverage each utility provided by MongoDB - mongoperf, mongosniff, mongostat, mongotop 19 A Holistic Approach to MongoDB Performance Tuning Future Scalability Query Tuning Design Approach, Schema Design OS & Storage Optimisation Infrastructure Sizing & Capacity Planning
  • 19. CIGNEX Datamatics Confidential www.cignex.com • Simplified solution architecture with the right technologies for the use case • Performance Tuning & scalability initiated from Day 1 – Holistic approach to performance tuning reduced response times from ~ 2- 3 minutes to ~ 3 -5 seconds • Proprietary & Open Source can coexist – Leverage existing investments proprietary tools and Open Source technologies that reduce licensing costs – Leverage open source java script components for visualization • Team composition played critical – Need complimentary skills: – Solution Architecture | Dev-Ops | Business Analysis/Data Science • Elastic compute storage – Leverage AWS cloud features of elastic scalability to upsize/downsize compute power based on data processing workloads. 20 Benefits Delivered
  • 20. CIGNEX Datamatics Confidential www.cignex.com MongoDB Vital Stats 500+ employees 2000+ customers Over $311 million in funding Offices in NY & Palo Alto and across EMEA, and APAC 21
  • 21. CIGNEX Datamatics Confidential www.cignex.com The best way to run MongoDB Automated. Supported. Secured. Features beyond those in the community edition: Enterprise-Grade Support Commercial License Ops Manager or Cloud Manager Premium Encrypted & In-Memory Storage Engines MongoDB Compass BI Connector (SQL Bridge) Advanced Security Platform Certification On-Demand Training MongoDB Enterprise Edition 22
  • 22. CIGNEX Datamatics Confidential www.cignex.com { _id: “123”, title: "MongoDB: The Definitive Guide", authors: [ { _id: "kchodorow", name: "Kristina Chodorow“ }, { _id: "mdirold", name: “Mike Dirolf“ } ], published_date: ISODate(”2010-09-24”), pages: 216, language: "English", thumbnail: BinData(0,"AREhMQ=="), publisher: { name: "O’Reilly Media", founded: 1980, locations: ["CA”, ”NY” ] } } The Data Is The Schema 23
  • 23. CIGNEX Datamatics Confidential www.cignex.com > db.authors.find() { _id: ”X12", name: { first: "Kristina”, last: “Chodorow” }, personalData: { favoritePets: [ “bird”, “dog” ], awards: [ {name: “Hugo”, when: 1983}, {name: “SSFX”, when: 1992} ] } } { _id: ”Y45", name: { first: ”Mike”, last: “Dirolf” } , personalData: { dob: ISODate(“1970-04-05”) } } Treat Your Data More Like Objects 24
  • 24. CIGNEX Datamatics Confidential www.cignex.com 7x-10x Performance, 50%-80% Less Storage MongoDB 3.0 Set The Stage… How: WiredTiger Storage Engine • Same data model, query language, & ops • 100% backwards compatible API • Non-disruptive upgrade • Storage savings driven by native compression • Write performance gains driven by – Document-level concurrency control – More efficient use of HW threads • Much better ability to scale vertically MongoDB 3.0MongoDB 2.6 Performance 25
  • 25. CIGNEX Datamatics Confidential www.cignex.com MongoDB Sweet Spot Use Cases Big Data Product & Asset Catalogs Security & Fraud Internet of Things Database-as-a- Service Mobile Apps Customer Data Management Single View Social & Collaboration Content Management Intelligence Agencies Top Investment and Retail Banks Top Global Shipping Company Top Industrial Equipment Manufacturer Top Media Company Top Investment and Retail Banks Complex Data Management Top Investment and Retail Banks Embedded / ISV Cushman & Wakefield 26
  • 26. CIGNEX Datamatics Confidential www.cignex.com27 CIGNEX Datamatics - Established in 2000, USA 12+ Open Source Framework/ Components#1 Pure Play Open Source Services Company 15 Open Source Books Authored Global Offices 13+Business Engagement Platforms4+ Open Source Community Contributions5000+Open Source Implementations500+Open Source Consultants500+ Portals, Content & Collaboration Portals Enterprise Integration Identity Relationship Management Enterprise Content Management Document Management Web Content Management Learning/Knowledge Management Imaging and Scanning - OCR/Digitization Enterprise Search Business Process Management E-Commerce B2B e-Commerce B2C e-Commerce Internet of Things (IoT) Big Data Analytics Data Integration Information Delivery Data Analysis Open Source Solutions Business Engagement Platforms
  • 27. CIGNEX Datamatics Confidential www.cignex.com28 At Glance – CIGNEX Datamatics Big Data Analytics & IoT Case Studies Improve performance through real-time intelligence by efficient device management. & issue identification GPS Services Company Networking Company Increase customer satisfaction & revenue due to uninterrupted video experience anywhere anytime on any device Modernization of legacy Quote Portal resulting into competitive advantage – Quote in 5 minutes Insurance Company First mover advantage with timely launch of Sentiment and Trending Analysis service SaaS Start-up Company B2B Market Intelligence Services 100% Increase in Conversion Rate with Single View of Business and Market Intelligence E-Learning Community Portal 7x-10x Efficient User Data Management with Improved application performance and data security
  • 28. CIGNEX Datamatics Confidential www.cignex.com29 Questions ? Test Drive Big Data Analytics & IoT Engage us for Proof-of-Concept (PoC) Website: www.cignex.com | Email: info@cignex.com