SlideShare uma empresa Scribd logo
1 de 37
Docker Summit
Containerization and Data Democratization for Cloud Native apps
Chris Grabosky – Solutions Architect @MongoDB
gsky.us | github.com/graboskyc | linkedin.com/in/chris-grabosky
“name”: “Chris Grabosky”,
“employment_history”: [
{
“company”: “MongoDB”,
“job”: “Sr. Solutions Architect”,
“from”: new Date(“2018-08”),
“background”: [“operational workloads”, “BI”,
“application modernization”, “devops”]
},{
“company”: “QualiSystems”,
“background”: [“devops”, “cloud management”, “infrastructure”,
“automation”, “hybrid cloud”, “containerization”, “virtualization”]
},{
“company”: “Comcast”,
“background”: [“network automation”,“devops”, “app dev”]
}
],“education”: [
{ “degree”: “BS”, “field”: “IS”, “school”: “Drexel” },
{ “degree”: “MS”, “field”: “IS”, “school”: “Drexel” }
],
“likes”: [“family”, “programming”, “paintball”],
“links”: [“gsky.us”, “github.com/graboskyc”,
“linkedin.com/in/chris-grabosky/”],
About Me
The world has changed
Large Data Center Virtual Machines VMs in the Cloud (EC2) Containers Serverless & Services
Manage H/W Manage less H/W Size & provision VMs Size & provision containers Just send in requests
Wasted resources Better utilization Rent, not buy Rent with less waste Pay as you go
$$$$$ $$$$ $$$ $$ $
Evolution of Computing Models
5
So you decided to build your app with MongoDB and Docker…
• Easy: Work with data in a natural, intuitive
way
• Flexible: Adapt and make changes quickly
• Fast: Get great performance with less code
• Versatile: Supports a wide variety of data
models and queries
• Modern platform for all applications
• Faster time to market
• Developer productivity
• Developer velocity
• IT infra reduction
• IT ops efficiency
• Faster issue resolution
6
Focus on Developers
Focus on Time to Market
Focus on What Matters
Same in Dev as Prod
So you decided to build your app with MongoDB and Docker…
7
FROM mongo
RUN apt-get clean && apt-get update && apt-get install -y git nodejs npm
COPY source dest
RUN mongod&
CMD npm run start
So, let’s start simply…
8
Everything works great until
a new business requirement:
Analytics
Congrats! Your application is a success!
For more than a decade,
organizations have been pursuing the promise of
Digital Transformation
88%
CIOs believe they have yet to benefit
from their digital strategy
Source: Harvey Nash / KPMG CIO Survey 2017
Source: https://www.datasciencecentral.com/profiles/blogs/economics-monetization-and-the-new-order-automobile-industry
Industry and Business Model Disruption Happens…
A New Wave of Industries Are Under Attack
For Example, the Auto/Transportation Industry
“The source of the value is no
longer the vehicle itself”
Millennials don’t care about owning cars
People will always need to get from “A” to “B”
Must find new sources of product/service value
“Cars as Product” to “Transportation as a Service”
Disruptors Force Incumbents to Transform
Disruptors
Incumbents
Consumer
Appliances/Industrial
Manufacturer
Tire Manufacturer
Internet/Marketing
Known For...
but the
DATAS I L O E D | C O M P L E X | T R A P P E D
We accelerated time to value to get new apps and new features to our customers
is still
Code user authentication
Code data access controls
Provision backend server
Install runtime environment
Add code to make backend HA
Add code to scale backend
Monitor & manage backend infrastructure
Code REST API for frontend to use backend
Code backend application logic
Code application frontend
Code against each external service API
Continuously poll database for changes
Old World
Simple JSON Config
Handled automatically by services
Code frontend using single SDK/API to
access backend services
New World
Backend
Data Access
Frontend
Provide code for Functions
Data
Democratization:
Getting data to the right people in the right format at the right
time so that they may turn it into actionable information,
knowledge, and wisdom.
18
Data Democratization – What and Why?
“We spend more time shoveling coal than steering the ship. We want to shift our energy to looking at the data and
navigating where we are going.”
-- Robert Kagarise, director of population health informatics and IT for the Delaware Valley Accountable Care Organization
What it is... Why it exists...
Data and insights accessible to all who need them
Data is siloed/complex, making it difficult to reconcile and deliver
insights to consumers of varying technical abilities
No technical barriers to accessing the data and insights
Time intensive for IT/Sys Admin/Data Stewards to wrangle data,
delaying insights and their business benefit
Proper access control and governance Security is non-negotiable (esp. in healthcare)
19
Data Democratization – Who and How?
Technicians
• Create
foundation for
data that’s easy
to organize and
enrich over time
• Reduce
complexity and
technical debt
• Define security
logic that works
across orgs and
apps
Developers
• Connect to the
data via APIs &
microservices
vs duplication
or tight
coupling to
monoliths
Analysts
• Utilize self-
service analytic
s through tools
Front-LineWorkers
• Access
insights via
pre-built
dashboards or
embedded in
apps
20
Both the Opportunity and the Challenge is Data...
Patients
Members
Billing
Physicians
Hospitals
Pharmacy
Lab Results
Procedures
Medications
Sales
Enrollment
Claims
Web
Mobile
Social
...but data across the ecosystem is shaped very differently
21
Tabular (Relational) Data Model
Related data split across multiple records and tables
Document Data Model
Related data contained in a single, rich document
{
"_id" : ObjectId("5ad88534e3632e1a35a58d00"),
"name" : {
"first" : "John",
"last" : "Doe" },
"address" : [
{ "location" : "work",
"address" : {
"street" : "16 Hatfields",
"city" : "London",
"postal_code" : "SE1 8DJ"},
"geo" : { "type" : "Point", "coord" : [
51.5065752,-0.109081]}},
+ {...}
],
"phone" : [
{ "location" : "work",
"number" : "+44-1234567890"},
+ {...}
],
"dob" : ISODate("1977-04-01T05:00:00Z"),
"retirement_fund" : NumberDecimal("1292815.75")
}
Need to Think About the Data Differently
22
Tabular (Relational) Data Model
Related data split across multiple records and tables
Document Data Model
Related data contained in a single, rich document
{
"_id" : ObjectId("5ad88534e3632e1a35a58d00"),
"name" : {
"first" : "John",
"last" : "Doe" },
"address" : [
{ "location" : "work",
"address" : {
"street" : "16 Hatfields",
"city" : "London",
"postal_code" : "SE1 8DJ"},
"geo" : { "type" : "Point", "coord" : [
51.5065752,-0.109081]}},
+ {...}
],
"phone" : [
{ "location" : "work",
"number" : "+44-1234567890"},
+ {...}
],
"dob" : ISODate("1977-04-01T05:00:00Z"),
"retirement_fund" : NumberDecimal("1292815.75")
}
This becomes complex & rigid to change
23
How to Approach This Transformation?
Data Layer Optimization:
• unlocks the value of data stored in
silos and legacy systems
• drives rapid, iterative integration
of data sources for new and
existing consuming applications
• supports enterprise data
governance efforts
• builds deep technical expertise
and best practices
Data Source
Integration
Data
APIs
Data Loading
and
Streaming
Legacy
Systems
Offloading
Data
Analytics
Data
Governance
Luckily, we started
with two highly scalable
and portable solutions
25
Freedom to run anywhere
Local
On-premises
Server & Mainframe Private cloud
Fully managed
cloud serviceHybrid cloud Public cloud
• Database that runs the same everywhere
• Leverage the benefits of a multi-cloud strategy
• Global coverage
• Avoid lock-in
Convenience: same codebase, same APIs, same tools, wherever you run
26
Docker Deployment Options with MongoDB
Docker Container with MongoD
Swarm & Compose with Rep Set
Docker & K8S with RS / SC & K8S Operator in Ops Mgr
Cloud-native & MongoDB Atlas
27
MongoDB Solution: Replica Sets
Replica Set – up to 50 nodes
Self-healing
Data Center Aware
Addresses availability considerations:
• High Availability
• Disaster Recovery
• Maintenance
Workload Isolation: operational & analytics
Application
Driver
Primary
Secondary
Secondary
Replication
28
Put data where you need it: Workload Isolation
Enable different workloads on the same data
• Combine operational and analytical workloads on a single data
platform
• Extract live insights from real-time data to enrich applications
• One set of nodes serving operational apps, replicating to
dedicated nodes serving analytics: up to 50 nodes in a single
replica set
• ETL-free
10 ms
2 ms
45 ms
35 ms
85 ms
75 ms
105 ms
10 ms
2 ms
10 ms
4 ms
4 ms
2 ms
10 ms
2 ms
31
Put data where you need it: Scalability with Sharding
Auto-Sharding
● Automatically scale beyond the constraints of a single node
● Application transparent
● Scale and rebalance incrementally, in real time
● Unlike NoSQL systems that randomly spray data across a
cluster, MongoDB exposes multiple data distribution policies to
optimize for query patterns and locality
•••Shard 1 Shard 2 Shard 3 Shard N
● Multiple sharding policies:
hashed, ranged, zoned
● Increase or decrease capacity as you go
● Automatic balancing for elasticity
Horizontally Scalable
32
Rich Query Language
JSON Documents Tabular Key-Value Text GraphGeospatial SQL (BI-Connector)
Declarative Data Access Rules
Fine-grained data access controls
Authorize the user, not just the app
34
ConsumersAPI Layer
Declarative Access Controls
basicProfile
(Read Rule)
{
"%or": [
{"$$pipeline.currentRole": "Nurse"},
{"$$pipeline.currentRole": "Doctor"}
]
}
basicProfile
(Write Rule)
{"$$pipeline.currentRole": "Doctor"}
completeHistory
(Read/Write Rules)
{"$$pipeline.currentRole": "Doctor"}
Doctor
Scientist
Nurse
{Aggregated Data}
API Layer
Declarative Access Controls
Pipeline for Analysis
{"$$pipeline.currentRole": "Scientist"}
Optimized Data Layer
Patient Profile
{
"patient_id": … ,
"basicProfile": {…} ,
"completeHistory": {…}
}
{Read Basic Data}
{Read/Write All}
● Fine-grained data access controls
● Base access on document, field, or value
● Authorize the user, not just the app
● Associate with user profile or any other info
● Defined with JSON rules, not code
API Layer is about more than just serving data
API Calls
• 24% CAGR: Hadoop,
Spark & Streaming
• 18% CAGR: Databases
• Databases are key
components within the
big data landscape
“Big Data” is More than Just Hadoop
Distributed processing/analytics
Custom analytics
workload
Custom analytics
workload
MDB Query
Lang
BI
Connector
Thank You
Chris Grabosky
gsky.us | github.com/graboskyc | linkedin.com/in/chris-grabosky/
chris.grabosky@mongodb.com
cloud.mongodb.com | mongodb.com/download-center

Mais conteúdo relacionado

Mais procurados

Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Denodo
 

Mais procurados (20)

Unlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location IntelligenceUnlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location Intelligence
 
Building Your Data Hub to Support Digital
Building Your Data Hub to Support DigitalBuilding Your Data Hub to Support Digital
Building Your Data Hub to Support Digital
 
MPS Enterprise Content Management Solutions
MPS Enterprise Content Management SolutionsMPS Enterprise Content Management Solutions
MPS Enterprise Content Management Solutions
 
Partena 2010.02.10
Partena 2010.02.10Partena 2010.02.10
Partena 2010.02.10
 
Next Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data WarehouseNext Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data Warehouse
 
BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101
 
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
 
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
IDERA Slides: Managing the Transition to Hybrid Cloud
IDERA Slides: Managing the Transition to Hybrid CloudIDERA Slides: Managing the Transition to Hybrid Cloud
IDERA Slides: Managing the Transition to Hybrid Cloud
 
How to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerceHow to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerce
 
CWIN17 Singapore / Darmadi komo (microsoft) modern data estate
CWIN17 Singapore / Darmadi komo (microsoft)   modern data estateCWIN17 Singapore / Darmadi komo (microsoft)   modern data estate
CWIN17 Singapore / Darmadi komo (microsoft) modern data estate
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data Environment
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the Cloud
 
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
 
Laurel Group Thought Leaders Print 0213
Laurel Group Thought Leaders Print 0213Laurel Group Thought Leaders Print 0213
Laurel Group Thought Leaders Print 0213
 
8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare
 
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
 

Semelhante a Docker Summit MongoDB - Data Democratization

Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
Caserta
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 

Semelhante a Docker Summit MongoDB - Data Democratization (20)

La Modernizzazione dei Dati come base per la Trasformazione Digitale
La Modernizzazione dei Dati come base per la Trasformazione DigitaleLa Modernizzazione dei Dati come base per la Trasformazione Digitale
La Modernizzazione dei Dati come base per la Trasformazione Digitale
 
Introduction: Relational to Graphs
Introduction: Relational to GraphsIntroduction: Relational to Graphs
Introduction: Relational to Graphs
 
PgConf 2018 - Postgres in a World of DevOps
PgConf 2018 - Postgres in a World of DevOpsPgConf 2018 - Postgres in a World of DevOps
PgConf 2018 - Postgres in a World of DevOps
 
Confluent & MongoDB APAC Lunch & Learn
Confluent & MongoDB APAC Lunch & LearnConfluent & MongoDB APAC Lunch & Learn
Confluent & MongoDB APAC Lunch & Learn
 
OpenSistemas Corporate Presentation
OpenSistemas Corporate PresentationOpenSistemas Corporate Presentation
OpenSistemas Corporate Presentation
 
Data Treatment MongoDB
Data Treatment MongoDBData Treatment MongoDB
Data Treatment MongoDB
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
MongoDB company and case studies - john hong
MongoDB company and case studies - john hong MongoDB company and case studies - john hong
MongoDB company and case studies - john hong
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft Azure
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
Matt McIlwain opening keynote
Matt McIlwain opening keynoteMatt McIlwain opening keynote
Matt McIlwain opening keynote
 
Evolving From Monolithic to Distributed Architecture Patterns in the Cloud
Evolving From Monolithic to Distributed Architecture Patterns in the CloudEvolving From Monolithic to Distributed Architecture Patterns in the Cloud
Evolving From Monolithic to Distributed Architecture Patterns in the Cloud
 
Cloud Seeding
Cloud SeedingCloud Seeding
Cloud Seeding
 
Roadmap for Enterprise Graph Strategy
Roadmap for Enterprise Graph StrategyRoadmap for Enterprise Graph Strategy
Roadmap for Enterprise Graph Strategy
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
SegmentOfOne
SegmentOfOneSegmentOfOne
SegmentOfOne
 

Último

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 

Último (20)

10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 

Docker Summit MongoDB - Data Democratization

  • 1. Docker Summit Containerization and Data Democratization for Cloud Native apps Chris Grabosky – Solutions Architect @MongoDB gsky.us | github.com/graboskyc | linkedin.com/in/chris-grabosky
  • 2. “name”: “Chris Grabosky”, “employment_history”: [ { “company”: “MongoDB”, “job”: “Sr. Solutions Architect”, “from”: new Date(“2018-08”), “background”: [“operational workloads”, “BI”, “application modernization”, “devops”] },{ “company”: “QualiSystems”, “background”: [“devops”, “cloud management”, “infrastructure”, “automation”, “hybrid cloud”, “containerization”, “virtualization”] },{ “company”: “Comcast”, “background”: [“network automation”,“devops”, “app dev”] } ],“education”: [ { “degree”: “BS”, “field”: “IS”, “school”: “Drexel” }, { “degree”: “MS”, “field”: “IS”, “school”: “Drexel” } ], “likes”: [“family”, “programming”, “paintball”], “links”: [“gsky.us”, “github.com/graboskyc”, “linkedin.com/in/chris-grabosky/”], About Me
  • 3. The world has changed
  • 4. Large Data Center Virtual Machines VMs in the Cloud (EC2) Containers Serverless & Services Manage H/W Manage less H/W Size & provision VMs Size & provision containers Just send in requests Wasted resources Better utilization Rent, not buy Rent with less waste Pay as you go $$$$$ $$$$ $$$ $$ $ Evolution of Computing Models
  • 5. 5 So you decided to build your app with MongoDB and Docker… • Easy: Work with data in a natural, intuitive way • Flexible: Adapt and make changes quickly • Fast: Get great performance with less code • Versatile: Supports a wide variety of data models and queries • Modern platform for all applications • Faster time to market • Developer productivity • Developer velocity • IT infra reduction • IT ops efficiency • Faster issue resolution
  • 6. 6 Focus on Developers Focus on Time to Market Focus on What Matters Same in Dev as Prod So you decided to build your app with MongoDB and Docker…
  • 7. 7 FROM mongo RUN apt-get clean && apt-get update && apt-get install -y git nodejs npm COPY source dest RUN mongod& CMD npm run start So, let’s start simply…
  • 8. 8 Everything works great until a new business requirement: Analytics Congrats! Your application is a success!
  • 9. For more than a decade, organizations have been pursuing the promise of Digital Transformation
  • 10. 88% CIOs believe they have yet to benefit from their digital strategy Source: Harvey Nash / KPMG CIO Survey 2017
  • 12. A New Wave of Industries Are Under Attack
  • 13. For Example, the Auto/Transportation Industry “The source of the value is no longer the vehicle itself” Millennials don’t care about owning cars People will always need to get from “A” to “B” Must find new sources of product/service value “Cars as Product” to “Transportation as a Service”
  • 14. Disruptors Force Incumbents to Transform Disruptors Incumbents Consumer Appliances/Industrial Manufacturer Tire Manufacturer Internet/Marketing Known For...
  • 15. but the DATAS I L O E D | C O M P L E X | T R A P P E D We accelerated time to value to get new apps and new features to our customers is still
  • 16. Code user authentication Code data access controls Provision backend server Install runtime environment Add code to make backend HA Add code to scale backend Monitor & manage backend infrastructure Code REST API for frontend to use backend Code backend application logic Code application frontend Code against each external service API Continuously poll database for changes Old World Simple JSON Config Handled automatically by services Code frontend using single SDK/API to access backend services New World Backend Data Access Frontend Provide code for Functions
  • 17. Data Democratization: Getting data to the right people in the right format at the right time so that they may turn it into actionable information, knowledge, and wisdom.
  • 18. 18 Data Democratization – What and Why? “We spend more time shoveling coal than steering the ship. We want to shift our energy to looking at the data and navigating where we are going.” -- Robert Kagarise, director of population health informatics and IT for the Delaware Valley Accountable Care Organization What it is... Why it exists... Data and insights accessible to all who need them Data is siloed/complex, making it difficult to reconcile and deliver insights to consumers of varying technical abilities No technical barriers to accessing the data and insights Time intensive for IT/Sys Admin/Data Stewards to wrangle data, delaying insights and their business benefit Proper access control and governance Security is non-negotiable (esp. in healthcare)
  • 19. 19 Data Democratization – Who and How? Technicians • Create foundation for data that’s easy to organize and enrich over time • Reduce complexity and technical debt • Define security logic that works across orgs and apps Developers • Connect to the data via APIs & microservices vs duplication or tight coupling to monoliths Analysts • Utilize self- service analytic s through tools Front-LineWorkers • Access insights via pre-built dashboards or embedded in apps
  • 20. 20 Both the Opportunity and the Challenge is Data... Patients Members Billing Physicians Hospitals Pharmacy Lab Results Procedures Medications Sales Enrollment Claims Web Mobile Social ...but data across the ecosystem is shaped very differently
  • 21. 21 Tabular (Relational) Data Model Related data split across multiple records and tables Document Data Model Related data contained in a single, rich document { "_id" : ObjectId("5ad88534e3632e1a35a58d00"), "name" : { "first" : "John", "last" : "Doe" }, "address" : [ { "location" : "work", "address" : { "street" : "16 Hatfields", "city" : "London", "postal_code" : "SE1 8DJ"}, "geo" : { "type" : "Point", "coord" : [ 51.5065752,-0.109081]}}, + {...} ], "phone" : [ { "location" : "work", "number" : "+44-1234567890"}, + {...} ], "dob" : ISODate("1977-04-01T05:00:00Z"), "retirement_fund" : NumberDecimal("1292815.75") } Need to Think About the Data Differently
  • 22. 22 Tabular (Relational) Data Model Related data split across multiple records and tables Document Data Model Related data contained in a single, rich document { "_id" : ObjectId("5ad88534e3632e1a35a58d00"), "name" : { "first" : "John", "last" : "Doe" }, "address" : [ { "location" : "work", "address" : { "street" : "16 Hatfields", "city" : "London", "postal_code" : "SE1 8DJ"}, "geo" : { "type" : "Point", "coord" : [ 51.5065752,-0.109081]}}, + {...} ], "phone" : [ { "location" : "work", "number" : "+44-1234567890"}, + {...} ], "dob" : ISODate("1977-04-01T05:00:00Z"), "retirement_fund" : NumberDecimal("1292815.75") } This becomes complex & rigid to change
  • 23. 23 How to Approach This Transformation? Data Layer Optimization: • unlocks the value of data stored in silos and legacy systems • drives rapid, iterative integration of data sources for new and existing consuming applications • supports enterprise data governance efforts • builds deep technical expertise and best practices Data Source Integration Data APIs Data Loading and Streaming Legacy Systems Offloading Data Analytics Data Governance
  • 24. Luckily, we started with two highly scalable and portable solutions
  • 25. 25 Freedom to run anywhere Local On-premises Server & Mainframe Private cloud Fully managed cloud serviceHybrid cloud Public cloud • Database that runs the same everywhere • Leverage the benefits of a multi-cloud strategy • Global coverage • Avoid lock-in Convenience: same codebase, same APIs, same tools, wherever you run
  • 26. 26 Docker Deployment Options with MongoDB Docker Container with MongoD Swarm & Compose with Rep Set Docker & K8S with RS / SC & K8S Operator in Ops Mgr Cloud-native & MongoDB Atlas
  • 27. 27 MongoDB Solution: Replica Sets Replica Set – up to 50 nodes Self-healing Data Center Aware Addresses availability considerations: • High Availability • Disaster Recovery • Maintenance Workload Isolation: operational & analytics Application Driver Primary Secondary Secondary Replication
  • 28. 28 Put data where you need it: Workload Isolation Enable different workloads on the same data • Combine operational and analytical workloads on a single data platform • Extract live insights from real-time data to enrich applications • One set of nodes serving operational apps, replicating to dedicated nodes serving analytics: up to 50 nodes in a single replica set • ETL-free
  • 29. 10 ms 2 ms 45 ms 35 ms 85 ms 75 ms 105 ms
  • 30. 10 ms 2 ms 10 ms 4 ms 4 ms 2 ms 10 ms 2 ms
  • 31. 31 Put data where you need it: Scalability with Sharding Auto-Sharding ● Automatically scale beyond the constraints of a single node ● Application transparent ● Scale and rebalance incrementally, in real time ● Unlike NoSQL systems that randomly spray data across a cluster, MongoDB exposes multiple data distribution policies to optimize for query patterns and locality •••Shard 1 Shard 2 Shard 3 Shard N ● Multiple sharding policies: hashed, ranged, zoned ● Increase or decrease capacity as you go ● Automatic balancing for elasticity Horizontally Scalable
  • 32. 32 Rich Query Language JSON Documents Tabular Key-Value Text GraphGeospatial SQL (BI-Connector)
  • 33. Declarative Data Access Rules Fine-grained data access controls Authorize the user, not just the app
  • 34. 34 ConsumersAPI Layer Declarative Access Controls basicProfile (Read Rule) { "%or": [ {"$$pipeline.currentRole": "Nurse"}, {"$$pipeline.currentRole": "Doctor"} ] } basicProfile (Write Rule) {"$$pipeline.currentRole": "Doctor"} completeHistory (Read/Write Rules) {"$$pipeline.currentRole": "Doctor"} Doctor Scientist Nurse {Aggregated Data} API Layer Declarative Access Controls Pipeline for Analysis {"$$pipeline.currentRole": "Scientist"} Optimized Data Layer Patient Profile { "patient_id": … , "basicProfile": {…} , "completeHistory": {…} } {Read Basic Data} {Read/Write All} ● Fine-grained data access controls ● Base access on document, field, or value ● Authorize the user, not just the app ● Associate with user profile or any other info ● Defined with JSON rules, not code API Layer is about more than just serving data API Calls
  • 35. • 24% CAGR: Hadoop, Spark & Streaming • 18% CAGR: Databases • Databases are key components within the big data landscape “Big Data” is More than Just Hadoop
  • 36. Distributed processing/analytics Custom analytics workload Custom analytics workload MDB Query Lang BI Connector
  • 37. Thank You Chris Grabosky gsky.us | github.com/graboskyc | linkedin.com/in/chris-grabosky/ chris.grabosky@mongodb.com cloud.mongodb.com | mongodb.com/download-center