SlideShare uma empresa Scribd logo
1 de 25
Healthcare Considerations for Modern Data
Architectures
Pitfalls, Challenges and Best Practices
Data Day Health 2017
Presented by:
Toby Owen,VP Product Development
OnRamp
- Industry leading high security and hybrid hosting provider
- Operates multiple enterprise class data centers located in Austin,
Texas and Raleigh, North Carolina
- SSAE 16 SOC II and SOC 3 Audited, PCI and HIPAA compliant
company
- Specializes in helping organizations meet their rigorous
compliance requirement and keep their data safe
Toby Owen
- Vice President, Product Development, OnRamp
- 20 year IT veteran with operations and engineering
background
- Security, IT ops at scale, hybrid cloud, compliant
workload hosting
AGENDA
GOAL: Designing an app for Healthcare… that’s compliant!
Data Stores
App Design
Where to Run It
Dev Lifecycle
Takeaways
Q & A
Refresher on (or intro to) databases
CAP theorem
C = Consistency
A = Availability
P = PartitionTolerance
Database ReferenceGuide – at a glance
*Adapted from http://blog.nahurst.com/visual-guide-to-nosql-systems
Why do we care?
• Scaling vertically versus horizontally
- Costs of scaling up can grow exponentially
- Scaling horizontally is linear
- Limits to scaling vertically, “indefinite” horizontal
scale limit
• Data sources are increasingly distributed
• Horizontal scaling provides better geo-
resiliency at the same time
• Not all data needs strict ACID compliance
 More arguments favor distributed data stores
RDBMS and ACID
• Definition: Atomicity, Consistency, Isolation, Durability
• Favors Consistency overAvailability
• Examples
- MSSQL
- MySQL
- Postgres
- Greenplum
- VoltDB
Is scalability and ACID a false tradeoff?
• Scalability and ACID are difficult to satisfy at the same time
• Not all data requires strict ACID compliance
• Relational can be a bottleneck
- Simpler models might simplify operations – easier and more efficient
• New relational DBs can be very fast AND scalable
• Many NoSQL DB’s adding features to look more like RDBMS
• Take-away: understand your data (shape and use case) and pick the
right solution
NoSQL and BASE
• NoSQL Definition
- SOME of the following: non-relational, distributed, open-source, horizontally
scalable, schema free, easy replication support, simple API
• BASE Definition: Basically Available, Soft state, Eventual consistency
- All data reads will eventually yield the same result
• Favors Availability over Consistency
• Let’s focus some time here exploring NoSQL databases/datastores
- Considerations based on scalability, encryption and key management
• Document oriented Database (JSON). Considered “semi-structured” data
• Scalability - built in via automatic sharding (range, hash, zone)
- EA FIFA game (250+ servers),Yandex (10’s billion objects,TBs of data, growing at 10MM files uploads/day)
• Security – encryption in-transit
- SSL/TLS client support (data in-transit)
- MongoDB Enterprise Advanced supports FIPS 140-2
- Atlas (Mongo-aaS on Amazon) does NOT support FIPS mode
• Security – encryption at-rest
- App level, external filesystem, disk level, or natively (encrypted storage engine). Native suports FIOPS 140-2
• Security – key management
- Each DB has a separate Key
- Can be integrated with external KMS
- Supports key rotation without downtime (via rolling restarts of replica set)
- Native encryption is only available via Enterprise Advanced version!
• Row-oriented
• Scalability – peer-to-peer distributed system, data across all nodes
- Each node contains commit log, exchanges data across cluster every second
- All writes are automatically partitioned and replicated throughout cluster
- Apple (75,000 nodes, 10PB); Netflix (2,500 nodes, 420TB, 1 trillion requests/day)
• Security – encryption in-transit
- SupportsTLS/SSL, separate configs for client-server and server-server
- FIPS compliance supported
• Security – encryption at-rest
- Open-sourceCassandra relies on filesystem encryption
- Datastax (commercial version) supports at-rest encryption
• Security – key management
- Open-sourceCassandra relies on filesystem encryption’s key management tools (can be complex)
- Datastax (commercial version) has native KMIP support
• Not really a database – distributed filesystem (HDFS) plus application interface (MapReduce)
• Scalability – designed for large file distribution across 100’s and 1000’s of servers, streaming
access and large data sets
- (compute cheaper to move than data)
- Facebook (21PB, 2000 machines), Spotify (1300 nodes, 42PB storage, 20TB a day ingested, 200TB a day
generated by Hadoop)
• Security – encryption in-transit
- HDFS supports transparent encryption
• Security – encryption at-rest
- Supported by HDFS, application, database, or disk-level
- Lots of options for commercial support and tools to simplify management
• Security – key management
- Natively supports it’s own KMS
- Again, more commercial options exist to simplify
LOTS of others
• KeyValue
- Redis
- DynamoDB
• Document Oriented
- CouchDB
- DocumentDB
• Time Series
• Graph
• + 225 more! (nosql-database.org for basic info and comparisons)
So you’ve chosen your datastore(s)
Now what?
Application architecture!
Application design
SOME Considerations for HIPAA and HITECH
• HITECH – each app zone requires firewall isolation
- Web, app, database
• Key Management
- Key Management System (KMS)
- Hardware Security Module (HSM)
- Keys database
- Key splitting – for transferring clear-text cipher keys
ReferenceArchitecture
And more
• Many other security considerations around compliant application
architecture
- Shared storage resources and shared IaaS
Supporting encryption at-rest may not be enough to achieve HIPAA or HITRUST
compliance.
- Verifiable (compliant) destruction of data in a shared environment
- Encryption keys need to be managed in accordance with shared secrets or
‘key splitting’ schemes (e.g. Shamir’s secret sharing)
Next?
We’ve chosen the right datastores…
We’ve designed our application to
support HITRUST or HIPAA…
Where will the app run?
Hybrid is the likely reality
• Consuming 3rd party data sources
• Capabilities of each data or app
component provider
• BAA with each provider
• Peril of failing to plan
How to keep all this compliant?
• Lots to consider to get it right
• Start at the beginning – your development
lifecycle
• Automate everything
• Dev/Test/Staging/Production should all account
for secure design
• Use Containers ?
• Maybe get some help
KeyTakeaways
• Distributed data is becoming the new norm
• Data is different – data usage should dictate data technology
- (no one-size-fits-all)
• ApplicationArchitecture is key to achieving compliance
• Must consider all locations where app is running
• Consider compliance in all phases of app development (starting with
design)
• Automation in development pipeline is key to building-in and maintaining
compliance throughout app lifecycle
• Final consideration – are you now a service provider?
Toby Owen
VP, Product Development
OnRamp
towen@onr.com
@tobydowen
linkedin.com/in/tobyowen
Resources
• Databases and scaling:
- http://stackoverflow.com/questions/12215002/why-are-relational-databases-having-scalability-issues
- http://blog.nahurst.com/visual-guide-to-nosql-systems
- http://nosql-database.org/
• MongoDB
- https://www.mongodb.com/mongodb-architecture
- https://webassets.mongodb.com/_com_assets/collateral/MongoDB_Security_Architecture_WP.pdf
• Cassandra
- http://cassandra.apache.org/doc/latest/operating/security.html?highlight=encryption
- http://stackoverflow.com/questions/32584253/how-to-use-cassandra-with-tde-transparent-data-encryption
- http://dba.stackexchange.com/questions/6909/cassandra-encryption-at-rest
- http://www.datastax.com/products/datastax-enterprise
• Hadoop
- https://hadoop.apache.org/docs/r2.7.2/hadoop-project-dist/hadoop-hdfs/TransparentEncryption.html
- Hadoop at Scale: Spotify http://cdn.oreillystatic.com/en/assets/1/event/118/The%20Evolution%20of%20Hadoop%20at%20Spotify-
%20Through%20Failures%20and%20Pain%20Presentation.pdf
• Key management
- https://en.wikipedia.org/wiki/Shamir%27s_Secret_Sharing

Mais conteúdo relacionado

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 

Destaque

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Destaque (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

Data Day Health IT - Data Architecture

  • 1. Healthcare Considerations for Modern Data Architectures Pitfalls, Challenges and Best Practices Data Day Health 2017 Presented by: Toby Owen,VP Product Development
  • 2. OnRamp - Industry leading high security and hybrid hosting provider - Operates multiple enterprise class data centers located in Austin, Texas and Raleigh, North Carolina - SSAE 16 SOC II and SOC 3 Audited, PCI and HIPAA compliant company - Specializes in helping organizations meet their rigorous compliance requirement and keep their data safe Toby Owen - Vice President, Product Development, OnRamp - 20 year IT veteran with operations and engineering background - Security, IT ops at scale, hybrid cloud, compliant workload hosting
  • 3.
  • 4. AGENDA GOAL: Designing an app for Healthcare… that’s compliant! Data Stores App Design Where to Run It Dev Lifecycle Takeaways Q & A
  • 5. Refresher on (or intro to) databases CAP theorem C = Consistency A = Availability P = PartitionTolerance
  • 6. Database ReferenceGuide – at a glance *Adapted from http://blog.nahurst.com/visual-guide-to-nosql-systems
  • 7. Why do we care? • Scaling vertically versus horizontally - Costs of scaling up can grow exponentially - Scaling horizontally is linear - Limits to scaling vertically, “indefinite” horizontal scale limit • Data sources are increasingly distributed • Horizontal scaling provides better geo- resiliency at the same time • Not all data needs strict ACID compliance  More arguments favor distributed data stores
  • 8. RDBMS and ACID • Definition: Atomicity, Consistency, Isolation, Durability • Favors Consistency overAvailability • Examples - MSSQL - MySQL - Postgres - Greenplum - VoltDB
  • 9. Is scalability and ACID a false tradeoff? • Scalability and ACID are difficult to satisfy at the same time • Not all data requires strict ACID compliance • Relational can be a bottleneck - Simpler models might simplify operations – easier and more efficient • New relational DBs can be very fast AND scalable • Many NoSQL DB’s adding features to look more like RDBMS • Take-away: understand your data (shape and use case) and pick the right solution
  • 10. NoSQL and BASE • NoSQL Definition - SOME of the following: non-relational, distributed, open-source, horizontally scalable, schema free, easy replication support, simple API • BASE Definition: Basically Available, Soft state, Eventual consistency - All data reads will eventually yield the same result • Favors Availability over Consistency • Let’s focus some time here exploring NoSQL databases/datastores - Considerations based on scalability, encryption and key management
  • 11. • Document oriented Database (JSON). Considered “semi-structured” data • Scalability - built in via automatic sharding (range, hash, zone) - EA FIFA game (250+ servers),Yandex (10’s billion objects,TBs of data, growing at 10MM files uploads/day) • Security – encryption in-transit - SSL/TLS client support (data in-transit) - MongoDB Enterprise Advanced supports FIPS 140-2 - Atlas (Mongo-aaS on Amazon) does NOT support FIPS mode • Security – encryption at-rest - App level, external filesystem, disk level, or natively (encrypted storage engine). Native suports FIOPS 140-2 • Security – key management - Each DB has a separate Key - Can be integrated with external KMS - Supports key rotation without downtime (via rolling restarts of replica set) - Native encryption is only available via Enterprise Advanced version!
  • 12. • Row-oriented • Scalability – peer-to-peer distributed system, data across all nodes - Each node contains commit log, exchanges data across cluster every second - All writes are automatically partitioned and replicated throughout cluster - Apple (75,000 nodes, 10PB); Netflix (2,500 nodes, 420TB, 1 trillion requests/day) • Security – encryption in-transit - SupportsTLS/SSL, separate configs for client-server and server-server - FIPS compliance supported • Security – encryption at-rest - Open-sourceCassandra relies on filesystem encryption - Datastax (commercial version) supports at-rest encryption • Security – key management - Open-sourceCassandra relies on filesystem encryption’s key management tools (can be complex) - Datastax (commercial version) has native KMIP support
  • 13. • Not really a database – distributed filesystem (HDFS) plus application interface (MapReduce) • Scalability – designed for large file distribution across 100’s and 1000’s of servers, streaming access and large data sets - (compute cheaper to move than data) - Facebook (21PB, 2000 machines), Spotify (1300 nodes, 42PB storage, 20TB a day ingested, 200TB a day generated by Hadoop) • Security – encryption in-transit - HDFS supports transparent encryption • Security – encryption at-rest - Supported by HDFS, application, database, or disk-level - Lots of options for commercial support and tools to simplify management • Security – key management - Natively supports it’s own KMS - Again, more commercial options exist to simplify
  • 14. LOTS of others • KeyValue - Redis - DynamoDB • Document Oriented - CouchDB - DocumentDB • Time Series • Graph • + 225 more! (nosql-database.org for basic info and comparisons)
  • 15. So you’ve chosen your datastore(s) Now what? Application architecture!
  • 16. Application design SOME Considerations for HIPAA and HITECH • HITECH – each app zone requires firewall isolation - Web, app, database • Key Management - Key Management System (KMS) - Hardware Security Module (HSM) - Keys database - Key splitting – for transferring clear-text cipher keys
  • 18. And more • Many other security considerations around compliant application architecture - Shared storage resources and shared IaaS Supporting encryption at-rest may not be enough to achieve HIPAA or HITRUST compliance. - Verifiable (compliant) destruction of data in a shared environment - Encryption keys need to be managed in accordance with shared secrets or ‘key splitting’ schemes (e.g. Shamir’s secret sharing)
  • 19. Next? We’ve chosen the right datastores… We’ve designed our application to support HITRUST or HIPAA… Where will the app run?
  • 20. Hybrid is the likely reality • Consuming 3rd party data sources • Capabilities of each data or app component provider • BAA with each provider • Peril of failing to plan
  • 21. How to keep all this compliant? • Lots to consider to get it right • Start at the beginning – your development lifecycle • Automate everything • Dev/Test/Staging/Production should all account for secure design • Use Containers ? • Maybe get some help
  • 22. KeyTakeaways • Distributed data is becoming the new norm • Data is different – data usage should dictate data technology - (no one-size-fits-all) • ApplicationArchitecture is key to achieving compliance • Must consider all locations where app is running • Consider compliance in all phases of app development (starting with design) • Automation in development pipeline is key to building-in and maintaining compliance throughout app lifecycle • Final consideration – are you now a service provider?
  • 23.
  • 24. Toby Owen VP, Product Development OnRamp towen@onr.com @tobydowen linkedin.com/in/tobyowen
  • 25. Resources • Databases and scaling: - http://stackoverflow.com/questions/12215002/why-are-relational-databases-having-scalability-issues - http://blog.nahurst.com/visual-guide-to-nosql-systems - http://nosql-database.org/ • MongoDB - https://www.mongodb.com/mongodb-architecture - https://webassets.mongodb.com/_com_assets/collateral/MongoDB_Security_Architecture_WP.pdf • Cassandra - http://cassandra.apache.org/doc/latest/operating/security.html?highlight=encryption - http://stackoverflow.com/questions/32584253/how-to-use-cassandra-with-tde-transparent-data-encryption - http://dba.stackexchange.com/questions/6909/cassandra-encryption-at-rest - http://www.datastax.com/products/datastax-enterprise • Hadoop - https://hadoop.apache.org/docs/r2.7.2/hadoop-project-dist/hadoop-hdfs/TransparentEncryption.html - Hadoop at Scale: Spotify http://cdn.oreillystatic.com/en/assets/1/event/118/The%20Evolution%20of%20Hadoop%20at%20Spotify- %20Through%20Failures%20and%20Pain%20Presentation.pdf • Key management - https://en.wikipedia.org/wiki/Shamir%27s_Secret_Sharing

Notas do Editor

  1. C= Consistency – a read request gets the most recent write (or an error) A=Availability – Every request receives a response, which may or may not be the most recent version of the record P=Partition Tolerance – system operates despite a network partition (messages being dropped or delayed)   In the presence of a network partition, one has to choose between consistency and availability.
  2. a transaction is all or nothing, DB stays in a valid state, transactions are separate and non-interfering, and commits stay committed
  3. Lots of databases are built on top of Hadoop, or are building integrated connectivity to hadoop
  4. Key Management System (KMS) – software used to store and provide access to encryption keys. Utilizes Local Master Key to encrypt keys stored locally. Must be under dual control. Hardware Security Module (HSM) – tamper-resistant hardware specialized for key management – generation, export, ciphering and storage of encryption keys. The HSM must be at a minimum certified and configured at FIPS 140-2 Level3. Keys database – key management database - must be encrypted and able to support all attributes needed for the key lifecycle management. Databases managing Live and Test keys must be physically separate and utilize different HSMs. Key splitting – usually in 3 parts, each part has it’s own custodian (on both sides of the transaction)
  5. Shared storage resources that support encryption at rest may not be enough to support claims of HIPAA or HITRUST compliance. Shared IaaS means that the traditional method of managing data remanence through storage media destruction is not possible. Highly efficient storage arrays write data across many disks. Verifiable (compliant) destruction of data in a shared environment requires that each tenant of a shared storage resource be utilizing unique encryption keys. Furthermore, encryption keys need to be managed in accordance with shared secrets or ‘key splitting’ schemes in order to meet HITRUST requirements (e.g. Sharmir’s secret sharing). Ex: https://en.wikipedia.org/wiki/Shamir%27s_Secret_Sharing
  6. Likely you are consuming and/or writing to external (legacy) data sources that you don’t control – means you will be hybrid All of the above considerations need to be considered against each location where these are hosted You will need a BAA with each provider Example – you can write an app in dev on a single AMI, then you add a few data sources based on new requirements, then you need to scale to multiple compute instances. Is it auditable? You are already broken – you need to rewrite the app stack to account for proxied DB connections, etc. Consider your providers capabilities: HSM, KMS, encryption (fips certified?), key management, auditability, IAM, interoperability of all of the above Ignore these considerations at your peril – not just an app rewrite, you may need to replatform or migrate Attempting to bolt on component after component on top of public cloud resources becomes a very costly proposition. Add to that the management of audit activities related to the deployed infrastructure and you’ve pretty well sapped all you investment dollars in just purchasing a ticket to the dance. Retrofitting IT environment to address this LATER is a problem – need to design in from the start. App tier and DB tier separate – changes to how data is retrieved can be a big lift once you are in production
  7. Consider from the design phase, include compliance requirements in your entire dev lifecycle tool chain and process Automation gets you repeatability and auditability, removes human error – also forces understanding and analysis on every process Containers – allow you portability and atomic version control = predictability and audit trail, versioning The agile healthcare SaaS provider needs a managed IT partner who knows more than they do about how to run compliant infrastructure and can expose easy to consume services that accelerate scale, not hamstring it with regulatory process.
  8. am I consuming a shared resource? am I providing a shared resource?