Strategic Advisor, Technical Evangelist, Data Vault Master and Certified Data Vault 2.0 Architect (CDVP2), Oracle ACE Director(Alumni), Data Architect, Trainer em Data Warrior
15 de Dec de 2021•0 gostou•1,397 visualizações
1 de 24
Data Mesh for Dinner
15 de Dec de 2021•0 gostou•1,397 visualizações
Baixar para ler offline
Denunciar
Dados e análise
Wonder what this data mesh stuff is all about? What are the principles of data mesh? Can you or should you consider data mesh as the approach for your analytics platform? And most important - how can Snowflake help?
Given in Montreal on 14-Dec-2021
Strategic Advisor, Technical Evangelist, Data Vault Master and Certified Data Vault 2.0 Architect (CDVP2), Oracle ACE Director(Alumni), Data Architect, Trainer em Data Warrior
8. Data Mesh: Domain-centric Architecture
Data Domain 1
Data
sources
from
different
domains
Consumers
• Domain-centric ownership of data sources, pipelines, and data quality
• Ownership sits with domain knowledge --> better data quality for consumers
• Domain teams can react faster to source format changes or quality issues
• Overall easier to scale the number of sources & consumers
• Consumers pull from >= 1 domains
• Data assets offered as products
• “Serve & pull” instead of “push &
ingest” model
Data Domain 2
Data Domain 4
Interoperability Standards, Federated Governance, Data Catalog
Data Domain 5
ELT ELT
ETL ETL ETL
Data
Model
Data
Model
ETL ETL
ETL ETL
ETL ETL
ETL
Data Domain 3
Data Domain 6
14. Data Mesh Reference Architecture
Domain: Customer
Domain: Sales
Domain: Products
Domain: Marketing
Domain: Customer 360
Inventory of shared
data products
Snowflake
Reader Account
Snowflake Data Cloud
Consumers
Data Sources
Interoperability Standards, Federated Governance, 3rd Party Tools
Snowflake Data Sharing as the preferred interoperability standard. Data Marketplace makes data discoverable.
Data Marketplace / Catalog:
• Connects providers to consumers
• Inventory of available assets
• No central storage of shared data
• Providers retain full control over shared
assets (data, functions)
• Consumers access live provider data, no
copies or ETL required
Data domains:
• Can consume and share data or
functions
• Control access policies, data masking,
etc. for downstream consumers
• Can share external tables, i.e. provide
access to data outside of Snowflake
• Can provide reader accounts for
non-Snowflake consumers
Data Consumers:
• Register shared data for local SQL
access in their environment (no copy)
Snowflake
Data
Marketplace
or 3rd-party
catalog
3rd party
marketing
agency
Reseller
Sales
Analysts
Churn &
Retention
Business
optimization
Finance &
Controlling
16. Global and Multi-Cloud Data Mesh
Data Domain 1
Data Domain 2
Data Domain 3
Data Domain 5
Data Domain 4
Interoperability Standards, Federated Governance, 3rd Party Tools
US East
FRA
Snowflake
Reader Account
Consumers
Snowflake enables a truly global and multi-cloud data mesh across cloud platforms and regions.
• Data sources, data domains, and
consumers can sit in different regions
and different cloud platforms
• Snowflake enables a truly global and
multi-cloud data mesh
Tokyo
Zurich
Data Sources
Snowflake Data Cloud
Inventory of shared
data products
Snowflake
Data Market-
place or 3rd-
party catalog
17. Snowflake Data Cloud
Data Sources
Interoperability Standards, Federated Governance, 3rd Party Tools
Data Mesh Reference Architecture: Governance
Data Domain 1
Data Domain 2
Data Domain 3
Data Domain 5
Data Domain 4
Snowflake
Reader Account
Consumers
In the Snowflake Data Cloud the governance follows the data !
1
2
3
4
Create
Listing
Create
local link
Creat
e
local
link
5
6
Data Mesh with
Federated Governance:
• Domain 2 shares a data asset by
creating a listing in the marketplace
• Domain 2 also defines access
restrictions on that data asset, based
on consumer roles and other attributes
• Domain 5 and consumer 3 discover the
asset in the inventory and link to it in
their local environment for live SQL
access against the table in domain 2
• Domain 5 and consumer 3 are subject to
the access restrictions defined by
domain 2
• Domain 5 transforms the data, defines
additional access restrictions , and
shares a copy with consumer 6
• Consumer 6 can use the data, subject to
the propagated access restrictions
that domain 5 has added.
Inventory of shared
data as a product
assets
via Data
Marketplace or
third-party tools
24. SOURCES
Zhamak's initial article introducing the data mesh concept: https://martinfowler.com/articles/data-monolith-to-mesh.html
Sven Balnojan's article on how one could implement a data mesh: https://towardsdatascience.com/data-mesh-applied-
21bed87876f2
Zhamak's second article, delving deeper: https://martinfowler.com/articles/data-mesh-principles.html
Barr Moses' article on Data Mesh basics: https://towardsdatascience.com/what-is-a-data-mesh-and-how-not-to-mesh-it-up-
210710bb41e0
Zhamak Dehghani | Kafka Summit Europe 2021 Keynote: How to Build the Data Mesh Foundation
https://www.youtube.com/watch?v=QF41q10NSAs
https://datameshlearning.github.io/intro-to-data-mesh/