SlideShare uma empresa Scribd logo
1 de 14
Data virtualization at
Statistics Netherlands
“Modern data integration systems: the
experience of Istat and other institutes”
Ran van den Boom
Program Manager Data Strategy | Statistics Netherlands
30.11-1.12//2021
Definition of Data Integration according to Wikipedia: “Data integration involves combining data residing in
different sources and providing users with a unified view of them”
We need to describe phenomena, we need to increase the efficiency and we have a lot of challenges to
implement all innovations. How is data integration and data virtualization going to help us?
o Why
o What
o How
o Lessons learned so far
What are the secret ingredients?
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
Indice / agenda
2
• Increasing demand of our statistics
• Strengthening of phenomenon-oriented
measurement & description of society such as
Covid-19 effects on the economy, on social
division
• Increase the role of CBS as data partner of the
Government
This requires trusted data, faster time-to-market,
data-driven work, and more collaboration
between various organizations.
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
Why: external drivers for change
3
• External drivers lead to Innovation in existing processes and future-focused trajectories (new
data sources)
• Internal processes need to become more efficient
• Being able to access more sources, also external
• Becoming more effective: understand the data
This requires innovation of our processes:
• Introduction of silos for data in rest at interfaces
• Helps to unravel process steps (data on the move) from interfaces (data in rest)
• Avoiding stove pipes, data driven instead of process driven
• More options for gathering external data sources, flexible and standardized
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
Why: internal drivers for change
4
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
Our current situation on data integration
5
(Source: Wikipedia)
Ideal database:
everything fits
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
Our current situation on data integration
6
(Source: Wikipedia)
Ideal database:
everything fits
In our situation several islands, with hardly any connection
• Microdata stored in Data Service Centre – but not all
• Published data stored in StatLine – but not all
• Raw data in Data collection
• Other datasets stored anywhere
2012: Data Virtualization combines disparate data sources into a
single “virtual” data layer that provides integrated data services to
consuming applications in real-time.
Could that be a solution?
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
Data Integration = Sharing Data
Statistical process,
maturity levels
7
1st interface: raw data
2nd interface: standardized data
3d interface: processed data
4th interface: statistics
5th interface: published data
External sources
Surveys
1. Being able to share data
2. Being able to share metadata
3. Governance
Our secret ingredients:
4. Collaboration (WII4Me), Agile approach at first
Note: this is a simplified picture
We also need Storage facilities, Privacy Preserving Techniques and tools, Tools for preparing
metadata and data for publishing, Publishing capabilities for external metadata, etc.
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
8
What do we need to support the Why
1st interface: raw data
2nd interface: standardized data
3d interface: processed data
4th interface: statistics
5th interface: published data
Sharing data
Metadata
Governance
Metadata
• Fit for purpose
• Patterns
• Maturity levels
• Metadata modelling
• Requirements
• Metadata Management System
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
9
What do we need to support the Why: the products
1st interface: raw data
2nd interface: standardized data
3d interface: processed data
4th interface: statistics
5th interface: published data
Sharing data
Metadata
Governance
Data Abstraction
Layer (DAL)
Denodo
Metadata catalog
including
taxonomy
Best practices
• People
• Knowledge
• Processes
• Organization
• Governance
Using several approaches:
• Change management
• Agile
• Project Management
• Step by step
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
10
How do we realize that?
Data Abstraction Layer
Source A
Source B Source C
Source D
Other metadata
(MMS)
Classifications,
codelists (CLS)
Metadata catalog and search engine
Taxonomy
Solutions
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
11
Demo: a Data Abstraction Layer is not abstract at all
Advantages
• One standardized method of
accessing data (SQL)
• Dataset (“view”) is created
run-time
• No copies
• Overview of all datasets
• Regardless the location of
the data or its shape/format
Disadvantages
• Source system is used for
queries
• Permissions, technical
access
MS Access to retrieve data through the DAL
Denodo Data Catalog
Designing a view
Steps to achieve these ambitions
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
12
Lessons learned so far: strategy
Maturity level
of data
integration
Characteristics Purpose Requires
Low Overview of datasets Find and use data DAL/Denodo
Create views
Medium Well described datasets Understanding the data
Explaining phenomena
Automated validation
Resolving inconsistencies
Metadata MMS (COTS), CLS
(own development)
High Standardized, harmonized
metadata
Metadata-driven processes
Data sharing
Automatically generate datasets
based on metadata
Information Dialog
Efficiency
Internal discussions
Time (years) and resources
Adapted processes, change
management
Tools
Here we are
Challenges
Finding use cases: nobody could imagine what it
would mean (WII4Me) – at first
Lack of expertise, technical issues
Difficult to get every one on the same line for the
governance: stability vs. autonomy led to
discussion control vs. anarchy
Too many innovations going on at the same time
Organizing this requires more than an Agile
approach
Success factors
Support from management, one goal
One Business Owner
Find the opportunities, e.g. distribution problem for
owners of many datasets; heavy consumers of
data with no overview (e.g. national accounts,
large companies)
Create show cases
Denodo training, hire external expertise
Reduce the number of innovations: other metadata
is postponed until 2023; steering the innovations is
subject of discussions
Introduction of Service Owners to implement Data
sharing and Metadata
DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM
13
Lessons learned so far
grazie
Ran van den Boom
Program Manager Data Strategy | Statistics Netherlands
r.vandenboom@cbs.nl
per l’attenzione

Mais conteúdo relacionado

Mais procurados

Data as a service
Data as a serviceData as a service
Data as a serviceZoltan Nagy
 
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...Jonathan Challener
 
Open data presentation on tools and automation
Open data presentation on tools and automationOpen data presentation on tools and automation
Open data presentation on tools and automationPia Waugh
 
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
20140902 LinDa Workshop Semantincs2014 - LinDA Project OverviewLinDa_FP7
 
Open Data Presentation v1.3 - Nov 2014
Open Data Presentation v1.3 - Nov 2014Open Data Presentation v1.3 - Nov 2014
Open Data Presentation v1.3 - Nov 2014Pia Waugh
 
Core Activities
Core ActivitiesCore Activities
Core ActivitiesSemic.eu
 
From open data to data-driven services
From open data to data-driven servicesFrom open data to data-driven services
From open data to data-driven servicesSlim Turki, Dr.
 
Introduction to the new DAD-IS architecture
Introduction to the new DAD-IS architecture Introduction to the new DAD-IS architecture
Introduction to the new DAD-IS architecture FAO
 
DAD-IS project overview and future perspectives
DAD-IS project overview and future perspectives DAD-IS project overview and future perspectives
DAD-IS project overview and future perspectives FAO
 
Service innovation: the hidden value of open data
Service innovation: the hidden value of open dataService innovation: the hidden value of open data
Service innovation: the hidden value of open dataSlim Turki, Dr.
 
Social Sentiment Indices Powered by X-Scores
Social Sentiment Indices Powered by X-ScoresSocial Sentiment Indices Powered by X-Scores
Social Sentiment Indices Powered by X-Scoreskcortis
 
20141030 LinDA Workshop echallenges2014 - LinDA project overview
20141030 LinDA Workshop echallenges2014 - LinDA project overview20141030 LinDA Workshop echallenges2014 - LinDA project overview
20141030 LinDA Workshop echallenges2014 - LinDA project overviewLinDa_FP7
 
BDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitationBDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitationBigData_Europe
 
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
How Government Agencies are Using MongoDB to Build Data as a Service SolutionsHow Government Agencies are Using MongoDB to Build Data as a Service Solutions
How Government Agencies are Using MongoDB to Build Data as a Service SolutionsMongoDB
 
A Business-Critical SharePoint Solution From adesso AG
A Business-CriticalSharePoint SolutionFrom adesso AGA Business-CriticalSharePoint SolutionFrom adesso AG
A Business-Critical SharePoint Solution From adesso AGadesso AG
 
CDMA Migration to AnalytiX™ Mapping Manager®
CDMA Migration to AnalytiX™ Mapping Manager®CDMA Migration to AnalytiX™ Mapping Manager®
CDMA Migration to AnalytiX™ Mapping Manager®Mohammad Azad
 
Towards Bottom up semantic services definition
Towards Bottom up semantic services definitionTowards Bottom up semantic services definition
Towards Bottom up semantic services definitionCristian Vasquez
 

Mais procurados (19)

Data as a service
Data as a serviceData as a service
Data as a service
 
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...
 
Open data presentation on tools and automation
Open data presentation on tools and automationOpen data presentation on tools and automation
Open data presentation on tools and automation
 
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
 
Open Data Presentation v1.3 - Nov 2014
Open Data Presentation v1.3 - Nov 2014Open Data Presentation v1.3 - Nov 2014
Open Data Presentation v1.3 - Nov 2014
 
Core Activities
Core ActivitiesCore Activities
Core Activities
 
From open data to data-driven services
From open data to data-driven servicesFrom open data to data-driven services
From open data to data-driven services
 
Introduction to the new DAD-IS architecture
Introduction to the new DAD-IS architecture Introduction to the new DAD-IS architecture
Introduction to the new DAD-IS architecture
 
Navistools Standard
Navistools StandardNavistools Standard
Navistools Standard
 
DAD-IS project overview and future perspectives
DAD-IS project overview and future perspectives DAD-IS project overview and future perspectives
DAD-IS project overview and future perspectives
 
Service innovation: the hidden value of open data
Service innovation: the hidden value of open dataService innovation: the hidden value of open data
Service innovation: the hidden value of open data
 
Census Hub Project
Census Hub ProjectCensus Hub Project
Census Hub Project
 
Social Sentiment Indices Powered by X-Scores
Social Sentiment Indices Powered by X-ScoresSocial Sentiment Indices Powered by X-Scores
Social Sentiment Indices Powered by X-Scores
 
20141030 LinDA Workshop echallenges2014 - LinDA project overview
20141030 LinDA Workshop echallenges2014 - LinDA project overview20141030 LinDA Workshop echallenges2014 - LinDA project overview
20141030 LinDA Workshop echallenges2014 - LinDA project overview
 
BDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitationBDE Technical Webinar 1 : Requirements elicitation
BDE Technical Webinar 1 : Requirements elicitation
 
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
How Government Agencies are Using MongoDB to Build Data as a Service SolutionsHow Government Agencies are Using MongoDB to Build Data as a Service Solutions
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
 
A Business-Critical SharePoint Solution From adesso AG
A Business-CriticalSharePoint SolutionFrom adesso AGA Business-CriticalSharePoint SolutionFrom adesso AG
A Business-Critical SharePoint Solution From adesso AG
 
CDMA Migration to AnalytiX™ Mapping Manager®
CDMA Migration to AnalytiX™ Mapping Manager®CDMA Migration to AnalytiX™ Mapping Manager®
CDMA Migration to AnalytiX™ Mapping Manager®
 
Towards Bottom up semantic services definition
Towards Bottom up semantic services definitionTowards Bottom up semantic services definition
Towards Bottom up semantic services definition
 

Semelhante a Data Virtualization at Statistics Netherlands

¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Denodo
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Total Data Industry Report
Total Data Industry ReportTotal Data Industry Report
Total Data Industry ReportRan Zhang
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Denodo
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItDenodo
 
An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018Denodo
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your PortfolioDenodo
 
Cloud Migration Strategies that Ensure Greater Value for the Business
Cloud Migration Strategies that Ensure Greater Value for the BusinessCloud Migration Strategies that Ensure Greater Value for the Business
Cloud Migration Strategies that Ensure Greater Value for the BusinessDenodo
 
Crossing the bridge - how do we link end-user-computing and formal tech for d...
Crossing the bridge - how do we link end-user-computing and formal tech for d...Crossing the bridge - how do we link end-user-computing and formal tech for d...
Crossing the bridge - how do we link end-user-computing and formal tech for d...J On The Beach
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Denodo
 
Tdwi solution spotlight presentation slides
Tdwi solution spotlight   presentation slidesTdwi solution spotlight   presentation slides
Tdwi solution spotlight presentation slidesWilliam Lam
 
Multi-Cloud Integration with Data Virtualization (ASEAN)
Multi-Cloud Integration with Data Virtualization (ASEAN)Multi-Cloud Integration with Data Virtualization (ASEAN)
Multi-Cloud Integration with Data Virtualization (ASEAN)Denodo
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceDATAVERSITY
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationDenodo
 
Delivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data FabricDelivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data FabricDenodo
 

Semelhante a Data Virtualization at Statistics Netherlands (20)

¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Big data
Big dataBig data
Big data
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Total Data Industry Report
Total Data Industry ReportTotal Data Industry Report
Total Data Industry Report
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 
An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio
 
Cloud Migration Strategies that Ensure Greater Value for the Business
Cloud Migration Strategies that Ensure Greater Value for the BusinessCloud Migration Strategies that Ensure Greater Value for the Business
Cloud Migration Strategies that Ensure Greater Value for the Business
 
Crossing the bridge - how do we link end-user-computing and formal tech for d...
Crossing the bridge - how do we link end-user-computing and formal tech for d...Crossing the bridge - how do we link end-user-computing and formal tech for d...
Crossing the bridge - how do we link end-user-computing and formal tech for d...
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
 
Tdwi solution spotlight presentation slides
Tdwi solution spotlight   presentation slidesTdwi solution spotlight   presentation slides
Tdwi solution spotlight presentation slides
 
Multi-Cloud Integration with Data Virtualization (ASEAN)
Multi-Cloud Integration with Data Virtualization (ASEAN)Multi-Cloud Integration with Data Virtualization (ASEAN)
Multi-Cloud Integration with Data Virtualization (ASEAN)
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Delivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data FabricDelivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data Fabric
 

Mais de Istituto nazionale di statistica

Mais de Istituto nazionale di statistica (20)

Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
14a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica1414a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica14
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 

Último

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 

Último (20)

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 

Data Virtualization at Statistics Netherlands

  • 1. Data virtualization at Statistics Netherlands “Modern data integration systems: the experience of Istat and other institutes” Ran van den Boom Program Manager Data Strategy | Statistics Netherlands 30.11-1.12//2021
  • 2. Definition of Data Integration according to Wikipedia: “Data integration involves combining data residing in different sources and providing users with a unified view of them” We need to describe phenomena, we need to increase the efficiency and we have a lot of challenges to implement all innovations. How is data integration and data virtualization going to help us? o Why o What o How o Lessons learned so far What are the secret ingredients? DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM Indice / agenda 2
  • 3. • Increasing demand of our statistics • Strengthening of phenomenon-oriented measurement & description of society such as Covid-19 effects on the economy, on social division • Increase the role of CBS as data partner of the Government This requires trusted data, faster time-to-market, data-driven work, and more collaboration between various organizations. DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM Why: external drivers for change 3
  • 4. • External drivers lead to Innovation in existing processes and future-focused trajectories (new data sources) • Internal processes need to become more efficient • Being able to access more sources, also external • Becoming more effective: understand the data This requires innovation of our processes: • Introduction of silos for data in rest at interfaces • Helps to unravel process steps (data on the move) from interfaces (data in rest) • Avoiding stove pipes, data driven instead of process driven • More options for gathering external data sources, flexible and standardized DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM Why: internal drivers for change 4
  • 5. DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM Our current situation on data integration 5 (Source: Wikipedia) Ideal database: everything fits
  • 6. DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM Our current situation on data integration 6 (Source: Wikipedia) Ideal database: everything fits
  • 7. In our situation several islands, with hardly any connection • Microdata stored in Data Service Centre – but not all • Published data stored in StatLine – but not all • Raw data in Data collection • Other datasets stored anywhere 2012: Data Virtualization combines disparate data sources into a single “virtual” data layer that provides integrated data services to consuming applications in real-time. Could that be a solution? DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM Data Integration = Sharing Data Statistical process, maturity levels 7 1st interface: raw data 2nd interface: standardized data 3d interface: processed data 4th interface: statistics 5th interface: published data External sources Surveys
  • 8. 1. Being able to share data 2. Being able to share metadata 3. Governance Our secret ingredients: 4. Collaboration (WII4Me), Agile approach at first Note: this is a simplified picture We also need Storage facilities, Privacy Preserving Techniques and tools, Tools for preparing metadata and data for publishing, Publishing capabilities for external metadata, etc. DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM 8 What do we need to support the Why 1st interface: raw data 2nd interface: standardized data 3d interface: processed data 4th interface: statistics 5th interface: published data Sharing data Metadata Governance
  • 9. Metadata • Fit for purpose • Patterns • Maturity levels • Metadata modelling • Requirements • Metadata Management System DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM 9 What do we need to support the Why: the products 1st interface: raw data 2nd interface: standardized data 3d interface: processed data 4th interface: statistics 5th interface: published data Sharing data Metadata Governance Data Abstraction Layer (DAL) Denodo Metadata catalog including taxonomy Best practices
  • 10. • People • Knowledge • Processes • Organization • Governance Using several approaches: • Change management • Agile • Project Management • Step by step DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM 10 How do we realize that? Data Abstraction Layer Source A Source B Source C Source D Other metadata (MMS) Classifications, codelists (CLS) Metadata catalog and search engine Taxonomy Solutions
  • 11. DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM 11 Demo: a Data Abstraction Layer is not abstract at all Advantages • One standardized method of accessing data (SQL) • Dataset (“view”) is created run-time • No copies • Overview of all datasets • Regardless the location of the data or its shape/format Disadvantages • Source system is used for queries • Permissions, technical access MS Access to retrieve data through the DAL Denodo Data Catalog Designing a view
  • 12. Steps to achieve these ambitions DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM 12 Lessons learned so far: strategy Maturity level of data integration Characteristics Purpose Requires Low Overview of datasets Find and use data DAL/Denodo Create views Medium Well described datasets Understanding the data Explaining phenomena Automated validation Resolving inconsistencies Metadata MMS (COTS), CLS (own development) High Standardized, harmonized metadata Metadata-driven processes Data sharing Automatically generate datasets based on metadata Information Dialog Efficiency Internal discussions Time (years) and resources Adapted processes, change management Tools Here we are
  • 13. Challenges Finding use cases: nobody could imagine what it would mean (WII4Me) – at first Lack of expertise, technical issues Difficult to get every one on the same line for the governance: stability vs. autonomy led to discussion control vs. anarchy Too many innovations going on at the same time Organizing this requires more than an Agile approach Success factors Support from management, one goal One Business Owner Find the opportunities, e.g. distribution problem for owners of many datasets; heavy consumers of data with no overview (e.g. national accounts, large companies) Create show cases Denodo training, hire external expertise Reduce the number of innovations: other metadata is postponed until 2023; steering the innovations is subject of discussions Introduction of Service Owners to implement Data sharing and Metadata DATA VIRTUALIZATION AT STATISTICS NETHERLANDS | RAN VAN DEN BOOM 13 Lessons learned so far
  • 14. grazie Ran van den Boom Program Manager Data Strategy | Statistics Netherlands r.vandenboom@cbs.nl per l’attenzione

Notas do Editor

  1. Islands of data and metadata, hardly any connection and some of them are missing
  2. Islands of data and metadata, hardly any connection and some of them are missing
  3. We hoped to be further after 9 years, but: it’s no dolce vita