SlideShare a Scribd company logo
1 of 21
The Data Flow Task
Encapsulates the data flow engine
Exists in the context of an overall control flow
Performs traditional ETL in addition to other extended scenarios
Is fast and scalable
Data Flow Components
Extract data from Sources
Load data into Destinations
Modify data with Transformations
Service Paths
Connect data flow components
Create the pipeline
Encapsulates the data flow engine
Load
Extract
Transform
A Path connects two components in a data flow by connecting the
output of one data flow component to the input of another
component. A path has a source and a destination.
In SSIS, a source is the data flow component that extracts data from different
external data sources and makes it available to the other components in the data
flow.
Sources have one regular output, and many sources in addition also have one
error output.
All the output columns are available as input columns to the next data flow
component in the data flow.
Sources extract data from:
Relational tables and views
Files
Analysis Services databases
OLEDB Oracle
Connection
Data Source
Source Adapter
Destinations are the data flow components that load the data from a data
flow into different types of data sources or create an in-memory dataset.
Destinations have one input and one error output.
Destinations load data to:
Relational tables and views
Files
Analysis Services databases and objects
DataReaders and Recordsets
Enterprise Edition only
ADO.NET
Connection TargetDestination
Adapter
SSIS Transformations are the components in the data flow of a package
that give you the ability to modify and manipulate data in the data flow.
A transformation performs an operation either on one row of data at a
time or on several rows of data at once.
For example aggregate, merge, distribute, and modify data and also
can perform lookup operations and generate sample datasets.
DimProduct
ProductKey
Color
Name
Cost
DimProduct
ProductKey
Color
Name
Cost
DimProduct
ProductKey
Color
Name
Cost
Source
Transformation
Destination
Best Practices
We can logically group them by functionality:
Row Transformations
Rowset Transformations
Split and Join Transformations
Auditing Transformations
Business Intelligence Transformations
Custom Transformations
The most common and easily configured transformations perform operations
on rows without needing other rows from the source. These transformations,
which logically work at the row level, often perform very well.
Update column values or create new columns
Transform each row in the pipeline input
Create new rowsets that can include
Aggregated values
Sorted values
Sample rowsets
Pivoted or unpivoted rowsets
Distribute rows to different outputs
Create copies of the transformation inputs
Join multiple inputs into one output
Perform lookup operations
Integration Services includes the following transformations to
add audit information and count rows.
cleaning data
updating of a slowly changing dimension
looks up values
mining text
running data mining prediction queries
The final grouping of transformations lets you perform
advanced operations on rows in the data flow pipeline.
ADO.NET
Connection
Sources
Transformations
Destinations
OLEDB Oracle
Connection
EXCEL
Connection
SSIS 2008 R2 data flow

More Related Content

What's hot

Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshConfluentInc1
 
Parnab Nandy SAP BW Consultant Resume
Parnab Nandy SAP BW Consultant Resume Parnab Nandy SAP BW Consultant Resume
Parnab Nandy SAP BW Consultant Resume Parnab Nandy
 
Azure Data Factory Data Flows Training v005
Azure Data Factory Data Flows Training v005Azure Data Factory Data Flows Training v005
Azure Data Factory Data Flows Training v005Mark Kromer
 
Dimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | TypesDimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | Typesumair saeed
 
Azure Data Factory Data Flow
Azure Data Factory Data FlowAzure Data Factory Data Flow
Azure Data Factory Data FlowMark Kromer
 
How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?confluent
 
Architecture of integration services
Architecture of integration servicesArchitecture of integration services
Architecture of integration servicesSlava Kokaev
 
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)Cathrine Wilhelmsen
 
Azure Data Factory for Azure Data Week
Azure Data Factory for Azure Data WeekAzure Data Factory for Azure Data Week
Azure Data Factory for Azure Data WeekMark Kromer
 
Data platform architecture
Data platform architectureData platform architecture
Data platform architectureSudheer Kondla
 
Data Migration Steps PowerPoint Presentation Slides
Data Migration Steps PowerPoint Presentation Slides Data Migration Steps PowerPoint Presentation Slides
Data Migration Steps PowerPoint Presentation Slides SlideTeam
 
5 Level of MDM Maturity
5 Level of MDM Maturity5 Level of MDM Maturity
5 Level of MDM MaturityPanaEk Warawit
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureKent Graziano
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseDatabricks
 

What's hot (20)

Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data Mesh
 
Ssis 2008
Ssis 2008Ssis 2008
Ssis 2008
 
MDM and Reference Data
MDM and Reference DataMDM and Reference Data
MDM and Reference Data
 
Fact table facts
Fact table factsFact table facts
Fact table facts
 
Parnab Nandy SAP BW Consultant Resume
Parnab Nandy SAP BW Consultant Resume Parnab Nandy SAP BW Consultant Resume
Parnab Nandy SAP BW Consultant Resume
 
Azure Data Factory Data Flows Training v005
Azure Data Factory Data Flows Training v005Azure Data Factory Data Flows Training v005
Azure Data Factory Data Flows Training v005
 
Dimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | TypesDimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | Types
 
Azure Data Factory Data Flow
Azure Data Factory Data FlowAzure Data Factory Data Flow
Azure Data Factory Data Flow
 
How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?
 
Architecture of integration services
Architecture of integration servicesArchitecture of integration services
Architecture of integration services
 
Database management systems
Database management systemsDatabase management systems
Database management systems
 
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
 
Azure Data Factory for Azure Data Week
Azure Data Factory for Azure Data WeekAzure Data Factory for Azure Data Week
Azure Data Factory for Azure Data Week
 
Data platform architecture
Data platform architectureData platform architecture
Data platform architecture
 
Data lake ppt
Data lake pptData lake ppt
Data lake ppt
 
Data Migration Steps PowerPoint Presentation Slides
Data Migration Steps PowerPoint Presentation Slides Data Migration Steps PowerPoint Presentation Slides
Data Migration Steps PowerPoint Presentation Slides
 
5 Level of MDM Maturity
5 Level of MDM Maturity5 Level of MDM Maturity
5 Level of MDM Maturity
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
 
Data Vault and DW2.0
Data Vault and DW2.0Data Vault and DW2.0
Data Vault and DW2.0
 

Viewers also liked

05 SSIS Control Flow
05 SSIS Control Flow05 SSIS Control Flow
05 SSIS Control FlowSlava Kokaev
 
Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1Skillwise Group
 
Business intelligence the next generation of knowledge management (1)
Business intelligence the next generation of knowledge  management (1)Business intelligence the next generation of knowledge  management (1)
Business intelligence the next generation of knowledge management (1)ichsanovsky
 
03 Integration Services Project
03 Integration Services Project03 Integration Services Project
03 Integration Services ProjectSlava Kokaev
 
SQL Server Integration Services – Enterprise Manageability
SQL Server Integration Services – Enterprise ManageabilitySQL Server Integration Services – Enterprise Manageability
SQL Server Integration Services – Enterprise ManageabilityDan English
 
Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...
Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...
Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...Wolfgang Strasser
 
9\9 SSIS 2008R2_Training - Package Reliability and Package Execution
9\9 SSIS 2008R2_Training - Package Reliability and Package Execution9\9 SSIS 2008R2_Training - Package Reliability and Package Execution
9\9 SSIS 2008R2_Training - Package Reliability and Package ExecutionPramod Singla
 
Step by Step design cube using SSAS
Step by Step design cube using SSASStep by Step design cube using SSAS
Step by Step design cube using SSASAhsan Kabir
 
SSIS Basic Data Flow
SSIS Basic Data FlowSSIS Basic Data Flow
SSIS Basic Data FlowRam Kedem
 
5\9 SSIS 2008R2_Training - DataFlow Basics
5\9 SSIS 2008R2_Training - DataFlow Basics5\9 SSIS 2008R2_Training - DataFlow Basics
5\9 SSIS 2008R2_Training - DataFlow BasicsPramod Singla
 
2\9.SSIS 2008R2 _Training - Control Flow
2\9.SSIS 2008R2 _Training - Control Flow2\9.SSIS 2008R2 _Training - Control Flow
2\9.SSIS 2008R2 _Training - Control FlowPramod Singla
 
SSIS Data Flow Tasks
SSIS Data Flow Tasks SSIS Data Flow Tasks
SSIS Data Flow Tasks Ram Kedem
 
6.2\9 SSIS 2008R2_Training - DataFlow Transformations
6.2\9 SSIS 2008R2_Training - DataFlow Transformations6.2\9 SSIS 2008R2_Training - DataFlow Transformations
6.2\9 SSIS 2008R2_Training - DataFlow TransformationsPramod Singla
 
Business Intelligence with SQL Server
Business Intelligence with SQL ServerBusiness Intelligence with SQL Server
Business Intelligence with SQL ServerPeter Gfader
 
Developing ssas cube
Developing ssas cubeDeveloping ssas cube
Developing ssas cubeSlava Kokaev
 
Introduction to Azure Data Factory
Introduction to Azure Data FactoryIntroduction to Azure Data Factory
Introduction to Azure Data FactorySlava Kokaev
 

Viewers also liked (20)

05 SSIS Control Flow
05 SSIS Control Flow05 SSIS Control Flow
05 SSIS Control Flow
 
06 SSIS Data Flow
06 SSIS Data Flow06 SSIS Data Flow
06 SSIS Data Flow
 
Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1Advanced integration services on microsoft ssis 1
Advanced integration services on microsoft ssis 1
 
Business intelligence the next generation of knowledge management (1)
Business intelligence the next generation of knowledge  management (1)Business intelligence the next generation of knowledge  management (1)
Business intelligence the next generation of knowledge management (1)
 
03 Integration Services Project
03 Integration Services Project03 Integration Services Project
03 Integration Services Project
 
SQL Server Integration Services – Enterprise Manageability
SQL Server Integration Services – Enterprise ManageabilitySQL Server Integration Services – Enterprise Manageability
SQL Server Integration Services – Enterprise Manageability
 
Ssis
SsisSsis
Ssis
 
Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...
Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...
Professional Recycling - SSIS Custom Control Flow Components With Visual Stud...
 
SQLDay2013_ChrisWebb_CubeDesign&PerformanceTuning
SQLDay2013_ChrisWebb_CubeDesign&PerformanceTuningSQLDay2013_ChrisWebb_CubeDesign&PerformanceTuning
SQLDay2013_ChrisWebb_CubeDesign&PerformanceTuning
 
9\9 SSIS 2008R2_Training - Package Reliability and Package Execution
9\9 SSIS 2008R2_Training - Package Reliability and Package Execution9\9 SSIS 2008R2_Training - Package Reliability and Package Execution
9\9 SSIS 2008R2_Training - Package Reliability and Package Execution
 
Step by Step design cube using SSAS
Step by Step design cube using SSASStep by Step design cube using SSAS
Step by Step design cube using SSAS
 
SSIS Basic Data Flow
SSIS Basic Data FlowSSIS Basic Data Flow
SSIS Basic Data Flow
 
5\9 SSIS 2008R2_Training - DataFlow Basics
5\9 SSIS 2008R2_Training - DataFlow Basics5\9 SSIS 2008R2_Training - DataFlow Basics
5\9 SSIS 2008R2_Training - DataFlow Basics
 
2\9.SSIS 2008R2 _Training - Control Flow
2\9.SSIS 2008R2 _Training - Control Flow2\9.SSIS 2008R2 _Training - Control Flow
2\9.SSIS 2008R2 _Training - Control Flow
 
SSIS Data Flow Tasks
SSIS Data Flow Tasks SSIS Data Flow Tasks
SSIS Data Flow Tasks
 
6.2\9 SSIS 2008R2_Training - DataFlow Transformations
6.2\9 SSIS 2008R2_Training - DataFlow Transformations6.2\9 SSIS 2008R2_Training - DataFlow Transformations
6.2\9 SSIS 2008R2_Training - DataFlow Transformations
 
Business Intelligence with SQL Server
Business Intelligence with SQL ServerBusiness Intelligence with SQL Server
Business Intelligence with SQL Server
 
Developing ssas cube
Developing ssas cubeDeveloping ssas cube
Developing ssas cube
 
Introduction to Azure Data Factory
Introduction to Azure Data FactoryIntroduction to Azure Data Factory
Introduction to Azure Data Factory
 
Informatica power center 9 Online Training
Informatica power center 9 Online TrainingInformatica power center 9 Online Training
Informatica power center 9 Online Training
 

Similar to SSIS 2008 R2 data flow

Introduction of ssis
Introduction of ssisIntroduction of ssis
Introduction of ssisdeepakk073
 
White jason presentation
White jason presentationWhite jason presentation
White jason presentationWhiteJason
 
Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22Mark Kromer
 
Azure Data Factory Data Flows Training (Sept 2020 Update)
Azure Data Factory Data Flows Training (Sept 2020 Update)Azure Data Factory Data Flows Training (Sept 2020 Update)
Azure Data Factory Data Flows Training (Sept 2020 Update)Mark Kromer
 
Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021Mark Kromer
 
introductionofssis-130418034853-phpapp01.pptx
introductionofssis-130418034853-phpapp01.pptxintroductionofssis-130418034853-phpapp01.pptx
introductionofssis-130418034853-phpapp01.pptxYashaswiniSrinivasan1
 
Informatica overview
Informatica overviewInformatica overview
Informatica overviewkarthik kumar
 
Informatica overview
Informatica overviewInformatica overview
Informatica overviewkarthik kumar
 
Data mining with ms access
Data mining with ms accessData mining with ms access
Data mining with ms accessDhatri Jain
 
Managing Data Integration Initiatives
Managing Data Integration InitiativesManaging Data Integration Initiatives
Managing Data Integration InitiativesAllinConsulting
 
SAS Online Training Institute in Hyderabad - C-Point
SAS Online Training Institute in Hyderabad - C-PointSAS Online Training Institute in Hyderabad - C-Point
SAS Online Training Institute in Hyderabad - C-Pointcpointss
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfoliopleeloy
 
6.1\9 SSIS 2008R2_Training - DataFlow Transformations
6.1\9 SSIS 2008R2_Training - DataFlow Transformations6.1\9 SSIS 2008R2_Training - DataFlow Transformations
6.1\9 SSIS 2008R2_Training - DataFlow TransformationsPramod Singla
 
Informatica Designer Module
Informatica Designer ModuleInformatica Designer Module
Informatica Designer Moduleganblues
 
Test Data Transfer Tool
Test Data Transfer ToolTest Data Transfer Tool
Test Data Transfer ToolHai Nguyen
 

Similar to SSIS 2008 R2 data flow (20)

Introduction of ssis
Introduction of ssisIntroduction of ssis
Introduction of ssis
 
White jason presentation
White jason presentationWhite jason presentation
White jason presentation
 
Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22
 
Azure Data Factory Data Flows Training (Sept 2020 Update)
Azure Data Factory Data Flows Training (Sept 2020 Update)Azure Data Factory Data Flows Training (Sept 2020 Update)
Azure Data Factory Data Flows Training (Sept 2020 Update)
 
Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021
 
introductionofssis-130418034853-phpapp01.pptx
introductionofssis-130418034853-phpapp01.pptxintroductionofssis-130418034853-phpapp01.pptx
introductionofssis-130418034853-phpapp01.pptx
 
AWS RDS Migration Tool
AWS RDS Migration Tool AWS RDS Migration Tool
AWS RDS Migration Tool
 
Olap
OlapOlap
Olap
 
Informatica overview
Informatica overviewInformatica overview
Informatica overview
 
Informatica overview
Informatica overviewInformatica overview
Informatica overview
 
adf.docx
adf.docxadf.docx
adf.docx
 
Data mining with ms access
Data mining with ms accessData mining with ms access
Data mining with ms access
 
2 designer
2 designer2 designer
2 designer
 
Managing Data Integration Initiatives
Managing Data Integration InitiativesManaging Data Integration Initiatives
Managing Data Integration Initiatives
 
SAS Online Training Institute in Hyderabad - C-Point
SAS Online Training Institute in Hyderabad - C-PointSAS Online Training Institute in Hyderabad - C-Point
SAS Online Training Institute in Hyderabad - C-Point
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolio
 
6.1\9 SSIS 2008R2_Training - DataFlow Transformations
6.1\9 SSIS 2008R2_Training - DataFlow Transformations6.1\9 SSIS 2008R2_Training - DataFlow Transformations
6.1\9 SSIS 2008R2_Training - DataFlow Transformations
 
Informatica Designer Module
Informatica Designer ModuleInformatica Designer Module
Informatica Designer Module
 
CIS 282 Final Review
CIS 282 Final ReviewCIS 282 Final Review
CIS 282 Final Review
 
Test Data Transfer Tool
Test Data Transfer ToolTest Data Transfer Tool
Test Data Transfer Tool
 

More from Slava Kokaev

Introduction to Azure Stream Analytics
Introduction to Azure Stream AnalyticsIntroduction to Azure Stream Analytics
Introduction to Azure Stream AnalyticsSlava Kokaev
 
Business process modeling and analysis for data warehouse design
Business process modeling and analysis for data warehouse designBusiness process modeling and analysis for data warehouse design
Business process modeling and analysis for data warehouse designSlava Kokaev
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Introduction BI Semantic Model with Sql Server Data Tools   copyIntroduction BI Semantic Model with Sql Server Data Tools   copy
Introduction BI Semantic Model with Sql Server Data Tools copySlava Kokaev
 
Architecture modeling with UML and Visual Studio 2010 Ultimate
Architecture modeling with UML and Visual Studio 2010 UltimateArchitecture modeling with UML and Visual Studio 2010 Ultimate
Architecture modeling with UML and Visual Studio 2010 UltimateSlava Kokaev
 
SSIS Connection managers and data sources
SSIS Connection managers and data sourcesSSIS Connection managers and data sources
SSIS Connection managers and data sourcesSlava Kokaev
 
Data visualization
Data visualizationData visualization
Data visualizationSlava Kokaev
 
Business intelligence architecture
Business intelligence architectureBusiness intelligence architecture
Business intelligence architectureSlava Kokaev
 
Designing and developing Business Process dimensional Model or Data Warehouse
Designing and developing  Business Process dimensional Model  or Data WarehouseDesigning and developing  Business Process dimensional Model  or Data Warehouse
Designing and developing Business Process dimensional Model or Data WarehouseSlava Kokaev
 
SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSlava Kokaev
 
MS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data WarehousingMS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data WarehousingSlava Kokaev
 
01 Architecture Of Integration Services
01 Architecture Of Integration Services01 Architecture Of Integration Services
01 Architecture Of Integration ServicesSlava Kokaev
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkSlava Kokaev
 

More from Slava Kokaev (12)

Introduction to Azure Stream Analytics
Introduction to Azure Stream AnalyticsIntroduction to Azure Stream Analytics
Introduction to Azure Stream Analytics
 
Business process modeling and analysis for data warehouse design
Business process modeling and analysis for data warehouse designBusiness process modeling and analysis for data warehouse design
Business process modeling and analysis for data warehouse design
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Introduction BI Semantic Model with Sql Server Data Tools   copyIntroduction BI Semantic Model with Sql Server Data Tools   copy
Introduction BI Semantic Model with Sql Server Data Tools copy
 
Architecture modeling with UML and Visual Studio 2010 Ultimate
Architecture modeling with UML and Visual Studio 2010 UltimateArchitecture modeling with UML and Visual Studio 2010 Ultimate
Architecture modeling with UML and Visual Studio 2010 Ultimate
 
SSIS Connection managers and data sources
SSIS Connection managers and data sourcesSSIS Connection managers and data sources
SSIS Connection managers and data sources
 
Data visualization
Data visualizationData visualization
Data visualization
 
Business intelligence architecture
Business intelligence architectureBusiness intelligence architecture
Business intelligence architecture
 
Designing and developing Business Process dimensional Model or Data Warehouse
Designing and developing  Business Process dimensional Model  or Data WarehouseDesigning and developing  Business Process dimensional Model  or Data Warehouse
Designing and developing Business Process dimensional Model or Data Warehouse
 
SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business Intelligence
 
MS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data WarehousingMS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
 
01 Architecture Of Integration Services
01 Architecture Of Integration Services01 Architecture Of Integration Services
01 Architecture Of Integration Services
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual Framework
 

Recently uploaded

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Recently uploaded (20)

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

SSIS 2008 R2 data flow

  • 1. The Data Flow Task Encapsulates the data flow engine Exists in the context of an overall control flow Performs traditional ETL in addition to other extended scenarios Is fast and scalable Data Flow Components Extract data from Sources Load data into Destinations Modify data with Transformations Service Paths Connect data flow components Create the pipeline
  • 2. Encapsulates the data flow engine Load Extract Transform
  • 3.
  • 4.
  • 5. A Path connects two components in a data flow by connecting the output of one data flow component to the input of another component. A path has a source and a destination.
  • 6. In SSIS, a source is the data flow component that extracts data from different external data sources and makes it available to the other components in the data flow. Sources have one regular output, and many sources in addition also have one error output. All the output columns are available as input columns to the next data flow component in the data flow. Sources extract data from: Relational tables and views Files Analysis Services databases
  • 8. Destinations are the data flow components that load the data from a data flow into different types of data sources or create an in-memory dataset. Destinations have one input and one error output. Destinations load data to: Relational tables and views Files Analysis Services databases and objects DataReaders and Recordsets Enterprise Edition only
  • 10. SSIS Transformations are the components in the data flow of a package that give you the ability to modify and manipulate data in the data flow. A transformation performs an operation either on one row of data at a time or on several rows of data at once. For example aggregate, merge, distribute, and modify data and also can perform lookup operations and generate sample datasets.
  • 11.
  • 14. We can logically group them by functionality: Row Transformations Rowset Transformations Split and Join Transformations Auditing Transformations Business Intelligence Transformations Custom Transformations
  • 15. The most common and easily configured transformations perform operations on rows without needing other rows from the source. These transformations, which logically work at the row level, often perform very well. Update column values or create new columns Transform each row in the pipeline input
  • 16. Create new rowsets that can include Aggregated values Sorted values Sample rowsets Pivoted or unpivoted rowsets
  • 17. Distribute rows to different outputs Create copies of the transformation inputs Join multiple inputs into one output Perform lookup operations
  • 18. Integration Services includes the following transformations to add audit information and count rows.
  • 19. cleaning data updating of a slowly changing dimension looks up values mining text running data mining prediction queries The final grouping of transformations lets you perform advanced operations on rows in the data flow pipeline.

Editor's Notes

  1. For beginners, the data flow can look very similar to the control flow. Ensure that students appreciate the difference between the arrows in control flow (precedence constraints) and data flow (service paths).Extended scenarios can consist of, for example, non-ETL data transfers, data cleansing, or text mining.When describing the sample data flow on the slide, use the analogy of a conveyor belt in a factory. Raw material (source data) is placed on the conveyor belt and passes through various processes (transformations). Quality assurance components might reject some material, in which case it can be scrapped (logged) or fixed and blended back in with the quality material. Eventually, finished goods (clean, valid data) arrive at the end of the conveyor belt (data warehouse).
  2. Developing Project Data Sources and Package ConnectionsBecause the main purpose of SSIS is to move data from sources to destinations, the nextmost important step is to add the pointers to these sources and destinations. These pointersare called data sources and connections. Data sources are stored at the project level and arefound in Solution Explorer under the logical folder named Data Sources. Connections, on theother hand, are defined within packages and are found in the Connection Managers pane atthe bottom of the Control Flow or Data Flow tab. Connections can be based on project datasources or can stand alone within packages. The next sections walk you through the uses andimplementation of project data sources and package connections.
  3. To work with the Data Flow Task, you can either drag a Data Flow Task from the Control Flow toolbox onto the workspace and then double-click it,or you can click the Data Flow tab within the SSIS Designer. After clicking the Data Flow tab, you see the Data Flow Designer, where you can use the data flow to handle and transform datasets. Lesson 2, “Creating and Editing Control Flow Objects,” showed how to use control flow tasksand containers.Notice the difference between the ControlFlow toolbox items and the Data Flow toolbox items.When you are working in the data flow, the toolbox shows items related to data flow development,including data flow sources, data flow transformations, and data flow destinations.In this lesson, you will look at the details of the source and destination adapters as well asthe transformations.
  4. If you run a package in SSIS Designer, you can view the data in a data flow by attaching data viewers to a path. A data viewer can be configured to display data in a grid, histogram, scatter plot, or column chart. A data viewer is a useful debugging tool. For more information, see Debugging Data Flow. For example, if a path connects an OLE DB source and a Sort transformation, the OLE DB source is the source of the path, and the Sort transformation is the destination of the path. The source is the component where the path starts, and the destination is the component where the path ends. The SSIS Designer provides the Data Flow Path Editor dialog box for setting path properties, viewing the metadata of the data columns that pass through the path, and configuring data viewers.
  5. ADO.NET SourceProvides connections to tables or queries through an ADO.NETprovider.Excel SourceAllows extractions from an Excel worksheet defined in an Excelfile.Flat File Source Connects to a delimited or fixed-width file created with differentcode pages.OLE DB Source Connects to installed OLE DB providers, such as SQL Server,Access, SSAS, and Oracle.Raw File SourceStores native SSIS data in a binary file type useful for data staging.XML SourceAllows raw data to be extracted from an XML file; requires an XMLschema to define data associations.
  6. Data flow source adapters use package connections, which point to the server instance or file location of the data source. (The only exception is the raw file adapter, which does not use a package connection.) A source adapter extracts data from sources and moves it into the data flow, where it will be modified and sent to a destination.The source for a data flow typically has one regular output. The regular output contains output columns, which are columns the source adds to the data flow.
  7. Data flow destinations are similar to sources in that they use package connections. However, destinations are the endpoints in a package, defining the location to which the data shouldbe pushed.For example, if you are sending data to an Excel file from a database table, your destination will be an Excel Destination adapter.All the source adapters (except the Data Reader source) have matching destination adapters in the SSIS data flow. And there are other destination adapters that let you send data toeven more destinationsADO.NET Destination Allows insertion of data by using an ADO.NET managedprovider.Data Mining Model Training Lets you pass data from the data flow into a data miningmodel in SSAS.DataReader Destination Lets you put data in an ADO.NET record set that can beprogrammatically referenced.Dimension Processing Lets SSAS dimensions be processed directly from dataflowing through the data flow.Excel Destination Used for inserting data into Excel, including Excel 2007.Flat File DestinationAllows insertion of data to a flat file such as a comma delimitedor tab-delimited file.OLE DB Destination Uses the OLE DB provider to insert rows into a destinationsystem that allows an OLE DB connection.Partition Processing Allows SSAS partitions to be processed directly fromdata flowing through the data flow.Raw File Destination Stores native SSIS data in a binary fi le type useful fordata staging.Recordset Destination Takes the data flow data and creates a record set in a package variable of type object.SQL Server Compact DestinationLets you send data to a mobile device running SQL Mobile.SQL Server Destination Provides a high-speed destination specific to SQL Server2008 if the package is running on SQL Server.
  8. Like the source, the destination adapter requires configuration,both in the connection and table that the rows should be inserted into as well as in mapping the data flow columns to thedestination table columns.
  9. As with the source and destination adapters, you drag transformations from the DataFlow toolbox onto the Data Flow tab of the SSIS Designer, and edit them by right-clicking thetransformation you want to change and then clicking Edit. You connect sources, transformations,and destinations through data paths, which you create by dragging the output arrowonto another component in the data flow. The green data path arrows are for rows that aresuccessfully transformed, and the red output path arrows are for rows that failed the transformationbecause of an error, such as a truncation or conversion error. Figure 1-25 shows adata flow that connects a source to several transformations through data paths and onto adestination.
  10. Notice that this OLE DB Destination uses the AdventureWorksDW2008 connection and is configured by default to use the Table Or View—Fast Load option of the Data Access Mode drop-down list. This means that records will be processed with bulk insert statements rather than one row at a time.Figure 1-24 shows the Mappings tab of the same OLE DB Destination Editor. This is where you map columns available from the data flow to the destination columns in the destinationadapter. All the destination adapters have a Mappings tab. FIGURE 1-24 Each destination adapter requires you to map data from the data flow input columns to thedestination columns.Notice that not all columns are mapped. However, if one of the unmapped destination columnsis marked as NOT NULL, the destination fails the package when it is run. In the sectiontitled “Using Transformations” later in this lesson, you see how to use the Slowly ChangingDimension Transformation to handle new records and updates.
  11. Transformations perform a wide variety of operations on the underlying data, and the transformation you choose depends on your data processing requirements. Some transformations operate similarly to other transformations; therefore, we can categorize them into naturalgroupings of like components.
  12. Character MapPerforms common text operations, such as Uppercase, and allowsadvanced linguistic bit conversion operations.Copy Column Duplicates column values in each row to new named columns.Data ConversionCreates new columns in each row based on new data typesconverted from other columns—for example, converting text tonumeric.Derived Column Uses the SSIS Expression language to perform in-place calculationson existing values; alternatively, allows the addition ofnew columns based on expressions and calculations from othercolumns and variables.Export Column Exports binary large object (BLOB) columns, one row at a time,to a file.Import Column Loads binary files such as images into the pipeline; intended for aBLOB data type destination.OLE DB Command Performs database operations such as updates and deletes,one row at a time, based on mapped parameters from inputrows.
  13. Aggregate Associates records based on defined groupings and generates aggregationssuch as SUM, MAX, MIN, and COUNT.Percent Sampling Filters the input rows by allowing only a defined percent to bepassed to the output path.Pivot Takes multiple input rows and pivots the rows to generate an outputwith more columns based on the original row values.Row Sampling Outputs a fixed number of rows, sampling the data from the entireinput, no matter how much larger than the defined output theinput is.Sort Orders the input based on defined sort columns and sort directionand allows the removal of duplicates across the sort columns.UnpivotTakes a single row and outputs multiple rows, moving columnvalues to the new row based on defined columns.
  14. Cache Transform Allows data that will be used in a Lookup Transformation to becached and available for multiple Lookup components.Conditional Split Routes or filters data based on Boolean expressions to one or moreoutputs, from which each row can be sent out only one outputpath.Multicast Generates one or more identical outputs, from which every row issent out every output.Union All Combines one or more similar inputs, stacking rows one on top ofanother, based on matching columnsMerge Join Joins the rows of two sorted inputs based on a defined joincolumn(s), adding columns from each source.Lookup Allows matching between pipeline column values to external databasetables; additional columns can be added to the data flow fromthe external table.Merge Combines the rows of two similar sorted inputs, one on top of theother, based on a defined sort key.
  15. Audit Adds additional columns to each row based on system packagevariables such as ExecutionStartTime and PackageName.Row Count Tracks the number of rows that flow through the transformationand stores the number in a package variable after the final row.
  16. Slowly Changing DimensionProcesses dimension changes, including tracking dimensionhistory and updating dimension values. The Slowly ChangingDimension Transformation handles these common dimensionchange types: Historical Attributes, Fixed Attributes, and Changing Attributes.Data Mining QueryApplies input rows against a data mining model for prediction.Fuzzy Grouping Associates column values with a set of rows based on similarity,for data cleansing.Fuzzy Lookup Joins a data flow input to a reference table based on column similarity. The Similarity Threshold setting specifies the closeness of allowed matches—a high setting means that matching values are closer in similarity.Script Component Provides VB.NET scripting capabilities against rows, columns, inputs, and outputs in the data flow pipeline.Term Extraction Analyzes text input columns for English nouns and noun phrases.Term Lookup Analyzes text input columns against a user-defined set of words for association.