SlideShare uma empresa Scribd logo
1 de 23
Informatica Overview
Ten Tools for Ten Big Data Areas
Series 01 Big Data Integration
www.sparkera.ca
Ten Tools for Ten Big Data Areas – Overview
2© Sparkera. Confidential. All Rights Reserved
10 Tools
10 Areas
Programming
SearchandIndex
First ETL fully on Yarn
Data storing platform
Data computing platform
SQL & Metadata
Visualize with just few clicks
Powerful as Java
Simple as Python
real-time
streaming
Made easier
Yours
Google
Lightning-fast cluster computing
Real-time distributed data store
High throughput
distributed messaging
Agenda
3© Sparkera. Confidential. All Rights Reserved
About data integration
2 About Informatica company and its approach
3 Informatica architecture, client, server components, developer tool overview
4 Informatica why and why not
5 Informatica job trend
1
Little About DI – Data Integration
• DI involves combining data residing in different sources and
providing users with a unified view of these data.
• DI process is also called Enterprise Information Integration (EII).
• DI usually means ETL - data extract, transformation, load.
• 80% of enterprise data projects' efforts are spent on DI work.
• Data cleansing, audit, master data management are usually
considered with DI.
© Sparkera. Confidential. All Rights Reserved
About Informatica Company
• Found in 1993
• 2014 revenue – US$1.05 billion
• Average growth rate 17% per year
• Employee – 5500+
• Customers – 5000
• Value customer covers up to 70% of global top 500 company
• Partners – 500+
• Cover various business, industries and government organizations
including telecommunications, health care, financial and insurance
services.
• A company dedicate on data integration and management
• Bought out as private company on August 2015.
© Sparkera. Confidential. All Rights Reserved
The Tradition Approach
Application Database Partner Data
SWIFT NACHA HIPAA …
Cloud Computing Unstructured
87% of enterprises use hand-coding for data integration
75% of enterprises reported increased maintenance costs
Data
Warehouse
Data
Migration
Test Data
Management
& Archiving
Master Data
Management
Data
Synchronization
B2B Data
Exchange
Data
Consolidation
Complex
Event
Processing
Ultra
Messaging
© Sparkera. Confidential. All Rights Reserved
The Informatica Approach
Application Partner Data
SWIFT NACHA HIPAA …
Cloud Computing UnstructuredDatabase
Data
Warehouse
Data
Migration
Test Data
Management
& Archiving
Master Data
Management
Data
Synchronization
B2B Data
Exchange
Data
Consolidation
Complex
Event
Processing
Ultra
Messaging
© Sparkera. Confidential. All Rights Reserved
Informatica Latest Products v9.6
• Data Integration
 PowerCenter
 PowerExchange
• Master Data Management
• Cloud Integration
• Big Data
 BDE – Informatica Developer
 Big data parser
© Sparkera. Confidential. All Rights Reserved
Informatica PowerCenter Overview
• An ETL tool ( Extract, Transform and Load)
• The main advantages over other ETL tools lies in its robustness,
across OS, and high performance.
• It can read from a variety of different sources and write to as many
targets, while transforming data in between.
• The architecture design use SOA concept for better extensibility and
high availability
• Single sign on access, built-in version control, GUI development,
built-in schedule and monitoring
© Sparkera. Confidential. All Rights Reserved
Informatica PowerCenter Architecture
© Sparkera. Confidential. All Rights Reserved
Informatica PowerCenter Client Component
• Repository Manager – meta data management
• Designer – Tool to build mapping for ETL logic
• Workflow Manager – Tool to build/run session and workflow
• Workflow Monitor – Tool to monitor job running
• Administration Console (browser based) - administration
© Sparkera. Confidential. All Rights Reserved
Repository Manager
Navigate through multiple folders and repositories, export & import,
user & folder management
© Sparkera. Confidential. All Rights Reserved
Designer
Create and debug mapping & maplet including source, target,
transformations for core ETL logic.
© Sparkera. Confidential. All Rights Reserved
Workflow Manager
Create, schedule, and run session, workflow, worklet wrapping
mapping.
© Sparkera. Confidential. All Rights Reserved
Workflow Monitor
Monitor running statistics and control execution of workflows.
© Sparkera. Confidential. All Rights Reserved
Administration Console
Monitor and manager various of Informatica service, licenses, etc.
© Sparkera. Confidential. All Rights Reserved
Informatica PowerCenter Server Components
• Repository service: The Repository service manages the repository.
It retrieves, inserts, and updates metadata into the repository
database tables.
• Integration service: The Integration service runs sessions and
workflows.
• Web services hub: The Web services hub receives requests from
web service clients and exposes PowerCenter workflows as services.
• Informatica service: Overall service management and coordination
© Sparkera. Confidential. All Rights Reserved
Informatica Big Data Edition Overview
Extract, load, and transform with big data ecosystem.
© Sparkera. Confidential. All Rights Reserved
Informatica BDE Component - Developer
BDE is all in one tool and can fully push job running on Hadoop
Developer component
• Mapping – Tool to build mapping for ETL logic
• Maplet – Reusable mapping
• Workflow – Tool to build workflow
• Application – Tool to deploy mapping/workflow
Others
• Monitoring Console (browser based) – job monitoring
• Administration Console (browser based) - administration
© Sparkera. Confidential. All Rights Reserved
Why Informatica Product
• Proven technology leadership
• A track record of continuous innovation
• The most neutral trusted partner – very focus
• Long history of customer success
• Over 5000+ industry leaders relies on Informatica
• Major banks, telecom, insurance, energy, health, research
companies are using Informatica in Toronto
• Easy and popular to use
• Pull push job to Hadoop
• Connector for many kinds of source
• Performance and reliability
© Sparkera. Confidential. All Rights Reserved
Side Effect - When May Not To
• High price: 150K+ to start
• Get challenges from ELT – Leverage database for transformation.
Need investment on ETL server. Its push to database optimization
has limitations.
• Schedule, monitoring, and version control functions are limited
• BDE is relative new although the concept is great
• Alternatives - MS SSIS, Talend Studio, Pentaho Data Integration
© Sparkera. Confidential. All Rights Reserved
Informatica Job Trends
Level Junior Level
(20%)
Middle Level
(40%)
Expert Level
(40%)
Position ETL developer
Informatica dev.
DW developer
Sr. ETL developer
Data Specialist
ETL specialist
ETL designer
ETL Admin
Big data ETL dev.
BDE developer
Informatica architect
Informatica consultant
Tool PowerCenter Informatica
Developer
Other
Usage
Percentage
80% 10% 10%
© Sparkera. Confidential. All Rights Reserved
www.sparkera.ca
BIG DATA is not only about data,
but the understanding of the data
and how people use data actively to improve their life.

Mais conteúdo relacionado

Mais procurados

Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMAUnlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Precisely
 
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseHybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
DataWorks Summit
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
Denodo
 

Mais procurados (20)

Talend MDM
Talend MDMTalend MDM
Talend MDM
 
IBM - Transformation digitale et le SI des banques
IBM - Transformation digitale et le SI des banquesIBM - Transformation digitale et le SI des banques
IBM - Transformation digitale et le SI des banques
 
Webcast slides for "Low Risk and High Reward in App Decomm with InfoArchive a...
Webcast slides for "Low Risk and High Reward in App Decomm with InfoArchive a...Webcast slides for "Low Risk and High Reward in App Decomm with InfoArchive a...
Webcast slides for "Low Risk and High Reward in App Decomm with InfoArchive a...
 
Data Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIData Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AI
 
Accelerate Return on Data
Accelerate Return on DataAccelerate Return on Data
Accelerate Return on Data
 
As400 mini
As400 miniAs400 mini
As400 mini
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data Lakes
 
Teradata Listener™: Radically Simplify Big Data Streaming
Teradata Listener™: Radically Simplify Big Data StreamingTeradata Listener™: Radically Simplify Big Data Streaming
Teradata Listener™: Radically Simplify Big Data Streaming
 
Open Development
Open DevelopmentOpen Development
Open Development
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
 
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMAUnlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
Unlock Insights Enabling Data-driven Decisions with Databricks, Precisely & CMA
 
Modernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your DataModernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your Data
 
Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...
 
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
 
The Manulife Journey
The Manulife JourneyThe Manulife Journey
The Manulife Journey
 
xRM - as an Evolution of CRM
xRM - as an Evolution of CRMxRM - as an Evolution of CRM
xRM - as an Evolution of CRM
 
Organising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data WorldOrganising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data World
 
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseHybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 

Destaque

Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev KumarApache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
Yahoo Developer Network
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data Warehousing
Amazon Web Services
 

Destaque (7)

ETL Using Informatica Power Center
ETL Using Informatica Power CenterETL Using Informatica Power Center
ETL Using Informatica Power Center
 
informatica training | informatica Course | informatica online training | I...
informatica training | informatica Course | informatica online training  |  I...informatica training | informatica Course | informatica online training  |  I...
informatica training | informatica Course | informatica online training | I...
 
Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev KumarApache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
 
Management in Informatica Power Center
Management in Informatica Power CenterManagement in Informatica Power Center
Management in Informatica Power Center
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data Warehousing
 
Informatica Powercenter Architecture
Informatica Powercenter ArchitectureInformatica Powercenter Architecture
Informatica Powercenter Architecture
 
Informatica PowerCenter
Informatica PowerCenterInformatica PowerCenter
Informatica PowerCenter
 

Semelhante a Ten tools for ten big data areas 01 informatica

Amit Kumar_Resume
Amit Kumar_ResumeAmit Kumar_Resume
Amit Kumar_Resume
Amit Kumar
 
Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users
Senturus
 

Semelhante a Ten tools for ten big data areas 01 informatica (20)

Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Ten tools for ten big data areas 02_Tableau
Ten tools for ten big data areas 02_TableauTen tools for ten big data areas 02_Tableau
Ten tools for ten big data areas 02_Tableau
 
Hadoop and Manufacturing
Hadoop and ManufacturingHadoop and Manufacturing
Hadoop and Manufacturing
 
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
 
Tame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data IntegrationTame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data Integration
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
Amit Kumar_Resume
Amit Kumar_ResumeAmit Kumar_Resume
Amit Kumar_Resume
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Unlock Hadoop Success with Cloudera Navigator Optimizer
Unlock Hadoop Success with Cloudera Navigator OptimizerUnlock Hadoop Success with Cloudera Navigator Optimizer
Unlock Hadoop Success with Cloudera Navigator Optimizer
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
Complement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & HadoopComplement Your Existing Data Warehouse with Big Data & Hadoop
Complement Your Existing Data Warehouse with Big Data & Hadoop
 
Database Security, Better Audits, Lower Costs
Database Security, Better Audits, Lower CostsDatabase Security, Better Audits, Lower Costs
Database Security, Better Audits, Lower Costs
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18
 
Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users Data Integration for Both Self-Service Analytics and IT Users
Data Integration for Both Self-Service Analytics and IT Users
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
 

Último

VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
imonikaupta
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
Diya Sharma
 
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
nilamkumrai
 

Último (20)

APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Booking
 
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
Enjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort Service
 
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
Russian Call Girls in %(+971524965298  )#  Call Girls in DubaiRussian Call Girls in %(+971524965298  )#  Call Girls in Dubai
Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
 
Real Escorts in Al Nahda +971524965298 Dubai Escorts Service
Real Escorts in Al Nahda +971524965298 Dubai Escorts ServiceReal Escorts in Al Nahda +971524965298 Dubai Escorts Service
Real Escorts in Al Nahda +971524965298 Dubai Escorts Service
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
 
𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...
𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...
𓀤Call On 7877925207 𓀤 Ahmedguda Call Girls Hot Model With Sexy Bhabi Ready Fo...
 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
 
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
 

Ten tools for ten big data areas 01 informatica

  • 1. Informatica Overview Ten Tools for Ten Big Data Areas Series 01 Big Data Integration www.sparkera.ca
  • 2. Ten Tools for Ten Big Data Areas – Overview 2© Sparkera. Confidential. All Rights Reserved 10 Tools 10 Areas Programming SearchandIndex First ETL fully on Yarn Data storing platform Data computing platform SQL & Metadata Visualize with just few clicks Powerful as Java Simple as Python real-time streaming Made easier Yours Google Lightning-fast cluster computing Real-time distributed data store High throughput distributed messaging
  • 3. Agenda 3© Sparkera. Confidential. All Rights Reserved About data integration 2 About Informatica company and its approach 3 Informatica architecture, client, server components, developer tool overview 4 Informatica why and why not 5 Informatica job trend 1
  • 4. Little About DI – Data Integration • DI involves combining data residing in different sources and providing users with a unified view of these data. • DI process is also called Enterprise Information Integration (EII). • DI usually means ETL - data extract, transformation, load. • 80% of enterprise data projects' efforts are spent on DI work. • Data cleansing, audit, master data management are usually considered with DI. © Sparkera. Confidential. All Rights Reserved
  • 5. About Informatica Company • Found in 1993 • 2014 revenue – US$1.05 billion • Average growth rate 17% per year • Employee – 5500+ • Customers – 5000 • Value customer covers up to 70% of global top 500 company • Partners – 500+ • Cover various business, industries and government organizations including telecommunications, health care, financial and insurance services. • A company dedicate on data integration and management • Bought out as private company on August 2015. © Sparkera. Confidential. All Rights Reserved
  • 6. The Tradition Approach Application Database Partner Data SWIFT NACHA HIPAA … Cloud Computing Unstructured 87% of enterprises use hand-coding for data integration 75% of enterprises reported increased maintenance costs Data Warehouse Data Migration Test Data Management & Archiving Master Data Management Data Synchronization B2B Data Exchange Data Consolidation Complex Event Processing Ultra Messaging © Sparkera. Confidential. All Rights Reserved
  • 7. The Informatica Approach Application Partner Data SWIFT NACHA HIPAA … Cloud Computing UnstructuredDatabase Data Warehouse Data Migration Test Data Management & Archiving Master Data Management Data Synchronization B2B Data Exchange Data Consolidation Complex Event Processing Ultra Messaging © Sparkera. Confidential. All Rights Reserved
  • 8. Informatica Latest Products v9.6 • Data Integration  PowerCenter  PowerExchange • Master Data Management • Cloud Integration • Big Data  BDE – Informatica Developer  Big data parser © Sparkera. Confidential. All Rights Reserved
  • 9. Informatica PowerCenter Overview • An ETL tool ( Extract, Transform and Load) • The main advantages over other ETL tools lies in its robustness, across OS, and high performance. • It can read from a variety of different sources and write to as many targets, while transforming data in between. • The architecture design use SOA concept for better extensibility and high availability • Single sign on access, built-in version control, GUI development, built-in schedule and monitoring © Sparkera. Confidential. All Rights Reserved
  • 10. Informatica PowerCenter Architecture © Sparkera. Confidential. All Rights Reserved
  • 11. Informatica PowerCenter Client Component • Repository Manager – meta data management • Designer – Tool to build mapping for ETL logic • Workflow Manager – Tool to build/run session and workflow • Workflow Monitor – Tool to monitor job running • Administration Console (browser based) - administration © Sparkera. Confidential. All Rights Reserved
  • 12. Repository Manager Navigate through multiple folders and repositories, export & import, user & folder management © Sparkera. Confidential. All Rights Reserved
  • 13. Designer Create and debug mapping & maplet including source, target, transformations for core ETL logic. © Sparkera. Confidential. All Rights Reserved
  • 14. Workflow Manager Create, schedule, and run session, workflow, worklet wrapping mapping. © Sparkera. Confidential. All Rights Reserved
  • 15. Workflow Monitor Monitor running statistics and control execution of workflows. © Sparkera. Confidential. All Rights Reserved
  • 16. Administration Console Monitor and manager various of Informatica service, licenses, etc. © Sparkera. Confidential. All Rights Reserved
  • 17. Informatica PowerCenter Server Components • Repository service: The Repository service manages the repository. It retrieves, inserts, and updates metadata into the repository database tables. • Integration service: The Integration service runs sessions and workflows. • Web services hub: The Web services hub receives requests from web service clients and exposes PowerCenter workflows as services. • Informatica service: Overall service management and coordination © Sparkera. Confidential. All Rights Reserved
  • 18. Informatica Big Data Edition Overview Extract, load, and transform with big data ecosystem. © Sparkera. Confidential. All Rights Reserved
  • 19. Informatica BDE Component - Developer BDE is all in one tool and can fully push job running on Hadoop Developer component • Mapping – Tool to build mapping for ETL logic • Maplet – Reusable mapping • Workflow – Tool to build workflow • Application – Tool to deploy mapping/workflow Others • Monitoring Console (browser based) – job monitoring • Administration Console (browser based) - administration © Sparkera. Confidential. All Rights Reserved
  • 20. Why Informatica Product • Proven technology leadership • A track record of continuous innovation • The most neutral trusted partner – very focus • Long history of customer success • Over 5000+ industry leaders relies on Informatica • Major banks, telecom, insurance, energy, health, research companies are using Informatica in Toronto • Easy and popular to use • Pull push job to Hadoop • Connector for many kinds of source • Performance and reliability © Sparkera. Confidential. All Rights Reserved
  • 21. Side Effect - When May Not To • High price: 150K+ to start • Get challenges from ELT – Leverage database for transformation. Need investment on ETL server. Its push to database optimization has limitations. • Schedule, monitoring, and version control functions are limited • BDE is relative new although the concept is great • Alternatives - MS SSIS, Talend Studio, Pentaho Data Integration © Sparkera. Confidential. All Rights Reserved
  • 22. Informatica Job Trends Level Junior Level (20%) Middle Level (40%) Expert Level (40%) Position ETL developer Informatica dev. DW developer Sr. ETL developer Data Specialist ETL specialist ETL designer ETL Admin Big data ETL dev. BDE developer Informatica architect Informatica consultant Tool PowerCenter Informatica Developer Other Usage Percentage 80% 10% 10% © Sparkera. Confidential. All Rights Reserved
  • 23. www.sparkera.ca BIG DATA is not only about data, but the understanding of the data and how people use data actively to improve their life.

Notas do Editor

  1. Lightning-fast cluster computing