Meet the experts dwo bde vds v7

•

1 gostou•849 visualizações

mmathipra

Meet the Experts: Data Warehouse Optimization with Big Data Edition and Vibe Data Stream

Dados e análise

Meet the Experts Series
How to use the Informatica Big Data
Edition and Vibe Data Stream for Hadoop-
based Data Warehouse Offloading
Informatica Product Desk
Murthy Mathiprakasam, Principal Product Marketing Manager
Sumeet Agrawal, Principal Product Manager
Jeff Rydz, Director of Big Data Solutions
Amrish Thakkar, Senior Product Manager
Knowledge
Series

Informatica PowerExchange & Vibe Data Stream
Vibe Data Stream
Real-time Data Integration
Multiple
Targets
Real-Time
Collection
Easy
Deployment
Highly
Available
Guaranteed
Delivery
Continuous
Streaming
PowerExchange
Batch Data Integration
Cloud &
SaaS Apps
Relational
& Flat Files
Hadoop &
NoSQL
MPP
Appliances
Social Data
Enterprise
Applications

4
Your Mission
Deploy the right workloads
On the right platforms
So the right people
Get the right data
At the right time
What’s the Mission of Every Data Services Team?

Data Warehouses Are Not Optimized For Modern Needs
7Source: Appfluent
More
Data
Supply
More
Data
Demand
80%
20%
Transformations
/ Data Loads
Analytical
Queries
Data Warehouse
Resource Utilization

Hadoop Can Help Drive Efficiency & Scalability
8
Machine Device,
Cloud
Relational, Mainframe
Social Media,
Web Logs
Data
Warehouse
Focused on
Analytics
Hadoop
Focused On
Data
Preparation
Source
Data

But Enterprises Are Approaching Hadoop With Caution
9
Slow Time
To Production
Challenging
to Staff
Risk of
Rework

• Robust, responsive &
reliable technology
platform
• Uniform version of
truth, high quality,
reliability,
accessibility, &
auditability
• Reduce costs, reduce
complexity, drive re-
use
• Unable to staff Big
Data projects
• Only 2 Hadoop
developers available
• Existing data
warehouse
architecture including
various technologies
• Quickly staffed Big
Data projects with
100 Informatica
developers
• Integrated with rest of
Data Warehouse
infrastructure
(e.g., Teradata)
• Logically consistent
information available
across all standard
platforms
Business need Challenge
Results With
Informatica
©2014 Informatica. Proprietary and Confidential. Do not distribute.
Case Study: Large Financial Services Firm

14
Data Is Growing and More Distributed
TB
Time
 Social media
 Web logs
 Sensor data

15
But Organizations Are Struggling to Harness It
Incomplete
Data Sets
Expensive
To Store
Low Fidelity
Analytics

Informatica Helps Lower Costs & Harness Real Time Data
16
Ingest Higher
Data Volumes
Lower Cost
Storage
Analytics On
All Data
10X Faster Streaming Technology
Informatica Vibe Data Stream

17
Informatica Vibe Data Stream
CEP (e.g.
Rulepoint,
Kinesis)
NoSQL (e.g.
Cassandra)

Use Cases for Streaming Data Into Hadoop
19
Predictive
Maintenance
Fraud
Detection
Pricing
Optimization
Machine Log
Analysis

22
Free Informatica 60-Day Trials
marketplace.informatica.com/bigdata
TRIAL DOWNLOADS
REFERENCE
ARCHITECTURES
TRAINING & WEBINARS

23
DWO Service Packages
DWO Needs
Analysis
Assessment
(Appfluent)
Architecture
Review
DWO Quick Start
Install & Configuration
Product Training
DWO Pilot
Install & Configuration
Product Training
Best Practices Knowledge Transfer

Mais conteúdo relacionado

Mais procurados

Open Source in the Energy Industry - Creating a New Operational Model for Dat...DataWorks Summit

Hadoop India Summit, Feb 2011 - InformaticaSanjeev Kumar

Capgemini Insights and Data DataWorks Summit/Hadoop Summit

Enterprise 360 - Graphs at the Center of a Data FabricPrecisely

EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...Capgemini

The Future of Data Management: The Enterprise Data HubCloudera, Inc.

Smart data for a predictive bankDataWorks Summit/Hadoop Summit

Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Denodo

Unlocking data science in the enterprise - with Oracle and ClouderaCloudera, Inc.

Emergence of MongoDB as an Enterprise Data HubMongoDB

Gov & Private Sector Regulatory Compliance: Using Hadoop to Address RequirementsDataWorks Summit

Active Governance Across the Delta Lake with AlationDatabricks

Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)Jeffrey T. Pollock

The Future of Data Management: The Enterprise Data HubCloudera, Inc.

Enterprise Data Hub: The Next Big Thing in Big DataCloudera, Inc.

Informatica Becomes Part of the Business Data Lake EcosystemCapgemini

Flash session -goldengate--lht1053-lonJeffrey T. Pollock

Logical Data Warehouse and Data Lakes Denodo

Developing a Strategy for Data Lake GovernanceTony Baer

Modern Data Management for Federal ModernizationDenodo

Mais procurados (20)

Open Source in the Energy Industry - Creating a New Operational Model for Dat...

Hadoop India Summit, Feb 2011 - Informatica

Capgemini Insights and Data

Enterprise 360 - Graphs at the Center of a Data Fabric

EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...

The Future of Data Management: The Enterprise Data Hub

Smart data for a predictive bank

Delivering Self-Service Analytics using Big Data and Data Virtualization on t...

Unlocking data science in the enterprise - with Oracle and Cloudera

Emergence of MongoDB as an Enterprise Data Hub

Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements

Active Governance Across the Delta Lake with Alation

Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)

The Future of Data Management: The Enterprise Data Hub

Enterprise Data Hub: The Next Big Thing in Big Data

Informatica Becomes Part of the Business Data Lake Ecosystem

Flash session -goldengate--lht1053-lon

Logical Data Warehouse and Data Lakes

Developing a Strategy for Data Lake Governance

Modern Data Management for Federal Modernization

Destaque

Affecto Informatica World Tour 2015: The Age of EngagementAffecto

Idq summit2014 ronald damhof - it's all about the dataPrudenza B.V

Giip kb-hadoop sizingLowy Shin

Streaming real time data with Vibe Data StreamInformaticaMarketplace

Power Big Data Analytics with Informatica Cloud Integration for Redshift, Kin...Amazon Web Services

Cloud-Con: Informatica Vibe and Cloud Integration for the Hybrid EnterpriseDarren Cunningham

Informatica Big Data Edition - Profinit - Jan UlrychProfinit

Informatica big data and social mediaRamy Mahrous

Integrate Big Data into Your Organization with Informatica and PerficientPerficient, Inc.

Hadoop Operations - Best practices from the fieldUwe Printz

Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...Hortonworks

(BDT305) Lessons Learned and Best Practices for Running Hadoop on AWS | AWS r...Amazon Web Services

Hdp security overview Hortonworks

Atlanta Data Science Meetup | Qubole slidesQubole

Qubole - Big data in cloudDmitry Tolpeko

7 Big Data Challenges and How to Overcome ThemQubole

Big data architectures and the data lakeJames Serra

Data quality and data profilingShailja Khurana

Data quality architectureanicewick

Destaque (19)

Affecto Informatica World Tour 2015: The Age of Engagement

Idq summit2014 ronald damhof - it's all about the data

Giip kb-hadoop sizing

Streaming real time data with Vibe Data Stream

Power Big Data Analytics with Informatica Cloud Integration for Redshift, Kin...

Cloud-Con: Informatica Vibe and Cloud Integration for the Hybrid Enterprise

Informatica Big Data Edition - Profinit - Jan Ulrych

Informatica big data and social media

Integrate Big Data into Your Organization with Informatica and Perficient

Hadoop Operations - Best practices from the field

Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...

(BDT305) Lessons Learned and Best Practices for Running Hadoop on AWS | AWS r...

Hdp security overview

Atlanta Data Science Meetup | Qubole slides

Qubole - Big data in cloud

7 Big Data Challenges and How to Overcome Them

Big data architectures and the data lake

Data quality and data profiling

Data quality architecture

Semelhante a Meet the experts dwo bde vds v7

2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks

Meet the Infochimps PlatformInfochimps, a CSC Big Data Business

Complement Your Existing Data Warehouse with Big Data & HadoopDatameer

Enterprise Hadoop is Here to Stay: Plan Your Evolution StrategyInside Analysis

C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...Hortonworks

Big Data Made Easy: A Simple, Scalable Solution for Getting Started with HadoopPrecisely

Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2

Bridging the Big Data Gap in the Software-Driven WorldCA Technologies

Simplifying Big Data ETL with TalendEdureka!

BAR360 open data platform presentation at DAMA, SydneySai Paravastu

Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Rittman Analytics

Exploring the Wider World of Big Data- Vasalis KapsalisNetAppUK

Transform Your Business with Big Data and Hortonworks Pactera_US

Transform You Business with Big Data and HortonworksHortonworks

Data Integration for Both Self-Service Analytics and IT Users Senturus

Talend webinarEdureka!

Hadoop as an Analytic Platform: Why Not?Inside Analysis

Better Total Value of Ownership (TVO) for Complex Analytic Workflows with the...ModusOptimum

Manipulating data with Talend. Learn how?Edureka!

Manipulating Data with Talend.Edureka!

Semelhante a Meet the experts dwo bde vds v7 (20)

2015 02 12 talend hortonworks webinar challenges to hadoop adoption

Meet the Infochimps Platform

Complement Your Existing Data Warehouse with Big Data & Hadoop

Enterprise Hadoop is Here to Stay: Plan Your Evolution Strategy

C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...

Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop

Big Data Tools: A Deep Dive into Essential Tools

Bridging the Big Data Gap in the Software-Driven World

Simplifying Big Data ETL with Talend

BAR360 open data platform presentation at DAMA, Sydney

Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)

Exploring the Wider World of Big Data- Vasalis Kapsalis

Transform Your Business with Big Data and Hortonworks

Transform You Business with Big Data and Hortonworks

Data Integration for Both Self-Service Analytics and IT Users

Talend webinar

Hadoop as an Analytic Platform: Why Not?

Better Total Value of Ownership (TVO) for Complex Analytic Workflows with the...

Manipulating data with Talend. Learn how?

Manipulating Data with Talend.

Último

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics

Real-Time AI Streaming - AI Max PrincetonTimothy Spann

What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17

Data Analysis Project: Stroke PredictionBoston Institute of Analytics

Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann

Digital Marketing Plan, how digital marketing worksdeepakthakur548787

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics

Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics

Networking Case Study prepared by teacher.pptxHimangsuNath

Insurance Churn Prediction Data Analysis ProjectBoston Institute of Analytics

Easter Eggs From Star Wars and in cars 1 and 217djon017

INTRODUCTION TO Natural language processingsocarem879

Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy

wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)

Cyber awareness ppt on the recorded dataTecnoIncentive

Learn How Data Science Changes Our WorldEduminds Learning

Principles and Practices of Data VisualizationKianJazayeri1

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone

Meet the experts dwo bde vds v7

1. Meet the Experts Series How to use the Informatica Big Data Edition and Vibe Data Stream for Hadoop- based Data Warehouse Offloading Informatica Product Desk Murthy Mathiprakasam, Principal Product Marketing Manager Sumeet Agrawal, Principal Product Manager Jeff Rydz, Director of Big Data Solutions Amrish Thakkar, Senior Product Manager Knowledge Series

2. Informatica Big Data Edition Standard Edition High Productivity Data Integration Governance Edition Comprehensive Data Governance Lineage & Glossary Profile Parse Discover ETL Profile Parse ETL Cleanse Includes restricted use Vibe Data Stream

3. Informatica PowerExchange & Vibe Data Stream Vibe Data Stream Real-time Data Integration Multiple Targets Real-Time Collection Easy Deployment Highly Available Guaranteed Delivery Continuous Streaming PowerExchange Batch Data Integration Cloud & SaaS Apps Relational & Flat Files Hadoop & NoSQL MPP Appliances Social Data Enterprise Applications

4. 4 Your Mission Deploy the right workloads On the right platforms So the right people Get the right data At the right time What’s the Mission of Every Data Services Team?

5. Meet the Experts Series How to use the Informatica Big Data Edition and Vibe Data Stream for Hadoop- based Data Warehouse Offloading Informatica Product Desk Murthy Mathiprakasam, Principal Product Marketing Manager Sumeet Agrawal, Principal Product Manager Jeff Rydz, Director of Big Data Solutions Amrish Thakkar, Senior Product Manager Knowledge Series

6. Big Data Edition

7. Data Warehouses Are Not Optimized For Modern Needs 7Source: Appfluent More Data Supply More Data Demand 80% 20% Transformations / Data Loads Analytical Queries Data Warehouse Resource Utilization

8. Hadoop Can Help Drive Efficiency & Scalability 8 Machine Device, Cloud Relational, Mainframe Social Media, Web Logs Data Warehouse Focused on Analytics Hadoop Focused On Data Preparation Source Data

9. But Enterprises Are Approaching Hadoop With Caution 9 Slow Time To Production Challenging to Staff Risk of Rework

10. Informatica Helps Lower Costs & Lower Risks 10 5X Developer Productivity Easier to Staff Easier to Adopt Innovations CleanseDiscover Profile Parse ETL Greater Efficiency Today, Higher Confidence For Tomorrow Informatica Big Data Edition Lineage & Glossary

11. 11 Demo

12. • Robust, responsive & reliable technology platform • Uniform version of truth, high quality, reliability, accessibility, & auditability • Reduce costs, reduce complexity, drive re- use • Unable to staff Big Data projects • Only 2 Hadoop developers available • Existing data warehouse architecture including various technologies • Quickly staffed Big Data projects with 100 Informatica developers • Integrated with rest of Data Warehouse infrastructure (e.g., Teradata) • Logically consistent information available across all standard platforms Business need Challenge Results With Informatica ©2014 Informatica. Proprietary and Confidential. Do not distribute. Case Study: Large Financial Services Firm

13. Vibe Data Stream

14. 14 Data Is Growing and More Distributed TB Time  Social media  Web logs  Sensor data

15. 15 But Organizations Are Struggling to Harness It Incomplete Data Sets Expensive To Store Low Fidelity Analytics

16. Informatica Helps Lower Costs & Harness Real Time Data 16 Ingest Higher Data Volumes Lower Cost Storage Analytics On All Data 10X Faster Streaming Technology Informatica Vibe Data Stream

17. 17 Informatica Vibe Data Stream CEP (e.g. Rulepoint, Kinesis) NoSQL (e.g. Cassandra)

18. 18 Demo

19. Use Cases for Streaming Data Into Hadoop 19 Predictive Maintenance Fraud Detection Pricing Optimization Machine Log Analysis

20. Next Steps

21. Learn More 21 www.dwoptimization.me

22. 22 Free Informatica 60-Day Trials marketplace.informatica.com/bigdata TRIAL DOWNLOADS REFERENCE ARCHITECTURES TRAINING & WEBINARS

23. 23 DWO Service Packages DWO Needs Analysis Assessment (Appfluent) Architecture Review DWO Quick Start Install & Configuration Product Training DWO Pilot Install & Configuration Product Training Best Practices Knowledge Transfer

24. Thank You! Informatica Product Desk

Notas do Editor

1
5
Data warehouse storage and CPU utilization are constrained by growing supply of data and demand for analytics Pushdown data transformations consume excess CPU cycles Analytical performance suffers Forces expansion of expensive platforms In addition, Informatica was quick to adopt this new data platform so organizations could use skills they already had today for ETL and data quality. In fact, with Informatica, developers can increase their productivity up to 5x while dramatically lowering both infrastructure costs and ongoing operational costs associated with BI/DW
Hadoop is ideally suited for unlimited data storage and processing and complex data analytics, often at 10 to 100 times less cost than traditional systems. But when Hadoop first began growing in popularity there was a lack of tooling so that developers had to resort to hand-coding ETL workloads in new languages and with a new shared-nothing paradigm called MapReduce. So while organizations could dramatically lower the cost of their infrastructure, ongoing operational labor costs continued to be a challenge. Hadoop developer skills are in high-demand and therefore can be difficult to find and retain.
Publish Subscribe Vibe Data Stream for Machine Data provides the ability to efficiently perform high volume (throughput), high velocity (speed), & high scale (large # of end points) streaming data collection across wide variety of sources over LAN & WAN environments to enable real-time & big data analytics, operational intelligence, and enterprise data warehousing. Some of the features and benefits of Vibe Data Stream are: Established high performance (>10X) real-time solution by leveraging fastest and most reliable high performance messaging technology UM messaging is a brokerless messaging system and this eliminates a lot of issues with traditional systems such as single point of failure, multiple hops, bottle neck, etc. This allows high performance and reliability with lower operational costs. High throughput solution for streaming, and guaranteed delivery Out of box support for wide variety of data sources (Sensors, Mobile Devices, log files, IoT, etc) High availability and Reliability Enterprise grade: Simplified configuration, deployment, administration and monitoring Vibe Data Stream front end is integrated in Informatica Admin Console and allows the user to manage and monitor the topology from within Admin Console. Vibe Data Stream leverages Apache zookeeper for configuration management. Once the user has defined the topology, deploying the topology will push the configuration into zookeeper. VDS nodes as they come up, will pull configuration from zookeeper to start with their operation. New Sources and Targets can be deployed without impacting the currently operational nodes. User can also add multiple nodes and load balance traffic across those nodes. VDS nodes are using Ultra Messaging as an infrastructure and as a result as very light weight. This allows you to embed VDS node in devices with limited resources (CPU, memory, etc). High performance/efficient streaming data collection over LAN/WAN GUI interface provides ease of configuration, deployment & use Continuous ingestion of real-time generated data (sensors; logs; etc.). Machine generated & other data sources Enable real-time interactions & response Real-time delivery directly to multiple targets (batch/stream processing) Highly available; efficient; scalable Available ecosystem of light weight agents (sources & targets)
Big Data Edition Trials for Cloudera and Hortonworks Free trial for Vibe Data Stream Data Warehouse Optimization reference architecture co-written with Cloudera Informatica – Cloudera collaborative training course Data Warehouse optimization whitepaper for Informatica and MapR INFA/Hortonworks/Teradata joint webinar Tuesday, September 16, 2014 Hadoop 2.0: YARN to Further Optimize Data Processing 12:00 PM Eastern / 9:00 AM Pacific Data is exponentially increasing in both types and volumes, creating opportunities for businesses. To fully realize the potential of this new data, analysts recommend the shift from a single platform to a data ecosystem. Multiple systems are needed to exploit the variety and volume of data sources. A flexible data repository such as a data lake is needed to store the data. Technologically speaking Apache Hadoop 2 enables true data lake architectures. The introduction of YARN in particular added a pluggable framework that enabled new data access patterns in addition to MapReduce. An intelligent data management layer is needed to manage metadata and usage patterns as well as track consumption across these data platforms. Join us in this webinar as our panel of experts discusses how Hadoop can be used alongside the Enterprise Data Warehouse and with Data Integration tools to enable the optimization of data processing workloads for more efficient use of resources.
24

Meet the experts dwo bde vds v7

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (19)

Semelhante a Meet the experts dwo bde vds v7

Semelhante a Meet the experts dwo bde vds v7 (20)

Último

Último (20)

Meet the experts dwo bde vds v7

Notas do Editor