Enviar pesquisa
Carregar
Greenplum hadoop
•
1 gostou
•
498 visualizações
Chiou-Nan Chen
Seguir
Tecnologia
Denunciar
Compartilhar
Denunciar
Compartilhar
1 de 32
Baixar agora
Baixar para ler offline
Recomendados
IBM zEnterprise: Healthcare
IBM zEnterprise: Healthcare
Strategy Advisory Group
Oracle India Mop Delegation Visit to Colorado 051611
Oracle India Mop Delegation Visit to Colorado 051611
chandyGhosh
Intel and Big Data
Intel and Big Data
Amazon Web Services LATAM
Big data cloud cloud circle keynote_final laura colvine 8th november 2012
Big data cloud cloud circle keynote_final laura colvine 8th november 2012
IBM
Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu Global
Agile BI : meeting the best of both worlds from departmental and enterprise BI
Agile BI : meeting the best of both worlds from departmental and enterprise BI
Jean-Michel Franco
IBM zEnterprise: Government
IBM zEnterprise: Government
Strategy Advisory Group
Managing Information Technology Services
Managing Information Technology Services
michaelmadsen
Recomendados
IBM zEnterprise: Healthcare
IBM zEnterprise: Healthcare
Strategy Advisory Group
Oracle India Mop Delegation Visit to Colorado 051611
Oracle India Mop Delegation Visit to Colorado 051611
chandyGhosh
Intel and Big Data
Intel and Big Data
Amazon Web Services LATAM
Big data cloud cloud circle keynote_final laura colvine 8th november 2012
Big data cloud cloud circle keynote_final laura colvine 8th november 2012
IBM
Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu Global
Agile BI : meeting the best of both worlds from departmental and enterprise BI
Agile BI : meeting the best of both worlds from departmental and enterprise BI
Jean-Michel Franco
IBM zEnterprise: Government
IBM zEnterprise: Government
Strategy Advisory Group
Managing Information Technology Services
Managing Information Technology Services
michaelmadsen
The Disruption of Big Data - AWS India Summit 2012
The Disruption of Big Data - AWS India Summit 2012
Amazon Web Services
IBM zEnterprise: Distribution
IBM zEnterprise: Distribution
Strategy Advisory Group
Latest news phoenix
Latest news phoenix
Jeff Pearce
Investmentz Case Study
Investmentz Case Study
Sanjay Mehta
September 2 Technology Trends Rpaquet
September 2 Technology Trends Rpaquet
Tom_Webb
Intel Social Computing & Sustainability Issues
Intel Social Computing & Sustainability Issues
Umair Mohsin
IBM Storage Strategy in the Era of Smarter Computing
IBM Storage Strategy in the Era of Smarter Computing
Tony Pearson
Photizio Group Presentation
Photizio Group Presentation
LawtonSmith
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
soudW
E Business Integration. Enabling the Real Time Enterprise
E Business Integration. Enabling the Real Time Enterprise
Johan Blomme
Green IT - a Marketing Term or Sustainable Business, part 1
Green IT - a Marketing Term or Sustainable Business, part 1
MatsBerglind
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Mauricio Godoy
Accenture Communications Research Pts Digital Lifestyle To Digital Lifeblood[1]
Accenture Communications Research Pts Digital Lifestyle To Digital Lifeblood[1]
khogan25
Lam Chee Keong
Lam Chee Keong
Amelia Velu
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
soudW
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
soudW
Kim Escherich - How Big Data Transforms Our World
Kim Escherich - How Big Data Transforms Our World
BigDataViz
IBM Software Day 2013. Turning opportunities into outcomes
IBM Software Day 2013. Turning opportunities into outcomes
IBM (Middle East and Africa)
Sales And Marketing Strategies For Sm Es By Ashim Bose Of The Aktion Group
Sales And Marketing Strategies For Sm Es By Ashim Bose Of The Aktion Group
Nasscom Chennai
101 cd 1630-1700
101 cd 1630-1700
Chiou-Nan Chen
102 1430-1445
102 1430-1445
Chiou-Nan Chen
101 cd 1315-1345
101 cd 1315-1345
Chiou-Nan Chen
Mais conteúdo relacionado
Mais procurados
The Disruption of Big Data - AWS India Summit 2012
The Disruption of Big Data - AWS India Summit 2012
Amazon Web Services
IBM zEnterprise: Distribution
IBM zEnterprise: Distribution
Strategy Advisory Group
Latest news phoenix
Latest news phoenix
Jeff Pearce
Investmentz Case Study
Investmentz Case Study
Sanjay Mehta
September 2 Technology Trends Rpaquet
September 2 Technology Trends Rpaquet
Tom_Webb
Intel Social Computing & Sustainability Issues
Intel Social Computing & Sustainability Issues
Umair Mohsin
IBM Storage Strategy in the Era of Smarter Computing
IBM Storage Strategy in the Era of Smarter Computing
Tony Pearson
Photizio Group Presentation
Photizio Group Presentation
LawtonSmith
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
soudW
E Business Integration. Enabling the Real Time Enterprise
E Business Integration. Enabling the Real Time Enterprise
Johan Blomme
Green IT - a Marketing Term or Sustainable Business, part 1
Green IT - a Marketing Term or Sustainable Business, part 1
MatsBerglind
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Mauricio Godoy
Accenture Communications Research Pts Digital Lifestyle To Digital Lifeblood[1]
Accenture Communications Research Pts Digital Lifestyle To Digital Lifeblood[1]
khogan25
Lam Chee Keong
Lam Chee Keong
Amelia Velu
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
soudW
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
soudW
Kim Escherich - How Big Data Transforms Our World
Kim Escherich - How Big Data Transforms Our World
BigDataViz
IBM Software Day 2013. Turning opportunities into outcomes
IBM Software Day 2013. Turning opportunities into outcomes
IBM (Middle East and Africa)
Sales And Marketing Strategies For Sm Es By Ashim Bose Of The Aktion Group
Sales And Marketing Strategies For Sm Es By Ashim Bose Of The Aktion Group
Nasscom Chennai
Mais procurados
(19)
The Disruption of Big Data - AWS India Summit 2012
The Disruption of Big Data - AWS India Summit 2012
IBM zEnterprise: Distribution
IBM zEnterprise: Distribution
Latest news phoenix
Latest news phoenix
Investmentz Case Study
Investmentz Case Study
September 2 Technology Trends Rpaquet
September 2 Technology Trends Rpaquet
Intel Social Computing & Sustainability Issues
Intel Social Computing & Sustainability Issues
IBM Storage Strategy in the Era of Smarter Computing
IBM Storage Strategy in the Era of Smarter Computing
Photizio Group Presentation
Photizio Group Presentation
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
E Business Integration. Enabling the Real Time Enterprise
E Business Integration. Enabling the Real Time Enterprise
Green IT - a Marketing Term or Sustainable Business, part 1
Green IT - a Marketing Term or Sustainable Business, part 1
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Accenture Communications Research Pts Digital Lifestyle To Digital Lifeblood[1]
Accenture Communications Research Pts Digital Lifestyle To Digital Lifeblood[1]
Lam Chee Keong
Lam Chee Keong
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
Cloud Computing: da curiosidade para casos reais
Kim Escherich - How Big Data Transforms Our World
Kim Escherich - How Big Data Transforms Our World
IBM Software Day 2013. Turning opportunities into outcomes
IBM Software Day 2013. Turning opportunities into outcomes
Sales And Marketing Strategies For Sm Es By Ashim Bose Of The Aktion Group
Sales And Marketing Strategies For Sm Es By Ashim Bose Of The Aktion Group
Destaque
101 cd 1630-1700
101 cd 1630-1700
Chiou-Nan Chen
102 1430-1445
102 1430-1445
Chiou-Nan Chen
101 cd 1315-1345
101 cd 1315-1345
Chiou-Nan Chen
101 ab 1415-1445
101 ab 1415-1445
Chiou-Nan Chen
101 ab 1530-1600
101 ab 1530-1600
Chiou-Nan Chen
101 cd 1345-1415
101 cd 1345-1415
Chiou-Nan Chen
101 ab 1600-1630
101 ab 1600-1630
Chiou-Nan Chen
102 1630 1700
102 1630 1700
Chiou-Nan Chen
Destaque
(8)
101 cd 1630-1700
101 cd 1630-1700
102 1430-1445
102 1430-1445
101 cd 1315-1345
101 cd 1315-1345
101 ab 1415-1445
101 ab 1415-1445
101 ab 1530-1600
101 ab 1530-1600
101 cd 1345-1415
101 cd 1345-1415
101 ab 1600-1630
101 ab 1600-1630
102 1630 1700
102 1630 1700
Semelhante a Greenplum hadoop
Rob anderson
Rob anderson
Eduserv
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - Technical
Accenture the Netherlands
Crunching “Big Data” to Drive 2012 Revenue Growth: The 5 Myths of Sales & Mar...
Crunching “Big Data” to Drive 2012 Revenue Growth: The 5 Myths of Sales & Mar...
MarketBridge
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
IntelAPAC
01 im overview high level
01 im overview high level
James Findlay
Scenari evolutivi nello snellimento dei sistemi informativi
Scenari evolutivi nello snellimento dei sistemi informativi
Fondazione CUOA
September 2 Technology Trends Rpaquet
September 2 Technology Trends Rpaquet
Tom_Webb
IBM Smarter Business 2012 - PureSystems - PureData
IBM Smarter Business 2012 - PureSystems - PureData
IBM Sverige
Enterprise Services Solutions
Enterprise Services Solutions
Karya Technologies
Big Data World Forum
Big Data World Forum
bigdatawf
Information Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise Challenge
Bob Rhubart
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integration
DataWorks Summit
(ATS4-GS03) Partner Session - Intel Balanced Cloud Solutions for the Healthca...
(ATS4-GS03) Partner Session - Intel Balanced Cloud Solutions for the Healthca...
BIOVIA
Privacy final presentaiton
Privacy final presentaiton
Stanford University
Privacy lecture 7 partners
Privacy lecture 7 partners
Stanford University
Privacy lecture 8 resources
Privacy lecture 8 resources
Stanford University
The Next Big Thing: Industry Experts Share Pioneering Technical Advancements ...
The Next Big Thing: Industry Experts Share Pioneering Technical Advancements ...
Career Communications Group
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big Data
IntelAPAC
48 benot-long
48 benot-long
KBIZEAU
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
Tony Pearson
Semelhante a Greenplum hadoop
(20)
Rob anderson
Rob anderson
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - Technical
Crunching “Big Data” to Drive 2012 Revenue Growth: The 5 Myths of Sales & Mar...
Crunching “Big Data” to Drive 2012 Revenue Growth: The 5 Myths of Sales & Mar...
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
01 im overview high level
01 im overview high level
Scenari evolutivi nello snellimento dei sistemi informativi
Scenari evolutivi nello snellimento dei sistemi informativi
September 2 Technology Trends Rpaquet
September 2 Technology Trends Rpaquet
IBM Smarter Business 2012 - PureSystems - PureData
IBM Smarter Business 2012 - PureSystems - PureData
Enterprise Services Solutions
Enterprise Services Solutions
Big Data World Forum
Big Data World Forum
Information Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise Challenge
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integration
(ATS4-GS03) Partner Session - Intel Balanced Cloud Solutions for the Healthca...
(ATS4-GS03) Partner Session - Intel Balanced Cloud Solutions for the Healthca...
Privacy final presentaiton
Privacy final presentaiton
Privacy lecture 7 partners
Privacy lecture 7 partners
Privacy lecture 8 resources
Privacy lecture 8 resources
The Next Big Thing: Industry Experts Share Pioneering Technical Advancements ...
The Next Big Thing: Industry Experts Share Pioneering Technical Advancements ...
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big Data
48 benot-long
48 benot-long
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
Mais de Chiou-Nan Chen
Moving NEON to 64 bits
Moving NEON to 64 bits
Chiou-Nan Chen
64-bit Android
64-bit Android
Chiou-Nan Chen
Intelligent Power Allocation
Intelligent Power Allocation
Chiou-Nan Chen
3. v sphere big data extensions
3. v sphere big data extensions
Chiou-Nan Chen
4. v sphere big data extensions hadoop
4. v sphere big data extensions hadoop
Chiou-Nan Chen
2. hadoop
2. hadoop
Chiou-Nan Chen
1. beyond mission critical virtualizing big data and hadoop
1. beyond mission critical virtualizing big data and hadoop
Chiou-Nan Chen
5. pivotal hd 2013
5. pivotal hd 2013
Chiou-Nan Chen
Emc keynote 1130 1200
Emc keynote 1130 1200
Chiou-Nan Chen
Emc keynote 1030 1130
Emc keynote 1030 1130
Chiou-Nan Chen
Emc keynote 0945 1030
Emc keynote 0945 1030
Chiou-Nan Chen
Emc keynote 0930 0945
Emc keynote 0930 0945
Chiou-Nan Chen
102 1600-1630
102 1600-1630
Chiou-Nan Chen
102 1530-1600
102 1530-1600
Chiou-Nan Chen
102 1430-1445
102 1430-1445
Chiou-Nan Chen
102 1315-1345
102 1315-1345
Chiou-Nan Chen
102 1445 1515
102 1445 1515
Chiou-Nan Chen
101 cd 1600-1630
101 cd 1600-1630
Chiou-Nan Chen
101 cd 1445-1515
101 cd 1445-1515
Chiou-Nan Chen
101 cd 1415-1445
101 cd 1415-1445
Chiou-Nan Chen
Mais de Chiou-Nan Chen
(20)
Moving NEON to 64 bits
Moving NEON to 64 bits
64-bit Android
64-bit Android
Intelligent Power Allocation
Intelligent Power Allocation
3. v sphere big data extensions
3. v sphere big data extensions
4. v sphere big data extensions hadoop
4. v sphere big data extensions hadoop
2. hadoop
2. hadoop
1. beyond mission critical virtualizing big data and hadoop
1. beyond mission critical virtualizing big data and hadoop
5. pivotal hd 2013
5. pivotal hd 2013
Emc keynote 1130 1200
Emc keynote 1130 1200
Emc keynote 1030 1130
Emc keynote 1030 1130
Emc keynote 0945 1030
Emc keynote 0945 1030
Emc keynote 0930 0945
Emc keynote 0930 0945
102 1600-1630
102 1600-1630
102 1530-1600
102 1530-1600
102 1430-1445
102 1430-1445
102 1315-1345
102 1315-1345
102 1445 1515
102 1445 1515
101 cd 1600-1630
101 cd 1600-1630
101 cd 1445-1515
101 cd 1445-1515
101 cd 1415-1445
101 cd 1415-1445
Último
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
MIND CTI
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
Igalia
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Edi Saputra
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Miguel Araújo
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
lior mazor
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
wesley chun
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Zilliz
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
apidays
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
apidays
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
MadyBayot
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Khushali Kathiriya
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
Andrey Devyatkin
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
rafiqahmad00786416
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Drew Madelung
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Khem
Architecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
Último
(20)
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Architecting Cloud Native Applications
Architecting Cloud Native Applications
Greenplum hadoop
1.
© Copyright 2012
EMC Corporation. All rights reserved. 1
2.
整合分析結構與非結構
性資料暨應用案例 Greenplum Enable Big Data Analytics 邱垂吉 Jimmy Chiu 技術顧問/EMC Greenplum Taiwan © Copyright 2012 EMC Corporation. All rights reserved. 2
3.
Volume, Variety, Velocity,
Value + Complexity New insights on Contextual and customers, products, Velocity Volume location-aware and operations delivery to any Big Data device Variety Complexity Documents Transactional Smart Grid Images Audio Text Video Data • Volume: data volumes approaching multiple petabytes • Velocity: data being generated and ingested for analysis in real-time • Variety: tabular, documents, e-mail, metering, network, video, image, audio • Complexity: different standards, domain rules, and storage formats per data type Gartner March 2011 © Copyright 2010 EMC Corporation. All rights reserved. 3
4.
Sample Big Data
Scenarios LOAN PROCESSING AUTO INSURANCE SMART GRID ANALYTICS IN BANKING IN P&C INSURANCE IN UTILITIES/ENERGY REAL-TIME STATISTICAL PROACTIVE EMERGENCY RESPONSE VIDEO ANALYTICS IN HEALTHCARE IN RETAIL PROCESS CONTROL IN MANUFACTURING © Copyright 2010 EMC Corporation. All rights reserved. 4
5.
Big Data Analytics
For Competitive Advantage Suppliers Suppliers Who are my most valuable Manufacturing customers? Manufacturing Inventory Inventory Physical Assets Physical Assets What are my most Distribution important Services Distribution products? Personal Marketing Services Mass Additional Marketing Profits What are my most successful campaigns? Customers Customers Today’s Business Model Big Data Analytics Business Model © Copyright 2010 EMC Corporation. All rights reserved. 5
6.
Big Data meets
Fast Data Social and Personal – Every Minutes: •Google gets more than 2 million search queries •About 47,000 people download an App •Some 100,000 tweets hit Twitter •Almost 300,000 people log on to Facebook Business and Transactional: •CERN (European Organization for Nuclear Research) generates 40TB/sec of scientific data •Wal-Mart – 1 million transactions per hour •World’s top systems currently trade at faster than 50 microseconds •New York Stock Exchange generates 1TB of new trading data daily © Copyright 2010 EMC Corporation. All rights reserved. 6
7.
Working together, they
enable entirely New Business Models Big Data allows you to find opportunities you didn’t know you had. Fast Data allows you to respond to opportunities before they are gone. In the Financial Services Industry, large quantities of historical data need to be processed against a growing number of fast-moving data feeds. Batch processing is no longer a suitable solution! © Copyright 2010 EMC Corporation. All rights reserved. 7
8.
Effective Customer Segmentation
is all about blending Structured and Unstructured Data – Transaction data (structured data) tells you what the customer did. – Unstructured data can tell you why they did it, why some others did not, what else they need or want, and what problems they may have. © Copyright 2010 EMC Corporation. All rights reserved. 8
9.
Big Data Architecture
Solving Big Data challenge involves more than just Requirements managing volumes of data. ― Gartner • Multiple data types: structured, semi-structured, unstructured • Integrated data stores: real-time, traditional, data warehouse • Modern development tools: Java, lightweight messages, mobile-enabled • Cloud-enabled: elastic scale, self-healing Beware point solutions – integration is critical! © Copyright 2010 EMC Corporation. All rights reserved. 9
10.
Greenplum Overview © Copyright
2010 EMC Corporation. All rights reserved. 10
11.
Greenplum Product Line ©
Copyright 2010 EMC Corporation. All rights reserved. 11
12.
Architecture of Greenplum Flexible
framework for processing large datasets Process large datasets with support for SQL both SQL and MapReduce MapReduce Master Master Master servers optimize queries for the most efficient query execution Interconnect for continuous pipelining of data processing Segment servers process queries close to the data in parallel MPP Scatter/Gather streaming for fast loading of data © Copyright 2010 EMC Corporation. All rights reserved. 12
13.
Greenplum MPP Share-Nothing
Arch. MPP Share Share Disk Share nothing everything eg: eg: eg: Oracle RAC Greenplum Unix server Intranet Master Intranet DB DB DB DB DB DB DB DB DB SAN/FC Disk SAN Disk Disk Disk Disk Share disk © Copyright 2010 EMC Corporation. All rights reserved. 13
14.
Benefits of the
Greenplum Database Architecture • Simplicity – Parallelism is automatic – no manual partitioning required – No complex tuning required – just load and query – HA – Best of breed x86 and Ethernet networking technologies • Scalability – Linear scalability – Each node adds storage, query performance, loading performance • Flexibility – Fully parallelism for SQL92, SQL99, SQL2003 OLAP, MapReduce – Any schema (star, snowflake, 3NF, hybrid, etc) – Rich extensibility and language support (Perl, Python, R, C, etc) – Structure, semi-structure and unstructure © Copyright 2010 EMC Corporation. All rights reserved. 14
15.
Greenplum and Hadoop
Analytics Semi-Structured Structured Machine Data UnStructured ERP/CRM Logs Images/Sound Ad-hoc Analysis batch reporting on static data Dynamic Data © Copyright 2010 EMC Corporation. All rights reserved. 15
16.
Big Data Analytics The
Power of Data Co-Processing Greenplum Chorus Analytic Productivity & Tool Integration End-to-end Platform Management & Control Data Access And Query Greenplum Commander SQL, MapReduce, SAS, MADLib, Mahout, R, and others SQL Engine MapReduce Engine parallel For Unstructured Data For Structured Data data exchange •Enterprise ready Apache • In-database Advanced Analytics Hadoop • Extreme performance on •Faster, more dependable, and commodity hardware parallel easier to use data exchange Greenplum Database Greenplum Hadoop Network Parallel Loading Of All Data Types © Copyright 2010 EMC Corporation. All rights reserved. 16
17.
Greenplum Hadoop • Greenplum
HD – Enterprise-ready Apache Hadoop – Proven at Scale in 1,000 node Analytics Workbench – Single product with 2 storage options (Isilon & HDFS) • Enterprise Edition becomes Greenplum MR: – Advanced features – 100% API compatible – Software-only product © Copyright 2010 EMC Corporation. All rights reserved. 17
18.
AWB Update Analytics Workbench
Operational! •1025 nodes operational •1011 nodes with GPHD installed •8 total projects have been on boarded from university collaboration to partner technology evaluation Proposals accepted by customer engagement team – info@analyticsworkbench.com •Engagement team will learn project objectives •JEDI council approves/disproves project based on technical feasibility and alignment with company goals •Projects informed of decisions and timelines Cluster access via - http://portal.analyticsworkbench.com/ © Copyright 2010 EMC Corporation. All rights reserved. 18
19.
Apache Hadoop Pain
Points • Poor Job and Application Monitoring Monitoring Solution • Non-existent Performance Monitoring Operability • Complex System Configuration and Manageability and • No Data Format Interoperability & Manageability Storage Abstractions • Poor Dimensional Lookup Performance Performance • Very poor Random Access and Serving Performance © Copyright 2010 EMC Corporation. All rights reserved. 19
20.
Greenplum MR: Enterprise Edition
Stack 100% APACHE Enhanced Monitoring INTERFACE Hive Pig HBase Zookeeper MapReduce Framework (MapRed) Distributed File System © Copyright 2010 EMC Corporation. All rights reserved. 20
21.
Greenplum MR: Enterprise
Edition Enterprise-Ready Hadoop Platform for Unstructured Data • 2 – 5x Faster than Apache Faster Hadoop • High Availability Reliable • Mirroring Easier to • NFS mountable Use • Graphical System Management © Copyright 2010 EMC Corporation. All rights reserved. 21
22.
Greenplum MR Simple
Management • Health Monitoring • Cluster Administratio n • Application Provisioning © Copyright 2010 EMC Corporation. All rights reserved. 22
23.
Rack Level Monitoring ©
Copyright 2010 EMC Corporation. All rights reserved. 23
24.
Greenplum MR Delivers
True Return on Investment • NFS direct access to simply load and access data directly in a Hadoop cluster • Enables standard tools and utilities to work directly on data contained in Hadoop • Heatmap user interface provides full cluster visibility and control. • Eliminates all single points of failure • High Availability for Job Tracker , NameNode & NFS • Snapshots allow point-in-time data protection and recovery. • Mirroring for business continuity includes wide area replication support. • Speeds jobs by 2X – 5X • Provides faster performance with ½ the hardware • Substantial capital and operating expense savings © Copyright 2010 EMC Corporation. All rights reserved. 24
25.
EMC Greenplum
Fastest data loading Advanced analytics DATA IN IN-DATABASE ANALYTICS DECISIONS OUT Scatter/Gather Streaming Optimized for fast query execution Unified data access for greater technology for the world’s and linear scalability insight and value from data fastest data loading •Move processing closer to data •Enable parallel analysis •Eliminate data load •Shared-nothing, massively across the enterprise bottlenecks parallel processing (MPP) •Open platform with broad •Clean and integrate new data scale-out architecture language support •Several loading options, •Computing is automatically •Certified enterprise ranging from bulk load optimized and distributed connectivity and integration updates to micro-batching for across resources with most business near real-time processing • Provides the best concurrent intelligence; extract, multi-workload performance transform, and load (ETL); and management products © Copyright 2010 EMC Corporation. All rights reserved. 25
26.
EMC Big Data
Analytics Reference Architecture Data Sources Hadoop Alerts Statistics Reduce Documents Genetic Algorithms Map- Map- Ecosystem* HDFS Reduce Dashboards Mobile Key Values Documents Other NoSql Machine Reports Data Mining Data Quality NoSQL Stores Multimedia parallel data exchange Spreadsheets SQL Stores Web/Social OLAP BU 1 Operations Research Data Marts LOB data MDM Mobile Enterprise Data BU 2 ERP Warehouse Neural Nets BU 3 ETL Data Visualization CRM Federated BI as a Data Service POS Warehouse Data Data Stores and Data Presentation & Integration Input Access Analysis Delivery Structured Traditional data Traditional data Big data analytics data sources Integration warehousing ramifications *Hadoop Ecosystem includes: Hive, Pig, Mahout, HBase, ZooKeeper, Oozie, Sqoop, Avro © Copyright 2010 EMC Corporation. All rights reserved. 26
27.
Architecture for Business
Value Business Value Chorus for Collaboration Analytics Analytics Self-develop app Self-develop app Java API Analytics tools Analytics tools JDBC (Mahout) (SAS, R, MADlib and more) ODBC Hbase .csv SAS & MADlib .txt GPDB - In GPDB - In Memory MapRFS (GPMR) ETL MapRFS: C++; MR: C++ x Load Performance: 2~5X DB’s Files High Availability Stable © Copyright 2010 EMC Corporation. All rights reserved. 27
28.
Big Data And
EMC 4 New Analytic Applications Data Science 3 2 Unified Analytics Platform Petabyte Scale Data Storage 1 © Copyright 2010 EMC Corporation. All rights reserved. 29
29.
SAS / Greenplum
Product Overview SAS High Performance Computing SAS Access for SAS In-Database SAS In-Memory Integration Processing Analytics Provides integration capability to Requires SAS Enterprise Miner in New functionality from SAS that a number of databases order to be of value requires dedicated database appliance Allows for increased performance Will lead to significant Very high performance for business of Base SAS Procs improvement in performance users that can significantly increase revenues or decrease costs as a result of improved performance Products: SAS Access for Greenpum Products: SAS Access for Products: SAS Access for Greenplum, SAS Grid Manager, SAS Greenplum, SAS Grid Manager, SAS Enterprise Miner, SAS Scoring High Performance Analytics Accelerator for Greenplum © Copyright 2010 EMC Corporation. All rights reserved. 30
30.
SAS and Greenplum
UAP Integrated Architecture Data Data Data Bl LOB Scientist Engineer Analyst Analyst User SAS Business Intelligence DATA SCIENCE TEAM Greenplum Chorus - Analytic Productivity Layer SAS Analytics Data Access & Query Layer (SAS ACCESS, SQL, MapReduce) Greenplum Database Greenplum Hadoop Private/Hybrid Cloud Infrastructure or Appliance Data Platform Admin SAS Information Management © Copyright 2010 EMC Corporation. All rights reserved. 31
31.
In A Single
Unified Analytics Platform Self-Service Iterative, Agile Transparent, Real-time Collaboration Structured & Unstructured Data Analyze Petabytes Of Current Data Virtual, Scale Out Architecture © Copyright 2010 EMC Corporation. All rights reserved. 32
32.
© Copyright 2010
EMC Corporation. All rights reserved. 33
Baixar agora