SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
© 2015 IBM Corporation
BigInsights on Cloud
Hadoop-as-a-Service
July 28th, 2015
© 2015 IBM Corporation2
Disclaimer
IBM’s statements regarding its plans, directions, and intent are subject to change or
withdrawal without notice at IBM’s sole discretion. Information regarding potential future
products is intended to outline our general product direction and it should not be relied on in
making a purchasing decision. The information mentioned regarding potential future products
is not a commitment, promise, or legal obligation to deliver any material, code or functionality.
Information about potential future products may not be incorporated into any contract. The
development, release, and timing of any future features or functionality described for our
products remains at our sole discretion.
© 2015 IBM Corporation3
Agenda
• Evolution of the Big Data Analytics space
• Open Data Platform and IBM’s BigInsights
• Hadoop as a Service – BigInsights on Cloud Options
• IBM Analytics for Hadoop – Free, 14-day trial
• BigInsights for Apache Hadoop – Bare Metal option for Production
• Demo
• Questions & Answers
• Resources
© 2015 IBM Corporation4
“At the World Economic
Forum last month in Davos,
Switzerland, Big Data was a
marquee topic. A report by the
forum, “Big Data, Big Impact,”
declared data a new class of
economic asset, like
currency or gold.
“Companies are being
inundated with data—from
information on customer-buying
habits to supply-chain efficiency.
But many managers struggle to
make sense of the numbers.”
“Increasingly, businesses are
applying analytics to social
media such as Facebook and
Twitter, as well as to product
review websites, to try to
“understand where customers are,
what makes them tick and what
they want”, says Deepak Advani,
who heads IBM’s predictive
analytics group.”
“Big Data has arrived at Seton
Health Care Family, fortunately
accompanied by an
analytics tool that will help
deal with the complexity of
more than two million
patient contacts a year…”
“Data is the new oil.”
Clive Humby
The Oscar Senti-meter — a tool
developed by the L.A. Times, IBM
and the USC Annenberg
Innovation Lab — analyzes
opinions about the Academy
Awards race shared in millions
of public messages on Twitter.”
Big Data continues to be a hot topic in the market
“…now Watson is being put to
work digesting millions of
pages of research,
incorporating the best clinical
practices and monitoring the
outcomes to assist physicians in
treating cancer patients.”
© 2015 IBM Corporation5
An automotive company is running a
series of experiments to better
understand and adapt to shifting
landscape of urban transportation by
streaming data from sensors on cars
using InfoSphere Streams to analyze it
on Hadoop using BigInsights on Cloud
Industrial manufacturer in the United
States reduces errors and the time
required for engine calibrations by 90
percent and improves reliability and new
product design by using sensors to collect
information on its products in the field and
analyzing it using InfoSphere BigInsights
Big Data implementations are driving real
business value for IBM customers
© 2015 IBM Corporation6
Rich capabilities in IBM’s Big Data Portfolio mean
lower risk and more successful projects
On premise, Cloud, and “as a Service”
BigInsights
© 2015 IBM Corporation7
Open Data Platform and IBM BigInsights
© 2015 IBM Corporation8
Open Data Platform Initiative
Why is IBM involved?
 Strong history of leadership in open source & standards
 Supports our commitment to open source currency in all
future releases
 Accelerates our innovation within Hadoop &
surrounding applications
Open Data Platform (ODP) vs. Apache Software
Foundation (ASF)
 ODP supports the ASF mission
 ASF provides a governance model around individual
projects without looking at ecosystem
 ODP aims to provide a vendor-led consistent packaging
model for core Apache components as an ecosystem
All Standard Apache Open Source Components
HDFS
YARN
MapReduce
Ambari HBase
Spark
Flume
Hive Pig
Sqoop
HCatalog
Solr/Lucene
ODP
© 2015 IBM Corporation9
SQL on Hadoop
Big SQL – optimized ANSI compliant SQL
Application Tooling
Toolkits and accelerators
Search & Entity Matching
Watson Explorer, Big Mach
Data Visualization
BigSheets spreadsheet interface
Predictive Modeling
Big R, Machine Learning
Text Analytics
Advanced text processing with AQL, Text
extraction web interface
Real-time Analytics
Streams
Data Governance and Security
DataClick, LDAP, Secure cluster
Storage Integration
GPFS - POSIX Distributed Filesystem
Enterprise Manageability
Adaptive MapReduce, Multi-tenant
scheduling
BigInsights for Apache Hadoop
IOP + IBM Value Adds = BigInsights
Knox
Ambari
Snappy
Open JDK
Avro
Solr
Oozie
Flume
Slider
Pig
Hadoop
HDFS/MapReduce/YARN*
Zookeeper
Parquet
HBase
IBM Open Platform (IOP)
Spark
Hive
Sqoop
ODP
© 2015 IBM Corporation10
BigInsights Users & Role-Based Modules
IBM Open Platform
BigInsights for
Apache Hadoop
© 2015 IBM Corporation11
BigInsights on Cloud
© 2015 IBM Corporation12
IBM Open Platform uses Ambari
© 2015 IBM Corporation13
BigInsights Home
© 2015 IBM Corporation14
IBM BigInsights – BigSheets
Spreadsheet style analysis tool for business users
Easily visualize big data using
rich built-in graphing and
analytic functions
© 2015 IBM Corporation15
Big SQL in BigInsights
Data Sources
Hive Tables HBase Tables
BigSQL Engine
BigInsights
Application
SQL Language
JDBC / ODBC Driver
JDBC / ODBC Server
Native Sources
CSV SEQ
Parquet RC
AVRO ORC
JSON Custom
 ANSI SQL 2011 Compliant
 IBM’s SQL for Hadoop
• Makes Hadoop data accessible
to a wider audience
• Familiar, widely known syntax
• Leverage native Hadoop
data sources
 Complements the Data
Warehouse
• Exploratory analytics
• Sandbox, Data Lake
 Included in BigInsights
 Use familiar SQL tools
• Cognos, SPSS, Tableau,
MicroStrategy
© 2015 IBM Corporation16
Example of text analytic tooling: Graphical
interface to describe structure of various
textual formats – from log file data to natural
language. Users do not need to now AQL
IBM BigInsights – Text Analytics
Information Extraction Framework for Text Analytics
© 2015 IBM Corporation17
R Clients
Embedded R Execution
R Packages
1
2
 Explore, visualize, transform, and
model big data using familiar R
syntax and paradigm
 Scale out R
 Partitioning of large data (“divide”)
 Parallel cluster execution of
pushed down R code (“conquer”)
 All of this from within the R
environment (Jaql, Map/Reduce
are hidden from you)
 Almost any R package can run in
this environment
Pull data
summaries to R
client
Or, push R
functions right
on the data
Data sources
R Packages
IBM BigInsights – Big R
End-to-end integration of R into BigInsights
© 2015 IBM Corporation18
 Prototype, create mash-ups in
the cloud for non-production use
 Empowers developers to rapidly
drive insight from all data
 Two-node Docker Instance
 Enterprise features – BigSheets,
Big SQL, Text, and Big R
 Delivered via IBM Bluemix
 50 GB – input data space
 Extendable, Free 14-day Trial
 For Production deployments at scale
in the cloud
 Delivers flexibility and efficiency
with BYOL and PAYG pricing
 Scale to meet spikes in demand
without on-premise infrastructure
 Perform enterprise-class, complex
analytics on Big Data Available via
the IBM Cloud Marketplace
 Web-based UI for Sizing/Pricing
IBM BigInsights – Cloud deployment options
Manage less, analyze more
IBM Analytics for Hadoop BigInsights for Apache Hadoop
© 2015 IBM Corporation19
IBM Analytics for Hadoop Details
 Free 14-day trial on www.bluemix.net
© 2015 IBM Corporation20
BigInsights for Apache Hadoop – Options
Secure, Dedicated Bare-metal
Infrastructure
IBM Open Platform
BigInsights for
Apache Hadoop
© 2015 IBM Corporation21
IBM BigInsights on Cloud – Security
 Dedicated, isolated environment for every client
 Administrative control owned by customer at Hadoop
and BigInsights level
 Native HDFS encryption; optional Guardium encryption
 Firewalls provide perimeter security and private network isolation
 Aiming for ISO 27K1 compliance in 2015
 Example Configuration…
Non-shared physical machines for added security & performance
© 2015 IBM Corporation22
BigInsights on Cloud
Demonstration
© 2015 IBM Corporation23
The IBM Difference
 IBM delivers the foundation for Big Data – now and in the future
 Embraces open source
 Establishes standards
 Integrates with familiar interfaces and established systems
 Delivers advanced analytic capabilities
 IBM is the only vendor providing…
 Hadoop as a Managed Service in the Cloud
 A single company providing Hadoop-base software, cloud and services
 Provides expertise to help you on your journey
 6,000 partners
 Analytics services and solution centers
© 2015 IBM Corporation24
IBM BigInsights on Cloud – unique capability
Built-in Twitter Decahose service
 Scaled down random sample of Twitter Firehose
 Easily land Twitter data into BigInsights HDFS
 Manipulate and visualize data using BigSheets
 Incorporate sentiment data into analytic models
 Easily store and accommodate vast data sets
© 2015 IBM Corporation25
Check out more data management services at www.bluemix.net
Cloudant dashDB
BigInsights on
Cloud
DB2 on Cloud
© 2015 IBM Corporation26
 Big Data University – Free Training
http://bigdatauniversity.com/
 Powered by Hadoop
http://wiki.apache.org/hadoop/PoweredBy
 Free Trial Software (both for on-premise and cloud)
http://www-01.ibm.com/software/data/infosphere/hadoop/trials.html
 YouTube Videos
 Watson
• The Science Behind the Answer (~7 minutes)
• Watson: Final Jeopardy (~11 minute summary)
 Big Data Channel
• http://www.youtube.com/user/ibmbigdata
Resources
© 2015 IBM Corporation27
Thank You
Merci
Grazie
Gracias Obrigado
Danke
Japanese
French
German
Italian
Spanish
Portuguese
Traditional Chinese
Simplified Chinese
Romanian
Multumesc
Turkish
Teşekkür ederim
English

Mais conteúdo relacionado

Mais procurados

Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
DataWorks Summit
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the experts
DataWorks Summit
 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on Hadoop
DataWorks Summit
 

Mais procurados (20)

Designing Data Pipelines for Automous and Trusted Analytics
Designing Data Pipelines for Automous and Trusted AnalyticsDesigning Data Pipelines for Automous and Trusted Analytics
Designing Data Pipelines for Automous and Trusted Analytics
 
Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol
Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol
Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
OpenPOWER Update
OpenPOWER UpdateOpenPOWER Update
OpenPOWER Update
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting a...
C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting a...C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting a...
C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting a...
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Securing your Big Data Environments in the Cloud
Securing your Big Data Environments in the CloudSecuring your Big Data Environments in the Cloud
Securing your Big Data Environments in the Cloud
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the experts
 
Ironfan: Your Foundation for Flexible Big Data Infrastructure
Ironfan: Your Foundation for Flexible Big Data InfrastructureIronfan: Your Foundation for Flexible Big Data Infrastructure
Ironfan: Your Foundation for Flexible Big Data Infrastructure
 
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
 
Machine Learning for z/OS
Machine Learning for z/OSMachine Learning for z/OS
Machine Learning for z/OS
 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on Hadoop
 
Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data Store
 
Red Hat Openshift on Microsoft Azure
Red Hat Openshift on Microsoft AzureRed Hat Openshift on Microsoft Azure
Red Hat Openshift on Microsoft Azure
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
451 Research Impact Report
451 Research Impact Report451 Research Impact Report
451 Research Impact Report
 
A Mayo Clinic Big Data Implementation
A Mayo Clinic Big Data ImplementationA Mayo Clinic Big Data Implementation
A Mayo Clinic Big Data Implementation
 

Semelhante a Get Started Quickly with IBM's Hadoop as a Service

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
Adrian Turcu
 
IBM CDS Overview
IBM CDS OverviewIBM CDS Overview
IBM CDS Overview
Jean Tan
 
ds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_Suiteds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_Suite
Robin Fong 方俊强
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
Pactera_US
 
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve SibleyFuture of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
IBM Danmark
 

Semelhante a Get Started Quickly with IBM's Hadoop as a Service (20)

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of Things
 
IBM CDS Overview
IBM CDS OverviewIBM CDS Overview
IBM CDS Overview
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Modern Thinking área digital MSKM 21/09/2017
Modern Thinking área digital MSKM 21/09/2017Modern Thinking área digital MSKM 21/09/2017
Modern Thinking área digital MSKM 21/09/2017
 
SQL + Hadoop: The High Performance Advantage�
SQL + Hadoop:  The High Performance Advantage�SQL + Hadoop:  The High Performance Advantage�
SQL + Hadoop: The High Performance Advantage�
 
ds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_Suiteds_Pivotal_Big_Data_Suite_Product_Suite
ds_Pivotal_Big_Data_Suite_Product_Suite
 
Libera la potenza del Machine Learning
Libera la potenza del Machine LearningLibera la potenza del Machine Learning
Libera la potenza del Machine Learning
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
 
Hadoop in the Cloud
Hadoop in the CloudHadoop in the Cloud
Hadoop in the Cloud
 
Why Infrastructure matters?!
Why Infrastructure matters?!Why Infrastructure matters?!
Why Infrastructure matters?!
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
 
The Big Picture on Big Data and Cognos
The Big Picture on Big Data and CognosThe Big Picture on Big Data and Cognos
The Big Picture on Big Data and Cognos
 
Cloud what is the best model for vietnam
Cloud   what is the best model for vietnamCloud   what is the best model for vietnam
Cloud what is the best model for vietnam
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve SibleyFuture of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
 
Accelerating Innovation with Hybrid Cloud
Accelerating Innovation with Hybrid CloudAccelerating Innovation with Hybrid Cloud
Accelerating Innovation with Hybrid Cloud
 
Better Total Value of Ownership (TVO) for Complex Analytic Workflows with the...
Better Total Value of Ownership (TVO) for Complex Analytic Workflows with the...Better Total Value of Ownership (TVO) for Complex Analytic Workflows with the...
Better Total Value of Ownership (TVO) for Complex Analytic Workflows with the...
 

Mais de IBM Cloud Data Services

Machine Learning with Apache Spark
Machine Learning with Apache SparkMachine Learning with Apache Spark
Machine Learning with Apache Spark
IBM Cloud Data Services
 

Mais de IBM Cloud Data Services (20)

CouchDB Day NYC 2017: Full Text Search
CouchDB Day NYC 2017: Full Text SearchCouchDB Day NYC 2017: Full Text Search
CouchDB Day NYC 2017: Full Text Search
 
CouchDB Day NYC 2017: Using Geospatial Data in Cloudant & CouchDB
CouchDB Day NYC 2017: Using Geospatial Data in Cloudant & CouchDBCouchDB Day NYC 2017: Using Geospatial Data in Cloudant & CouchDB
CouchDB Day NYC 2017: Using Geospatial Data in Cloudant & CouchDB
 
CouchDB Day NYC 2017: MapReduce Views
CouchDB Day NYC 2017: MapReduce ViewsCouchDB Day NYC 2017: MapReduce Views
CouchDB Day NYC 2017: MapReduce Views
 
CouchDB Day NYC 2017: Replication
CouchDB Day NYC 2017: ReplicationCouchDB Day NYC 2017: Replication
CouchDB Day NYC 2017: Replication
 
CouchDB Day NYC 2017: Mango
CouchDB Day NYC 2017: MangoCouchDB Day NYC 2017: Mango
CouchDB Day NYC 2017: Mango
 
CouchDB Day NYC 2017: JSON Documents
CouchDB Day NYC 2017: JSON DocumentsCouchDB Day NYC 2017: JSON Documents
CouchDB Day NYC 2017: JSON Documents
 
CouchDB Day NYC 2017: Core HTTP API
CouchDB Day NYC 2017: Core HTTP APICouchDB Day NYC 2017: Core HTTP API
CouchDB Day NYC 2017: Core HTTP API
 
CouchDB Day NYC 2017: Introduction to CouchDB 2.0
CouchDB Day NYC 2017: Introduction to CouchDB 2.0CouchDB Day NYC 2017: Introduction to CouchDB 2.0
CouchDB Day NYC 2017: Introduction to CouchDB 2.0
 
Practical Use of a NoSQL
Practical Use of a NoSQLPractical Use of a NoSQL
Practical Use of a NoSQL
 
I See NoSQL Document Stores in Geospatial Applications
I See NoSQL Document Stores in Geospatial ApplicationsI See NoSQL Document Stores in Geospatial Applications
I See NoSQL Document Stores in Geospatial Applications
 
Webinar: The Anatomy of the Cloudant Data Layer
Webinar: The Anatomy of the Cloudant Data LayerWebinar: The Anatomy of the Cloudant Data Layer
Webinar: The Anatomy of the Cloudant Data Layer
 
NoSQL for SQL Users
NoSQL for SQL UsersNoSQL for SQL Users
NoSQL for SQL Users
 
dashDB: the GIS professional’s bridge to mainstream IT systems
dashDB: the GIS professional’s bridge to mainstream IT systemsdashDB: the GIS professional’s bridge to mainstream IT systems
dashDB: the GIS professional’s bridge to mainstream IT systems
 
Cloud Data Services: A Brand New Ballgame for Business
Cloud Data Services: A  Brand New Ballgame for BusinessCloud Data Services: A  Brand New Ballgame for Business
Cloud Data Services: A Brand New Ballgame for Business
 
Practical Use of a NoSQL Database
Practical Use of a NoSQL DatabasePractical Use of a NoSQL Database
Practical Use of a NoSQL Database
 
SQL To NoSQL - Top 6 Questions Before Making The Move
SQL To NoSQL - Top 6 Questions Before Making The MoveSQL To NoSQL - Top 6 Questions Before Making The Move
SQL To NoSQL - Top 6 Questions Before Making The Move
 
Machine Learning with Apache Spark
Machine Learning with Apache SparkMachine Learning with Apache Spark
Machine Learning with Apache Spark
 
Mobile App Development With IBM Cloudant
Mobile App Development With IBM CloudantMobile App Development With IBM Cloudant
Mobile App Development With IBM Cloudant
 
IBM Cognos Business Intelligence using dashDB
IBM Cognos Business Intelligence using dashDBIBM Cognos Business Intelligence using dashDB
IBM Cognos Business Intelligence using dashDB
 
Run Oracle Apps in the Cloud with dashDB
Run Oracle Apps in the Cloud with dashDBRun Oracle Apps in the Cloud with dashDB
Run Oracle Apps in the Cloud with dashDB
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Get Started Quickly with IBM's Hadoop as a Service

  • 1. © 2015 IBM Corporation BigInsights on Cloud Hadoop-as-a-Service July 28th, 2015
  • 2. © 2015 IBM Corporation2 Disclaimer IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion. Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion.
  • 3. © 2015 IBM Corporation3 Agenda • Evolution of the Big Data Analytics space • Open Data Platform and IBM’s BigInsights • Hadoop as a Service – BigInsights on Cloud Options • IBM Analytics for Hadoop – Free, 14-day trial • BigInsights for Apache Hadoop – Bare Metal option for Production • Demo • Questions & Answers • Resources
  • 4. © 2015 IBM Corporation4 “At the World Economic Forum last month in Davos, Switzerland, Big Data was a marquee topic. A report by the forum, “Big Data, Big Impact,” declared data a new class of economic asset, like currency or gold. “Companies are being inundated with data—from information on customer-buying habits to supply-chain efficiency. But many managers struggle to make sense of the numbers.” “Increasingly, businesses are applying analytics to social media such as Facebook and Twitter, as well as to product review websites, to try to “understand where customers are, what makes them tick and what they want”, says Deepak Advani, who heads IBM’s predictive analytics group.” “Big Data has arrived at Seton Health Care Family, fortunately accompanied by an analytics tool that will help deal with the complexity of more than two million patient contacts a year…” “Data is the new oil.” Clive Humby The Oscar Senti-meter — a tool developed by the L.A. Times, IBM and the USC Annenberg Innovation Lab — analyzes opinions about the Academy Awards race shared in millions of public messages on Twitter.” Big Data continues to be a hot topic in the market “…now Watson is being put to work digesting millions of pages of research, incorporating the best clinical practices and monitoring the outcomes to assist physicians in treating cancer patients.”
  • 5. © 2015 IBM Corporation5 An automotive company is running a series of experiments to better understand and adapt to shifting landscape of urban transportation by streaming data from sensors on cars using InfoSphere Streams to analyze it on Hadoop using BigInsights on Cloud Industrial manufacturer in the United States reduces errors and the time required for engine calibrations by 90 percent and improves reliability and new product design by using sensors to collect information on its products in the field and analyzing it using InfoSphere BigInsights Big Data implementations are driving real business value for IBM customers
  • 6. © 2015 IBM Corporation6 Rich capabilities in IBM’s Big Data Portfolio mean lower risk and more successful projects On premise, Cloud, and “as a Service” BigInsights
  • 7. © 2015 IBM Corporation7 Open Data Platform and IBM BigInsights
  • 8. © 2015 IBM Corporation8 Open Data Platform Initiative Why is IBM involved?  Strong history of leadership in open source & standards  Supports our commitment to open source currency in all future releases  Accelerates our innovation within Hadoop & surrounding applications Open Data Platform (ODP) vs. Apache Software Foundation (ASF)  ODP supports the ASF mission  ASF provides a governance model around individual projects without looking at ecosystem  ODP aims to provide a vendor-led consistent packaging model for core Apache components as an ecosystem All Standard Apache Open Source Components HDFS YARN MapReduce Ambari HBase Spark Flume Hive Pig Sqoop HCatalog Solr/Lucene ODP
  • 9. © 2015 IBM Corporation9 SQL on Hadoop Big SQL – optimized ANSI compliant SQL Application Tooling Toolkits and accelerators Search & Entity Matching Watson Explorer, Big Mach Data Visualization BigSheets spreadsheet interface Predictive Modeling Big R, Machine Learning Text Analytics Advanced text processing with AQL, Text extraction web interface Real-time Analytics Streams Data Governance and Security DataClick, LDAP, Secure cluster Storage Integration GPFS - POSIX Distributed Filesystem Enterprise Manageability Adaptive MapReduce, Multi-tenant scheduling BigInsights for Apache Hadoop IOP + IBM Value Adds = BigInsights Knox Ambari Snappy Open JDK Avro Solr Oozie Flume Slider Pig Hadoop HDFS/MapReduce/YARN* Zookeeper Parquet HBase IBM Open Platform (IOP) Spark Hive Sqoop ODP
  • 10. © 2015 IBM Corporation10 BigInsights Users & Role-Based Modules IBM Open Platform BigInsights for Apache Hadoop
  • 11. © 2015 IBM Corporation11 BigInsights on Cloud
  • 12. © 2015 IBM Corporation12 IBM Open Platform uses Ambari
  • 13. © 2015 IBM Corporation13 BigInsights Home
  • 14. © 2015 IBM Corporation14 IBM BigInsights – BigSheets Spreadsheet style analysis tool for business users Easily visualize big data using rich built-in graphing and analytic functions
  • 15. © 2015 IBM Corporation15 Big SQL in BigInsights Data Sources Hive Tables HBase Tables BigSQL Engine BigInsights Application SQL Language JDBC / ODBC Driver JDBC / ODBC Server Native Sources CSV SEQ Parquet RC AVRO ORC JSON Custom  ANSI SQL 2011 Compliant  IBM’s SQL for Hadoop • Makes Hadoop data accessible to a wider audience • Familiar, widely known syntax • Leverage native Hadoop data sources  Complements the Data Warehouse • Exploratory analytics • Sandbox, Data Lake  Included in BigInsights  Use familiar SQL tools • Cognos, SPSS, Tableau, MicroStrategy
  • 16. © 2015 IBM Corporation16 Example of text analytic tooling: Graphical interface to describe structure of various textual formats – from log file data to natural language. Users do not need to now AQL IBM BigInsights – Text Analytics Information Extraction Framework for Text Analytics
  • 17. © 2015 IBM Corporation17 R Clients Embedded R Execution R Packages 1 2  Explore, visualize, transform, and model big data using familiar R syntax and paradigm  Scale out R  Partitioning of large data (“divide”)  Parallel cluster execution of pushed down R code (“conquer”)  All of this from within the R environment (Jaql, Map/Reduce are hidden from you)  Almost any R package can run in this environment Pull data summaries to R client Or, push R functions right on the data Data sources R Packages IBM BigInsights – Big R End-to-end integration of R into BigInsights
  • 18. © 2015 IBM Corporation18  Prototype, create mash-ups in the cloud for non-production use  Empowers developers to rapidly drive insight from all data  Two-node Docker Instance  Enterprise features – BigSheets, Big SQL, Text, and Big R  Delivered via IBM Bluemix  50 GB – input data space  Extendable, Free 14-day Trial  For Production deployments at scale in the cloud  Delivers flexibility and efficiency with BYOL and PAYG pricing  Scale to meet spikes in demand without on-premise infrastructure  Perform enterprise-class, complex analytics on Big Data Available via the IBM Cloud Marketplace  Web-based UI for Sizing/Pricing IBM BigInsights – Cloud deployment options Manage less, analyze more IBM Analytics for Hadoop BigInsights for Apache Hadoop
  • 19. © 2015 IBM Corporation19 IBM Analytics for Hadoop Details  Free 14-day trial on www.bluemix.net
  • 20. © 2015 IBM Corporation20 BigInsights for Apache Hadoop – Options Secure, Dedicated Bare-metal Infrastructure IBM Open Platform BigInsights for Apache Hadoop
  • 21. © 2015 IBM Corporation21 IBM BigInsights on Cloud – Security  Dedicated, isolated environment for every client  Administrative control owned by customer at Hadoop and BigInsights level  Native HDFS encryption; optional Guardium encryption  Firewalls provide perimeter security and private network isolation  Aiming for ISO 27K1 compliance in 2015  Example Configuration… Non-shared physical machines for added security & performance
  • 22. © 2015 IBM Corporation22 BigInsights on Cloud Demonstration
  • 23. © 2015 IBM Corporation23 The IBM Difference  IBM delivers the foundation for Big Data – now and in the future  Embraces open source  Establishes standards  Integrates with familiar interfaces and established systems  Delivers advanced analytic capabilities  IBM is the only vendor providing…  Hadoop as a Managed Service in the Cloud  A single company providing Hadoop-base software, cloud and services  Provides expertise to help you on your journey  6,000 partners  Analytics services and solution centers
  • 24. © 2015 IBM Corporation24 IBM BigInsights on Cloud – unique capability Built-in Twitter Decahose service  Scaled down random sample of Twitter Firehose  Easily land Twitter data into BigInsights HDFS  Manipulate and visualize data using BigSheets  Incorporate sentiment data into analytic models  Easily store and accommodate vast data sets
  • 25. © 2015 IBM Corporation25 Check out more data management services at www.bluemix.net Cloudant dashDB BigInsights on Cloud DB2 on Cloud
  • 26. © 2015 IBM Corporation26  Big Data University – Free Training http://bigdatauniversity.com/  Powered by Hadoop http://wiki.apache.org/hadoop/PoweredBy  Free Trial Software (both for on-premise and cloud) http://www-01.ibm.com/software/data/infosphere/hadoop/trials.html  YouTube Videos  Watson • The Science Behind the Answer (~7 minutes) • Watson: Final Jeopardy (~11 minute summary)  Big Data Channel • http://www.youtube.com/user/ibmbigdata Resources
  • 27. © 2015 IBM Corporation27 Thank You Merci Grazie Gracias Obrigado Danke Japanese French German Italian Spanish Portuguese Traditional Chinese Simplified Chinese Romanian Multumesc Turkish Teşekkür ederim English