SlideShare uma empresa Scribd logo
1 de 66
© 2016 MapR Technologies© 2016 MapR Technologies
MapR 5.2: Getting More Value from the MapR
Converged Community Edition
Sep 14, 2016
© 2016 MapR Technologies
Today’s Presenters
Deborah Littlefield
Technical Curriculum Developer
Ankur Desai
Sr. Manager, Platform and Products
© 2016 MapR Technologies 3
Today’s Agenda
• Recent updates to the MapR Converged Data Platform
• Latest Ecosystem Support in MapR 5.2
• How to upgrade to the latest version of the Community Edition
• Q&A
© 2016 MapR Technologies 4
The MapR Converged Data Platform
© 2016 MapR Technologies 5
4 Major Additions to the MapR Platform in the past
12 months
• Taking cluster monitoring to the next level with the Spyglass
Initiative
• Real-time streaming with MapR Streams
• MapR-DB JSON document database and application
development with OJAI
• Securing your data with access control expressions (ACEs)
© 2016 MapR Technologies 6© 2016 MapR Technologies
Project Spyglass
© 2016 MapR Technologies 7
MapR Vision: Maximizing User/Operator Productivity
Deep
Visibility
Another
sample
Easy
Management
Full
Control
© 2016 MapR Technologies 8
The MapR Spyglass Initiative
• New approach for increasing user and administrator productivity
– Comprehensive, open, extensible
• Simplifies the management of growing big data deployments
• Starts with 5.2 release
– Phase 1 – MapR Monitoring
– Initial focus on operational visibility
• Helps community innovate faster
– Extensive use of open source visualization and dashboarding tools
© 2016 MapR Technologies 9
Spyglass Initiative Phase 1 - MapR Monitoring
Empower administrators with cluster
monitoring capabilities, including
metric and log collection from nodes,
services, and jobs, with dashboards to
display information in a useful way.
Converged
Customizable
Extensible
© 2016 MapR Technologies 10
Collection VisualizationAggregation &
Storage
MapR Monitoring Architecture
Future
Data Sources
Log Shippers
Metrics
Collectors
Alerting
Node
Environmentals
(CPU, Mem, I/O)
Service
Daemons
(YARN, Drill,
Hive, etc.)
MapR Control System
…
© 2014 MapR Technologies 11
Project Spyglass – Monitoring All You Care About
Node/Infrastructure Monitoring
• Global Aggregates (Average, Min, Max)
Charts (e.g. CPU, Disk utilization)
• Per-node charts (e.g. I/O Throughput
by disk)
• MFS read/writes and throughput
• DB puts, gets, scans and cache metrics
© 2014 MapR Technologies 12
Project Spyglass – Monitoring All You Care About
Node/Infrastructure Monitoring
• Global Aggregates (Average, Min, Max)
Charts (e.g. CPU, Disk utilization)
• Per-node charts (e.g. I/O Throughput
by disk)
• MFS read/writes and throughput
• DB puts, gets, scans and cache metrics
Cluster Space Utilization Monitoring
• Cluster wide storage utilization
• Storage Utilization Trend
• Utilization per volume and per accountable
entity (data, volume, snapshot and total size)
© 2014 MapR Technologies 13
Project Spyglass – Monitoring All You Care About
Node/Infrastructure Monitoring
• Global Aggregates (Average, Min, Max)
Charts (e.g. CPU, Disk utilization)
• Per-node charts (e.g. I/O Throughput
by disk)
• MFS read/writes and throughput
• DB puts, gets, scans and cache metrics
Cluster Space Utilization Monitoring
• Cluster wide storage utilization
• Storage Utilization Trend
• Utilization per volume and per accountable
entity (data, volume, snapshot and total size)
YARN/MR Application Monitoring
• Global YARN trend graphs
• Containers - Pending, Active
• vCores & RAM - Allocated & Used
• Per Queue charts - containers, vCores, RAM
© 2014 MapR Technologies 14
Project Spyglass – Monitoring All You Care About
Node/Infrastructure Monitoring
• Global Aggregates (Average, Min, Max)
Charts (e.g. CPU, Disk utilization)
• Per-node charts (e.g. I/O Throughput
by disk)
• MFS read/writes and throughput
• DB puts, gets, scans and cache metrics
Cluster Space Utilization Monitoring
• Cluster wide storage utilization
• Storage Utilization Trend
• Utilization per volume and per accountable
entity (data, volume, snapshot and total size)
YARN/MR Application Monitoring
• Global YARN trend graphs
• Containers - Pending, Active
• vCores & RAM - Allocated & Used
• Per Queue charts - containers, vCores, RAM
Service Daemon Monitoring
• Per-service charts with for (CPU Usage by
type, Memory)
• Centralized, searchable logs
• MapR core and ecosystem services
(includes YARN, Drill and Spark)
© 2016 MapR Technologies 15
Customizable
Dashboards
for Visualizing Metrics
Log
Analytics
© 2016 MapR Technologies 16
Destination to Learn and Collaborate
Blog about topics and ideas
Share code snippets and dashboards
View demos, tutorials, and videos
Engage in use case discussion/development
© 2016 MapR Technologies 17
Dashboards are defined with JSON
and easy to export and import in
Grafana and Kibana
Extend/Integrate using REST API
The Exchange
© 2016 MapR Technologies 18
Dashboards
can be viewed
on mobile
devices.
© 2016 MapR Technologies 19
Summary
● Data collection and storage infrastructure (packaged
and supported)
○ Collection/storage of metrics & logs across node, storage,
services
● Visualization dashboard (Driven via community)
○ Sample dashboards for Grafana & Kibana
5.2 - Spyglass 1.0 GA
CUSTOMIZABLE, shareable and mobile-ready dashboards
CONVERGED monitoring with deep search
EXTENSIBLE and easy to integrate with REST API
© 2016 MapR Technologies 20© 2016 MapR Technologies
MapR Streams
© 2016 MapR Technologies 21
MapR Streams: Enabling Continuous Data Processing
To enable continuous,
globally scalable streaming of
event data, allowing developers to
create real-time applications
that their business can depend on.
Converged
Continuous
Global
© 2016 MapR Technologies 22
MapR Streams:
Publish-subscribe Event Streaming System for Big Data
Producers publish billions of
messages/sec to a topic in a stream.
Guaranteed, immediate delivery
to all consumers.
Standard real-time API (Kafka).
Integrates with Spark Streaming,
Storm, Apex, and Flink
Direct data access (OJAI API) from
analytics frameworks.
To
pi
c
Stream
Producers
Remote sites and consumers
Batch analytics
Topic
Replication
Consumers
Consumers
Available in the Enterprise Edition Only
© 2016 MapR Technologies 23
MapR Streams: Building Faster and Simpler Apps
Simpler and
Faster
Architecture
• Converged platform with file storage and database
reduces data movement, data latency, hardware
cost, and administration cost
• Event streaming and stream processing in the same
cluster enables faster processing
• Unified security framework with files and database
tables reduces administration cost around setting
up and enforcing security policies
• Multi-tenant - topic isolation, quotas, data
placement control allows multiple isolated streaming
applications to run on the same cluster reducing
hardware cost and data movement
© 2016 MapR Technologies 24
Scalable.
• Ingest more events to enable faster insights
• Hold on to events longer to enable deeper insights
• Develop app once and apply to short & long-term
data (i.e. run analysis on 15-days data AND 1-year
data using same application)
MapR Streams: Building Faster and Simpler Apps
© 2016 MapR Technologies 25© 2016 MapR Technologies
MapR-DB JSON document database
and application development with OJAI
© 2016 MapR Technologies 26
Open Source OJAI API for JSON-Based Applications
Open JSON Application Interface (OJAI)
Databases Streams
MapR-Client
File Systems
{JSON}
MapR-Client
© 2016 MapR Technologies 27
Familiar JSON Paradigm – Similar API Constructs
MapR-DB
Document record = Json.newDocument()
.set("firstName", "John")
.set("lastName", "Doe")
.set("age", 50);
table.insert("jdoe", record);
MongoDB
BasicDBObject doc = new BasicDBObject
("firstName", "John")
.append("lastName", "Doe")
.append("age", 50);
coll.insert(doc);
© 2016 MapR Technologies 28
JSON: Easy Variation with Documents
{
"_id" : "rp-prod132546",
"name" : "Marvel T2 Athena”,
"brand" : "Pinarello",
"category" : "bike",
"type" : "Road Bike”,
"price" : 2949.99,
"size" : "55cm",
"wheel_size" : "700c",
"frameset" : {
"frame" : "Carbon Toryaca",
"fork" : "Onda 2V C"
},
"groupset" : {
"chainset" : "Camp. Athena 50/34",
"brake" : "Camp."
},
"wheelset" : {
"wheels" : "Camp. Zonda",
"tyres" : "Vittoria Pro"
}
}
{
"_id" : "rp-prod106702",
"name" : " Ultegra SPD-SL 6800”,
"brand" : "Shimano",
"category" : "pedals",
"type" : "Components,
"price" : 112.99,
"features" : [
"Low profile design increases ...",
"Supplied with floating SH11 cleats",
"Weight: 260g (pair)"
]
}
{
"_id" : "rp-prod113104",
"name" : "Bianchi Pride Jersey SS15”,
"brand" : "Nalini",
"category" : "Jersey",
"type" : "Clothing,
"price" : 76.99,
"features" : [
"100% Polyester",
"3/4 hidden zip",
"3 rear pocket"
],
"color" : "black"
}
jerseypedalbike
© 2016 MapR Technologies 29
Product Catalog - RDBMS
To get a single product“Entity Value Attribute” pattern
SELECT * FROM (
SELECT
ce.sku,
ea.attribute_id,
ea.attribute_code,
CASE ea.backend_type
WHEN 'varchar' THEN ce_varchar.value
WHEN 'int' THEN ce_int.value
WHEN 'text' THEN ce_text.value
WHEN 'decimal' THEN ce_decimal.value
WHEN 'datetime' THEN ce_datetime.value
ELSE ea.backend_type
END AS value,
ea.is_required AS required
FROM catalog_product_entity AS ce
LEFT JOIN eav_attribute AS ea
ON ce.entity_type_id = ea.entity_type_id
LEFT JOIN catalog_product_entity_varchar AS ce_varchar
ON ce.entity_id = ce_varchar.entity_id
AND ea.attribute_id = ce_varchar.attribute_id
AND ea.backend_type = 'varchar'
LEFT JOIN catalog_product_entity_text AS ce_text
ON ce.entity_id = ce_text.entity_id
AND ea.attribute_id = ce_text.attribute_id
AND ea.backend_type = 'text'
LEFT JOIN catalog_product_entity_decimal AS ce_decimal
ON ce.entity_id = ce_decimal.entity_id
AND ea.attribute_id = ce_decimal.attribute_id
AND ea.backend_type = 'decimal'
LEFT JOIN catalog_product_entity_datetime AS ce_datetime
ON ce.entity_id = ce_datetime.entity_id
AND ea.attribute_id = ce_datetime.attribute_id
AND ea.backend_type = 'datetime'
WHERE ce.sku = ‘rp-prod132546’
) AS tab
WHERE tab.value != ’’;
© 2016 MapR Technologies 30
Store the product “as a business object” To get a single product
{
"_id" : "rp-prod132546",
"name" : "Marvel T2 Athena”,
"brand" : "Pinarello",
"category" : "bike",
"type" : "Road Bike”,
"price" : 2949.99,
"size" : "55cm",
"wheel_size" : "700c",
"frameset" : {
"frame" : "Carbon Toryaca",
"fork" : "Onda 2V C"
},
"groupset" : {
"chainset" : "Camp. Athena 50/34",
"brake" : "Camp."
},
"wheelset" : {
"wheels" : "Camp. Zonda",
"tyres" : "Vittoria Pro"
}
}
products
.findById(“rp-prod132546”)
Product Catalog - NoSQL/Document
© 2016 MapR Technologies 31
Native JSON Support in MapR-DB
{
order_num: 5555,
products: [
{ product_id: 348752,
quantity: 1,
unit_price: 149.99,
total_price: 149.99
},
{ product_id: 439322,
quantity: 1,
unit_price: 99.99,
total_price: 99.99
},
{ product_id: 953923,
quantity: 1,
unit_price: 49.99,
total_price: 49.99
},
]
}
Reads/writes at element level
• Granular disk reads/writes
• Less network traffic
• Higher concurrency
Any new elements added on demand
• No predefined schemas
• Easy to store evolving data
Not all NoSQL databases treat JSON as a native data type.
© 2016 MapR Technologies 32
Leverage the Column Family Construct (Optional)
/
{a:
{a1:
{b1: "v1",
b2: [
{c1: "v1",
c2: "v2"}
]
},
a2:
{
e1: "v1",
e2: <inline jpg>
}
}
}
Column Family 1
Column Family 2
Control layout for faster data access
Different TTL requirements
Separate Table Replication settings
Specific data placement policies
Efficient ACEs
© 2016 MapR Technologies 33
Fine Grained Security for JSON Documents
{
“fname”: “John”,
“lname”: “Doe”,
“address”: “111 Main St.”,
“city”: “San Jose”,
“state”: “CA”,
“zip”: “95134”,
“credit_cards”: [
{“issuer”: “Visa”,
“number”: “4444555566667777”},
{“issuer”: “MasterCard”,
“number”: “5555666677778888”}
]
}
Entire document
Element: “fname”
Array: “credit_cards”
Sub-element in array element:
“credit_cards[*].number”
Specify different permissions levels within the document.
© 2016 MapR Technologies 34
Comprehensive Data Type Support for MapR-DB
• NULL
• Boolean
• String
• Map
• Array
• Float, Double
• Binary
• Byte, Short, Int, Long
• Date
• Decimal
• Interval
• Time
• Timestamp
Examples:
{
“sample_int”: {"$numberLong”: 2147483647},
“sample_date”: {“$dateDay”: “2016-02-22”},
“sample_decimal”:{“$decimal”: “1234567890.23456789”},
“sample_time”: {“$time”: “10:26:12.487”},
“sample_timestamp”: {“$date”: “2016-02-22T10:26:12.487+Z”}
}
© 2016 MapR Technologies 35© 2016 MapR Technologies
Data Security with Access Control
Expressions
© 2016 MapR Technologies 36
File ACEs – Key Features
Intuitive
Inheritance
Subdirectories
and files inherit
perms from parent
directory
Whole-Volume
ACEs
Volume-level filter –
useful in multitenant
environments.
Roles
Arbitrary grouping
of users according
to your business
needs
High Performance
No performance hit
Boolean Operators
Allowing for
ultra fine-grain
permissions
AUTHORIZATION
© 2016 MapR Technologies 37
File ACEs: Whole Volume ACE Example
Whole-Volume ACE
r: group:finance
Jane grants read access to Bob.
File: /finance/final_report.csv
r: user:bob
Bob cannot read the file
/finance/final_report.csv because
the whole-volume ACE is set to
allow read-access to finance only.
Jane
(Finance)
Bob
(Developer)
Whole-Volume ACE
AUTHORIZATION
© 2016 MapR Technologies 38
POSIX ACLs vs ACEs
r : user:sally |
(group:dev_team & group:managers)
Access Control Lists
MapR Access Control Expressions
AUTHORIZATION
Which one is easier to set and understand?
Which one allows for higher granularity?
© 2016 MapR Technologies 39
MapR Has ACEs for Files and MapR-DB Records
Example: user:mary | (group:admins & group:VP) & user:!bob
Permissions on files, tables, column families, columns, JSON documents and sub-documents
Use Access Control Expressions (ACEs) to set granular permissions.
AUTHORIZATION
© 2016 MapR Technologies 40© 2016 MapR Technologies
Ecosystem Updates
© 2016 MapR Technologies 41
5.2 Ecosystem Support
These are the only component version changes in MEP 1.0 from 5.2 release date
and all of these have been out for 5.1 already.
Eco on 5.1 today MEP 1.0 on 5.2
Component Released with 5.1
Subsequently released for
5.1
Drill 1.4 1.6 1.6
Spark 1.5.2 1.6.1 1.6.1 (2.0 in dev
preview)
Impala 2.2.0 2.5 2.5
Storm 0.10.0 0.10.1 0.10.1
Mahout 0.11.2 0.12.2 0.12.2
© 2016 MapR Technologies 42
Converging SQL and JSON with Apache Drill 1.6
• Flexible and operational analytics on NoSQL
– MapR-DB plugin allows analysts to perform SQL queries directly on JSON data in MapR-DB tables
– Pushdown capabilities provide optimal interactive experience
• Enhanced query performance
– Provides better query performance via partition pruning, metadata caching and other optimizations
– Delivers up to 10-60X performance gains in query planning compared to the previous releases of Drill
• Better memory management
– Delivers greater stability and scale which enables customers to run not only larger but also more SQL
workloads on a MapR cluster
• Improved integration with visualization tools like Tableau
– Introduces client impersonation for end-to-end security from the visualization tool to data in Hadoop.
– Enhanced SQL Window functions
© 2016 MapR Technologies 43
What’s New in Spark 2.0?
• Structured Streaming with Spark SQL
– The ability to perform interactive queries against live streaming data.
– Output can now be aggregated in a stream for continuous applications.
– Pre-computation of analytics in a continuous fashion can occur as the data is generated
• Whole Stage Code-gen
– Provided by the second-generation Tungsten engine.
– Eliminates the need for multiple JVM calls by flattening SQL queries into one single
function evaluated as bytecode at runtime.
• Dataframe API’s
– Runs on the same engine as SparkSQL.
– Allows access to data from a variety of different data sources.
– Can run database-like operations or allow for passing in custom code.
© 2016 MapR Technologies 44© 2016 MapR Technologies
Upgrade to the Latest MapR
Converged Community Edition
© 2016 MapR Technologies 45
Select an Upgrade Method
Takes advantage of
high-availability features
Offline
Installer
Time
Complexity
Rolling
Manual
Rolling
Scripted
Offline
Manual
Cluster offline during upgrade
© 2016 MapR Technologies 46
Community Edition and Rolling Upgrades
• Expect interruptions to cluster operations when nodes running the
only copy of a service (for example, CLDB) are upgraded
• Minimize cluster access
• With 10 or fewer nodes,
offline upgrade probably
makes the most sense
Offline
Installer
Rolling
Manual
Rolling
Scripted
Offline
Manual
© 2016 MapR Technologies 47
Supported Upgrade Methods
From Version Offline Installer Offline Manual Rolling Manual Rolling Scripted
3.x
4.0
4.1
5.0
5.1
* Supported for clusters that were installed using the MapR Installer. This is the only
method that also upgrades ecosystem components.
© 2016 MapR Technologies 48
High-Level Overview
2
Prepare
1
Plan! Upgrade
3
© 2016 MapR Technologies 49
Plan: Determine What to Include
MapR Core
Ecosystem components not at supported MEP
MapR clients
New features
?
?
© 2016 MapR Technologies 50
Plan: Develop a Test Plan
• Run tests before and after each upgrade step
– Compare results
• Test basic functionality
– Verify cluster access and volumes
– Use maprcli, hadoop fs, MCS
• Test jobs and queries
– Based on the components you use
© 2016 MapR Technologies 51
Plan: Create an Upgrade Schedule
What needs to
happen after the
upgrade?
What can be done
days ahead?
What needs to
happen the day of
the upgrade?
What can be done
weeks ahead?
© 2016 MapR Technologies 52
Prepare: Weeks Ahead
• Review Release Notes
• Verify node specifications
– Update the JDK if needed
• Upgrade on a test cluster
– Document surprises
– Prepare configuration files
Weeks
Ahead
Critical!Critical!
© 2016 MapR Technologies 53
Prepare: Days Ahead
• Download the installer, packages, etc.
• Run tests and record results
• Back up critical data
Days
Ahead
© 2016 MapR Technologies 54
Prepare: Day of Upgrade
• Verify cluster health and clear alarms
• Empty job queue/terminate jobs
• Stop cross-cluster operations
– Volume mirroring
– Table replication
© 2016 MapR Technologies 55
Upgrade Order
1. MapR core
2. Ecosystem components
• Upgraded manually, unless using MapR Installer
3. MapR clients
4. Enable new features
© 2016 MapR Technologies 56
Upgrade MapR Core
Component Includes
MapReduce binaries
MapR Core
Webserver
maprcli command binaries, MCS, REST API
Other services
New features, performance enhancements (varies by release)
© 2016 MapR Technologies 57
Upgrade MapR Core: Config Files
New default configuration files created:
Active Configuration Files
(do not change during upgrade)
New Configuration Files
(added with upgrade)
/opt/mapr/conf /opt/mapr/conf.new
/opt/mapr/conf/conf.d /opt/mapr/conf.d.new
/opt/mapr/hadoop/hadoop-<ver>/conf opt/mapr/hadoop/hadoop-<ver>/conf.new
© 2016 MapR Technologies 58
Upgrade MapR Core: Config Files
New default configuration files created:
Active Configuration Files
(do not change during upgrade)
New Configuration Files
(added with upgrade)
/opt/mapr/conf /opt/mapr/conf.new
/opt/mapr/conf/conf.d /opt/mapr/conf.d.new
/opt/mapr/hadoop/hadoop-<ver>/conf opt/mapr/hadoop/hadoop-<ver>/conf.new
Important! Merge
required changes into
active configuration files
© 2016 MapR Technologies 59
Upgrade MapR Core: Hadoop Common Version
1. New Hadoop directory created at:
/opt/mapr/hadoop/hadoop-<version>
2. Existing Hadoop directory moved to:
/opt/mapr/hadoop/OLD_HADOOP_VERSIONS
3. Links updated for new version:
/opt/mapr/lib/*.jar
4. Paths in service configuration files updated:
/opt/mapr/conf/conf.d/warden.<service name>.conf
© 2016 MapR Technologies 60
Upgrade MapR Core: Post-Upgrade Tasks
• If upgrading from 5.0 or earlier, copy new license file into place on each
node:
cp /opt/mapr/conf.new/BaseLicense.txt /opt/mapr/conf/
• After a manual (rolling, or offline) upgrade, update Hadoop configuration
file with new version:
/opt/mapr/conf/hadoop_version
• Resume cross-cluster operations
– Volume mirroring
– Table replication
© 2016 MapR Technologies 61
Upgrade Ecosystem Components
• Follow pre- and post-upgrade
steps in documentation
• As of MapR 5.2, must upgrade
to ecosystem components that
belong to the same MapR
Ecosystem Pack (MEP)
http://maprdocs.mapr.com/home/InteropMatrix/r_MEP_52.html
© 2016 MapR Technologies 62
Upgrade MapR Clients
MapR Client
(Windows, Mac, Linux)
Cluster
hadoop fs –ls /
maprcli volume list
© 2016 MapR Technologies 63
Upgrade MapR POSIX Clients
• Loopback POSIX client
• FUSE-based POSIX client
– FUSE-based new in MapR 5.1
• Recommend: upgrade to
FUSE-based POSIX client
MapR POSIX Client
(Linux only)
© 2016 MapR Technologies 64
Upgrading from MapR 3.x
• To run MapReduce v1 jobs, change the default MapReduce
mode or submit them with the appropriate command
• May need to recompile MapReduce jobs
• May need to add YARN services to cluster
http://maprdocs.mapr.com/home/UpgradeGuide/RunningMRjobsYarn.html
© 2016 MapR Technologies 65
Other Upgrade Considerations
• Mirroring between clusters
– Volumes must be mirrored to a cluster at the same, or higher, revision
– Upgrade the destination cluster first!
– Consider disabling mirror operations during the upgrades, to avoid
alarms and maximize available bandwidth
• Table replication between clusters
– Clusters involved in table replication can be at different versions
© 2016 MapR Technologies 66
Q&AEngage with us!
• Spyglass Initiative
o https://www.mapr.com/products/spyglass-initiative
• Try out MapR Streams and MapR-DB in the free MapR Community
Edition
o https://www.mapr.com/products/hadoop-download
• Try out MapR Streams and MapR-DB in the MapR Sandbox (virtual
machine)
o https://www.mapr.com/products/mapr-sandbox-hadoop

Mais conteúdo relacionado

Mais procurados

MapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Technologies
 
Dchug m7-30 apr2013
Dchug m7-30 apr2013Dchug m7-30 apr2013
Dchug m7-30 apr2013jdfiori
 
Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0MapR Technologies
 
How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications MapR Technologies
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBaseCarol McDonald
 
Deep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningDeep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningMapR Technologies
 
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production SuccessAllen Day, PhD
 
Architectural Overview of MapR's Apache Hadoop Distribution
Architectural Overview of MapR's Apache Hadoop DistributionArchitectural Overview of MapR's Apache Hadoop Distribution
Architectural Overview of MapR's Apache Hadoop Distributionmcsrivas
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLMapR Technologies
 
Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop DataWorks Summit/Hadoop Summit
 
Design, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for HadoopDesign, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for Hadoopmcsrivas
 
Real Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeReal Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeMapR Technologies
 
Hive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenchesHive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenchesDataWorks Summit
 
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...Codemotion
 
Big Data Performance and Capacity Management
Big Data Performance and Capacity ManagementBig Data Performance and Capacity Management
Big Data Performance and Capacity Managementrightsize
 
Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Adam Doyle
 
A New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouseA New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouseDataWorks Summit/Hadoop Summit
 
Hadoop from Hive with Stinger to Tez
Hadoop from Hive with Stinger to TezHadoop from Hive with Stinger to Tez
Hadoop from Hive with Stinger to TezJan Pieter Posthuma
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Mathieu Dumoulin
 

Mais procurados (20)

Philly DB MapR Overview
Philly DB MapR OverviewPhilly DB MapR Overview
Philly DB MapR Overview
 
MapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data Platform
 
Dchug m7-30 apr2013
Dchug m7-30 apr2013Dchug m7-30 apr2013
Dchug m7-30 apr2013
 
Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0
 
How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications How Spark is Enabling the New Wave of Converged Cloud Applications
How Spark is Enabling the New Wave of Converged Cloud Applications
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBase
 
Deep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningDeep Learning vs. Cheap Learning
Deep Learning vs. Cheap Learning
 
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
 
Architectural Overview of MapR's Apache Hadoop Distribution
Architectural Overview of MapR's Apache Hadoop DistributionArchitectural Overview of MapR's Apache Hadoop Distribution
Architectural Overview of MapR's Apache Hadoop Distribution
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQL
 
Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop
 
Design, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for HadoopDesign, Scale and Performance of MapR's Distribution for Hadoop
Design, Scale and Performance of MapR's Distribution for Hadoop
 
Real Time and Big Data – It’s About Time
Real Time and Big Data – It’s About TimeReal Time and Big Data – It’s About Time
Real Time and Big Data – It’s About Time
 
Hive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenchesHive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenches
 
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...
 
Big Data Performance and Capacity Management
Big Data Performance and Capacity ManagementBig Data Performance and Capacity Management
Big Data Performance and Capacity Management
 
Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016
 
A New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouseA New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouse
 
Hadoop from Hive with Stinger to Tez
Hadoop from Hive with Stinger to TezHadoop from Hive with Stinger to Tez
Hadoop from Hive with Stinger to Tez
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
 

Destaque

Reference Architecture: EMC Hybrid Cloud with VMware
Reference Architecture: EMC Hybrid Cloud with VMwareReference Architecture: EMC Hybrid Cloud with VMware
Reference Architecture: EMC Hybrid Cloud with VMwareEMC
 
Julie Van den Steen en Maarten Verhulst richten firma op
Julie Van den Steen en Maarten Verhulst richten firma opJulie Van den Steen en Maarten Verhulst richten firma op
Julie Van den Steen en Maarten Verhulst richten firma opThierry Debels
 
'Living Lab' for HCI - presentation made at HCI International 2009
'Living Lab' for HCI - presentation made at HCI International 2009'Living Lab' for HCI - presentation made at HCI International 2009
'Living Lab' for HCI - presentation made at HCI International 2009Ed Chi
 
Collaboration with Eclipse final
Collaboration with Eclipse finalCollaboration with Eclipse final
Collaboration with Eclipse finalKenu, GwangNam Heo
 
The Biggest Lies That Digital Marketers Tell Themselves - 3XE Digital
The Biggest Lies That Digital Marketers Tell Themselves - 3XE DigitalThe Biggest Lies That Digital Marketers Tell Themselves - 3XE Digital
The Biggest Lies That Digital Marketers Tell Themselves - 3XE DigitalEduardas Gricius
 
Legrand Group Belgium - Brochure Sfera
Legrand Group Belgium - Brochure SferaLegrand Group Belgium - Brochure Sfera
Legrand Group Belgium - Brochure SferaArchitectura
 
Digital transformation - DevOps Day - 02/02/2017
Digital transformation - DevOps Day - 02/02/2017Digital transformation - DevOps Day - 02/02/2017
Digital transformation - DevOps Day - 02/02/2017Clara Feuillet
 
And the new System Center is here... what's actually new?
And the new System Center is here... what's actually new?And the new System Center is here... what's actually new?
And the new System Center is here... what's actually new?Tomica Kaniski
 
Giip bp-giip connectivity1703
Giip bp-giip connectivity1703Giip bp-giip connectivity1703
Giip bp-giip connectivity1703Lowy Shin
 
Conociendo los servicios adicionales en big data
Conociendo los servicios adicionales en big dataConociendo los servicios adicionales en big data
Conociendo los servicios adicionales en big dataSpanishPASSVC
 
Lambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataLambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataTrieu Nguyen
 
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...Vladimir Bacvanski, PhD
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Caserta
 
Grade 3 text structure assessment teaching guide
Grade 3 text structure assessment teaching guideGrade 3 text structure assessment teaching guide
Grade 3 text structure assessment teaching guideEmily Kissner
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...Lucidworks
 
A4 drive dev_ops_agility_and_operational_efficiency
A4 drive dev_ops_agility_and_operational_efficiencyA4 drive dev_ops_agility_and_operational_efficiency
A4 drive dev_ops_agility_and_operational_efficiencyDr. Wilfred Lin (Ph.D.)
 

Destaque (20)

Reference Architecture: EMC Hybrid Cloud with VMware
Reference Architecture: EMC Hybrid Cloud with VMwareReference Architecture: EMC Hybrid Cloud with VMware
Reference Architecture: EMC Hybrid Cloud with VMware
 
Julie Van den Steen en Maarten Verhulst richten firma op
Julie Van den Steen en Maarten Verhulst richten firma opJulie Van den Steen en Maarten Verhulst richten firma op
Julie Van den Steen en Maarten Verhulst richten firma op
 
Unc plus delta
Unc plus deltaUnc plus delta
Unc plus delta
 
'Living Lab' for HCI - presentation made at HCI International 2009
'Living Lab' for HCI - presentation made at HCI International 2009'Living Lab' for HCI - presentation made at HCI International 2009
'Living Lab' for HCI - presentation made at HCI International 2009
 
Collaboration with Eclipse final
Collaboration with Eclipse finalCollaboration with Eclipse final
Collaboration with Eclipse final
 
The Biggest Lies That Digital Marketers Tell Themselves - 3XE Digital
The Biggest Lies That Digital Marketers Tell Themselves - 3XE DigitalThe Biggest Lies That Digital Marketers Tell Themselves - 3XE Digital
The Biggest Lies That Digital Marketers Tell Themselves - 3XE Digital
 
Legrand Group Belgium - Brochure Sfera
Legrand Group Belgium - Brochure SferaLegrand Group Belgium - Brochure Sfera
Legrand Group Belgium - Brochure Sfera
 
Digital transformation - DevOps Day - 02/02/2017
Digital transformation - DevOps Day - 02/02/2017Digital transformation - DevOps Day - 02/02/2017
Digital transformation - DevOps Day - 02/02/2017
 
And the new System Center is here... what's actually new?
And the new System Center is here... what's actually new?And the new System Center is here... what's actually new?
And the new System Center is here... what's actually new?
 
The Beauty of BAD code
The Beauty of  BAD codeThe Beauty of  BAD code
The Beauty of BAD code
 
Giip bp-giip connectivity1703
Giip bp-giip connectivity1703Giip bp-giip connectivity1703
Giip bp-giip connectivity1703
 
Conociendo los servicios adicionales en big data
Conociendo los servicios adicionales en big dataConociendo los servicios adicionales en big data
Conociendo los servicios adicionales en big data
 
Lambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataLambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big data
 
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data using InfoSphere BigInsights...
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
 
Grade 3 text structure assessment teaching guide
Grade 3 text structure assessment teaching guideGrade 3 text structure assessment teaching guide
Grade 3 text structure assessment teaching guide
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...
Understand the Breadth and Depth of Solr via the Admin UI: Presented by Upaya...
 
A4 drive dev_ops_agility_and_operational_efficiency
A4 drive dev_ops_agility_and_operational_efficiencyA4 drive dev_ops_agility_and_operational_efficiency
A4 drive dev_ops_agility_and_operational_efficiency
 
Migrating to aws
Migrating to awsMigrating to aws
Migrating to aws
 

Semelhante a MapR 5.2: Getting More Value from the MapR Converged Community Edition

MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR Technologies
 
Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1Tugdual Grall
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016Mathieu Dumoulin
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainMapR Technologies
 
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...MapR Technologies
 
Container and Kubernetes without limits
Container and Kubernetes without limitsContainer and Kubernetes without limits
Container and Kubernetes without limitsAntje Barth
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceeRic Choo
 
Putting Apache Drill into Production
Putting Apache Drill into ProductionPutting Apache Drill into Production
Putting Apache Drill into ProductionMapR Technologies
 
Downtime is not an option - day 2 operations - Jörg Schad
Downtime is not an option - day 2 operations -  Jörg SchadDowntime is not an option - day 2 operations -  Jörg Schad
Downtime is not an option - day 2 operations - Jörg SchadCodemotion
 
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Tugdual Grall
 
True Reusable Code - DevSum2016
True Reusable Code - DevSum2016True Reusable Code - DevSum2016
True Reusable Code - DevSum2016Eduard Lazar
 
Map r seattle streams meetup oct 2016
Map r seattle streams meetup   oct 2016Map r seattle streams meetup   oct 2016
Map r seattle streams meetup oct 2016Nitin Kumar
 
Spark One Platform Webinar
Spark One Platform WebinarSpark One Platform Webinar
Spark One Platform WebinarCloudera, Inc.
 
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionReal World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionCodemotion
 
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming DataAdvanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming DataCarol McDonald
 
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...Sumeet Singh
 

Semelhante a MapR 5.2: Getting More Value from the MapR Converged Community Edition (20)

MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
 
Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1Fast Cars, Big Data - How Streaming Can Help Formula 1
Fast Cars, Big Data - How Streaming Can Help Formula 1
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and Rain
 
Streaming in the Extreme
Streaming in the ExtremeStreaming in the Extreme
Streaming in the Extreme
 
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
 
Container and Kubernetes without limits
Container and Kubernetes without limitsContainer and Kubernetes without limits
Container and Kubernetes without limits
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Is Spark Replacing Hadoop
Is Spark Replacing HadoopIs Spark Replacing Hadoop
Is Spark Replacing Hadoop
 
Putting Apache Drill into Production
Putting Apache Drill into ProductionPutting Apache Drill into Production
Putting Apache Drill into Production
 
Downtime is not an option - day 2 operations - Jörg Schad
Downtime is not an option - day 2 operations -  Jörg SchadDowntime is not an option - day 2 operations -  Jörg Schad
Downtime is not an option - day 2 operations - Jörg Schad
 
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
 
True Reusable Code - DevSum2016
True Reusable Code - DevSum2016True Reusable Code - DevSum2016
True Reusable Code - DevSum2016
 
Map r seattle streams meetup oct 2016
Map r seattle streams meetup   oct 2016Map r seattle streams meetup   oct 2016
Map r seattle streams meetup oct 2016
 
Spark One Platform Webinar
Spark One Platform WebinarSpark One Platform Webinar
Spark One Platform Webinar
 
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionReal World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
 
MapR Unique features
MapR Unique featuresMapR Unique features
MapR Unique features
 
APAN Cloud WG (2015/3/2)
APAN Cloud WG (2015/3/2)APAN Cloud WG (2015/3/2)
APAN Cloud WG (2015/3/2)
 
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming DataAdvanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming Data
 
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
Hadoop Summit Brussels 2015: Architecting a Scalable Hadoop Platform - Top 10...
 

Mais de MapR Technologies

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscapeMapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationMapR Technologies
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureMapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...MapR Technologies
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsMapR Technologies
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMapR Technologies
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action MapR Technologies
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsMapR Technologies
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageMapR Technologies
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionMapR Technologies
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformMapR Technologies
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...MapR Technologies
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareMapR Technologies
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Technologies
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data AnalyticsMapR Technologies
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsMapR Technologies
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR Technologies
 
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceHandling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceMapR Technologies
 

Mais de MapR Technologies (20)

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIs
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
 
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceHandling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in Finance
 

Último

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 

Último (20)

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 

MapR 5.2: Getting More Value from the MapR Converged Community Edition

  • 1. © 2016 MapR Technologies© 2016 MapR Technologies MapR 5.2: Getting More Value from the MapR Converged Community Edition Sep 14, 2016
  • 2. © 2016 MapR Technologies Today’s Presenters Deborah Littlefield Technical Curriculum Developer Ankur Desai Sr. Manager, Platform and Products
  • 3. © 2016 MapR Technologies 3 Today’s Agenda • Recent updates to the MapR Converged Data Platform • Latest Ecosystem Support in MapR 5.2 • How to upgrade to the latest version of the Community Edition • Q&A
  • 4. © 2016 MapR Technologies 4 The MapR Converged Data Platform
  • 5. © 2016 MapR Technologies 5 4 Major Additions to the MapR Platform in the past 12 months • Taking cluster monitoring to the next level with the Spyglass Initiative • Real-time streaming with MapR Streams • MapR-DB JSON document database and application development with OJAI • Securing your data with access control expressions (ACEs)
  • 6. © 2016 MapR Technologies 6© 2016 MapR Technologies Project Spyglass
  • 7. © 2016 MapR Technologies 7 MapR Vision: Maximizing User/Operator Productivity Deep Visibility Another sample Easy Management Full Control
  • 8. © 2016 MapR Technologies 8 The MapR Spyglass Initiative • New approach for increasing user and administrator productivity – Comprehensive, open, extensible • Simplifies the management of growing big data deployments • Starts with 5.2 release – Phase 1 – MapR Monitoring – Initial focus on operational visibility • Helps community innovate faster – Extensive use of open source visualization and dashboarding tools
  • 9. © 2016 MapR Technologies 9 Spyglass Initiative Phase 1 - MapR Monitoring Empower administrators with cluster monitoring capabilities, including metric and log collection from nodes, services, and jobs, with dashboards to display information in a useful way. Converged Customizable Extensible
  • 10. © 2016 MapR Technologies 10 Collection VisualizationAggregation & Storage MapR Monitoring Architecture Future Data Sources Log Shippers Metrics Collectors Alerting Node Environmentals (CPU, Mem, I/O) Service Daemons (YARN, Drill, Hive, etc.) MapR Control System …
  • 11. © 2014 MapR Technologies 11 Project Spyglass – Monitoring All You Care About Node/Infrastructure Monitoring • Global Aggregates (Average, Min, Max) Charts (e.g. CPU, Disk utilization) • Per-node charts (e.g. I/O Throughput by disk) • MFS read/writes and throughput • DB puts, gets, scans and cache metrics
  • 12. © 2014 MapR Technologies 12 Project Spyglass – Monitoring All You Care About Node/Infrastructure Monitoring • Global Aggregates (Average, Min, Max) Charts (e.g. CPU, Disk utilization) • Per-node charts (e.g. I/O Throughput by disk) • MFS read/writes and throughput • DB puts, gets, scans and cache metrics Cluster Space Utilization Monitoring • Cluster wide storage utilization • Storage Utilization Trend • Utilization per volume and per accountable entity (data, volume, snapshot and total size)
  • 13. © 2014 MapR Technologies 13 Project Spyglass – Monitoring All You Care About Node/Infrastructure Monitoring • Global Aggregates (Average, Min, Max) Charts (e.g. CPU, Disk utilization) • Per-node charts (e.g. I/O Throughput by disk) • MFS read/writes and throughput • DB puts, gets, scans and cache metrics Cluster Space Utilization Monitoring • Cluster wide storage utilization • Storage Utilization Trend • Utilization per volume and per accountable entity (data, volume, snapshot and total size) YARN/MR Application Monitoring • Global YARN trend graphs • Containers - Pending, Active • vCores & RAM - Allocated & Used • Per Queue charts - containers, vCores, RAM
  • 14. © 2014 MapR Technologies 14 Project Spyglass – Monitoring All You Care About Node/Infrastructure Monitoring • Global Aggregates (Average, Min, Max) Charts (e.g. CPU, Disk utilization) • Per-node charts (e.g. I/O Throughput by disk) • MFS read/writes and throughput • DB puts, gets, scans and cache metrics Cluster Space Utilization Monitoring • Cluster wide storage utilization • Storage Utilization Trend • Utilization per volume and per accountable entity (data, volume, snapshot and total size) YARN/MR Application Monitoring • Global YARN trend graphs • Containers - Pending, Active • vCores & RAM - Allocated & Used • Per Queue charts - containers, vCores, RAM Service Daemon Monitoring • Per-service charts with for (CPU Usage by type, Memory) • Centralized, searchable logs • MapR core and ecosystem services (includes YARN, Drill and Spark)
  • 15. © 2016 MapR Technologies 15 Customizable Dashboards for Visualizing Metrics Log Analytics
  • 16. © 2016 MapR Technologies 16 Destination to Learn and Collaborate Blog about topics and ideas Share code snippets and dashboards View demos, tutorials, and videos Engage in use case discussion/development
  • 17. © 2016 MapR Technologies 17 Dashboards are defined with JSON and easy to export and import in Grafana and Kibana Extend/Integrate using REST API The Exchange
  • 18. © 2016 MapR Technologies 18 Dashboards can be viewed on mobile devices.
  • 19. © 2016 MapR Technologies 19 Summary ● Data collection and storage infrastructure (packaged and supported) ○ Collection/storage of metrics & logs across node, storage, services ● Visualization dashboard (Driven via community) ○ Sample dashboards for Grafana & Kibana 5.2 - Spyglass 1.0 GA CUSTOMIZABLE, shareable and mobile-ready dashboards CONVERGED monitoring with deep search EXTENSIBLE and easy to integrate with REST API
  • 20. © 2016 MapR Technologies 20© 2016 MapR Technologies MapR Streams
  • 21. © 2016 MapR Technologies 21 MapR Streams: Enabling Continuous Data Processing To enable continuous, globally scalable streaming of event data, allowing developers to create real-time applications that their business can depend on. Converged Continuous Global
  • 22. © 2016 MapR Technologies 22 MapR Streams: Publish-subscribe Event Streaming System for Big Data Producers publish billions of messages/sec to a topic in a stream. Guaranteed, immediate delivery to all consumers. Standard real-time API (Kafka). Integrates with Spark Streaming, Storm, Apex, and Flink Direct data access (OJAI API) from analytics frameworks. To pi c Stream Producers Remote sites and consumers Batch analytics Topic Replication Consumers Consumers Available in the Enterprise Edition Only
  • 23. © 2016 MapR Technologies 23 MapR Streams: Building Faster and Simpler Apps Simpler and Faster Architecture • Converged platform with file storage and database reduces data movement, data latency, hardware cost, and administration cost • Event streaming and stream processing in the same cluster enables faster processing • Unified security framework with files and database tables reduces administration cost around setting up and enforcing security policies • Multi-tenant - topic isolation, quotas, data placement control allows multiple isolated streaming applications to run on the same cluster reducing hardware cost and data movement
  • 24. © 2016 MapR Technologies 24 Scalable. • Ingest more events to enable faster insights • Hold on to events longer to enable deeper insights • Develop app once and apply to short & long-term data (i.e. run analysis on 15-days data AND 1-year data using same application) MapR Streams: Building Faster and Simpler Apps
  • 25. © 2016 MapR Technologies 25© 2016 MapR Technologies MapR-DB JSON document database and application development with OJAI
  • 26. © 2016 MapR Technologies 26 Open Source OJAI API for JSON-Based Applications Open JSON Application Interface (OJAI) Databases Streams MapR-Client File Systems {JSON} MapR-Client
  • 27. © 2016 MapR Technologies 27 Familiar JSON Paradigm – Similar API Constructs MapR-DB Document record = Json.newDocument() .set("firstName", "John") .set("lastName", "Doe") .set("age", 50); table.insert("jdoe", record); MongoDB BasicDBObject doc = new BasicDBObject ("firstName", "John") .append("lastName", "Doe") .append("age", 50); coll.insert(doc);
  • 28. © 2016 MapR Technologies 28 JSON: Easy Variation with Documents { "_id" : "rp-prod132546", "name" : "Marvel T2 Athena”, "brand" : "Pinarello", "category" : "bike", "type" : "Road Bike”, "price" : 2949.99, "size" : "55cm", "wheel_size" : "700c", "frameset" : { "frame" : "Carbon Toryaca", "fork" : "Onda 2V C" }, "groupset" : { "chainset" : "Camp. Athena 50/34", "brake" : "Camp." }, "wheelset" : { "wheels" : "Camp. Zonda", "tyres" : "Vittoria Pro" } } { "_id" : "rp-prod106702", "name" : " Ultegra SPD-SL 6800”, "brand" : "Shimano", "category" : "pedals", "type" : "Components, "price" : 112.99, "features" : [ "Low profile design increases ...", "Supplied with floating SH11 cleats", "Weight: 260g (pair)" ] } { "_id" : "rp-prod113104", "name" : "Bianchi Pride Jersey SS15”, "brand" : "Nalini", "category" : "Jersey", "type" : "Clothing, "price" : 76.99, "features" : [ "100% Polyester", "3/4 hidden zip", "3 rear pocket" ], "color" : "black" } jerseypedalbike
  • 29. © 2016 MapR Technologies 29 Product Catalog - RDBMS To get a single product“Entity Value Attribute” pattern SELECT * FROM ( SELECT ce.sku, ea.attribute_id, ea.attribute_code, CASE ea.backend_type WHEN 'varchar' THEN ce_varchar.value WHEN 'int' THEN ce_int.value WHEN 'text' THEN ce_text.value WHEN 'decimal' THEN ce_decimal.value WHEN 'datetime' THEN ce_datetime.value ELSE ea.backend_type END AS value, ea.is_required AS required FROM catalog_product_entity AS ce LEFT JOIN eav_attribute AS ea ON ce.entity_type_id = ea.entity_type_id LEFT JOIN catalog_product_entity_varchar AS ce_varchar ON ce.entity_id = ce_varchar.entity_id AND ea.attribute_id = ce_varchar.attribute_id AND ea.backend_type = 'varchar' LEFT JOIN catalog_product_entity_text AS ce_text ON ce.entity_id = ce_text.entity_id AND ea.attribute_id = ce_text.attribute_id AND ea.backend_type = 'text' LEFT JOIN catalog_product_entity_decimal AS ce_decimal ON ce.entity_id = ce_decimal.entity_id AND ea.attribute_id = ce_decimal.attribute_id AND ea.backend_type = 'decimal' LEFT JOIN catalog_product_entity_datetime AS ce_datetime ON ce.entity_id = ce_datetime.entity_id AND ea.attribute_id = ce_datetime.attribute_id AND ea.backend_type = 'datetime' WHERE ce.sku = ‘rp-prod132546’ ) AS tab WHERE tab.value != ’’;
  • 30. © 2016 MapR Technologies 30 Store the product “as a business object” To get a single product { "_id" : "rp-prod132546", "name" : "Marvel T2 Athena”, "brand" : "Pinarello", "category" : "bike", "type" : "Road Bike”, "price" : 2949.99, "size" : "55cm", "wheel_size" : "700c", "frameset" : { "frame" : "Carbon Toryaca", "fork" : "Onda 2V C" }, "groupset" : { "chainset" : "Camp. Athena 50/34", "brake" : "Camp." }, "wheelset" : { "wheels" : "Camp. Zonda", "tyres" : "Vittoria Pro" } } products .findById(“rp-prod132546”) Product Catalog - NoSQL/Document
  • 31. © 2016 MapR Technologies 31 Native JSON Support in MapR-DB { order_num: 5555, products: [ { product_id: 348752, quantity: 1, unit_price: 149.99, total_price: 149.99 }, { product_id: 439322, quantity: 1, unit_price: 99.99, total_price: 99.99 }, { product_id: 953923, quantity: 1, unit_price: 49.99, total_price: 49.99 }, ] } Reads/writes at element level • Granular disk reads/writes • Less network traffic • Higher concurrency Any new elements added on demand • No predefined schemas • Easy to store evolving data Not all NoSQL databases treat JSON as a native data type.
  • 32. © 2016 MapR Technologies 32 Leverage the Column Family Construct (Optional) / {a: {a1: {b1: "v1", b2: [ {c1: "v1", c2: "v2"} ] }, a2: { e1: "v1", e2: <inline jpg> } } } Column Family 1 Column Family 2 Control layout for faster data access Different TTL requirements Separate Table Replication settings Specific data placement policies Efficient ACEs
  • 33. © 2016 MapR Technologies 33 Fine Grained Security for JSON Documents { “fname”: “John”, “lname”: “Doe”, “address”: “111 Main St.”, “city”: “San Jose”, “state”: “CA”, “zip”: “95134”, “credit_cards”: [ {“issuer”: “Visa”, “number”: “4444555566667777”}, {“issuer”: “MasterCard”, “number”: “5555666677778888”} ] } Entire document Element: “fname” Array: “credit_cards” Sub-element in array element: “credit_cards[*].number” Specify different permissions levels within the document.
  • 34. © 2016 MapR Technologies 34 Comprehensive Data Type Support for MapR-DB • NULL • Boolean • String • Map • Array • Float, Double • Binary • Byte, Short, Int, Long • Date • Decimal • Interval • Time • Timestamp Examples: { “sample_int”: {"$numberLong”: 2147483647}, “sample_date”: {“$dateDay”: “2016-02-22”}, “sample_decimal”:{“$decimal”: “1234567890.23456789”}, “sample_time”: {“$time”: “10:26:12.487”}, “sample_timestamp”: {“$date”: “2016-02-22T10:26:12.487+Z”} }
  • 35. © 2016 MapR Technologies 35© 2016 MapR Technologies Data Security with Access Control Expressions
  • 36. © 2016 MapR Technologies 36 File ACEs – Key Features Intuitive Inheritance Subdirectories and files inherit perms from parent directory Whole-Volume ACEs Volume-level filter – useful in multitenant environments. Roles Arbitrary grouping of users according to your business needs High Performance No performance hit Boolean Operators Allowing for ultra fine-grain permissions AUTHORIZATION
  • 37. © 2016 MapR Technologies 37 File ACEs: Whole Volume ACE Example Whole-Volume ACE r: group:finance Jane grants read access to Bob. File: /finance/final_report.csv r: user:bob Bob cannot read the file /finance/final_report.csv because the whole-volume ACE is set to allow read-access to finance only. Jane (Finance) Bob (Developer) Whole-Volume ACE AUTHORIZATION
  • 38. © 2016 MapR Technologies 38 POSIX ACLs vs ACEs r : user:sally | (group:dev_team & group:managers) Access Control Lists MapR Access Control Expressions AUTHORIZATION Which one is easier to set and understand? Which one allows for higher granularity?
  • 39. © 2016 MapR Technologies 39 MapR Has ACEs for Files and MapR-DB Records Example: user:mary | (group:admins & group:VP) & user:!bob Permissions on files, tables, column families, columns, JSON documents and sub-documents Use Access Control Expressions (ACEs) to set granular permissions. AUTHORIZATION
  • 40. © 2016 MapR Technologies 40© 2016 MapR Technologies Ecosystem Updates
  • 41. © 2016 MapR Technologies 41 5.2 Ecosystem Support These are the only component version changes in MEP 1.0 from 5.2 release date and all of these have been out for 5.1 already. Eco on 5.1 today MEP 1.0 on 5.2 Component Released with 5.1 Subsequently released for 5.1 Drill 1.4 1.6 1.6 Spark 1.5.2 1.6.1 1.6.1 (2.0 in dev preview) Impala 2.2.0 2.5 2.5 Storm 0.10.0 0.10.1 0.10.1 Mahout 0.11.2 0.12.2 0.12.2
  • 42. © 2016 MapR Technologies 42 Converging SQL and JSON with Apache Drill 1.6 • Flexible and operational analytics on NoSQL – MapR-DB plugin allows analysts to perform SQL queries directly on JSON data in MapR-DB tables – Pushdown capabilities provide optimal interactive experience • Enhanced query performance – Provides better query performance via partition pruning, metadata caching and other optimizations – Delivers up to 10-60X performance gains in query planning compared to the previous releases of Drill • Better memory management – Delivers greater stability and scale which enables customers to run not only larger but also more SQL workloads on a MapR cluster • Improved integration with visualization tools like Tableau – Introduces client impersonation for end-to-end security from the visualization tool to data in Hadoop. – Enhanced SQL Window functions
  • 43. © 2016 MapR Technologies 43 What’s New in Spark 2.0? • Structured Streaming with Spark SQL – The ability to perform interactive queries against live streaming data. – Output can now be aggregated in a stream for continuous applications. – Pre-computation of analytics in a continuous fashion can occur as the data is generated • Whole Stage Code-gen – Provided by the second-generation Tungsten engine. – Eliminates the need for multiple JVM calls by flattening SQL queries into one single function evaluated as bytecode at runtime. • Dataframe API’s – Runs on the same engine as SparkSQL. – Allows access to data from a variety of different data sources. – Can run database-like operations or allow for passing in custom code.
  • 44. © 2016 MapR Technologies 44© 2016 MapR Technologies Upgrade to the Latest MapR Converged Community Edition
  • 45. © 2016 MapR Technologies 45 Select an Upgrade Method Takes advantage of high-availability features Offline Installer Time Complexity Rolling Manual Rolling Scripted Offline Manual Cluster offline during upgrade
  • 46. © 2016 MapR Technologies 46 Community Edition and Rolling Upgrades • Expect interruptions to cluster operations when nodes running the only copy of a service (for example, CLDB) are upgraded • Minimize cluster access • With 10 or fewer nodes, offline upgrade probably makes the most sense Offline Installer Rolling Manual Rolling Scripted Offline Manual
  • 47. © 2016 MapR Technologies 47 Supported Upgrade Methods From Version Offline Installer Offline Manual Rolling Manual Rolling Scripted 3.x 4.0 4.1 5.0 5.1 * Supported for clusters that were installed using the MapR Installer. This is the only method that also upgrades ecosystem components.
  • 48. © 2016 MapR Technologies 48 High-Level Overview 2 Prepare 1 Plan! Upgrade 3
  • 49. © 2016 MapR Technologies 49 Plan: Determine What to Include MapR Core Ecosystem components not at supported MEP MapR clients New features ? ?
  • 50. © 2016 MapR Technologies 50 Plan: Develop a Test Plan • Run tests before and after each upgrade step – Compare results • Test basic functionality – Verify cluster access and volumes – Use maprcli, hadoop fs, MCS • Test jobs and queries – Based on the components you use
  • 51. © 2016 MapR Technologies 51 Plan: Create an Upgrade Schedule What needs to happen after the upgrade? What can be done days ahead? What needs to happen the day of the upgrade? What can be done weeks ahead?
  • 52. © 2016 MapR Technologies 52 Prepare: Weeks Ahead • Review Release Notes • Verify node specifications – Update the JDK if needed • Upgrade on a test cluster – Document surprises – Prepare configuration files Weeks Ahead Critical!Critical!
  • 53. © 2016 MapR Technologies 53 Prepare: Days Ahead • Download the installer, packages, etc. • Run tests and record results • Back up critical data Days Ahead
  • 54. © 2016 MapR Technologies 54 Prepare: Day of Upgrade • Verify cluster health and clear alarms • Empty job queue/terminate jobs • Stop cross-cluster operations – Volume mirroring – Table replication
  • 55. © 2016 MapR Technologies 55 Upgrade Order 1. MapR core 2. Ecosystem components • Upgraded manually, unless using MapR Installer 3. MapR clients 4. Enable new features
  • 56. © 2016 MapR Technologies 56 Upgrade MapR Core Component Includes MapReduce binaries MapR Core Webserver maprcli command binaries, MCS, REST API Other services New features, performance enhancements (varies by release)
  • 57. © 2016 MapR Technologies 57 Upgrade MapR Core: Config Files New default configuration files created: Active Configuration Files (do not change during upgrade) New Configuration Files (added with upgrade) /opt/mapr/conf /opt/mapr/conf.new /opt/mapr/conf/conf.d /opt/mapr/conf.d.new /opt/mapr/hadoop/hadoop-<ver>/conf opt/mapr/hadoop/hadoop-<ver>/conf.new
  • 58. © 2016 MapR Technologies 58 Upgrade MapR Core: Config Files New default configuration files created: Active Configuration Files (do not change during upgrade) New Configuration Files (added with upgrade) /opt/mapr/conf /opt/mapr/conf.new /opt/mapr/conf/conf.d /opt/mapr/conf.d.new /opt/mapr/hadoop/hadoop-<ver>/conf opt/mapr/hadoop/hadoop-<ver>/conf.new Important! Merge required changes into active configuration files
  • 59. © 2016 MapR Technologies 59 Upgrade MapR Core: Hadoop Common Version 1. New Hadoop directory created at: /opt/mapr/hadoop/hadoop-<version> 2. Existing Hadoop directory moved to: /opt/mapr/hadoop/OLD_HADOOP_VERSIONS 3. Links updated for new version: /opt/mapr/lib/*.jar 4. Paths in service configuration files updated: /opt/mapr/conf/conf.d/warden.<service name>.conf
  • 60. © 2016 MapR Technologies 60 Upgrade MapR Core: Post-Upgrade Tasks • If upgrading from 5.0 or earlier, copy new license file into place on each node: cp /opt/mapr/conf.new/BaseLicense.txt /opt/mapr/conf/ • After a manual (rolling, or offline) upgrade, update Hadoop configuration file with new version: /opt/mapr/conf/hadoop_version • Resume cross-cluster operations – Volume mirroring – Table replication
  • 61. © 2016 MapR Technologies 61 Upgrade Ecosystem Components • Follow pre- and post-upgrade steps in documentation • As of MapR 5.2, must upgrade to ecosystem components that belong to the same MapR Ecosystem Pack (MEP) http://maprdocs.mapr.com/home/InteropMatrix/r_MEP_52.html
  • 62. © 2016 MapR Technologies 62 Upgrade MapR Clients MapR Client (Windows, Mac, Linux) Cluster hadoop fs –ls / maprcli volume list
  • 63. © 2016 MapR Technologies 63 Upgrade MapR POSIX Clients • Loopback POSIX client • FUSE-based POSIX client – FUSE-based new in MapR 5.1 • Recommend: upgrade to FUSE-based POSIX client MapR POSIX Client (Linux only)
  • 64. © 2016 MapR Technologies 64 Upgrading from MapR 3.x • To run MapReduce v1 jobs, change the default MapReduce mode or submit them with the appropriate command • May need to recompile MapReduce jobs • May need to add YARN services to cluster http://maprdocs.mapr.com/home/UpgradeGuide/RunningMRjobsYarn.html
  • 65. © 2016 MapR Technologies 65 Other Upgrade Considerations • Mirroring between clusters – Volumes must be mirrored to a cluster at the same, or higher, revision – Upgrade the destination cluster first! – Consider disabling mirror operations during the upgrades, to avoid alarms and maximize available bandwidth • Table replication between clusters – Clusters involved in table replication can be at different versions
  • 66. © 2016 MapR Technologies 66 Q&AEngage with us! • Spyglass Initiative o https://www.mapr.com/products/spyglass-initiative • Try out MapR Streams and MapR-DB in the free MapR Community Edition o https://www.mapr.com/products/hadoop-download • Try out MapR Streams and MapR-DB in the MapR Sandbox (virtual machine) o https://www.mapr.com/products/mapr-sandbox-hadoop