SlideShare uma empresa Scribd logo
1 de 21
Baixar para ler offline
Cloudera	
  Manager	
  –	
  API’s	
  &	
  
Extensibility	
  	
  
Patrick	
  Angeles,	
  Director	
  Field	
  Technical	
  Services	
  
December	
  2013	
  
	
  

1

CONFIDENTIAL	
  -­‐	
  RESTRICTED	
  
Cloudera	
  Manager	
  
End-­‐to-­‐End	
  AdministraHon	
  for	
  CDH	
  

1
Monitor	
  
2
Diagnose	
  
3
Integrate	
  
4
Manage	
  

Easily	
  deploy,	
  configure	
  &	
  opHmize	
  clusters	
  

Maintain	
  a	
  central	
  view	
  of	
  all	
  acHvity	
  

Easily	
  idenHfy	
  and	
  resolve	
  issues	
  

Use	
  Cloudera	
  Manager	
  with	
  exisHng	
  tools	
  

2

©2013	
  Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
IntegraHng	
  with	
  your	
  IT	
  Mgmt	
  tools	
  
Datacenter	
  Opera*ons	
  

Various	
  op*ons	
  of	
  integra*ng	
  Cloudera	
  Manager	
  into	
  your	
  exis*ng	
  
Installa;on,	
  
Datacenter	
  Opera*ons/Tools	
  Monitoring	
  
Deployment	
  
Aler;ng	
  
Tools	
  
tools	
  
Tools	
  
e.g.	
  Orion,	
  	
  
•  Cloudera	
  Manager	
  API	
  
e.g.	
  Chef,	
  
e.g	
  Nagios,	
  
Tivoli,	
  BMC	
  
Puppet	
  etc.	
  
SNMP	
  etc.	
  
•  Introduced	
  in	
  CM4	
  (June	
  2etc.	
  
012)	
  
	
  
•  Installa*on	
  &	
  deployment	
  
•  Monitoring	
  
•  SNMP	
  Alerts	
  
•  Introduced	
  in	
  CM4.5	
  (Feb	
  2013)	
  
•  Hadoop	
  Opera*ons	
  
And	
  more…	
  
Cloudera	
  
•  Monitoring	
  ‘tsquery’	
  (Feb	
  2013)	
  
Manager	
  
•  User-­‐defined	
  triggers/alarms	
  (new	
  for	
  C5!)	
  
•  Service	
  extensibility	
  (new	
  for	
  C5!)	
  

3

©2013	
  Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
Cloudera	
  Manager	
  (CM)	
  API	
  
	
  
• 

• 

	
  
	
  

4

API	
  access	
  was	
  a	
  new	
  feature	
  introduced	
  in	
  Cloudera	
  Manager	
  4.0,	
  providing	
  programmaHc	
  access	
  to	
  
cluster	
  operaHons	
  (such	
  as	
  configuraHon	
  and	
  restart)	
  and	
  monitoring	
  informaHon	
  (such	
  as	
  health	
  and	
  
metrics).	
  	
  
The	
  CM	
  API	
  is	
  an	
  HTTP	
  REST	
  API,	
  using	
  JSON	
  serializaHon.	
  The	
  API	
  is	
  served	
  on	
  the	
  same	
  host	
  and	
  port	
  as	
  
the	
  CM	
  web	
  UI,	
  and	
  does	
  not	
  require	
  an	
  extra	
  process	
  or	
  extra	
  configuraHon.	
  API	
  users	
  have	
  the	
  same	
  
privileges	
  as	
  they	
  do	
  in	
  the	
  web	
  UI	
  world.	
  
•  Docs	
  &	
  Examples	
  
hZp://cloudera.github.io/cm_api/	
  
hZps://github.com/cloudera/cm_api	
  
•  Java/Python	
  clients	
  
hZp://blog.cloudera.com/blog/2013/05/how-­‐to-­‐
automate-­‐your-­‐hadoop-­‐cluster-­‐from-­‐java/	
  
	
  
	
  

©2013Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
Examples	
  of	
  integraHon	
  with	
  CM	
  API	
  
• 

Installa;on	
  &	
  Deployment	
  
• 
• 
• 

Chef	
  
Puppet	
  
Dell	
  Crowbar	
  
• 

• 

hZp://blog.cloudera.com/blog/2013/08/how-­‐to-­‐deploy-­‐hadoop-­‐clusters-­‐automaHcally-­‐with-­‐dell-­‐crowbar-­‐and-­‐cloudera-­‐manager/

StackIQ	
  
• 

	
  

hZp://web.stackiq.com/blog/bid/312064/StackIQ-­‐Cluster-­‐Manager-­‐now-­‐integrated-­‐with-­‐Cloudera	
  

• 

• 

• 

WANdisco	
  –	
  non-­‐stop	
  NN	
  setup	
  

Several	
  other	
  customers/partners	
  leveraging	
  the	
  API’s	
  as	
  part	
  of	
  	
  their	
  install	
  &	
  deployment	
  	
  
process	
  

Monitoring	
  &	
  Aler;ng	
  
• 
• 

Oracle	
  Enterprise	
  Manager	
  (via	
  Big	
  Data	
  Appliance)	
  
Nagios	
  
• 
• 

• 

hZps://github.com/cloudera/cm_api/tree/master/nagios	
  
hZps://github.com/harisekhon/nagios-­‐plugins/blob/master/
check_hadoop_cloudera_manager_metrics.pl	
  

SNMP	
  alerts	
  integraHon	
  with	
  IBM	
  Netcool	
  

Develop	
  &	
  Contribute	
  your	
  plug-­‐in’s	
  using	
  Cloudera	
  
Manager	
  API	
  	
  
5

©2013	
  Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
Cloudera	
  Manager	
  –	
  Monitoring	
  via	
  ‘tsquery’	
  
• 

Introduced	
  as	
  part	
  of	
  CM4.5	
  	
  release	
  (Feb	
  2013)	
  

• 

Great	
  way	
  to	
  add	
  interesHng	
  	
  charts	
  (above	
  &	
  beyond	
  what	
  is	
  provided	
  by	
  default)	
  	
  and	
  monitor	
  
metrics	
  that	
  are	
  relevant	
  to	
  your	
  clusters	
  

• 

• 

The	
  tsquery	
  language	
   s	
  used	
  to	
  
	
  Manager	
  Hme-­‐series	
  diata	
  store	
   specify	
  statements	
  for	
  retrieving	
  Hme-­‐series	
  data	
  from	
  the	
  Cloudera	
  
	
  
Example:	
  How	
  do	
  I	
  compare	
  all	
  disk	
  IO	
  for	
  all	
  the	
  DataNodes	
  that	
  belong	
  to	
  a	
  specific	
  HDFS	
  service?	
  
select	
  bytes_read,	
  bytes_wriZen	
  where	
  roleType=DATANODE	
  and	
  serviceName=hdfs1	
  

• 

Retrieved	
  Hme-­‐series	
  data	
  can	
  be	
  ploZed	
  via	
  various	
  opHons	
  –	
  line,	
  bar,	
  scaZer,	
  	
  heat	
  maps,	
  table	
  list	
  
etc.	
  

• 

Extending	
  this	
  concept	
  to	
  create	
  user-­‐defined	
  triggers/alarms	
  (new	
  for	
  C5!).	
  
	
  
More	
  details	
  
•  hZp://www.cloudera.com/content/cloudera-­‐content/cloudera-­‐docs/CM5/latest/Cloudera-­‐
Manager-­‐DiagnosHcs-­‐Guide/cm5dg_chart_Hme_series_data.html	
  

• 

6

©2013	
  Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
Examples	
  of	
  Cloudera	
  Manager	
  ‘tsquery’	
  
Example1:	
  How	
  do	
  I	
  track	
  the	
  
	
  
aggregate	
  Cluster	
  Disk	
  IO?	
  
	
  
select	
  dt0(read_bytes_disk_sum),	
  
dt0(write_bytes_disk_sum)	
  where	
  
category	
  =	
  CLUSTER	
  and	
  clusterId	
  =	
  
$CLUSTERID	
  
Example2:	
  How	
  do	
  I	
  compare	
  CPU	
  
usage	
  across	
  hosts?	
  

select	
  dt0(total_cpu_user)	
  /	
  getHostFact(numCores,	
  1)	
  *	
  100,	
  
dt0(total_cpu_system)	
  /	
  getHostFact(numCores,	
  1)	
  *	
  100,	
  
dt0(total_cpu_nice)	
  /	
  getHostFact(numCores,	
  1)	
  *	
  100,	
  
dt0(total_cpu_iowait)	
  /	
  getHostFact(numCores,	
  1)	
  *	
  100,	
  
dt0(total_cpu_irq)	
  /	
  getHostFact(numCores,	
  1)	
  *	
  100,	
  
dt0(total_cpu_so`_irq)	
  /	
  getHostFact(numCores,	
  1)	
  *	
  100	
  

Create	
  &	
  Contribute	
  your	
  ‘tsqueries’!	
  
hZps://github.com/cloudera/cm_charHng_scrapbook	
  
7

©2013	
  Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
Cloudera	
  Manager	
  –	
  Service	
  Extensibility	
  
• 

Introduced	
  in	
  C5	
  
• 

SHll	
  in	
  Beta!	
  

• 

Some	
  aspects	
  (espcially	
  Parcel	
  mgmt)	
  available	
  in	
  CM4.x	
  

• 

Example:	
  CollaboraHon	
  with	
  Syncsort	
  to	
  deploy	
  DMX-­‐h	
  libraries	
  

• 

Single	
  management	
  console	
  for	
  CDH,	
  non-­‐CDH	
  services	
  and	
  ISV	
  
applicaHons	
  

• 

Similar	
  look	
  and	
  feel	
  as	
  exisHng	
  services	
  

• 

Easy	
  to	
  write	
  (Java-­‐free!)	
  

• 

Flexible	
  

• 

Independent	
  release	
  cycle	
  
©2013Cloudera, Inc. All Rights Reserved.
Analogy	
  from	
  OperaHng	
  Systems	
  (OS)	
  world	
  
	
  
	
  
ISV’s	
  view	
  of	
  OS	
  	
  

Systems	
  Management	
  
Package	
  
Mgmt	
  

Process/	
  
Resource	
  
Mgmt	
  

Security	
  
Mgmt	
  

Core	
  OS	
  kernel	
  

9

©2013Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  

Data	
  
Access	
  
Mgmt	
  
Bringing	
  ISV	
  Apps	
  to	
  CDH	
  
	
  
	
  

ISV’s	
  view	
  of	
  Hadoop	
  	
  

Cloudera	
  Manager	
  
Parcels	
  

Resource	
  	
  
Mgmt	
  

Security	
  
Mgmt	
  

CDK	
  API’s	
  

Core	
  Hadoop/CDH	
  kernel	
  

10

©2013Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
IntegraHng	
  into	
  the	
  Cloudera	
  Product	
  Porpolio	
  
Features	
  

Examples	
  

-­‐	
  Ability	
  to	
  easily	
  package	
  and	
  distribute	
  binaries/
jars	
  via	
  “Parcels”	
  

-­‐ InformaHca	
  
-­‐ Syncsort	
  

Resource	
  
Mgmt	
  

-­‐	
  Ability	
  to	
  deploy	
  applicaHons	
  as	
  stand-­‐alone	
  
processes	
  	
  or	
  via	
  YARN*	
  on	
  the	
  Hadoop	
  grid	
  
-­‐	
  Resource	
  isolaHon	
  of	
  cluster	
  resources	
  	
  	
  

-­‐ SAS	
  
-­‐ 0xData	
  
-­‐ Accumulo	
  

Security	
  
Mgmt	
  

Cloudera	
  Manager	
  

Descrip;on	
  

Package	
  
Mgmt	
  

ISV’s	
  

-­‐  Support	
  for	
  Kerberos	
  Mgmt	
  
-­‐  Role	
  bases	
  access	
  control	
  for	
  Tables/Views	
  in	
  
Hive/Impala	
  via	
  Sentry	
  

Data	
  Access	
  
Mgmt	
  

-­‐	
  HDFS	
  and	
  HBase	
  API	
  abstracHon	
  and	
  
simplificaHon	
  

Systems	
  Mgmt	
  
Manage	
  
Monitor	
  

Non-­‐CDH	
  Apps…	
  
Accumulo,	
  
Spark,	
  Giraph	
  
etc.	
  

-­‐ Deploy	
  and	
  upgrade	
  (rolling)	
  services	
  and	
  pkgs	
  
-­‐ Manage	
  configuraHons	
  
-­‐ ProacHve	
  health	
  checks	
  
-­‐ Track	
  resource	
  uHlizaHon	
  	
  
-­‐ Custom	
  metrics	
  charts	
  

Diagnose	
  

-­‐ Distributed	
  log	
  collecHon	
  and	
  searching	
  
-­‐ Tag	
  and	
  track	
  key	
  events	
  

Integrate	
  

-­‐ Access	
  operaHonal	
  tools	
  via	
  API	
  
-­‐ Surface	
  overall	
  cluster	
  metrics	
  to	
  ISV	
  dashboard	
  	
  

*	
  Support	
  for	
  YARN	
  planned	
  as	
  part	
  of	
  CM5.x	
  in	
  FY14	
  

11

©2013Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  
So..	
  How	
  does	
  it	
  work?	
  
	
  
•  A	
  JSON	
  file	
  that	
  describes	
  of	
  your	
  service	
  
•  Set	
  of	
  control	
  scripts	
  
•  Packaged	
  as	
  a	
  JAR	
  file	
  
•  As	
  promised,	
  Java-­‐free	
  

©2013Cloudera, Inc. All Rights Reserved.
Example:	
  Cloudera	
  Manager	
  Extensions	
  -­‐	
  Spark	
  

	
  

	
  

©2013Cloudera, Inc. All Rights Reserved.
 

Cloudera	
  Manager	
  Extensions	
  
	
  

©2013Cloudera, Inc. All Rights Reserved.
Cloudera	
  Manager	
  Extensions:	
  Spark	
  

©2013Cloudera, Inc. All Rights Reserved.
Cloudera	
  Manager	
  Extensions:	
  Spark	
  

©2013Cloudera, Inc. All Rights Reserved.
Cloudera	
  Manager	
  Extensions:	
  Spark	
  

©2013Cloudera, Inc. All Rights Reserved.
The	
  Code	
  
	
  

	
  
#!/bin/bash	
  

name	
  :	
  “spark”,	
  
roles	
  :	
  [{	
  
	
  	
  	
  	
  	
  name	
  :	
  "master",	
  

CMD=$1	
  
MASTER_PORT=<read	
  in	
  from	
  ./params.proper;es>	
  

	
  	
  	
  	
  	
  startRunner	
  :	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  program	
  :	
  "scripts/control.sh",	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  args	
  :	
  [	
  	
  "start_master",	
  	
  

case	
  $CMD	
  in	
  
	
  	
  (start_master)	
  
	
  	
  exec	
  $SPARK_HOME/scripts/spark-­‐start.sh	
  master"	
  

	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "./params.proper;es"]	
  
	
  	
  	
  	
  	
  	
  	
  },	
  

	
  	
  	
  	
  ;;	
  

	
  	
  	
  	
  	
  	
  	
  parameters	
  :	
  [{	
  

	
  	
  	
  (*)	
  
	
  	
  	
  	
  echo	
  "$;mestamp	
  Don't	
  understand	
  [$CMD]"	
  

	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  name	
  :	
  "master_port",	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  type	
  :	
  "port",	
  

	
  	
  	
  	
  ;;	
  
esac	
  

	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  default	
  :	
  7077	
  
	
  	
  	
  	
  	
  	
  	
  	
  }],	
  
	
  	
  	
  	
  	
  	
  	
  configWriter	
  :	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  generators	
  :	
  [{	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  filename	
  :	
  "params.proper;es"	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  }]	
  
}]	
  

©2013Cloudera, Inc. All Rights Reserved.
Next	
  Steps	
  
•  DocumentaHon	
  &	
  SDK	
  as	
  part	
  of	
  C5	
  Beta2	
  
or	
  later	
  (definitely	
  before	
  GA!)	
  
•  Working	
  with	
  select	
  ISV’s	
  (SAS,	
  Syncsort,	
  
0xData	
  etc.)	
  as	
  part	
  of	
  Beta	
  to	
  further	
  fine-­‐
tune	
  this	
  feature	
  
	
  

Develop	
  &	
  Contribute	
  your	
  	
  Cloudera	
  Manager	
  service	
  
extensibility	
  plug-­‐in’s	
  !	
  
©2013Cloudera, Inc. All Rights Reserved.
Horizontal Extension

Security
ISV’s

0xData

SAS

Syncsort

Informatica

Revolution

API

Ops Apps
Capacity
Mgr

Service Extensibility

Vertical Extension

Vision	
  of	
  CM	
  Extensibility	
  

SLA Mgr

Cost
Optimizer

CDH

CM
SNMP API

Oracle
OEM

20

Nagios

Dell

Chef/
Puppet

©2012Cloudera,	
  Inc.	
  All	
  Rights	
  Reserved.	
  

Accumulo

Spark

Giraph
Q&A	
  

©2013Cloudera, Inc. All Rights Reserved.

Mais conteúdo relacionado

Mais procurados

Apache Accumulo Overview
Apache Accumulo OverviewApache Accumulo Overview
Apache Accumulo OverviewBill Havanki
 
Why Your Apache Spark Job is Failing
Why Your Apache Spark Job is FailingWhy Your Apache Spark Job is Failing
Why Your Apache Spark Job is FailingCloudera, Inc.
 
Getting Apache Spark Customers to Production
Getting Apache Spark Customers to ProductionGetting Apache Spark Customers to Production
Getting Apache Spark Customers to ProductionCloudera, Inc.
 
Hello OpenStack, Meet Hadoop
Hello OpenStack, Meet HadoopHello OpenStack, Meet Hadoop
Hello OpenStack, Meet HadoopDataWorks Summit
 
Docker based Hadoop provisioning - Hadoop Summit 2014
Docker based Hadoop provisioning - Hadoop Summit 2014 Docker based Hadoop provisioning - Hadoop Summit 2014
Docker based Hadoop provisioning - Hadoop Summit 2014 Janos Matyas
 
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...spinningmatt
 
Savanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStackSavanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStackMirantis
 
DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...
DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...
DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...DataStax
 
Hadoop Cluster on Docker Containers
Hadoop Cluster on Docker ContainersHadoop Cluster on Docker Containers
Hadoop Cluster on Docker Containerspranav_joshi
 
Hortonworks Setup & Configuration on Azure
Hortonworks Setup & Configuration on AzureHortonworks Setup & Configuration on Azure
Hortonworks Setup & Configuration on AzureAnita Luthra
 
Dev ops for big data cluster management tools
Dev ops for big data  cluster management toolsDev ops for big data  cluster management tools
Dev ops for big data cluster management toolsRan Silberman
 
Resource Management in Impala - StampedeCon 2016
Resource Management in Impala - StampedeCon 2016Resource Management in Impala - StampedeCon 2016
Resource Management in Impala - StampedeCon 2016StampedeCon
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansPeter Clapham
 
Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014spinningmatt
 
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは
分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とはCloudera Japan
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on DockerRakesh Saha
 

Mais procurados (20)

Apache Accumulo Overview
Apache Accumulo OverviewApache Accumulo Overview
Apache Accumulo Overview
 
YARN
YARNYARN
YARN
 
Why Your Apache Spark Job is Failing
Why Your Apache Spark Job is FailingWhy Your Apache Spark Job is Failing
Why Your Apache Spark Job is Failing
 
Getting Apache Spark Customers to Production
Getting Apache Spark Customers to ProductionGetting Apache Spark Customers to Production
Getting Apache Spark Customers to Production
 
Hello OpenStack, Meet Hadoop
Hello OpenStack, Meet HadoopHello OpenStack, Meet Hadoop
Hello OpenStack, Meet Hadoop
 
DR_PRESENT 1
DR_PRESENT 1DR_PRESENT 1
DR_PRESENT 1
 
Docker based Hadoop provisioning - Hadoop Summit 2014
Docker based Hadoop provisioning - Hadoop Summit 2014 Docker based Hadoop provisioning - Hadoop Summit 2014
Docker based Hadoop provisioning - Hadoop Summit 2014
 
Ansible Automation - Enterprise Use Cases | Juncheng Anthony Lin
Ansible Automation - Enterprise Use Cases | Juncheng Anthony LinAnsible Automation - Enterprise Use Cases | Juncheng Anthony Lin
Ansible Automation - Enterprise Use Cases | Juncheng Anthony Lin
 
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
 
Savanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStackSavanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStack
 
DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...
DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...
DataStax | Deploy DataStax Enterprise Clusters with OpsCenter (LCM) (Manikand...
 
Hadoop Cluster on Docker Containers
Hadoop Cluster on Docker ContainersHadoop Cluster on Docker Containers
Hadoop Cluster on Docker Containers
 
Hadoop and OpenStack
Hadoop and OpenStackHadoop and OpenStack
Hadoop and OpenStack
 
Hortonworks Setup & Configuration on Azure
Hortonworks Setup & Configuration on AzureHortonworks Setup & Configuration on Azure
Hortonworks Setup & Configuration on Azure
 
Dev ops for big data cluster management tools
Dev ops for big data  cluster management toolsDev ops for big data  cluster management tools
Dev ops for big data cluster management tools
 
Resource Management in Impala - StampedeCon 2016
Resource Management in Impala - StampedeCon 2016Resource Management in Impala - StampedeCon 2016
Resource Management in Impala - StampedeCon 2016
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
 
Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014
 
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは
分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on Docker
 

Destaque

Cloudera hadoop installation
Cloudera hadoop installationCloudera hadoop installation
Cloudera hadoop installationSumitra Pundlik
 
Single node hadoop cluster installation
Single node hadoop cluster installation Single node hadoop cluster installation
Single node hadoop cluster installation Mahantesh Angadi
 
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208Cloudera, Inc.
 
What the Enterprise Requires - Usability
What the Enterprise Requires - UsabilityWhat the Enterprise Requires - Usability
What the Enterprise Requires - UsabilityCloudera, Inc.
 
AnalyzingMovieData and Business Intelligence
AnalyzingMovieData and Business IntelligenceAnalyzingMovieData and Business Intelligence
AnalyzingMovieData and Business IntelligenceJUNWEI GUAN
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupAndrei Savu
 
Unit testing Agile OpenSpace
Unit testing Agile OpenSpaceUnit testing Agile OpenSpace
Unit testing Agile OpenSpaceAndrei Savu
 
[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotepindeedeng
 
Directed Acyclic Graph
Directed Acyclic Graph Directed Acyclic Graph
Directed Acyclic Graph AJAL A J
 
Apache Accumulo and Cloudera
Apache Accumulo and ClouderaApache Accumulo and Cloudera
Apache Accumulo and ClouderaJoey Echeverria
 
CDH5最新情報 #cwt2013
CDH5最新情報 #cwt2013CDH5最新情報 #cwt2013
CDH5最新情報 #cwt2013Cloudera Japan
 
Recommendation Engine using Apache Mahout
Recommendation Engine using Apache MahoutRecommendation Engine using Apache Mahout
Recommendation Engine using Apache MahoutAmbarish Hazarnis
 
Introducing Cloudera Director at Big Data Bash
Introducing Cloudera Director at Big Data BashIntroducing Cloudera Director at Big Data Bash
Introducing Cloudera Director at Big Data BashAndrei Savu
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopDavid Yahalom
 
Hadoop Operations for Production Systems (Strata NYC)
Hadoop Operations for Production Systems (Strata NYC)Hadoop Operations for Production Systems (Strata NYC)
Hadoop Operations for Production Systems (Strata NYC)Kathleen Ting
 
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera, Inc.
 
Samsung’s First 90-Days Building a Next-Generation Analytics Platform
Samsung’s First 90-Days Building a Next-Generation Analytics PlatformSamsung’s First 90-Days Building a Next-Generation Analytics Platform
Samsung’s First 90-Days Building a Next-Generation Analytics PlatformCloudera, Inc.
 
Cloudera Manager 5 (hadoop運用) #cwt2013
Cloudera Manager 5 (hadoop運用)  #cwt2013Cloudera Manager 5 (hadoop運用)  #cwt2013
Cloudera Manager 5 (hadoop運用) #cwt2013Cloudera Japan
 

Destaque (20)

Cloudera hadoop installation
Cloudera hadoop installationCloudera hadoop installation
Cloudera hadoop installation
 
Single node hadoop cluster installation
Single node hadoop cluster installation Single node hadoop cluster installation
Single node hadoop cluster installation
 
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
 
What the Enterprise Requires - Usability
What the Enterprise Requires - UsabilityWhat the Enterprise Requires - Usability
What the Enterprise Requires - Usability
 
AnalyzingMovieData and Business Intelligence
AnalyzingMovieData and Business IntelligenceAnalyzingMovieData and Business Intelligence
AnalyzingMovieData and Business Intelligence
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
 
Unit testing Agile OpenSpace
Unit testing Agile OpenSpaceUnit testing Agile OpenSpace
Unit testing Agile OpenSpace
 
[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep
 
Directed Acyclic Graph
Directed Acyclic Graph Directed Acyclic Graph
Directed Acyclic Graph
 
Apache Accumulo and Cloudera
Apache Accumulo and ClouderaApache Accumulo and Cloudera
Apache Accumulo and Cloudera
 
CDH5最新情報 #cwt2013
CDH5最新情報 #cwt2013CDH5最新情報 #cwt2013
CDH5最新情報 #cwt2013
 
Recommendation Engine using Apache Mahout
Recommendation Engine using Apache MahoutRecommendation Engine using Apache Mahout
Recommendation Engine using Apache Mahout
 
YARN High Availability
YARN High AvailabilityYARN High Availability
YARN High Availability
 
Inside Flume
Inside FlumeInside Flume
Inside Flume
 
Introducing Cloudera Director at Big Data Bash
Introducing Cloudera Director at Big Data BashIntroducing Cloudera Director at Big Data Bash
Introducing Cloudera Director at Big Data Bash
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera Hadoop
 
Hadoop Operations for Production Systems (Strata NYC)
Hadoop Operations for Production Systems (Strata NYC)Hadoop Operations for Production Systems (Strata NYC)
Hadoop Operations for Production Systems (Strata NYC)
 
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
 
Samsung’s First 90-Days Building a Next-Generation Analytics Platform
Samsung’s First 90-Days Building a Next-Generation Analytics PlatformSamsung’s First 90-Days Building a Next-Generation Analytics Platform
Samsung’s First 90-Days Building a Next-Generation Analytics Platform
 
Cloudera Manager 5 (hadoop運用) #cwt2013
Cloudera Manager 5 (hadoop運用)  #cwt2013Cloudera Manager 5 (hadoop運用)  #cwt2013
Cloudera Manager 5 (hadoop運用) #cwt2013
 

Semelhante a Extending and Automating Cloudera Manager via API

Pa cloudera manager-api's_extensibility_v2
Pa   cloudera manager-api's_extensibility_v2Pa   cloudera manager-api's_extensibility_v2
Pa cloudera manager-api's_extensibility_v2ClouderaUserGroups
 
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on KubernetesApache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on KubernetesDataWorks Summit
 
Halifax DevOps - Meet-up - July.19 2017
Halifax DevOps - Meet-up - July.19 2017Halifax DevOps - Meet-up - July.19 2017
Halifax DevOps - Meet-up - July.19 2017Kyle Bassett
 
Cloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the CloudCloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the CloudGoDataDriven
 
Prometheus and Docker (Docker Galway, November 2015)
Prometheus and Docker (Docker Galway, November 2015)Prometheus and Docker (Docker Galway, November 2015)
Prometheus and Docker (Docker Galway, November 2015)Brian Brazil
 
Best Practices For Workflow
Best Practices For WorkflowBest Practices For Workflow
Best Practices For WorkflowTimothy Spann
 
Reusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesReusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesYevgeniy Brikman
 
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlueCloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlueShapeBlue
 
大数据数据治理及数据安全
大数据数据治理及数据安全大数据数据治理及数据安全
大数据数据治理及数据安全Jianwei Li
 
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)Cloudera, Inc.
 
Apache Eagle in Action
Apache Eagle in ActionApache Eagle in Action
Apache Eagle in ActionHao Chen
 
Apache Eagle at Hadoop Summit 2016 San Jose
Apache Eagle at Hadoop Summit 2016 San JoseApache Eagle at Hadoop Summit 2016 San Jose
Apache Eagle at Hadoop Summit 2016 San JoseHao Chen
 
Masterless Puppet Using AWS S3 Buckets and IAM Roles
Masterless Puppet Using AWS S3 Buckets and IAM RolesMasterless Puppet Using AWS S3 Buckets and IAM Roles
Masterless Puppet Using AWS S3 Buckets and IAM RolesMalcolm Duncanson, CISSP
 
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...Databricks
 
What's new in Docker - InfraKit - Docker Meetup Berlin 2016
What's new in Docker - InfraKit - Docker Meetup Berlin 2016What's new in Docker - InfraKit - Docker Meetup Berlin 2016
What's new in Docker - InfraKit - Docker Meetup Berlin 2016Patrick Chanezon
 
Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...
Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...
Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...Data Con LA
 
Infrastructure Considerations : Design : "webops"
Infrastructure Considerations : Design : "webops"Infrastructure Considerations : Design : "webops"
Infrastructure Considerations : Design : "webops"Piyush Kumar
 

Semelhante a Extending and Automating Cloudera Manager via API (20)

Pa cloudera manager-api's_extensibility_v2
Pa   cloudera manager-api's_extensibility_v2Pa   cloudera manager-api's_extensibility_v2
Pa cloudera manager-api's_extensibility_v2
 
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on KubernetesApache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes
Apache Druid Auto Scale-out/in for Streaming Data Ingestion on Kubernetes
 
Halifax DevOps - Meet-up - July.19 2017
Halifax DevOps - Meet-up - July.19 2017Halifax DevOps - Meet-up - July.19 2017
Halifax DevOps - Meet-up - July.19 2017
 
Cloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the CloudCloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the Cloud
 
Prometheus and Docker (Docker Galway, November 2015)
Prometheus and Docker (Docker Galway, November 2015)Prometheus and Docker (Docker Galway, November 2015)
Prometheus and Docker (Docker Galway, November 2015)
 
Best Practices For Workflow
Best Practices For WorkflowBest Practices For Workflow
Best Practices For Workflow
 
Reusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesReusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modules
 
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlueCloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
CloudStack Tooling Ecosystem – Kiran Chavala, ShapeBlue
 
大数据数据治理及数据安全
大数据数据治理及数据安全大数据数据治理及数据安全
大数据数据治理及数据安全
 
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
How Big Data Can Enable Analytics from the Cloud (Technical Workshop)
 
Apache Eagle in Action
Apache Eagle in ActionApache Eagle in Action
Apache Eagle in Action
 
Apache Eagle: Secure Hadoop in Real Time
Apache Eagle: Secure Hadoop in Real TimeApache Eagle: Secure Hadoop in Real Time
Apache Eagle: Secure Hadoop in Real Time
 
Apache Eagle at Hadoop Summit 2016 San Jose
Apache Eagle at Hadoop Summit 2016 San JoseApache Eagle at Hadoop Summit 2016 San Jose
Apache Eagle at Hadoop Summit 2016 San Jose
 
Simplified Cluster Operation and Troubleshooting
Simplified Cluster Operation and TroubleshootingSimplified Cluster Operation and Troubleshooting
Simplified Cluster Operation and Troubleshooting
 
Simplified Cluster Operation & Troubleshooting
Simplified Cluster Operation & TroubleshootingSimplified Cluster Operation & Troubleshooting
Simplified Cluster Operation & Troubleshooting
 
Masterless Puppet Using AWS S3 Buckets and IAM Roles
Masterless Puppet Using AWS S3 Buckets and IAM RolesMasterless Puppet Using AWS S3 Buckets and IAM Roles
Masterless Puppet Using AWS S3 Buckets and IAM Roles
 
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
 
What's new in Docker - InfraKit - Docker Meetup Berlin 2016
What's new in Docker - InfraKit - Docker Meetup Berlin 2016What's new in Docker - InfraKit - Docker Meetup Berlin 2016
What's new in Docker - InfraKit - Docker Meetup Berlin 2016
 
Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...
Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...
Big Data Day LA 2016/ NoSQL track - Apache Kudu: Fast Analytics on Fast Data,...
 
Infrastructure Considerations : Design : "webops"
Infrastructure Considerations : Design : "webops"Infrastructure Considerations : Design : "webops"
Infrastructure Considerations : Design : "webops"
 

Último

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 

Último (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 

Extending and Automating Cloudera Manager via API

  • 1. Cloudera  Manager  –  API’s  &   Extensibility     Patrick  Angeles,  Director  Field  Technical  Services   December  2013     1 CONFIDENTIAL  -­‐  RESTRICTED  
  • 2. Cloudera  Manager   End-­‐to-­‐End  AdministraHon  for  CDH   1 Monitor   2 Diagnose   3 Integrate   4 Manage   Easily  deploy,  configure  &  opHmize  clusters   Maintain  a  central  view  of  all  acHvity   Easily  idenHfy  and  resolve  issues   Use  Cloudera  Manager  with  exisHng  tools   2 ©2013  Cloudera,  Inc.  All  Rights  Reserved.  
  • 3. IntegraHng  with  your  IT  Mgmt  tools   Datacenter  Opera*ons   Various  op*ons  of  integra*ng  Cloudera  Manager  into  your  exis*ng   Installa;on,   Datacenter  Opera*ons/Tools  Monitoring   Deployment   Aler;ng   Tools   tools   Tools   e.g.  Orion,     •  Cloudera  Manager  API   e.g.  Chef,   e.g  Nagios,   Tivoli,  BMC   Puppet  etc.   SNMP  etc.   •  Introduced  in  CM4  (June  2etc.   012)     •  Installa*on  &  deployment   •  Monitoring   •  SNMP  Alerts   •  Introduced  in  CM4.5  (Feb  2013)   •  Hadoop  Opera*ons   And  more…   Cloudera   •  Monitoring  ‘tsquery’  (Feb  2013)   Manager   •  User-­‐defined  triggers/alarms  (new  for  C5!)   •  Service  extensibility  (new  for  C5!)   3 ©2013  Cloudera,  Inc.  All  Rights  Reserved.  
  • 4. Cloudera  Manager  (CM)  API     •  •      4 API  access  was  a  new  feature  introduced  in  Cloudera  Manager  4.0,  providing  programmaHc  access  to   cluster  operaHons  (such  as  configuraHon  and  restart)  and  monitoring  informaHon  (such  as  health  and   metrics).     The  CM  API  is  an  HTTP  REST  API,  using  JSON  serializaHon.  The  API  is  served  on  the  same  host  and  port  as   the  CM  web  UI,  and  does  not  require  an  extra  process  or  extra  configuraHon.  API  users  have  the  same   privileges  as  they  do  in  the  web  UI  world.   •  Docs  &  Examples   hZp://cloudera.github.io/cm_api/   hZps://github.com/cloudera/cm_api   •  Java/Python  clients   hZp://blog.cloudera.com/blog/2013/05/how-­‐to-­‐ automate-­‐your-­‐hadoop-­‐cluster-­‐from-­‐java/       ©2013Cloudera,  Inc.  All  Rights  Reserved.  
  • 5. Examples  of  integraHon  with  CM  API   •  Installa;on  &  Deployment   •  •  •  Chef   Puppet   Dell  Crowbar   •  •  hZp://blog.cloudera.com/blog/2013/08/how-­‐to-­‐deploy-­‐hadoop-­‐clusters-­‐automaHcally-­‐with-­‐dell-­‐crowbar-­‐and-­‐cloudera-­‐manager/ StackIQ   •    hZp://web.stackiq.com/blog/bid/312064/StackIQ-­‐Cluster-­‐Manager-­‐now-­‐integrated-­‐with-­‐Cloudera   •  •  •  WANdisco  –  non-­‐stop  NN  setup   Several  other  customers/partners  leveraging  the  API’s  as  part  of    their  install  &  deployment     process   Monitoring  &  Aler;ng   •  •  Oracle  Enterprise  Manager  (via  Big  Data  Appliance)   Nagios   •  •  •  hZps://github.com/cloudera/cm_api/tree/master/nagios   hZps://github.com/harisekhon/nagios-­‐plugins/blob/master/ check_hadoop_cloudera_manager_metrics.pl   SNMP  alerts  integraHon  with  IBM  Netcool   Develop  &  Contribute  your  plug-­‐in’s  using  Cloudera   Manager  API     5 ©2013  Cloudera,  Inc.  All  Rights  Reserved.  
  • 6. Cloudera  Manager  –  Monitoring  via  ‘tsquery’   •  Introduced  as  part  of  CM4.5    release  (Feb  2013)   •  Great  way  to  add  interesHng    charts  (above  &  beyond  what  is  provided  by  default)    and  monitor   metrics  that  are  relevant  to  your  clusters   •  •  The  tsquery  language   s  used  to    Manager  Hme-­‐series  diata  store   specify  statements  for  retrieving  Hme-­‐series  data  from  the  Cloudera     Example:  How  do  I  compare  all  disk  IO  for  all  the  DataNodes  that  belong  to  a  specific  HDFS  service?   select  bytes_read,  bytes_wriZen  where  roleType=DATANODE  and  serviceName=hdfs1   •  Retrieved  Hme-­‐series  data  can  be  ploZed  via  various  opHons  –  line,  bar,  scaZer,    heat  maps,  table  list   etc.   •  Extending  this  concept  to  create  user-­‐defined  triggers/alarms  (new  for  C5!).     More  details   •  hZp://www.cloudera.com/content/cloudera-­‐content/cloudera-­‐docs/CM5/latest/Cloudera-­‐ Manager-­‐DiagnosHcs-­‐Guide/cm5dg_chart_Hme_series_data.html   •  6 ©2013  Cloudera,  Inc.  All  Rights  Reserved.  
  • 7. Examples  of  Cloudera  Manager  ‘tsquery’   Example1:  How  do  I  track  the     aggregate  Cluster  Disk  IO?     select  dt0(read_bytes_disk_sum),   dt0(write_bytes_disk_sum)  where   category  =  CLUSTER  and  clusterId  =   $CLUSTERID   Example2:  How  do  I  compare  CPU   usage  across  hosts?   select  dt0(total_cpu_user)  /  getHostFact(numCores,  1)  *  100,   dt0(total_cpu_system)  /  getHostFact(numCores,  1)  *  100,   dt0(total_cpu_nice)  /  getHostFact(numCores,  1)  *  100,   dt0(total_cpu_iowait)  /  getHostFact(numCores,  1)  *  100,   dt0(total_cpu_irq)  /  getHostFact(numCores,  1)  *  100,   dt0(total_cpu_so`_irq)  /  getHostFact(numCores,  1)  *  100   Create  &  Contribute  your  ‘tsqueries’!   hZps://github.com/cloudera/cm_charHng_scrapbook   7 ©2013  Cloudera,  Inc.  All  Rights  Reserved.  
  • 8. Cloudera  Manager  –  Service  Extensibility   •  Introduced  in  C5   •  SHll  in  Beta!   •  Some  aspects  (espcially  Parcel  mgmt)  available  in  CM4.x   •  Example:  CollaboraHon  with  Syncsort  to  deploy  DMX-­‐h  libraries   •  Single  management  console  for  CDH,  non-­‐CDH  services  and  ISV   applicaHons   •  Similar  look  and  feel  as  exisHng  services   •  Easy  to  write  (Java-­‐free!)   •  Flexible   •  Independent  release  cycle   ©2013Cloudera, Inc. All Rights Reserved.
  • 9. Analogy  from  OperaHng  Systems  (OS)  world       ISV’s  view  of  OS     Systems  Management   Package   Mgmt   Process/   Resource   Mgmt   Security   Mgmt   Core  OS  kernel   9 ©2013Cloudera,  Inc.  All  Rights  Reserved.   Data   Access   Mgmt  
  • 10. Bringing  ISV  Apps  to  CDH       ISV’s  view  of  Hadoop     Cloudera  Manager   Parcels   Resource     Mgmt   Security   Mgmt   CDK  API’s   Core  Hadoop/CDH  kernel   10 ©2013Cloudera,  Inc.  All  Rights  Reserved.  
  • 11. IntegraHng  into  the  Cloudera  Product  Porpolio   Features   Examples   -­‐  Ability  to  easily  package  and  distribute  binaries/ jars  via  “Parcels”   -­‐ InformaHca   -­‐ Syncsort   Resource   Mgmt   -­‐  Ability  to  deploy  applicaHons  as  stand-­‐alone   processes    or  via  YARN*  on  the  Hadoop  grid   -­‐  Resource  isolaHon  of  cluster  resources       -­‐ SAS   -­‐ 0xData   -­‐ Accumulo   Security   Mgmt   Cloudera  Manager   Descrip;on   Package   Mgmt   ISV’s   -­‐  Support  for  Kerberos  Mgmt   -­‐  Role  bases  access  control  for  Tables/Views  in   Hive/Impala  via  Sentry   Data  Access   Mgmt   -­‐  HDFS  and  HBase  API  abstracHon  and   simplificaHon   Systems  Mgmt   Manage   Monitor   Non-­‐CDH  Apps…   Accumulo,   Spark,  Giraph   etc.   -­‐ Deploy  and  upgrade  (rolling)  services  and  pkgs   -­‐ Manage  configuraHons   -­‐ ProacHve  health  checks   -­‐ Track  resource  uHlizaHon     -­‐ Custom  metrics  charts   Diagnose   -­‐ Distributed  log  collecHon  and  searching   -­‐ Tag  and  track  key  events   Integrate   -­‐ Access  operaHonal  tools  via  API   -­‐ Surface  overall  cluster  metrics  to  ISV  dashboard     *  Support  for  YARN  planned  as  part  of  CM5.x  in  FY14   11 ©2013Cloudera,  Inc.  All  Rights  Reserved.  
  • 12. So..  How  does  it  work?     •  A  JSON  file  that  describes  of  your  service   •  Set  of  control  scripts   •  Packaged  as  a  JAR  file   •  As  promised,  Java-­‐free   ©2013Cloudera, Inc. All Rights Reserved.
  • 13. Example:  Cloudera  Manager  Extensions  -­‐  Spark       ©2013Cloudera, Inc. All Rights Reserved.
  • 14.   Cloudera  Manager  Extensions     ©2013Cloudera, Inc. All Rights Reserved.
  • 15. Cloudera  Manager  Extensions:  Spark   ©2013Cloudera, Inc. All Rights Reserved.
  • 16. Cloudera  Manager  Extensions:  Spark   ©2013Cloudera, Inc. All Rights Reserved.
  • 17. Cloudera  Manager  Extensions:  Spark   ©2013Cloudera, Inc. All Rights Reserved.
  • 18. The  Code       #!/bin/bash   name  :  “spark”,   roles  :  [{            name  :  "master",   CMD=$1   MASTER_PORT=<read  in  from  ./params.proper;es>            startRunner  :  {                    program  :  "scripts/control.sh",                    args  :  [    "start_master",     case  $CMD  in      (start_master)      exec  $SPARK_HOME/scripts/spark-­‐start.sh  master"                                                  "./params.proper;es"]                },          ;;                parameters  :  [{        (*)          echo  "$;mestamp  Don't  understand  [$CMD]"                      name  :  "master_port",                      type  :  "port",          ;;   esac                      default  :  7077                  }],                configWriter  :  {                      generators  :  [{                            filename  :  "params.proper;es"                      }]   }]   ©2013Cloudera, Inc. All Rights Reserved.
  • 19. Next  Steps   •  DocumentaHon  &  SDK  as  part  of  C5  Beta2   or  later  (definitely  before  GA!)   •  Working  with  select  ISV’s  (SAS,  Syncsort,   0xData  etc.)  as  part  of  Beta  to  further  fine-­‐ tune  this  feature     Develop  &  Contribute  your    Cloudera  Manager  service   extensibility  plug-­‐in’s  !   ©2013Cloudera, Inc. All Rights Reserved.
  • 20. Horizontal Extension Security ISV’s 0xData SAS Syncsort Informatica Revolution API Ops Apps Capacity Mgr Service Extensibility Vertical Extension Vision  of  CM  Extensibility   SLA Mgr Cost Optimizer CDH CM SNMP API Oracle OEM 20 Nagios Dell Chef/ Puppet ©2012Cloudera,  Inc.  All  Rights  Reserved.   Accumulo Spark Giraph
  • 21. Q&A   ©2013Cloudera, Inc. All Rights Reserved.