2. Globalcode – Open4education
Apresentação
• 25 anos atuando com desenvolvimento de software
• TDC 2011 – Trilha SOA
• TDC 2012 – Trilhas iOS e Análise 2.0
• TDC 2013 – Trilha Scala
• TDC 2014 – Trilhas Arquitetura .NET, Big Data e HPC
5. Globalcode – Open4education
Big Data
Big Data é
… como sexo na adolescência:
•todos falam sobre isso;
•nenhum deles sabe realmente como fazer;
•todos pensam que os amigos estão fazendo;
•Todos dizem que estão fazendo;
7. Globalcode – Open4education
Big Data
• Nos últimos dois anos, criamos 90% de todos os dados disponíveis no
mundo.
• Atualmente, geramos algo próximo a 15 petabytes em somente um dia, o
que equivale à soma de cada palavra dita desde o início dos tempos.
Fonte : A nova era da computação
Freddy Vaquero – VP de Sistemas e Tecnologia IBM Brasil
http://www.ibm.com/midmarket/br/pt/articles_nova_era_computacao.html
11. Globalcode – Open4education
Big Data
Distribuições Cloudera
Cloudera Enterprise
Designed specifically for mission-critical environments, Cloudera Enterprise
includes CDH, the world’s most popular open source Hadoop-based platform, as
well as advanced system management and data management tools plus dedicated
support and community advocacy from our world-class team of Hadoop developers
and experts. Cloudera is your partner on the path to big data.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-
services/cloudera-enterprise.html
12. Globalcode – Open4education
Big Data
Cloudera Express
The Best Way to Get Started with Hadoop
Cloudera Express is a free download that combines CDH, Cloudera’s 100% open
source and enterprise-ready distribution of Apache Hadoop with Cloudera Manager,
which provides robust cluster management capabilities like automated deployment,
centralized administration, monitoring, and diagnostic tools.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-
services/cloudera-express.html
13. Globalcode – Open4education
Big Data
Cloudera Manager
End-to-End Administration for Hadoop
Cloudera Manager is the industry’s first and most sophisticated management
application for Apache Hadoop and the enterprise data hub. Cloudera Manager
sets the standard for enterprise deployment by delivering granular visibility into and
control over every part of the data hub — empowering operators to improve
performance, enhance quality of service, increase compliance and reduce
administrative costs.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-
services/cloudera-enterprise/cloudera-manager.html
14. Globalcode – Open4education
Big Data
Cloudera Manager
End-to-End Administration for Hadoop
Cloudera Manager is the industry’s first and most sophisticated management
application for Apache Hadoop and the enterprise data hub. Cloudera Manager
sets the standard for enterprise deployment by delivering granular visibility into and
control over every part of the data hub — empowering operators to improve
performance, enhance quality of service, increase compliance and reduce
administrative costs.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-
services/cloudera-enterprise/cloudera-manager.html
15. Globalcode – Open4education
Big Data
CDH
100% Open Source Distribution including Apache Hadoop
CDH is the world’s most complete, tested, and popular distribution of Apache
Hadoop and related projects. CDH is 100% Apache-licensed open source and is
the only Hadoop solution to offer unified batch processing, interactive SQL, and
interactive search, and role-based access controls. More enterprises have
downloaded CDH than all other such distributions combined.
•Fonte: http://www.cloudera.com/content/cloudera/en/products-and-
services/cdh.html
16. Globalcode – Open4education
Big Data
Distribuições MapR
M3 Standard Edition
MapR M3 is a complete distribution for Apache TM Hadoop ® MapR M3 Standard
Edition is a free and complete distribution for Apache™ Hadoop® that includes
Apache HBaseTM, Apache Pig, Apache Hive, Apache Mahout, Cascading, Apache
Sqoop, Apache Flume, and more. MapR M3 provides the capabilities for entry-level
Hadoop users to develop Big Data applications using the complete Apache Hadoop
stack while providing easy management, seamless interoperability and high
performance.
Fonte: http://www.mapr.com/sites/default/files/mapr020_datasheet_m3.pdf
17. Globalcode – Open4education
Big Data
M5 Enterprise Edition
MapR M5 is a complete enterprise-grade distribution for ApacheTM Hadoop ®
MapR M5 includes Apache Hbase TM, Apache Pig, Apache Hive, Apache
Mahout,Cascading, Apache Sqoop, Apache Flume and more. MapR M5 not only
provides advanced high availability (HA) and data protection features such as Self
Healing HA, JobTracker HA, Snapshots and Mirroring, but also enables un-
precedented Hadoop access and management capabilities through industry
standard interfaces such as NFS and ODBC. MapR M5, avail able on a
subscription basis, is fully supported for the most demanding mission-critical
deployments
Fonte: http://www.mapr.com/sites/default/files/mapr020_datasheet_m5.pdf
18. Globalcode – Open4education
Big Data
M7 Enterprise DataBase Edition
MapR M7 offers all the powerful features of MapR M5 Enterprise Edition, and also
includes Apache projects such as Apache HBase, Apache Pig, Apache Hive TM,
Apache Mahout TM, Cascading, Apache Sqoop TM, Apache Flume TM, and more.
Fonte: http://www.mapr.com/sites/default/files/mapr_datasheet_m7.pdf
19. Globalcode – Open4education
Big Data
MapR Sandbox
The MapR Sandbox for Hadoop provides tutorials, demo applications, and browser-
based user interfaces to let developers and administrators get started quickly with
Hadoop. It is a fully functional Hadoop cluster running in a virtual machine.
You can try our Sandbox now - it is completely free and available as a VMware or
VirtualBox VM.
Fonte: http://www.mapr.com/products/mapr-sandbox-hadoop
20. Globalcode – Open4education
Big Data
Hortonworks Data Platform
Architected, developed and built completely in the open, Hortonworks Data Platform
(HDP) is designed to meet the changing needs of enterprise data processing.
HDP is fundamentally versatile, providing linear, scalablestorage and compute
across a wide range of accessmethods, from batch and interactive to real time,
search and streaming. It includes a comprehensive set of theessential data
capabilities required by the modern enterprise across governance, integration,
security and operations.
Fonte: http://br.hortonworks.com/
21. Globalcode – Open4education
Big Data
Hortonworks Data SandBox
The easiest way to get started with Enterprise Hadoop
Sandbox is a personal, portable Hadoop environment that comes with a dozen
interactive Hadoop tutorials. Sandbox includes many of the most exciting
developments from the latest HDP distribution, packaged up in a virtual
environment that you can get up and running in 15 minutes!
Fonte: http://br.hortonworks.com/products/hortonworks-sandbox/
22. Globalcode – Open4education
Big Data
Principais Players Big Data
•IBM InfoSphere Cloudera
•Oracle Oracle Big Data Cloudera
•EMC GreenPlun MapR
•Teradata Hortonworks
•Microsoft HDInsight Hortonworks