Personal Information
Organização/Local de trabalho
Hyderabad Area, India India
Cargo
Big Data Engineer
Setor
Technology / Software / Internet
Sobre
I am a passionate Big Data Engineer who loves building large scale real time data processing pipelines.
Current working as Big Data Engineer developing large scale real time data processing pipelines in Microsoft Azure Cloud using Cloudera CDH tools that includes Hadoop, Spark, Kudu, kafka, Flume, HDFS, YARN, Azure SQL Data warehouse, Azure SQL Database etc.
Prior to this, worked on Data Warehouse migration from Oracle to Hadoop ecosystem on Azure using Spark as ETL, Oozie as Scheduler and Hive as target data warehouse with data in Parquet format. Sqoop to load historical data. Responsible for designing and implementing scalable and robust platform.
Prior to that, worked as Data Wareho...
Gostaram
(27)11 Principles of Applied Analytics
Georgian
•
Há 7 anos
Upgrade Without the Headache: Best Practices for Upgrading Hadoop in Production
Cloudera, Inc.
•
Há 9 anos
Fast Data Analytics with Spark and Python
Benjamin Bengfort
•
Há 9 anos
Intro to Spark development
Spark Summit
•
Há 8 anos
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
Há 9 anos
Spark rdd part 2
Kiran Krishna
•
Há 7 anos
IBM Spark Meetup - RDD & Spark Basics
Satya Narayan
•
Há 8 anos
Custom Applications with Spark's RDD: Spark Summit East talk by Tejas Patil
Spark Summit
•
Há 7 anos
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
Cloudera, Inc.
•
Há 8 anos
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR
Amazon Web Services
•
Há 8 anos
Real time Analytics with Apache Kafka and Apache Spark
Rahul Jain
•
Há 9 anos
Trends for Big Data and Apache Spark in 2017 by Matei Zaharia
Spark Summit
•
Há 7 anos
Deep Dive: Memory Management in Apache Spark
Databricks
•
Há 7 anos
Install Apache Hadoop for Development/Production
IMC Institute
•
Há 7 anos
Apache Spark & Hadoop : Train-the-trainer
IMC Institute
•
Há 7 anos
Apache Spark Architecture
Alexey Grishchenko
•
Há 8 anos
Scala for dummies
Javier Santos Paniego
•
Há 9 anos
Introduction to spark
Javier Santos Paniego
•
Há 8 anos
Distributed computing with spark
Javier Santos Paniego
•
Há 9 anos
Spark after Dark by Chris Fregly of Databricks
Data Con LA
•
Há 9 anos
Spark after Dark by Chris Fregly of Databricks
Data Con LA
•
Há 9 anos
Apache Spark Introduction and Resilient Distributed Dataset basics and deep dive
Sachin Aggarwal
•
Há 8 anos
Not Your Father's Database by Vida Ha
Spark Summit
•
Há 8 anos
IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)
Spark Summit
•
Há 8 anos
Real-Time Event & Stream Processing on MS Azure
Khalid Salama
•
Há 7 anos
Enterprise Cloud Data Platforms - with Microsoft Azure
Khalid Salama
•
Há 7 anos
Large scale ETL with Hadoop
OReillyStrata
•
Há 11 anos
Personal Information
Organização/Local de trabalho
Hyderabad Area, India India
Cargo
Big Data Engineer
Setor
Technology / Software / Internet
Sobre
I am a passionate Big Data Engineer who loves building large scale real time data processing pipelines.
Current working as Big Data Engineer developing large scale real time data processing pipelines in Microsoft Azure Cloud using Cloudera CDH tools that includes Hadoop, Spark, Kudu, kafka, Flume, HDFS, YARN, Azure SQL Data warehouse, Azure SQL Database etc.
Prior to this, worked on Data Warehouse migration from Oracle to Hadoop ecosystem on Azure using Spark as ETL, Oozie as Scheduler and Hive as target data warehouse with data in Parquet format. Sqoop to load historical data. Responsible for designing and implementing scalable and robust platform.
Prior to that, worked as Data Wareho...