Personal Information
Organização/Local de trabalho
London, United Kingdom United Kingdom
Cargo
Data Science and Big Data
Setor
Technology / Software / Internet
Sobre
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Marcadores
newbie
pycon
python
programming
pycon2010
Ver mais
Apresentações
(2)Documentos
(1)Gostaram
(24)Netezza Architecture and Administration
Braja Krishna Das
•
Há 7 anos
Netezza Deep Dives
Rush Shah
•
Há 7 anos
Notes from Coursera Deep Learning courses by Andrew Ng
Tess Ferrandez
•
Há 6 anos
Developing Real-Time Data Pipelines with Apache Kafka
Joe Stein
•
Há 8 anos
Scala - The Simple Parts, SFScala presentation
Martin Odersky
•
Há 9 anos
Pragmatic Real-World Scala (short version)
Jonas Bonér
•
Há 15 anos
Scala Data Pipelines @ Spotify
Neville Li
•
Há 8 anos
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
•
Há 9 anos
Hive tuning
Michael Zhang
•
Há 10 anos
Spark SQL Deep Dive @ Melbourne Spark Meetup
Databricks
•
Há 8 anos
Spark Summit East 2015 Advanced Devops Student Slides
Databricks
•
Há 9 anos
DTCC '14 Spark Runtime Internals
Cheng Lian
•
Há 10 anos
Tuning and Debugging in Apache Spark
Databricks
•
Há 9 anos
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
Há 9 anos
Why Scala Is Taking Over the Big Data World
Dean Wampler
•
Há 9 anos
storm at twitter
Krishna Gade
•
Há 10 anos
Collaborative Filtering with Spark
Chris Johnson
•
Há 9 anos
DataFu @ ApacheCon 2014
William Vaughan
•
Há 10 anos
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Paco Nathan
•
Há 9 anos
Hadoop World 2011: Advanced HBase Schema Design - Lars George, Cloudera
Cloudera, Inc.
•
Há 12 anos
HBase schema design Big Data TechCon Boston
amansk
•
Há 11 anos
HBaseCon 2012 | HBase Schema Design - Ian Varley, Salesforce
Cloudera, Inc.
•
Há 11 anos
The 21 Coolest Internet Of Things Gadgets
Bernard Marr
•
Há 9 anos
Personal Information
Organização/Local de trabalho
London, United Kingdom United Kingdom
Cargo
Data Science and Big Data
Setor
Technology / Software / Internet
Sobre
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Marcadores
newbie
pycon
python
programming
pycon2010
Ver mais