Personal Information
Organização/Local de trabalho
San Francisco Bay Area, QC United States
Cargo
Data scientist at Stitch Fix
Setor
Retail
Sobre
Data paranoid, failed entrepreneur, ex stock trader, father, Canadian in US, Shanghainese.
Programming since 13 (QBasic in DOS on a 386 PC with a 5' floppy disk). Once studied Physics then went to Canada to learn more on business. Built a company then got hit by financial crisis. Got married and moved to US. Moved to Silicon Valley with wife as she got a job there.
Love freedom and enjoy all the randomness in life.
Highest Kaggle rank: 1076th / 300k https://www.kaggle.com/piggybox
http://stackoverflow.com/users/2102764/piggybox
https://github.com/piggybox
Marcadores
database
time-series
functional programming
inventory
spark redshift data-engineering spark-summit
spark
redshift
data quality
data cleansing
machine learning
etl
data munging
data wrangling
Ver mais
Apresentações
(4)Gostaram
(7)Kubernetes on AWS at Zalando: Failures & Learnings - DevOps NRW
Henning Jacobs
•
Há 6 anos
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
Cloudera, Inc.
•
Há 8 anos
(BDT303) Running Spark and Presto on the Netflix Big Data Platform
Amazon Web Services
•
Há 8 anos
Spark shuffle introduction
colorant
•
Há 9 anos
Streaming SQL
Julian Hyde
•
Há 8 anos
Choosing an HDFS data storage format- Avro vs. Parquet and more - StampedeCon 2015
StampedeCon
•
Há 8 anos
Effective testing for spark programs Strata NY 2015
Holden Karau
•
Há 8 anos
Personal Information
Organização/Local de trabalho
San Francisco Bay Area, QC United States
Cargo
Data scientist at Stitch Fix
Setor
Retail
Sobre
Data paranoid, failed entrepreneur, ex stock trader, father, Canadian in US, Shanghainese.
Programming since 13 (QBasic in DOS on a 386 PC with a 5' floppy disk). Once studied Physics then went to Canada to learn more on business. Built a company then got hit by financial crisis. Got married and moved to US. Moved to Silicon Valley with wife as she got a job there.
Love freedom and enjoy all the randomness in life.
Highest Kaggle rank: 1076th / 300k https://www.kaggle.com/piggybox
http://stackoverflow.com/users/2102764/piggybox
https://github.com/piggybox
Marcadores
database
time-series
functional programming
inventory
spark redshift data-engineering spark-summit
spark
redshift
data quality
data cleansing
machine learning
etl
data munging
data wrangling
Ver mais