Personal Information
Organização/Local de trabalho
Sebastopol, CA United States
Cargo
Evil Mad Scientist
Setor
Technology / Software / Internet
Site
derwen.ai/paco
Sobre
Known as a "player/coach", with core expertise in data science, natural language processing, machine learning, cloud computing; 35+ years tech industry experience, ranging from Bell Labs to early-stage start-ups. Co-chair Rev. Advisor for Amplify Partners, Deep Learning Analytics, Primer, Data Spartan, Recognai. Recent roles: Director, Learning Group @ O'Reilly Media; Director, Community Evangelism @ Databricks and Apache Spark. Cited in 2015 as one of the Top 30 People in Big Data and Analytics by Innovation Enterprise.
Marcadores
big data
data science
machine learning
hadoop
cascading
spark
mesos
scalding
cascalog
nlp
python
jupyter
scala
use cases
enterprise data workflows
ai
textrank
streaming
twitter
cluster computing
open data
pmml
aws
cloud computing
text analytics
r
active learning
graph algorithms
approximation algorithms
case studies
ipython notebook
functional programming
management
human-in-the-loop
learning
docker
mesosphere
clojure
o'reilly media
publishing
real-time analytics
sql
knime
advanced math
distributed systems
google
predictive modeling
java
disambiguation
ontology
open source
scikit-learn
chicago
history
apache hadoop
analytics
networkx
datasketch
spacy
deep learning
content discovery
media
video
computable content
inverted classroom
education
graphx
community
certification
mooc
graph queries
abstract algebra
datacenter computing
marathon
linux
low latency
graph theory
airbnb
linux containers
isolation
borg
mathematics
statistics
portland
sas
ansi sql
palo alto
mapreduce
algorithms
enterprise
redis
gephi
business strategy
social media
knowledge graph
search
learning experiences
nike
nginx
kaltura
best practices
literate programming
summarization
standards
pfa
accountability
governance
avro
recommender systems
social context
kubernetes
learning curve
continuous learning
computational thinking
philosophy
parquet
thebe
json
oscon
notebooks
brazil
sao paulo
qcon
iot
paco nathan
pagerank
probabilistic data structures
system architecture
business
stanford
functio
cluster scheduling
quasar
probabilistic programming
chronos
cgroups
omega
mbrace
augustus
julia
mlbase
summingbird
titan
genetic programming
metascale
sears
chug
virtualization
university of chicago
ensembles
kdd
hadoop summit
windows azure
texas
pattern language
predictive models
optimization
tdd
optiq
application layer
enterprise architecture
splunk
bigdata
tf-idf
data analysis
pentaho
imvu
continuous deployment
emr
enron
infochimps
datameer
Ver mais
Apresentações
(73)
Ver tudo
Gostaram
(118)
Ver tudo
Building an open metadata and governance ecosystem
Connected Data World
•
Há 4 anos
When Privacy Scales - Intelligent Product Design under GDPR
Amanda Casari
•
Há 4 anos
Data science apps powered by Jupyter Notebooks
Natalino Busa
•
Há 5 anos
Learning to learn Model Behavior: How to use "human-in-the-loop" to explain decisions.
IDEAS - Int'l Data Engineering and Science Association
•
Há 5 anos
Data Science with Human in the Loop @Faculty of Science #Leiden University
Lora Aroyo
•
Há 5 anos
Making fashion recommendations with human-in-the-loop machine learning
Brad Klingenberg
•
Há 6 anos
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud Prevention
DataWorks Summit
•
Há 5 anos
Active Learning and Human-in-the-Loop
CrowdFlower
•
Há 6 anos
Managing and Versioning Machine Learning Models in Python
Simon Frid
•
Há 6 anos
WTF - Why the Future Is Up to Us - pptx version
Tim O'Reilly
•
Há 6 anos
Container Ship - How to reduce effect on Climate and Pollution
Glenn Klith Andersen
•
Há 13 anos
SKIL - Dl4j in the wild meetup
Adam Gibson
•
Há 6 anos
Anomaly Detection in Deep Learning (Updated)
Adam Gibson
•
Há 6 anos
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production Environments
Robert Grossman
•
Há 6 anos
Non-exhaustive, Overlapping K-means
David Gleich
•
Há 7 anos
Dimensionality Reduction of Genomic Variation with Big Data Genomics ADAM & Spark MLLib/ML & SparkR
Deborah Siegel
•
Há 7 anos
Lecture 1 introduction To The Course: The Flipped Classroom
Marina Santini
•
Há 8 anos
Up and Running with Twitter Bootstrap: Refresh Boston, January 2013
Jen Kramer
•
Há 10 anos
Designing Reactive Systems with Akka
Thomas Lockney
•
Há 7 anos
Sparkling pandas Letting Pandas Roam - PyData Seattle 2015
Holden Karau
•
Há 7 anos
How to Hire Data Scientists
Galvanize
•
Há 7 anos
Spark Meetup @ Netflix, 05/19/2015
Yves Raimond
•
Há 7 anos
Distributed machine learning 101 using apache spark from the browser
Andy Petrella
•
Há 7 anos
Spark Summit 2015 Highlights in Tweets
Gerard Maas
•
Há 7 anos
How to use Parquet as a basis for ETL and analytics (Hadoop Summit San Jose 2015)
Julien Le Dem
•
Há 7 anos
Hadoop Summit 2015: Performance Optimization at Scale, Lessons Learned at Twitter (Alex Levenson)
Alex Levenson
•
Há 7 anos
Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala
Helena Edelson
•
Há 7 anos
Apache spark meetup
Israel Gaytan
•
Há 7 anos
Scala Days San Francisco
Martin Odersky
•
Há 7 anos
Building and Deploying Application to Apache Mesos
Joe Stein
•
Há 7 anos
Big data apache spark + scala
Juantomás García Molina
•
Há 7 anos
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
Há 7 anos
Introducing DataFrames in Spark for Large Scale Data Science
Databricks
•
Há 7 anos
Why Spark?
Álvaro Agea Herradón
•
Há 8 anos
7+1 myths of the new os
Alexis Richardson
•
Há 8 anos
Introduction to Spark
Li Ming Tsai
•
Há 8 anos
Realtime Data Analysis Patterns
Mikio L. Braun
•
Há 8 anos
Spark Streaming with Cassandra
Jacek Lewandowski
•
Há 8 anos
Monoids, Store, and Dependency Injection - Abstractions for Spark Streaming Jobs
Ryan Weald
•
Há 9 anos
Paris Data Geek - Spark Streaming
Djamel Zouaoui
•
Há 8 anos
R, Data Wrangling & Kaggle Data Science Competitions
Krishna Sankar
•
Há 8 anos
Introduccion a Apache Spark
Gustavo Arjones
•
Há 8 anos
Apache Spark Briefing
Thomas W. Dinsmore
•
Há 9 anos
Doing-the-impossible
Ted Dunning
•
Há 8 anos
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
Krishna Sankar
•
Há 8 anos
Kinesis and Spark Streaming - Advanced AWS Meetup - August 2014
Chris Fregly
•
Há 8 anos
Managing Cassandra at Scale by Al Tobey
DataStax Academy
•
Há 8 anos
Technological Revolutions and Cultural Revolutions: OSCON 2014
Tim O'Reilly
•
Há 8 anos
A Survey of Probabilistic Data Structures - StampedeCon 2012
StampedeCon
•
Há 10 anos
Introduction to Apache Mesos
Joe Stein
•
Há 8 anos
Anti-differentiating approximation algorithms: A case study with min-cuts, spectral, and flow
David Gleich
•
Há 8 anos
Recent Developments in Spark MLlib and Beyond
Xiangrui Meng
•
Há 8 anos
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
Allen Day, PhD
•
Há 8 anos
Cascading User Group Meet
Vinoth Kannan
•
Há 8 anos
HBase Data Types
Nick Dimiduk
•
Há 8 anos
Large-Scale Machine Learning with Apache Spark
DB Tsai
•
Há 8 anos
Building Data Science Teams, Abbreviated
Allen Day, PhD
•
Há 8 anos
Let Spark Fly: Advantages and Use Cases for Spark on Hadoop
MapR Technologies
•
Há 8 anos
Spark at Twitter - Seattle Spark Meetup, April 2014
Sriram Krishnan
•
Há 8 anos
Genomics Crash Course for Data Engineers
Allen Day, PhD
•
Há 8 anos
Productionalizing Spark Streaming
Ryan Weald
•
Há 9 anos
Possible Visions for Mahout 1.0
Ted Dunning
•
Há 8 anos
Whitepaper: Agricultural Systems + Data Outlook 2Q14
The Data Guild
•
Há 8 anos
LA HUG - Ted Dunning 2012-09-25
MapR Technologies
•
Há 10 anos
NextGen BigData Workloads in NextGen Sequencing - 20140402 - Phoenix - TGEN
Allen Day, PhD
•
Há 8 anos
Data Science Folk Knowledge
Krishna Sankar
•
Há 8 anos
Introduction to Apache Mesos
tomasbart
•
Há 8 anos
Data Wrangling For Kaggle Data Science Competitions
Krishna Sankar
•
Há 8 anos
Reactive Reatime Big Data with Open Source Lambda Architecture - TechCampVN 2014
Trieu Nguyen
•
Há 8 anos
Got Chaos? Extracting Business Intelligence from Email with Natural Language Processing and Dynamic Graph Analysis
Digital Reasoning
•
Há 9 anos
Micro Servers in Big Data
Aater Suleman
•
Há 9 anos
Fast matrix primitives for ranking, link-prediction and more
David Gleich
•
Há 9 anos
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Robert Grossman
•
Há 9 anos
Evolution of The Twitter Stack
Chris Aniszczyk
•
Há 9 anos
Semantically coherent functional linear data structures
Jack Fox
•
Há 9 anos
Mesos
Anis Nasir
•
Há 10 anos
SQL Now! How Optiq brings the best of SQL to NoSQL data.
Julian Hyde
•
Há 9 anos
Personalized PageRank based community detection
David Gleich
•
Há 9 anos
Why Docker
dotCloud
•
Há 9 anos
Hadoop on-mesos
Henry Cai 蔡明航
•
Há 9 anos
Functional linear data structures in f#
Jack Fox
•
Há 9 anos
Data Science with Hadoop - A primer
Ofer Mendelevitch
•
Há 9 anos
HUG August 2010: Mesos
Hadoop User Group
•
Há 12 anos
PRISM seed-stage Investor Deck
David Coallier
•
Há 9 anos
A dynamical system for PageRank with time-dependent teleportation
David Gleich
•
Há 9 anos
Agile analytics applications on hadoop
Russell Jurney
•
Há 9 anos
Skills, Reputation, and Search
Peter Skomoroch
•
Há 9 anos
Sparse matrix computations in MapReduce
David Gleich
•
Há 9 anos
Functional programming
for optimization problems
in Big Data
Paco Nathan
•
Há 9 anos
Visualize Big Graph Data
Mathieu Bastian
•
Há 9 anos
Data Day Texas 2013
Matthias Broecheler
•
Há 9 anos
Why clojure
Thomas Goossens
•
Há 9 anos
Incorporating Regularity into Models of Noncontractual Customer-Firm Relationships
Michael Platzer
•
Há 13 anos
Netflix and Open Source
Adrian Cockcroft
•
Há 9 anos
Microlearning: a strategy for ongoing professional development
eLearning Papers
•
Há 12 anos
LinkedIn Data Products
Vitaly Gordon
•
Há 9 anos
Drill / SQL / Optiq
Julian Hyde
•
Há 9 anos
Scalding
Mario Pastorelli
•
Há 10 anos
Scalable and Flexible Machine Learning With Scala @ LinkedIn
Vitaly Gordon
•
Há 9 anos
Enterprise Data Workflows with Cascading
Paco Nathan
•
Há 10 anos
Optiq: a SQL front-end for everything
Julian Hyde
•
Há 10 anos
Ember.js for SFHTML5
Anthony Bull
•
Há 10 anos
Scalding: Twitter's Scala DSL for Hadoop/Cascading
johnynek
•
Há 10 anos
Cultural Algorithm - Genetic Algorithms - Related Techniques
Daniel Condurachi
•
Há 14 anos
An agile approach to knowledge discovery on web log data
Paul Lam
•
Há 10 anos
Couchbase Performance Benchmarking
Renat Khasanshyn
•
Há 10 anos
Digitizing the World
DataWorks Summit
•
Há 10 anos
Cascading
nathanmarz
•
Há 12 anos
10+ Deploys Per Day: Dev and Ops Cooperation at Flickr
John Allspaw
•
Há 13 anos
Gephi Plugin Developer Workshop
Gephi Consortium
•
Há 11 anos
Building Distributed Systems With Riak and Riak Core
Andy Gross
•
Há 12 anos
AWS Customer Presentation - Headcase Humanufacturing
Amazon Web Services
•
Há 15 anos
Apresentações
(73)
Ver tudo
Gostaram
(118)
Ver tudo
Building an open metadata and governance ecosystem
Connected Data World
•
Há 4 anos
When Privacy Scales - Intelligent Product Design under GDPR
Amanda Casari
•
Há 4 anos
Data science apps powered by Jupyter Notebooks
Natalino Busa
•
Há 5 anos
Learning to learn Model Behavior: How to use "human-in-the-loop" to explain decisions.
IDEAS - Int'l Data Engineering and Science Association
•
Há 5 anos
Data Science with Human in the Loop @Faculty of Science #Leiden University
Lora Aroyo
•
Há 5 anos
Making fashion recommendations with human-in-the-loop machine learning
Brad Klingenberg
•
Há 6 anos
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud Prevention
DataWorks Summit
•
Há 5 anos
Active Learning and Human-in-the-Loop
CrowdFlower
•
Há 6 anos
Managing and Versioning Machine Learning Models in Python
Simon Frid
•
Há 6 anos
WTF - Why the Future Is Up to Us - pptx version
Tim O'Reilly
•
Há 6 anos
Container Ship - How to reduce effect on Climate and Pollution
Glenn Klith Andersen
•
Há 13 anos
SKIL - Dl4j in the wild meetup
Adam Gibson
•
Há 6 anos
Anomaly Detection in Deep Learning (Updated)
Adam Gibson
•
Há 6 anos
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production Environments
Robert Grossman
•
Há 6 anos
Non-exhaustive, Overlapping K-means
David Gleich
•
Há 7 anos
Dimensionality Reduction of Genomic Variation with Big Data Genomics ADAM & Spark MLLib/ML & SparkR
Deborah Siegel
•
Há 7 anos
Lecture 1 introduction To The Course: The Flipped Classroom
Marina Santini
•
Há 8 anos
Up and Running with Twitter Bootstrap: Refresh Boston, January 2013
Jen Kramer
•
Há 10 anos
Designing Reactive Systems with Akka
Thomas Lockney
•
Há 7 anos
Sparkling pandas Letting Pandas Roam - PyData Seattle 2015
Holden Karau
•
Há 7 anos
How to Hire Data Scientists
Galvanize
•
Há 7 anos
Spark Meetup @ Netflix, 05/19/2015
Yves Raimond
•
Há 7 anos
Distributed machine learning 101 using apache spark from the browser
Andy Petrella
•
Há 7 anos
Spark Summit 2015 Highlights in Tweets
Gerard Maas
•
Há 7 anos
How to use Parquet as a basis for ETL and analytics (Hadoop Summit San Jose 2015)
Julien Le Dem
•
Há 7 anos
Hadoop Summit 2015: Performance Optimization at Scale, Lessons Learned at Twitter (Alex Levenson)
Alex Levenson
•
Há 7 anos
Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala
Helena Edelson
•
Há 7 anos
Apache spark meetup
Israel Gaytan
•
Há 7 anos
Scala Days San Francisco
Martin Odersky
•
Há 7 anos
Building and Deploying Application to Apache Mesos
Joe Stein
•
Há 7 anos
Big data apache spark + scala
Juantomás García Molina
•
Há 7 anos
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
Há 7 anos
Introducing DataFrames in Spark for Large Scale Data Science
Databricks
•
Há 7 anos
Why Spark?
Álvaro Agea Herradón
•
Há 8 anos
7+1 myths of the new os
Alexis Richardson
•
Há 8 anos
Introduction to Spark
Li Ming Tsai
•
Há 8 anos
Realtime Data Analysis Patterns
Mikio L. Braun
•
Há 8 anos
Spark Streaming with Cassandra
Jacek Lewandowski
•
Há 8 anos
Monoids, Store, and Dependency Injection - Abstractions for Spark Streaming Jobs
Ryan Weald
•
Há 9 anos
Paris Data Geek - Spark Streaming
Djamel Zouaoui
•
Há 8 anos
R, Data Wrangling & Kaggle Data Science Competitions
Krishna Sankar
•
Há 8 anos
Introduccion a Apache Spark
Gustavo Arjones
•
Há 8 anos
Apache Spark Briefing
Thomas W. Dinsmore
•
Há 9 anos
Doing-the-impossible
Ted Dunning
•
Há 8 anos
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
Krishna Sankar
•
Há 8 anos
Kinesis and Spark Streaming - Advanced AWS Meetup - August 2014
Chris Fregly
•
Há 8 anos
Managing Cassandra at Scale by Al Tobey
DataStax Academy
•
Há 8 anos
Technological Revolutions and Cultural Revolutions: OSCON 2014
Tim O'Reilly
•
Há 8 anos
A Survey of Probabilistic Data Structures - StampedeCon 2012
StampedeCon
•
Há 10 anos
Introduction to Apache Mesos
Joe Stein
•
Há 8 anos
Anti-differentiating approximation algorithms: A case study with min-cuts, spectral, and flow
David Gleich
•
Há 8 anos
Recent Developments in Spark MLlib and Beyond
Xiangrui Meng
•
Há 8 anos
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
Allen Day, PhD
•
Há 8 anos
Cascading User Group Meet
Vinoth Kannan
•
Há 8 anos
HBase Data Types
Nick Dimiduk
•
Há 8 anos
Large-Scale Machine Learning with Apache Spark
DB Tsai
•
Há 8 anos
Building Data Science Teams, Abbreviated
Allen Day, PhD
•
Há 8 anos
Let Spark Fly: Advantages and Use Cases for Spark on Hadoop
MapR Technologies
•
Há 8 anos
Spark at Twitter - Seattle Spark Meetup, April 2014
Sriram Krishnan
•
Há 8 anos
Genomics Crash Course for Data Engineers
Allen Day, PhD
•
Há 8 anos
Productionalizing Spark Streaming
Ryan Weald
•
Há 9 anos
Possible Visions for Mahout 1.0
Ted Dunning
•
Há 8 anos
Whitepaper: Agricultural Systems + Data Outlook 2Q14
The Data Guild
•
Há 8 anos
LA HUG - Ted Dunning 2012-09-25
MapR Technologies
•
Há 10 anos
NextGen BigData Workloads in NextGen Sequencing - 20140402 - Phoenix - TGEN
Allen Day, PhD
•
Há 8 anos
Data Science Folk Knowledge
Krishna Sankar
•
Há 8 anos
Introduction to Apache Mesos
tomasbart
•
Há 8 anos
Data Wrangling For Kaggle Data Science Competitions
Krishna Sankar
•
Há 8 anos
Reactive Reatime Big Data with Open Source Lambda Architecture - TechCampVN 2014
Trieu Nguyen
•
Há 8 anos
Got Chaos? Extracting Business Intelligence from Email with Natural Language Processing and Dynamic Graph Analysis
Digital Reasoning
•
Há 9 anos
Micro Servers in Big Data
Aater Suleman
•
Há 9 anos
Fast matrix primitives for ranking, link-prediction and more
David Gleich
•
Há 9 anos
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Robert Grossman
•
Há 9 anos
Evolution of The Twitter Stack
Chris Aniszczyk
•
Há 9 anos
Semantically coherent functional linear data structures
Jack Fox
•
Há 9 anos
Mesos
Anis Nasir
•
Há 10 anos
SQL Now! How Optiq brings the best of SQL to NoSQL data.
Julian Hyde
•
Há 9 anos
Personalized PageRank based community detection
David Gleich
•
Há 9 anos
Why Docker
dotCloud
•
Há 9 anos
Hadoop on-mesos
Henry Cai 蔡明航
•
Há 9 anos
Functional linear data structures in f#
Jack Fox
•
Há 9 anos
Data Science with Hadoop - A primer
Ofer Mendelevitch
•
Há 9 anos
HUG August 2010: Mesos
Hadoop User Group
•
Há 12 anos
PRISM seed-stage Investor Deck
David Coallier
•
Há 9 anos
A dynamical system for PageRank with time-dependent teleportation
David Gleich
•
Há 9 anos
Agile analytics applications on hadoop
Russell Jurney
•
Há 9 anos
Skills, Reputation, and Search
Peter Skomoroch
•
Há 9 anos
Sparse matrix computations in MapReduce
David Gleich
•
Há 9 anos
Functional programming
for optimization problems
in Big Data
Paco Nathan
•
Há 9 anos
Visualize Big Graph Data
Mathieu Bastian
•
Há 9 anos
Data Day Texas 2013
Matthias Broecheler
•
Há 9 anos
Why clojure
Thomas Goossens
•
Há 9 anos
Incorporating Regularity into Models of Noncontractual Customer-Firm Relationships
Michael Platzer
•
Há 13 anos
Netflix and Open Source
Adrian Cockcroft
•
Há 9 anos
Microlearning: a strategy for ongoing professional development
eLearning Papers
•
Há 12 anos
LinkedIn Data Products
Vitaly Gordon
•
Há 9 anos
Drill / SQL / Optiq
Julian Hyde
•
Há 9 anos
Scalding
Mario Pastorelli
•
Há 10 anos
Scalable and Flexible Machine Learning With Scala @ LinkedIn
Vitaly Gordon
•
Há 9 anos
Enterprise Data Workflows with Cascading
Paco Nathan
•
Há 10 anos
Optiq: a SQL front-end for everything
Julian Hyde
•
Há 10 anos
Ember.js for SFHTML5
Anthony Bull
•
Há 10 anos
Scalding: Twitter's Scala DSL for Hadoop/Cascading
johnynek
•
Há 10 anos
Cultural Algorithm - Genetic Algorithms - Related Techniques
Daniel Condurachi
•
Há 14 anos
An agile approach to knowledge discovery on web log data
Paul Lam
•
Há 10 anos
Couchbase Performance Benchmarking
Renat Khasanshyn
•
Há 10 anos
Digitizing the World
DataWorks Summit
•
Há 10 anos
Cascading
nathanmarz
•
Há 12 anos
10+ Deploys Per Day: Dev and Ops Cooperation at Flickr
John Allspaw
•
Há 13 anos
Gephi Plugin Developer Workshop
Gephi Consortium
•
Há 11 anos
Building Distributed Systems With Riak and Riak Core
Andy Gross
•
Há 12 anos
AWS Customer Presentation - Headcase Humanufacturing
Amazon Web Services
•
Há 15 anos
Personal Information
Organização/Local de trabalho
Sebastopol, CA United States
Cargo
Evil Mad Scientist
Setor
Technology / Software / Internet
Site
derwen.ai/paco
Sobre
Known as a "player/coach", with core expertise in data science, natural language processing, machine learning, cloud computing; 35+ years tech industry experience, ranging from Bell Labs to early-stage start-ups. Co-chair Rev. Advisor for Amplify Partners, Deep Learning Analytics, Primer, Data Spartan, Recognai. Recent roles: Director, Learning Group @ O'Reilly Media; Director, Community Evangelism @ Databricks and Apache Spark. Cited in 2015 as one of the Top 30 People in Big Data and Analytics by Innovation Enterprise.
Marcadores
big data
data science
machine learning
hadoop
cascading
spark
mesos
scalding
cascalog
nlp
python
jupyter
scala
use cases
enterprise data workflows
ai
textrank
streaming
twitter
cluster computing
open data
pmml
aws
cloud computing
text analytics
r
active learning
graph algorithms
approximation algorithms
case studies
ipython notebook
functional programming
management
human-in-the-loop
learning
docker
mesosphere
clojure
o'reilly media
publishing
real-time analytics
sql
knime
advanced math
distributed systems
google
predictive modeling
java
disambiguation
ontology
open source
scikit-learn
chicago
history
apache hadoop
analytics
networkx
datasketch
spacy
deep learning
content discovery
media
video
computable content
inverted classroom
education
graphx
community
certification
mooc
graph queries
abstract algebra
datacenter computing
marathon
linux
low latency
graph theory
airbnb
linux containers
isolation
borg
mathematics
statistics
portland
sas
ansi sql
palo alto
mapreduce
algorithms
enterprise
redis
gephi
business strategy
social media
knowledge graph
search
learning experiences
nike
nginx
kaltura
best practices
literate programming
summarization
standards
pfa
accountability
governance
avro
recommender systems
social context
kubernetes
learning curve
continuous learning
computational thinking
philosophy
parquet
thebe
json
oscon
notebooks
brazil
sao paulo
qcon
iot
paco nathan
pagerank
probabilistic data structures
system architecture
business
stanford
functio
cluster scheduling
quasar
probabilistic programming
chronos
cgroups
omega
mbrace
augustus
julia
mlbase
summingbird
titan
genetic programming
metascale
sears
chug
virtualization
university of chicago
ensembles
kdd
hadoop summit
windows azure
texas
pattern language
predictive models
optimization
tdd
optiq
application layer
enterprise architecture
splunk
bigdata
tf-idf
data analysis
pentaho
imvu
continuous deployment
emr
enron
infochimps
datameer
Ver mais