Our pitch at Data-Driven NYC meetup on September 17th (http://datadrivennyc.com).
Speaking about Data Scientists pains and how Dataiku Data Science Studio can help them to more than Data Cleaners and Data Leak Fixers !
6. How
can
we
HELP
DATA
SCIENTISTS
to
FOCUS
on
the
REAL
PROBLEMS
?
7. Pain
points
• Data
prepara9on
is
9me-‐consuming
• Machine
learning
is
hard
to
understand
• Insights
and
models
(almost)
never
reach
produc9on
8. Data
Science
Studio
• A
democra9c
&
ready
to
use
Data
Science
Studio
to
start
innova9ng
with
data!
Ready
to
Use
Data
Science
PlaYorm
Common
playground
for
innova9on
Accessible
Sta9s9cs
&
Machine
Learning
for
everyone
Handle
real-‐life
data
9. Data
Science
Studio
Visual
and
Interac9ve
Data
Prepara9on
For
Data
Cleaners
Guided
Machine
Learning
For
non
Machine
Learning
Experts
Produc9on
ready
For
Data
Leak
Fixers
11. Visual
Data
Prepara9on
• Interac9ve
UI
with
instant
feedback
and
sugges9ons
• Reversibility
of
the
script,
data
integrity
• Explora9on
of
data:
quick
analysis,
facets
• Cleansing:
missing
values,
outliers,
parsing
• Enrichment:
GeoIP,
Holidays,
joins
• Produc9on-‐ready:
integra9on
within
a
flow
14. Data
Science
Studio:
benefits
• Real-‐9me
and
interac9ve
– Transforma9on
effects
can
be
previsualized
in
real-‐9me
• Transparent
and
traceable
– Keep
the
full
history
of
your
data
transforma9on
logics
and
model
designs
• Easy
access
to
machine
learning
– Get
started
with
our
app
templates,
bootstrap
your
model
and
features
selec9ons,
then
go
further!
• Scalable
and
Produc9on
Ready
– Apply
your
recipes
on
your
cluster
on
terabytes
of
data
15. Dataiku
at
a
glance
• Founded
in
2013
by
Data
and
Search
Engine
veterans
• From
“data”
and
“haïku”
“data
can
be
big
solu;on
would
be
small
feel
the
hot
wind”
• 1
goal:
make
Data
Science
accessible
to
anyone!
Contact:
marc.baAy@dataiku.com
-‐
@baAymarc
-‐
github.com/dataiku