Data Science and Goodhart's Law

•Download as PPTX, PDF•

2 likes•4,629 views

Domino Data Lab

These slides were presented by Kyle Polich, Principal Data Scientist, DataScience.com for his talk at Data Science Pop Up LA in September 14, 2016.

Data Science and
Goodhart’s Law
Kyle Polich
Data Science, Inc.

Goodhart’s Law
2
When a measure becomes a target, it ceases to be a good measure

Sales Rep Compensation Example
• Base pay + variable commission
• For monthly <50k, commission = 3%
• For monthly 50-99k, commission = 5%
• For monthly 100k+, commission = 7%
3

Some Examples
 Spam filtering arms race
 Search engine ranking
 Clearing cookies to get better airline prices
 Keep account open to manipulate FICO score
 Retail discounting/couponing strategies
 Bidding in AdTech marketplaces
4

Measuring with Cross Validation
Cross Validation
• You should be doing this anyway!
• Set production performance expectation
• Measure post deployment
• Total deviation =
deviation due to overfit
+ deviation due to incomplete training
+ deviation due to Goodhart’s Law
5

Measuring via Homogeneity Assumption
Can you train a model to accurately
predict the date at which the observation was created?
6

Measuring Drift
7

Measuring Drift
8
Typical failure from a web application release

Measuring Drift
9
Possible failure from a web application release

Dealing with it
• Detection is key
• Experimentation is required
• Agile methods for model deployment
10

Causal Impact
• An approach to estimating
the causal effect of a
designed intervention on a
time series.
• Predicts counterfactual
(how response likely would
have evolved absent the
intervention)
11

Self Fulfilling Prophecies
• Beware!
• Case study: lead qualification
– Try to predict leads that will close
– Relearn the bias of your training
12

Fast Iterations
• Outside normal SWLC release cycle
– State updates
– Parameter tuning
• Run experiments
13

Explanatory power
• Goodhart’s law will often manifest on only a
subset of (possibly significant) instances.
• Model interpretability for effected instances is
key
14

Interpretable Models
15

Interpretable Models
16

Why Should I Trust You?
Explaining the Predictions of Any Classifier
Ribeiro, Singh, Guestrin
17
Model Interpretability

Summary
• Goodhart’s law: When a measure becomes a target, it ceases to be
a good measure
• As a data scientist, if your work is meaningful, you will encounter it
• Try to measure it in the data
• Work on explanatory models to mitigate
• Don’t let the average case blind you
18

DataScience
19
facebook.com/datascience
@DataSkeptic
@datascienceinc
linkedin.com/company/datascience-inc
(310) 579 - 6200

More Related Content

What's hot

Ideas are great - no doubt - but what do you do once you have an entire backlog of ideas? Prioritization is a critical part when building an efficient, impactful testing program. Creating a framework to make smarter choices and thinking deeply about the key factors will make your program successful in the long term. In this session we will talk about how to create a strong prioritization process, how to keep it running, and how to constantly benchmark and optimize your process.

[CXL Live 16] What to Test Next - Prioritizing Your Tests by Pauline Marol

[CXL Live 16] What to Test Next - Prioritizing Your Tests by Pauline Marol

[CXL Live 16] What to Test Next - Prioritizing Your Tests by Pauline Marol

[CXL Live 16] The Grand Unified Theory of Conversion Optimization by John Ekman

[CXL Live 16] The Grand Unified Theory of Conversion Optimization by John Ekman

[CXL Live 16] The Grand Unified Theory of Conversion Optimization by John Ekman

Be A Great Product Leader (Slack 2017)

Be A Great Product Leader (Slack 2017)

Be A Great Product Leader (Slack 2017)

Using data to create intrinsic motivation and a growth mindset

Using data to create intrinsic motivation and a growth mindset

Using data to create intrinsic motivation and a growth mindset

Vendasta Technologies

[CXL Live 16] "Best Practices" or "Common Practices" - Which Is It? by Justin...

[CXL Live 16] "Best Practices" or "Common Practices" - Which Is It? by Justin...

[CXL Live 16] "Best Practices" or "Common Practices" - Which Is It? by Justin...

How to Integrate Customer Discovery

How to Integrate Customer Discovery

How to Integrate Customer Discovery

Deep Customer Research...The Heart Of Innovation - Richard Young and Diana Ad...

Deep Customer Research...The Heart Of Innovation - Richard Young and Diana Ad...

Deep Customer Research...The Heart Of Innovation - Richard Young and Diana Ad...

The Business of Execution (Infographic)

The Business of Execution (Infographic)

The Business of Execution (Infographic)

Nicolas Visiers - User Experience Testing

Nicolas Visiers - User Experience Testing

Nicolas Visiers - User Experience Testing

There’s a thing called “time to value” – it’s how long it takes a team to uncover and actualise value from a product. It’s a hard problem for most software products, because they aren’t architected and designed to solve the “time to value” problem. It’s usually an afterthought. Building onboarding experiences that may or may not improve the customer experience can be both costly and time consuming, especially in enterprise software solutions – so how do you know that what you build will really add value? Data, research or just building things in silos won’t solve the problem. Often too much data or research can make things worse by paralysing teams into inaction, or worse they just start building something, anything without understanding the impact it will have to the experience. Working with large scale enterprise products with millions of customers, and navigating through long roadmaps can be a tough place to try and build fast growth into a product. It is hard to apply startup thinking when you need to care and value the experience that millions of customers have with your software each and everyday. But in order to survive and continually grow, you need to find a way. Atlassian approached and solved this problem by leveraging a combination of growth hacking, user research, data analytics and A/B testing at scale to dramatically increase customer engagement with our products. I’ll describe the variety of approaches we started with and how we learned which ones to pursue and which ones to discard. The design and growth hacking teams worked together to pull off some pretty amazingly fast ways to modify and test variations of an enterprise product experience — without interfering with the product team. Finally, I’ll show how to design and centralise improved onboarding experiences that can be scaled across all of your products.

Data informed design - UX Australia august 2015

Data informed design - UX Australia august 2015

Data informed design - UX Australia august 2015

Alastair Simpson

[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session

[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session

[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session

Quantitative measurement is the key to scaling businesses, processes, and products and making them better. It sounds easy: just pick a number, and improve it. It's not. Choosing a metric is an exploration of a many-dimensional space with no map and no guide. Until now. This talk will teach you the science of choosing metrics that will guide you to building a better product. When you attend this webinar, you will: Learn how applying data is as much about questions as it is about answers Discover the trade-offs between different metrics Understand systematic ways to measure your product funnel Learn how to isolate the most important metrics to maximize your product impact

Test & Learn: The Alchemy & Science of Product Metrics - Choosing Metrics Tha...

Test & Learn: The Alchemy & Science of Product Metrics - Choosing Metrics Tha...

Test & Learn: The Alchemy & Science of Product Metrics - Choosing Metrics Tha...

How HubSpot Launches Products - ProductCamp Boston

How HubSpot Launches Products - ProductCamp Boston

How HubSpot Launches Products - ProductCamp Boston

UX Research - simpler than you thought

UX Research - simpler than you thought

UX Research - simpler than you thought

Peep Laja, CEO, ConversionXL - How to Turn Data into Insights & Customers

Peep Laja, CEO, ConversionXL - How to Turn Data into Insights & Customers

Peep Laja, CEO, ConversionXL - How to Turn Data into Insights & Customers

Most optimizers aren’t big fans of the traditional redesign cycle. They promote a continuous optimizing process. And that’s fine. But sometimes websites just suck donkey balls. And a new website is a must. Also: new companies need new websites too. So how can we stop those clients from making shitty websites? Which research methods and tools can you use in which stage of the development cycle? And how do you this?

[Elite Camp 2016] Karl Gilis - How to Make Sure Your New Website Won’t Be a F...

[Elite Camp 2016] Karl Gilis - How to Make Sure Your New Website Won’t Be a F...

[Elite Camp 2016] Karl Gilis - How to Make Sure Your New Website Won’t Be a F...

Arash Arabi - A guide to multi-organisational distributed scrum

Arash Arabi - A guide to multi-organisational distributed scrum

Arash Arabi - A guide to multi-organisational distributed scrum

Scrum Australia Pty Ltd

Learnings from startups

Learnings from startups

Learnings from startups

Intro To Lean Startup (8 Oct 2015)

Intro To Lean Startup (8 Oct 2015)

Intro To Lean Startup (8 Oct 2015)

No matter the product you sell or the service you offer, your priority is improving user experience. You might associate "UX" with websites or tech tools, but the experience your prospects and clients are having right now are key to their decisions on whether or not to stick around. Churn or engagement? Loss or retention? Survey your target audience to better understand how to make their user experience even better.

How to Survey Your Target Audience's User Experience

How to Survey Your Target Audience's User Experience

How to Survey Your Target Audience's User Experience

What's hot (20)

[CXL Live 16] What to Test Next - Prioritizing Your Tests by Pauline Marol

[CXL Live 16] What to Test Next - Prioritizing Your Tests by Pauline Marol

[CXL Live 16] What to Test Next - Prioritizing Your Tests by Pauline Marol

[CXL Live 16] The Grand Unified Theory of Conversion Optimization by John Ekman

[CXL Live 16] The Grand Unified Theory of Conversion Optimization by John Ekman

[CXL Live 16] The Grand Unified Theory of Conversion Optimization by John Ekman

Be A Great Product Leader (Slack 2017)

Be A Great Product Leader (Slack 2017)

Be A Great Product Leader (Slack 2017)

Using data to create intrinsic motivation and a growth mindset

Using data to create intrinsic motivation and a growth mindset

Using data to create intrinsic motivation and a growth mindset

[CXL Live 16] "Best Practices" or "Common Practices" - Which Is It? by Justin...

[CXL Live 16] "Best Practices" or "Common Practices" - Which Is It? by Justin...

[CXL Live 16] "Best Practices" or "Common Practices" - Which Is It? by Justin...

How to Integrate Customer Discovery

How to Integrate Customer Discovery

How to Integrate Customer Discovery

Deep Customer Research...The Heart Of Innovation - Richard Young and Diana Ad...

Deep Customer Research...The Heart Of Innovation - Richard Young and Diana Ad...

Deep Customer Research...The Heart Of Innovation - Richard Young and Diana Ad...

The Business of Execution (Infographic)

The Business of Execution (Infographic)

The Business of Execution (Infographic)

Nicolas Visiers - User Experience Testing

Nicolas Visiers - User Experience Testing

Nicolas Visiers - User Experience Testing

Data informed design - UX Australia august 2015

Data informed design - UX Australia august 2015

Data informed design - UX Australia august 2015

[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session

[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session

[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session

Test & Learn: The Alchemy & Science of Product Metrics - Choosing Metrics Tha...

Test & Learn: The Alchemy & Science of Product Metrics - Choosing Metrics Tha...

Test & Learn: The Alchemy & Science of Product Metrics - Choosing Metrics Tha...

How HubSpot Launches Products - ProductCamp Boston

How HubSpot Launches Products - ProductCamp Boston

How HubSpot Launches Products - ProductCamp Boston

UX Research - simpler than you thought

UX Research - simpler than you thought

UX Research - simpler than you thought

Peep Laja, CEO, ConversionXL - How to Turn Data into Insights & Customers

Peep Laja, CEO, ConversionXL - How to Turn Data into Insights & Customers

Peep Laja, CEO, ConversionXL - How to Turn Data into Insights & Customers

[Elite Camp 2016] Karl Gilis - How to Make Sure Your New Website Won’t Be a F...

[Elite Camp 2016] Karl Gilis - How to Make Sure Your New Website Won’t Be a F...

[Elite Camp 2016] Karl Gilis - How to Make Sure Your New Website Won’t Be a F...

Arash Arabi - A guide to multi-organisational distributed scrum

Arash Arabi - A guide to multi-organisational distributed scrum

Arash Arabi - A guide to multi-organisational distributed scrum

Learnings from startups

Learnings from startups

Learnings from startups

Intro To Lean Startup (8 Oct 2015)

Intro To Lean Startup (8 Oct 2015)

Intro To Lean Startup (8 Oct 2015)

How to Survey Your Target Audience's User Experience

How to Survey Your Target Audience's User Experience

How to Survey Your Target Audience's User Experience

Viewers also liked

A Tour of the Data Science Process, a Case Study Using Movie Industry Data

A Tour of the Data Science Process, a Case Study Using Movie Industry Data

A Tour of the Data Science Process, a Case Study Using Movie Industry Data

Domino Data Lab

Success Through an Actionable Data Science Stack

Success Through an Actionable Data Science Stack

Success Through an Actionable Data Science Stack

Domino Data Lab

Capturing the Mirage: Machine Learning in Media and Entertainment Industries

Capturing the Mirage: Machine Learning in Media and Entertainment Industries

Capturing the Mirage: Machine Learning in Media and Entertainment Industries

Domino Data Lab

by William Whipple Neely Director of Data Science at Electronic Arts Data scientists and analysts write code, sometimes a lot of code, so we are also software developers as much as model builders and algorithm creators. This talk is about the challenges a team of data scientists and analysts face when trying to scale their work, to make their work repeatable and testable. I’ll talk about how our data science team is leveling-up their skills as software developers, the challenges we’ve faced and the strategies that are helping.

Data Scientists Are Analysts Are Also Software Engineers

Data Scientists Are Analysts Are Also Software Engineers

Data Scientists Are Analysts Are Also Software Engineers

Domino Data Lab

by Paco Nathan Director, Learning Group at O’Reilly Media This talk will present: * the system architecture based on Jupyter as middleware, plus Thebe, Docker, Mesos, Nginx, etc. * data analytics and project experiences based on delivering _computable content_ at scale * supporting theory for this pedagogical approach, including Knuth’s _Literate Programming_ * media production techniques that use the video as _subtext_ We will also consider the use of notebooks (Jupyter and others) in an organizational context: how do notebooks help teams share and learn? what impact might notebooks have on developer collaboration that is currently focused on IDEs? The resulting medium provides highly effective tooling for a data-centric organization.

Computable content: Notebooks, containers, and data-centric organizational le...

Computable content: Notebooks, containers, and data-centric organizational le...

Computable content: Notebooks, containers, and data-centric organizational le...

Domino Data Lab

by Noelle Sio Saldana Principal Data Scientist at Pivotal The success of a Data Science project is not simply the model fit or the accuracy of its predictions; it is whether those models are being leveraged to make smarter business decisions. Over the past few years, Pivotal’s Data Scientists have experimented with software development methods practiced and taught by their Pivotal Labs counterparts in engineering, design and product management. By reframing Data Science as building software and products instead of research, we found that we reaped similar benefits: shorter and more productive iterations, and clients who actually used the models that we built and skills we taught long after we left. In this talk, we discuss how we have successfully (and maybe not as successfully) borrowed principles from practices like Lean and Agile to Data Science. Topics include: Minimum Viable Product Models Build-Measure-Learn instead of a silver bullet Pair programming Scrums and retrospectives Practicing empathy instead of elitism

Lean Data Science

Lean Data Science

Lean Data Science

Domino Data Lab

Sentiment Analysis of Film-Related Messages on Social Media

Sentiment Analysis of Film-Related Messages on Social Media

Sentiment Analysis of Film-Related Messages on Social Media

Domino Data Lab

by Hristo Spassimirov Paskov Founder and CEO, ThinkFast Mathematical Intelligence Corporations Intel Software Innovator for Artificial Intelligence Machine learning has revolutionized the technological landscape and its success has inspired the collection of vast amounts of data aimed at answering ever deeper questions and solving increasingly harder problems. Continuing this success critically relies on the existence of machine learning paradigms that can perform sophisticated analyses at the data scales required by modern data sets and that reduce development cycle times by improving ease of use. The evolution of machine learning paradigms shows a marked trend toward better addressing these desiderata and a convergence toward paradigms that blend “smooth” modeling techniques classically attributed to statistics with “combinatorial” elements traditionally studied in computer science. These modern learning paradigms pose a new set of challenges that, when properly addressed, open an unexpected wealth of possibilities. I will discuss how ThinkFast is solving these challenges with fundamental advances in optimization that promote the interpretation of machine learning as a more classical database technology. These advances allow us to scale a variety of techniques to unprecedented data scales using commodity hardware. They also provide surprising insights into how modern techniques learn about data, including a characterization of the limits of what they can learn, and ultimately allow us to devise new, more powerful techniques that do not suffer from these limitations.

ThinkFast: Scaling Machine Learning to Modern Demands

ThinkFast: Scaling Machine Learning to Modern Demands

ThinkFast: Scaling Machine Learning to Modern Demands

Domino Data Lab

by Szilard Pafka Chief Scientist at Epoch Szilard studied Physics in the 90s in Budapest and has obtained a PhD by using statistical methods to analyze the risk of financial portfolios. Next he has worked in finance quantifying and managing market risk. A decade ago he moved to California to become the Chief Scientist of a credit card processing company doing what now is called data science (data munging, analysis, modeling, visualization, machine learning etc). He is the founder/organizer of several data science meetups in Santa Monica, and he is also a visiting professor at CEU in Budapest, where he teaches data science in the Masters in Business Analytics program. While extracting business value from data has been performed by practitioners for decades, the last several years have seen an unprecedented amount of hype in this field. This hype has created not only unrealistic expectations in results, but also glamour in the usage of the newest tools assumably capable of extraordinary feats. In this talk I will apply the much needed methods of critical thinking and quantitative measurements (that data scientists are supposed to use daily in solving problems for their companies) to assess the capabilities of the most widely used software tools for data science. I will discuss in details two such analyses, one concerning the size of datasets used for analytics and the other one regarding the performance of machine learning software used for supervised learning.

No-Bullshit Data Science

No-Bullshit Data Science

No-Bullshit Data Science

Domino Data Lab

Open Data for Social Good

Open Data for Social Good

Open Data for Social Good

Domino Data Lab

Realtime Learning: Using Triggers to Know What the ?$# is Going On

Realtime Learning: Using Triggers to Know What the ?$# is Going On

Realtime Learning: Using Triggers to Know What the ?$# is Going On

Domino Data Lab

Machine Learning at Netflix

Machine Learning at Netflix

Machine Learning at Netflix

Domino Data Lab

The Right Question

The Right Question

The Right Question

Domino Data Lab

Challenges of Predicting User Engagement

Challenges of Predicting User Engagement

Challenges of Predicting User Engagement

Domino Data Lab

How to Improve agile team efficiency

How to Improve agile team efficiency

How to Improve agile team efficiency

Agile software development practices are based on a set of values and principles described in the Agile Manifesto. As change agents for Agile transformation, we rely on these to help get the message across. There is another layer below principles, a set of scientific models that can help explain why the principleswork and strengthen the Agile message for some audiences. These are described in this presentation.

Systems Concepts for Agile Practitioners

Systems Concepts for Agile Practitioners

Systems Concepts for Agile Practitioners

Tune up your data science process

Tune up your data science process

Tune up your data science process

Benjamin Skrainka

Analysis, data & process modeling

Analysis, data & process modeling

Analysis, data & process modeling

Cross border - off-shoring and outsourcing privacy sensitive data

Cross border - off-shoring and outsourcing privacy sensitive data

Cross border - off-shoring and outsourcing privacy sensitive data

Data science training in hyderabad

Data science training in hyderabad

Data science training in hyderabad

Kelly Technologies

Viewers also liked (20)

A Tour of the Data Science Process, a Case Study Using Movie Industry Data

A Tour of the Data Science Process, a Case Study Using Movie Industry Data

A Tour of the Data Science Process, a Case Study Using Movie Industry Data

Success Through an Actionable Data Science Stack

Success Through an Actionable Data Science Stack

Success Through an Actionable Data Science Stack

Capturing the Mirage: Machine Learning in Media and Entertainment Industries

Capturing the Mirage: Machine Learning in Media and Entertainment Industries

Capturing the Mirage: Machine Learning in Media and Entertainment Industries

Data Scientists Are Analysts Are Also Software Engineers

Data Scientists Are Analysts Are Also Software Engineers

Data Scientists Are Analysts Are Also Software Engineers

Computable content: Notebooks, containers, and data-centric organizational le...

Computable content: Notebooks, containers, and data-centric organizational le...

Computable content: Notebooks, containers, and data-centric organizational le...

Lean Data Science

Lean Data Science

Lean Data Science

Sentiment Analysis of Film-Related Messages on Social Media

Sentiment Analysis of Film-Related Messages on Social Media

Sentiment Analysis of Film-Related Messages on Social Media

ThinkFast: Scaling Machine Learning to Modern Demands

ThinkFast: Scaling Machine Learning to Modern Demands

ThinkFast: Scaling Machine Learning to Modern Demands

No-Bullshit Data Science

No-Bullshit Data Science

No-Bullshit Data Science

Open Data for Social Good

Open Data for Social Good

Open Data for Social Good

Realtime Learning: Using Triggers to Know What the ?$# is Going On

Realtime Learning: Using Triggers to Know What the ?$# is Going On

Realtime Learning: Using Triggers to Know What the ?$# is Going On

Machine Learning at Netflix

Machine Learning at Netflix

Machine Learning at Netflix

The Right Question

The Right Question

The Right Question

Challenges of Predicting User Engagement

Challenges of Predicting User Engagement

Challenges of Predicting User Engagement

How to Improve agile team efficiency

How to Improve agile team efficiency

How to Improve agile team efficiency

Systems Concepts for Agile Practitioners

Systems Concepts for Agile Practitioners

Systems Concepts for Agile Practitioners

Tune up your data science process

Tune up your data science process

Tune up your data science process

Analysis, data & process modeling

Analysis, data & process modeling

Analysis, data & process modeling

Cross border - off-shoring and outsourcing privacy sensitive data

Cross border - off-shoring and outsourcing privacy sensitive data

Cross border - off-shoring and outsourcing privacy sensitive data

Data science training in hyderabad

Data science training in hyderabad

Data science training in hyderabad

Similar to Data Science and Goodhart's Law

The Finishing Line

The Finishing Line

The Finishing Line

Oban International

How to design powerful experiments - Ying Zhang

How to design powerful experiments - Ying Zhang

How to design powerful experiments - Ying Zhang

Product Anonymous

Stochastic Modeling - Financial Reporting

Stochastic Modeling - Financial Reporting

Stochastic Modeling - Financial Reporting

Barga Galvanize Sept 2015

Barga Galvanize Sept 2015

Barga Galvanize Sept 2015

Nancy's webinar

Nancy's webinar

Nancy's webinar

Tale of Two Tests

Tale of Two Tests

Tale of Two Tests

Are you ready for Data science? A 12 point test

Are you ready for Data science? A 12 point test

Are you ready for Data science? A 12 point test

Short overview of decision analysis in project management; project decision analysis workflow; introduction to psychology of judgement and decision-making in project management. For more information how to perform schedule risk analysis using RiskyProject software please visit Intaver Institute web site: http://www.intaver.com. About Intaver Institute. Intaver Institute Inc. develops project risk management and project risk analysis software. Intaver's flagship product is RiskyProject: project risk management software. RiskyProject integrates with Microsoft Project, Oracle Primavera, other project management software or can run standalone. RiskyProject comes in three configurations: RiskyProject Lite, RiskyProject Professional, and RiskyProject Enterprise.

Introduction to Project Decision Analysis

Introduction to Project Decision Analysis

Introduction to Project Decision Analysis

Intaver Insititute

Monte Carlo and Schedule Risk Analysis

Monte Carlo and Schedule Risk Analysis

Monte Carlo and Schedule Risk Analysis

Intaver Insititute

Product development is inherently risky. While lean and agile methods are praised for supporting rapid feedback from customers through experiments and continuous iteration, teams could do a lot better at prioritizing using basic modeling techniques from finance. This talk will focus on quantitative risk modeling when developing new products or services that do not have a well understood product/market fit scenario. Using modeling approaches like Monte Carlo simulations and Cost of Delay scenarios, combined with qualitative tools like the Lean Canvas and Value Dynamics, we will explore how lean innovation teams can bring scientific rigor back into their process.

Stop Flying Blind! Quantifying Risk with Monte Carlo Simulation

Stop Flying Blind! Quantifying Risk with Monte Carlo Simulation

Stop Flying Blind! Quantifying Risk with Monte Carlo Simulation

Presentations - Zarget CRO meetup 2017

Presentations - Zarget CRO meetup 2017

Presentations - Zarget CRO meetup 2017

Webinar: Experimentation & Product Management by Indeed Product Lead

Webinar: Experimentation & Product Management by Indeed Product Lead

Webinar: Experimentation & Product Management by Indeed Product Lead

MonetizingStatistics

MonetizingStatistics

MonetizingStatistics

Models ABC

Sergey Sviridenko

When Will This Be Done?

When Will This Be Done?

When Will This Be Done?

Training manual - customer development

Training manual - customer development

Training manual - customer development

Becoming data-driven requires analytics to be embedded throughout the organization in different functional areas and different operational processes. But how do you provide more and more people with the ability to run any analytics on any data anywhere– without breaking the bank? In this session, you’ll see real-world examples of Dell customers who have successfully embedded analytics across processes and operations to drive innovation.We will also demonstrate how embedding analytics enables faster innovation and improves collaboration between data scientists, business analysts, and business stakeholders, leading to a competitive advantage.

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...

Quality

AI in the Real World: Challenges, and Risks and how to handle them?

AI in the Real World: Challenges, and Risks and how to handle them?

AI in the Real World: Challenges, and Risks and how to handle them?

Leveraging Gap Analysis for Continuous Improvement

Leveraging Gap Analysis for Continuous Improvement

Leveraging Gap Analysis for Continuous Improvement

Similar to Data Science and Goodhart's Law (20)

The Finishing Line

The Finishing Line

The Finishing Line

How to design powerful experiments - Ying Zhang

How to design powerful experiments - Ying Zhang

How to design powerful experiments - Ying Zhang

Stochastic Modeling - Financial Reporting

Stochastic Modeling - Financial Reporting

Stochastic Modeling - Financial Reporting

Barga Galvanize Sept 2015

Barga Galvanize Sept 2015

Barga Galvanize Sept 2015

Nancy's webinar

Nancy's webinar

Nancy's webinar

Tale of Two Tests

Tale of Two Tests

Tale of Two Tests

Are you ready for Data science? A 12 point test

Are you ready for Data science? A 12 point test

Are you ready for Data science? A 12 point test

Introduction to Project Decision Analysis

Introduction to Project Decision Analysis

Introduction to Project Decision Analysis

Monte Carlo and Schedule Risk Analysis

Monte Carlo and Schedule Risk Analysis

Monte Carlo and Schedule Risk Analysis

Stop Flying Blind! Quantifying Risk with Monte Carlo Simulation

Stop Flying Blind! Quantifying Risk with Monte Carlo Simulation

Stop Flying Blind! Quantifying Risk with Monte Carlo Simulation

Presentations - Zarget CRO meetup 2017

Presentations - Zarget CRO meetup 2017

Presentations - Zarget CRO meetup 2017

Webinar: Experimentation & Product Management by Indeed Product Lead

Webinar: Experimentation & Product Management by Indeed Product Lead

Webinar: Experimentation & Product Management by Indeed Product Lead

MonetizingStatistics

MonetizingStatistics

MonetizingStatistics

Models ABC

When Will This Be Done?

When Will This Be Done?

When Will This Be Done?

Training manual - customer development

Training manual - customer development

Training manual - customer development

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...

Quality

AI in the Real World: Challenges, and Risks and how to handle them?

AI in the Real World: Challenges, and Risks and how to handle them?

AI in the Real World: Challenges, and Risks and how to handle them?

Leveraging Gap Analysis for Continuous Improvement

Leveraging Gap Analysis for Continuous Improvement

Leveraging Gap Analysis for Continuous Improvement

More from Domino Data Lab

While business analysis rapidly grows more data-driven, the analyst community is slow to adapt the best practices of data science workflows. Many parallels exists between data science “top topics” (e.g. reproducibility) and business pain points, but these common needs are obscured by the different “languages” of these two communities. The opportunity cost is greatest in heavily regulated industries such as finance and insurance where documentation and compliance are paramount. In this talk, we will review our experience transitioning Capital One business analysts from legacy systems to open-source workflows by developing user-friendly tools. We incentivized business analysts to adopt the data science mindset by curating open-source tools and developing code packages which simplify workflows and eliminate pain points. Our internal R package, tidycf, reimagines cumbersome Excel cashflow statements as dataframes and uses RMarkdown templates and the RStudio IDE for an intuitive, user-friendly experience without the overhead of maintaining a custom GUI. We tackle challenges in documentation and communication while immersing new users in the R language. We will share best practices and lessons learned from our experience designing tools for non-technical end-users, standardizing workflows based on the RStudio IDE’s infrastructure, and evangelizing data science methods.

What's in your workflow? Bringing data science workflows to business analysis...

What's in your workflow? Bringing data science workflows to business analysis...

What's in your workflow? Bringing data science workflows to business analysis...

Domino Data Lab

In this talk, we’ll describe NoSQL (“not-only SQL”) and document-oriented databases and the value they provide for data science companies like Uptake. We will walk through the unique challenges such datastores pose for data science workflows. To make these challenges and lessons learned concrete, we’ll explore data science workflows through a discussion of the development efforts that led to “uptasticsearch”, an R package released by the Uptake Data Science team to reduce friction in interacting with a document store called Elasticsearch. The talk will conclude with a discussion of recent developments in NoSQL technologies and implications for data scientists.

The Proliferation of New Database Technologies and Implications for Data Scie...

The Proliferation of New Database Technologies and Implications for Data Scie...

The Proliferation of New Database Technologies and Implications for Data Scie...

Domino Data Lab

Since 2004, Illinois has collected demographic information about traffic stops conducted by police in an effort to identify racial bias. This data has been used by groups such as the ACLU and the Stanford Open Policing Project to identify key markers that infer racial bias in policing. We have applied exploratory data analysis to investigate whether systemic racial bias may appear and to what extent. This talk will walk the audience through the insights gleaned from the exploration of this data along with the challenges posed and ongoing questions raised.

Racial Bias in Policing: an analysis of Illinois traffic stops data

Racial Bias in Policing: an analysis of Illinois traffic stops data

Racial Bias in Policing: an analysis of Illinois traffic stops data

Domino Data Lab

Analytics and data science are ever growing fields, as business decision makers continue to use data to drive decisions. The pinnacle of these fields are the models and their accuracy/fit,; what about the data? Is your data clean, and how do you know that? Our discussion will focus on best practices for data preprocessing for analytic uses. Beginning with essential distributional checks of a dataset to a propose method for automated data validation process during ETL for transactional data.

Data Quality Analytics: Understanding what is in your data, before using it

Data Quality Analytics: Understanding what is in your data, before using it

Data Quality Analytics: Understanding what is in your data, before using it

Domino Data Lab

Recent technological advances, a dynamic competitive landscape, and an evolving regulatory environment have led to a period of rapid innovation for many insurance providers. Here, we’ll explore how data scientists may use randomized experiments to rigorously assess the causal impact of innovations on business outcomes. Particular emphasis will be placed on experimentation in “offline” channels, with some of the challenges and mitigation strategies highlighted.

Supporting innovation in insurance with randomized experimentation

Supporting innovation in insurance with randomized experimentation

Supporting innovation in insurance with randomized experimentation

Domino Data Lab

Cars.com Inc. is a decision engine for car buyers and a growth engine for our partners. Data Science is the bread and butter of any decision engine and Cars is no different. In this talk, I will discuss how we quantify various parameters of a car and plan to make use of all the data in hand to put predictive models at various stages of a users’ automobile lifecycle. This talk will also cater to students looking to gain knowledge on how data science is utilized at scale while still following certain processes and leading the way for business and product partners.

Leveraging Data Science in the Automotive Industry

Leveraging Data Science in the Automotive Industry

Leveraging Data Science in the Automotive Industry

Domino Data Lab

Lake Michigan and outdoor recreation are enjoyable aspects of summers in Chicago, but it can come with risk of potential E. coli in Lake Michigan or West Nile Virus from mosquitos. This summer, the City of Chicago launched two new predictive analytics projects to forecasts the risks and to proactively limit these risks. Members of the research team, Gene Leynes and Nick Lucius discuss the projects and how they’re being used as part of city operations.

Summertime Analytics: Predicting E. coli and West Nile Virus

Summertime Analytics: Predicting E. coli and West Nile Virus

Summertime Analytics: Predicting E. coli and West Nile Virus

Domino Data Lab

Reproducible Dashboards and other great things to do with Jupyter

Reproducible Dashboards and other great things to do with Jupyter

Reproducible Dashboards and other great things to do with Jupyter

Domino Data Lab

Today, more than ever before, maps are being used to bring data to life. In this presentation I will demonstrate how geoviz can make data science more tangible by providing an interactive canvas for spatial data. Gregory Brunner will shows several examples of how maps are being used enhance how we communicate data and how this applies across all scales, including spatial, temporal, and size of data.

GeoViz: A Canvas for Data Science

GeoViz: A Canvas for Data Science

GeoViz: A Canvas for Data Science

Domino Data Lab

Managing Data Science | Lessons from the Field

Managing Data Science | Lessons from the Field

Managing Data Science | Lessons from the Field

Domino Data Lab

Doing your first Kaggle (Python for Big Data sets)

Doing your first Kaggle (Python for Big Data sets)

Doing your first Kaggle (Python for Big Data sets)

Domino Data Lab

Most of analytics modeling work today focuses on the production of single-purpose "artisanal" models for predictions. This approach to analytics is fragile with respect to model consistency, reorganization, and resource availability. This talk will argue that instead the focus of analytics modeling should be toward the production of analytics interchangeable parts, which can be combined in creative ways to produce a wide variety of analytics results. This "nuts and bolts" approach allows analytics groups to produce results in an agile way where the time between ask and answer is determined by the right combination of analytics, rather than the modeling.

Leveraged Analytics at Scale

Leveraged Analytics at Scale

Leveraged Analytics at Scale

Domino Data Lab

How I Learned to Stop Worrying and Love Linked Data

How I Learned to Stop Worrying and Love Linked Data

How I Learned to Stop Worrying and Love Linked Data

Domino Data Lab

Although both disciplines are unique in their own ways, Software Engineering and Data Science make heavy use of programing languages to do their respective jobs. Data Science is a relatively new discipline and many of its practitioners have not previously been professional software engineers. There are a few techniques that Data Scientists can leverage from Software Engineering in order to make their tooling and environments, faster to design, more easily debugged and most importantly, clearer to read. This talk will be going over some practical tips that anyone can use to help better understand their code; give clarity around cloud environments, their uses and drawbacks and finally briefly touching on the Software Development Lifecycle.

Software Engineering for Data Scientists

Software Engineering for Data Scientists

Software Engineering for Data Scientists

Domino Data Lab

Within marketing research, big data is often described as being “census” data for the population that it represents. The devil is in the details and when we take a closer look we can see that this isn’t the case. There are many situations that are not captured within the population that big data purports to be a census of. Big data isn’t even a census of itself since it’s not uncommon for records to be excluded either by accident during the collection process or by design in the cleaning processor. Unfortunately, our industry is so enamored with the size of big data that some users of data are willing to trade off precision for tonnage. Fortunately, if the shortcomings of big data are understood and corrected it can accurately represent the population that it measures in the correct proportion to the universe. We will discuss a method that Nielsen has developed called “Common Homes” that is designed to identify and correct the shortcomings of big data sets that represent media consumption.

Making Big Data Smart

Making Big Data Smart

Making Big Data Smart

Domino Data Lab

The exponential growth of Big Data and Analytics has outpaced the ability of organizations to govern their data appropriately. The ability to reuse the work done by data scientists work is becoming an economic necessity. The mix of data sources is changing from tradition transactional and ERP systems to include a mix of structured, semi-structured and unstructured data. Data Governance needs to adapt to these changes. This session discusses these data changes and proposed how to adapt current data governance processes. These include, how the concept of a stakeholder has changed and the need for expansion of communications and content management. We look at need to consolidate data from disparate systems and how it governed. Lastly we will investigate how context is emerging as an important factor in governance and how it can be leveraged to provide for accurate, reliable data reuse.

Moving Data Science from an Event to A Program: Considerations in Creating Su...

Moving Data Science from an Event to A Program: Considerations in Creating Su...

Moving Data Science from an Event to A Program: Considerations in Creating Su...

Domino Data Lab

Big Data analytics is well known to uncover hidden insights that gives an organization an edge over the competition. But data does not need to be big in order to be useful. Smaller companies and startups may lack the volume of data that qualifies as big data, yet the variety of data can still yield a trove of insights that helps in driving the business strategies of a company. Startups may also lack the resources to fund an additional, seemingly expensive development project. The key is in simplicity, start small, simple and architect for scalability and performance. But how do you start? In this presentation, we share our experience in building a cost effective, AWS serverless data analytics platform that became an invaluable tool for sales, marketing and operational efficiencies.Serverless architectures simplify development work where servers and software are managed by a third party cloud provider. Developers can focus on just building the data wrangling and data analysis logic where critical aspects like scalability and high availability are guaranteed by the cloud provider. Besides, serverless services offer the pay as you go model, where you pay only based on the amount of resources you use. This turns out to be another attractive aspect where costs can be managed based on the usage. In this presentation we will focus on techniques and best practices to build a big data analytics platform using AWS serverless services like Lambda, DynamoDB, S3, Kinesis, Athena, QuickSight and Amazon ML. We will highlight the strengths of each of these services and what role each plays in the data analytics pipeline. We compare and contrast these services with some of the other popularly used big data technologies like Hadoop, Spark and Kafka. We also demonstrate the usage of these services to build intelligent components that detect anomalies, yield recommendations, simulate chat bots and generate predictive analytics.

Building Data Analytics pipelines in the cloud using serverless technology

Building Data Analytics pipelines in the cloud using serverless technology

Building Data Analytics pipelines in the cloud using serverless technology

Domino Data Lab

The data science process seeks to transform and empower organizations by finding and exploiting market inefficiencies and potentially hidden opportunities, but this is often an expensive, tedious process. However, many steps can be automated to provide a streamlined experience for data scientists. Eduardo Arino de la Rubia explores the tools being created by the open source community to free data scientists from tedium, enabling them to work on the high-value aspects of insight creation and impact validation. The promise of the automated statistician is almost as old as statistics itself. From the creations of vast tables, which saved the labor of calculation, to modern tools which automatically mine datasets for correlations, there has been a considerable amount of advancement in this field. Eduardo compares and contrasts a number of open source tools, including TPOT and auto-sklearn for automated model generation and scikit-feature for feature generation and other aspects of the data science workflow, evaluates their results, and discusses their place in the modern data science workflow. Along the way, Eduardo outlines the pitfalls of automated data science and applications of the “no free lunch” theorem and dives into alternate approaches, such as end-to-end deep learning, which seek to leverage massive-scale computing and architectures to handle automatic generation of features and advanced models.

Leveraging Open Source Automated Data Science Tools

Leveraging Open Source Automated Data Science Tools

Leveraging Open Source Automated Data Science Tools

Domino Data Lab

Domino and AWS: collaborative analytics and model governance at financial ser...

Domino and AWS: collaborative analytics and model governance at financial ser...

Domino and AWS: collaborative analytics and model governance at financial ser...

Domino Data Lab

The Role and Importance of Curiosity in Data Science

The Role and Importance of Curiosity in Data Science

The Role and Importance of Curiosity in Data Science

Domino Data Lab

More from Domino Data Lab (20)

What's in your workflow? Bringing data science workflows to business analysis...

What's in your workflow? Bringing data science workflows to business analysis...

What's in your workflow? Bringing data science workflows to business analysis...

The Proliferation of New Database Technologies and Implications for Data Scie...

The Proliferation of New Database Technologies and Implications for Data Scie...

The Proliferation of New Database Technologies and Implications for Data Scie...

Racial Bias in Policing: an analysis of Illinois traffic stops data

Racial Bias in Policing: an analysis of Illinois traffic stops data

Racial Bias in Policing: an analysis of Illinois traffic stops data

Data Quality Analytics: Understanding what is in your data, before using it

Data Quality Analytics: Understanding what is in your data, before using it

Data Quality Analytics: Understanding what is in your data, before using it

Supporting innovation in insurance with randomized experimentation

Supporting innovation in insurance with randomized experimentation

Supporting innovation in insurance with randomized experimentation

Leveraging Data Science in the Automotive Industry

Leveraging Data Science in the Automotive Industry

Leveraging Data Science in the Automotive Industry

Summertime Analytics: Predicting E. coli and West Nile Virus

Summertime Analytics: Predicting E. coli and West Nile Virus

Summertime Analytics: Predicting E. coli and West Nile Virus

Reproducible Dashboards and other great things to do with Jupyter

Reproducible Dashboards and other great things to do with Jupyter

Reproducible Dashboards and other great things to do with Jupyter

GeoViz: A Canvas for Data Science

GeoViz: A Canvas for Data Science

GeoViz: A Canvas for Data Science

Managing Data Science | Lessons from the Field

Managing Data Science | Lessons from the Field

Managing Data Science | Lessons from the Field

Doing your first Kaggle (Python for Big Data sets)

Doing your first Kaggle (Python for Big Data sets)

Doing your first Kaggle (Python for Big Data sets)

Leveraged Analytics at Scale

Leveraged Analytics at Scale

Leveraged Analytics at Scale

How I Learned to Stop Worrying and Love Linked Data

How I Learned to Stop Worrying and Love Linked Data

How I Learned to Stop Worrying and Love Linked Data

Software Engineering for Data Scientists

Software Engineering for Data Scientists

Software Engineering for Data Scientists

Making Big Data Smart

Making Big Data Smart

Making Big Data Smart

Moving Data Science from an Event to A Program: Considerations in Creating Su...

Moving Data Science from an Event to A Program: Considerations in Creating Su...

Moving Data Science from an Event to A Program: Considerations in Creating Su...

Building Data Analytics pipelines in the cloud using serverless technology

Building Data Analytics pipelines in the cloud using serverless technology

Building Data Analytics pipelines in the cloud using serverless technology

Leveraging Open Source Automated Data Science Tools

Leveraging Open Source Automated Data Science Tools

Leveraging Open Source Automated Data Science Tools

Domino and AWS: collaborative analytics and model governance at financial ser...

Domino and AWS: collaborative analytics and model governance at financial ser...

Domino and AWS: collaborative analytics and model governance at financial ser...

The Role and Importance of Curiosity in Data Science

The Role and Importance of Curiosity in Data Science

The Role and Importance of Curiosity in Data Science

Recently uploaded

In the thrilling conclusion to 2023, ransomware groups had a banner year, really outdoing themselves in the "make everyone's life miserable" department. LockBit 3.0 took gold in the hacking olympics, followed by the plucky upstarts Clop and ALPHV/BlackCat. Apparently, 48% of organizations were feeling left out and decided to get in on the cyber attack action. Business services won the "most likely to get digitally mugged" award, with education and retail nipping at their heels. Hackers expanded their repertoire beyond boring old encryption to the much more exciting world of extortion. The US, UK and Canada took top honors in the "countries most likely to pay up" category. Bitcoins were the currency of choice for discerning hackers, because who doesn't love untraceable money?

Ransomware_Q4_2023. The report. [EN].pdf

Ransomware_Q4_2023. The report. [EN].pdf

Ransomware_Q4_2023. The report. [EN].pdf

Overkill Security

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

Scaling API-first – The story of a global engineering organization Ian Reasor, Senior Computer Scientist - Adobe Radu Cotescu, Senior Computer Scientist - Adobe Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Accelerating FinTech Innovation: Unleashing API Economy and GenAI Vasa Krishnan, Chief Technology Officer - FinResults Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Boost Fertility New Invention Ups Success Rates.pdf

Boost Fertility New Invention Ups Success Rates.pdf

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

Architecting Cloud Native Applications

Architecting Cloud Native Applications

Architecting Cloud Native Applications

Modernizing Securities Finance: The cloud-native prime brokerage platform transforming capital markets. Madhu Subbu, Managing Director, Head of Securities Finance Engineering Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows. We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases. This video focuses on the deployment of external web forms using Jotform for Bonterra Impact Management. This solution can be customized to your organization’s needs and deployed to support the common use cases below: - Intake and consent - Assessments - Surveys - Applications - Program registration Interested in deploying web form automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Jeffrey Haguewood

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

The Good, the Bad and the Governed - Why is governance a dirty word? David O'Neill, Chief Operating Officer - APIContext Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

AWS Community Day CPH - Three problems of Terraform

AWS Community Day CPH - Three problems of Terraform

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

Manulife - Insurer Transformation Award 2024

Manulife - Insurer Transformation Award 2024

Manulife - Insurer Transformation Award 2024

The Digital Insurer

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processors

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Juan lago vázquez

AXA XL - Insurer Innovation Award Americas 2024

AXA XL - Insurer Innovation Award Americas 2024

AXA XL - Insurer Innovation Award Americas 2024

The Digital Insurer

Recently uploaded (20)

Ransomware_Q4_2023. The report. [EN].pdf

Ransomware_Q4_2023. The report. [EN].pdf

Ransomware_Q4_2023. The report. [EN].pdf

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Boost Fertility New Invention Ups Success Rates.pdf

Boost Fertility New Invention Ups Success Rates.pdf

Boost Fertility New Invention Ups Success Rates.pdf

Architecting Cloud Native Applications

Architecting Cloud Native Applications

Architecting Cloud Native Applications

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

AWS Community Day CPH - Three problems of Terraform

AWS Community Day CPH - Three problems of Terraform

AWS Community Day CPH - Three problems of Terraform

Manulife - Insurer Transformation Award 2024

Manulife - Insurer Transformation Award 2024

Manulife - Insurer Transformation Award 2024

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processors

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

AXA XL - Insurer Innovation Award Americas 2024

AXA XL - Insurer Innovation Award Americas 2024

AXA XL - Insurer Innovation Award Americas 2024

Data Science and Goodhart's Law

1. Data Science and Goodhart’s Law Kyle Polich Data Science, Inc.

2. Goodhart’s Law 2 When a measure becomes a target, it ceases to be a good measure

3. Sales Rep Compensation Example • Base pay + variable commission • For monthly <50k, commission = 3% • For monthly 50-99k, commission = 5% • For monthly 100k+, commission = 7% 3

4. Some Examples  Spam filtering arms race  Search engine ranking  Clearing cookies to get better airline prices  Keep account open to manipulate FICO score  Retail discounting/couponing strategies  Bidding in AdTech marketplaces 4

5. Measuring with Cross Validation Cross Validation • You should be doing this anyway! • Set production performance expectation • Measure post deployment • Total deviation = deviation due to overfit + deviation due to incomplete training + deviation due to Goodhart’s Law 5

6. Measuring via Homogeneity Assumption Can you train a model to accurately predict the date at which the observation was created? 6

7. Measuring Drift 7

8. Measuring Drift 8 Typical failure from a web application release

9. Measuring Drift 9 Possible failure from a web application release

10. Dealing with it • Detection is key • Experimentation is required • Agile methods for model deployment 10

11. Causal Impact • An approach to estimating the causal effect of a designed intervention on a time series. • Predicts counterfactual (how response likely would have evolved absent the intervention) 11

12. Self Fulfilling Prophecies • Beware! • Case study: lead qualification – Try to predict leads that will close – Relearn the bias of your training 12

13. Fast Iterations • Outside normal SWLC release cycle – State updates – Parameter tuning • Run experiments 13

14. Explanatory power • Goodhart’s law will often manifest on only a subset of (possibly significant) instances. • Model interpretability for effected instances is key 14

15. Interpretable Models 15

16. Interpretable Models 16

17. Why Should I Trust You? Explaining the Predictions of Any Classifier Ribeiro, Singh, Guestrin 17 Model Interpretability

18. Summary • Goodhart’s law: When a measure becomes a target, it ceases to be a good measure • As a data scientist, if your work is meaningful, you will encounter it • Try to measure it in the data • Work on explanatory models to mitigate • Don’t let the average case blind you 18

19. DataScience 19 facebook.com/datascience @DataSkeptic @datascienceinc linkedin.com/company/datascience-inc (310) 579 - 6200

Editor's Notes

Define homogeneity assumption Be careful of detecting a seasonal effect Be careful of your covariate selection