SlideShare uma empresa Scribd logo
1 de 32
1© 2017 Cloudera, Inc. All rights reserved.
Digital Government
Data + Government Isn’t Enough
Trey Causey
Senior Product Manager, Socrata
@treycausey
#wrangleconf | Powered by Cloudera
2© 2017 Cloudera, Inc. All rights reserved.
3© 2017 Cloudera, Inc. All rights reserved.
4© 2017 Cloudera, Inc. All rights reserved.
5© 2017 Cloudera, Inc. All rights reserved.
6© 2017 Cloudera, Inc. All rights reserved.
7© 2017 Cloudera, Inc. All rights reserved.
Adoption and compliance
8© 2017 Cloudera, Inc. All rights reserved.
9© 2017 Cloudera, Inc. All rights reserved.
10© 2017 Cloudera, Inc. All rights reserved.
FOIA requests down
50%.
Requests for officer-
involved shooting data to
zero.
11© 2017 Cloudera, Inc. All rights reserved.
Open data success stories
12© 2017 Cloudera, Inc. All rights reserved.
NYC
Transparency
13© 2017 Cloudera, Inc. All rights reserved.
14© 2017 Cloudera, Inc. All rights reserved.
15© 2017 Cloudera, Inc. All rights reserved.
New Orleans
Safety & efficiency
16© 2017 Cloudera, Inc. All rights reserved.
“Nothing we did required big data or fancy machines or big tech investments. We are a city of less
than 400,000 and we are strained for resources. If we can do it, anyone can do it.”
- Oliver Wise, Director, Office of Performance & Analytics
17© 2017 Cloudera, Inc. All rights reserved.
18© 2017 Cloudera, Inc. All rights reserved.
Jackson, MS
Transparency &
accountability
19© 2017 Cloudera, Inc. All rights reserved.
2014 2016
20© 2017 Cloudera, Inc. All rights reserved.
21© 2017 Cloudera, Inc. All rights reserved.
22© 2017 Cloudera, Inc. All rights reserved.
Data isn’t enough
23© 2017 Cloudera, Inc. All rights reserved.
Compliance != compelling.
24© 2017 Cloudera, Inc. All rights reserved.
Data-informed decision-
making needs (open) data.
25© 2017 Cloudera, Inc. All rights reserved.
Data science in
government is a whole
different ball game.
26© 2017 Cloudera, Inc. All rights reserved.
All eyes are on you.
27© 2017 Cloudera, Inc. All rights reserved.
28© 2017 Cloudera, Inc. All rights reserved.
Data is locked up in legacy
systems.
29© 2017 Cloudera, Inc. All rights reserved.
Tooling is complicated and
data is (very) messy.
30© 2017 Cloudera, Inc. All rights reserved.
Your audience is everyone.
31© 2017 Cloudera, Inc. All rights reserved.
data
+ mission
+ strategy
= successful open data
Thank you!
@treycausey

Mais conteúdo relacionado

Mais procurados

How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
DataStax
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
 

Mais procurados (20)

Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
 
The Journey to Success with Big Data
The Journey to Success with Big DataThe Journey to Success with Big Data
The Journey to Success with Big Data
 
The Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersThe Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent Offers
 
Data Science in Enterprise
Data Science in EnterpriseData Science in Enterprise
Data Science in Enterprise
 
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest MindsCase study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
 
Rethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubRethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data Hub
 
Multi Cloud Data Integration- Retail
Multi Cloud Data Integration- RetailMulti Cloud Data Integration- Retail
Multi Cloud Data Integration- Retail
 
IoT-Enabled Predictive Maintenance
IoT-Enabled Predictive MaintenanceIoT-Enabled Predictive Maintenance
IoT-Enabled Predictive Maintenance
 
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud RealitiesWebinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
 
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
 
How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid CloudHow to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Protecting health and life science organizations from breaches and ransomware
Protecting health and life science organizations from breaches and ransomwareProtecting health and life science organizations from breaches and ransomware
Protecting health and life science organizations from breaches and ransomware
 
Building Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball ApproachBuilding Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball Approach
 
Multi Cloud Data Integration- Manufacturing Industry
Multi Cloud Data Integration- Manufacturing IndustryMulti Cloud Data Integration- Manufacturing Industry
Multi Cloud Data Integration- Manufacturing Industry
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
 
Building Data Science Teams
Building Data Science TeamsBuilding Data Science Teams
Building Data Science Teams
 

Semelhante a Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017

graymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdfgraymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdf
GrayMatter Software Services
 

Semelhante a Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017 (20)

Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!
 
Garantice la continuidad de su negocio Damian Prieto
Garantice la continuidad de su negocio   Damian PrietoGarantice la continuidad de su negocio   Damian Prieto
Garantice la continuidad de su negocio Damian Prieto
 
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and GovernanceGeek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
 
BDNA joins Flexera
BDNA joins FlexeraBDNA joins Flexera
BDNA joins Flexera
 
Denodo DataFest 2017: Company Leadership from Data Leadership
Denodo DataFest 2017: Company Leadership from Data LeadershipDenodo DataFest 2017: Company Leadership from Data Leadership
Denodo DataFest 2017: Company Leadership from Data Leadership
 
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions
A Data Privacy & Security Year in Review: Top 10 Trends and PredictionsA Data Privacy & Security Year in Review: Top 10 Trends and Predictions
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions
 
Data & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersData & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customers
 
Data Analytics in Cyber Security
Data Analytics in Cyber SecurityData Analytics in Cyber Security
Data Analytics in Cyber Security
 
Data Analytics in Cyber Security
Data Analytics in Cyber Security Data Analytics in Cyber Security
Data Analytics in Cyber Security
 
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
 
Observability in DevOps with Modern Distributed Applications
Observability in DevOps with Modern Distributed ApplicationsObservability in DevOps with Modern Distributed Applications
Observability in DevOps with Modern Distributed Applications
 
Keynote: Art of the Possible - Moore
Keynote: Art of the Possible - MooreKeynote: Art of the Possible - Moore
Keynote: Art of the Possible - Moore
 
From Rogue One to Rebel Alliance: Building Developers into Security Champions
From Rogue One to Rebel Alliance: Building Developers into Security Champions From Rogue One to Rebel Alliance: Building Developers into Security Champions
From Rogue One to Rebel Alliance: Building Developers into Security Champions
 
Session 2.6 semantic data governance for regulatory compliance
Session 2.6   semantic data governance for regulatory complianceSession 2.6   semantic data governance for regulatory compliance
Session 2.6 semantic data governance for regulatory compliance
 
Creating an EDGE - Enterprise Data Governance Experience
Creating an EDGE - Enterprise Data Governance ExperienceCreating an EDGE - Enterprise Data Governance Experience
Creating an EDGE - Enterprise Data Governance Experience
 
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
 
graymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdfgraymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdf
 
Veritas Solution Day 2017, France, keynote by Mike Palmer
Veritas Solution Day 2017, France, keynote by Mike PalmerVeritas Solution Day 2017, France, keynote by Mike Palmer
Veritas Solution Day 2017, France, keynote by Mike Palmer
 
Why Your Approach To Data Governance Needs a Major Update
Why Your Approach To Data Governance Needs a Major UpdateWhy Your Approach To Data Governance Needs a Major Update
Why Your Approach To Data Governance Needs a Major Update
 
5 Steps to Prepare for Digital Transformation & Real-Time Analytics
5 Steps to Prepare for Digital Transformation & Real-Time Analytics 5 Steps to Prepare for Digital Transformation & Real-Time Analytics
5 Steps to Prepare for Digital Transformation & Real-Time Analytics
 

Mais de Cloudera, Inc.

Mais de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017

Notas do Editor

  1. We're SaaS business providing cloud-based solutions for data-driven government. Data-as-a-service platform and cloud applications for government agencies  We make government data discoverable, reusable, and actionable I’m a social scientist turned data scientist turned product manager, so I think a lot about:  how humans generate data and how that data gets encoded, and then how that encoded data gets turned into models. Here I am at our annual employee summit – I was in the middle of talking to Dave Doyle, the City of Seattle open data program manager, who had just given closing remarks, and didn’t realize a group photo was forming. Or, as a coworker said to me after seeing this photo “man you must really love that laptop" I own several of our backend services –- all of the way that our customers ingress data onto our platform – as well as a chunk of our machine learning infrastructure.
  2. What do we build to enable open data? Open data portals – - web interfacet to a cloud-based data-as-a-service platform that lets agencies provide open data to their communities with built-in APIs, search, etc. making the data discoverable and resuable.  - this is probably what most people think of when they think of open data - lots of cities, counties, and states, some NGOs and federal agencies as well
  3. Open data powers applications as well that help agencies plan and communicate with stakeholders: - here’s the City of Seattle’s open budget  - Lets anyone explore the budget in nitty gritty detail, interactively
  4. Becoming more popular – performance programs - lets agencies provide transparency and accountability for the goals they've set for programs and initiatives - think of it as a public dashboard with KPIs that anyone can check in on - Budgets and performance dashboards are changes in the consumption experience for the raw data that open data portals host - represent the overall maturation of open data, as we move from data for data's sake to solving specific problems with open data and putting open data in the path of government work, rather than a destination where data lives
  5. - Even Steve Ballmer is getting in on the action Side project to help Americans understand the flow of money in government No LA Clippers salary cap explorer – but he’ll have 21 million or so freed up without Chris Paul in the upcoming season. However, USA facts is having the same problems with engagement that I'll talk about later in this talk – at the national governor's association last week, he said after a big flurry of publicity, they're only getting about 4000 visitors a day. Just putting data up for people to consume often doesn't produce a ton of engagement
  6. What’s the state of open data in 2017?
  7. - Broke the rules of giving a talk by discussing open data so far without really talking about what it is and what we mean by it Open data as both an idea and a practice really picked up steam during the Obama Administration – but the concept has been around for a while. So what do we mean? Data can be open in a couple of ways –  machine readable, available programmatically – this means data in widely accepted formats like CSV, JSON, XML, not locked up in a PDF or stored in physical copies that require a FOIA to get at It also means APIs that allow programmatic retrieval, and to enable developers to build applications with the data – if you've used a non-city-created transit application to find out when your bus/train is coming, you've benefited from this kind of open data. permissively licensed for reuse This is obviously a stickier issue – as licensing always is – and differs from agency to agency, but there is a generally agreed upon idea that open data should be available for reuse by anyone that wants to use it (at the very least non-commercially) but often this means for commercial reuse as well. Some examples of data reuse in applications include restaurant inspections in Yelp reviews or data used by Zillow for estimating housing value. 
  8. Sunlight Foundation, a non-profit dedicated to making governments accountable and transparent, has published guidelines for open data, many of which have been adopted by government agencies in creating their open data polices. Over the past 10 years or so, we've seen a number of municipalities create policies or legislation mandating that any data that can be made open be made open. The implementation and wording differs from agency to agency, but a growing number. Open Data Policies dot org – branched off from sunlight -- lists 101 current open data policies at the city, county, and state levels Federal level: DATA Act: Digital Accountability and Transparency Act (2014) – how does the government spend its money?
  9. This widespread adoption has produced real results for many agencies. The city of Chicago saw their freedom of information act requests drop by 50% when they launched their open data portal. Chicago is a real leader in open data – constantly pushing us – and open sourcing a lot of their work for other governments to use. They recently relaunched their open data portal (data.cityofchicago.org) to great success. The Dallas PD saw requests for data on officer-involved shootings drop to zero (dallasopendata.com) when they started releasing that data on their open data portal. Fulfilling requests for records and freedom of information act requests are resource-intensive. They’re usually time-bound and failure to comply carries a penalty.
  10. Beyond time saving and request-fulfilling efficiency, Where are we with open data? What are some of the success stories? Hopefully the data scientists in the crowd will allow me a bit of selection on the dependent variable for a moment.
  11. NYC has one of the most active open data programs in the country, thanks to a combination of a very talented staff and a legislated primary goal for all city agencies to share open data. The Department of Information Technology and Telecommunications (best acronym -- DoITT) and MODA (the Mayor’s Office of Data Analytics) manage a tremendously complicated program with great success – acting as central hubs for many of the city's agencies and their data. NYC also has stringent retention policies, so a lot of moving pieces to manage. A favorite dataset of mine is the NYC tree census of all trees in NYC – whenever we're testing out geospatial features, we usually use that dataset as one of our testers. 
  12. - Probably the most common thing many people think of when they think of Open Data – citizens serving as independent watchdogs, finding inefficiencies, injustices, and just plain mistakes. “Transparency” Ben Wellington is a quantitative analyst (who runs the popular blog I Quant NY) at Two Sigma – which if you’ve been to a recent PyData or SciPy conference will be familiar did this analysis using NYC open data.  Something worth over $33,000 in this picture – and it’s not the Mercedes.   Using NYC Open Data, found 84 tickets over a 4.5-month period -- $33,000 a year in fines, not including towing fees. One block over, another hydrant generating 24,000 a year – over 55,000 a year on two blocks
  13. Open data for the same reason you crowdsource things Can’t think of all possible questions to ask, instead, rely on motivated individuals to ask them Of course, this cost the city 55,000 a year! This has a bit of a bizarre twist, as there’s some confusion over whether these cars were parked legally or not. Using Google Maps – there is a protected bike lane between the cars and the fire hydrant.
  14. New Orleans is a real leader in performance management – a city with a number of well-known and lesser-known problems that really embraced tying its decision-making to open data.
  15. - NOLA combined data from the American Housing Survey, American Community Survey (both from the US Census) and FD admin data Office of Performance & Analytics identified homes most at-risk for fire Distributed 8,000 smoke detectors, reducing fire deaths & injuries A family of 11 later that year escaped a house fire after a smoke alarm went off at 3am – that had been installed as part of this program Also increased operational efficiency of FD by modeling where fires are most likely to occur
  16. Blight is a well-known problem in New Orleans, which has experienced natural disasters and social crises over the past century – homes left abandoned, boarded up in a city that is always at risk of being reclaimed by the earth.  Blight is problematic for a number of reasons – crime, pest infestation, public safety, and that lot is blocking new development / revenue collection / depressing home values elsewhere.  - Blight affected up to ¼ of all residential addresses in NOLA after Katrina Formed BlightSTAT – cross-department task force to reduce blight using data Decreased blight by 30% especially impressive when benchmarked against peer cities where abandoment rates are climbing cut response time from initial inspection to a hearing in half – over 3 months reduction
  17. - Jackson, MS – like many cities and like New Orleans, plagued by aging infrastructure - the kind of infrastructure that sees schoolbuses fall into sinkholes TWICE in three years – these are different buses on different streets. - more than half of the city's bridges in dire need of repairs Last year, one of these bridges targeted for repairs totally collapsed due to flooding – after closing, thankfully, but an example of the critical state of affairs. Estimates to repair Jackson's infrastructure range from 750 million to 1 billion dollars – that's 5-6 times the annual revenue of the city all told. City has one lever it can turn to generate revenue --  1 percent sales tax voted in, with revenue reserved for capital projects
  18. Jackson knew that the only way to keep the public's trust was to be open and transparent about how this revenue was being used and how it was tracking to meet its goals This slide, which you saw earlier, is actually part of Jackson's performance program – called JackStats Of particular interest given the bus-sinkhole problem is Operation Orange Cone By using 311data to more efficiently dispatch repair crews and identify problem areas, Filled over 69,000 potholes in 2 years, a 60% increase in pothole-filling compared to before Operation Orange Cone.  Some of these complaints dated to 2010! AND also decreased 311 calls
  19. Now residents can track the progress of Operation Orange Cone in a number of places and see how the project is doing on time and on budget, as well as getting up-to-date information about what streets are planned for resurfacing. Operation Orange Cone was supposed to be a two-week pilot program, but has run continuously for 2 years now.
  20. From the agency’s perspective, you need a strategy for success. Open data programs are most successful when they see high engagement with constituents and residents. Engagement is key – the mayor/governor wants to see that these open data programs are providing value. One way they do that is via constituent engagement. Checking the open data box doesn’t drive eyeballs. Need to figure out what problem you're solving and who you're solving it for. (Selfishly, I might say that open data programs need a good product manager but that’s probably another talk). Just releasing open data for civic hackers only helps a small portion of the population. When Jackson, MS needed to fix its roads to keep buses from falling into sinkholes, they made the data part of the planning process, the decisions that were made, and the reporting on the actions that resulted from those decisions. Putting a CSV on an open data portal doesn’t drive engagement, and is at worst a good way to have stale, out-of-date data that no one trusts or relies on. Each of the previous success stories began with some problem to be solved or goal to be accomplished – that’s what makes open data compelling. Data is only part of the story – what happens with the data is the magic of open data.
  21. Data-informed decision-making (credit to Greg Reda at PyData Seattle) needs data. For that data to be effective, that data needs to be up to date and authoritative. Government information workers need to be able to trust that data and integrate it into their workflows. “Open data portals” are a red herring – it shouldn’t be where old CSVs go to retire – it should be where government workers look for data they need to do their job.
  22. For every data scientist with a phd working at the NSA or the Census, there’s an analyst working at the department of public works in a city that has to work on 20 projects at once. They’re working with limited resources, limited time, and a public that often doesn’t care a whole lot about how busy they are. Public service isn’t just a catchphrase – there’s real service here.
  23. Your work is out there for the public to see. When your audience is everyone, you have to show your work. This means opening your data and explaining your models.  Sometimes your work is going to be featured in the local news, and there will often be press releases about it.  On the other hand, your work is going to be out there for everyone to see! I bet a lot of you don't get to talk about your work.
  24. Over 30,000 emails published for anyone to read. Released every Friday. This used to be a FOIA request that the local news outlets would make every week, now they just release them automatically to save them the trouble. Complex workflow that is half-automated, half-manual (can't release PII, constituent information, etc.).
  25. Government moves slowly – and upgrade cycles are no exception. Getting data from one agency to another is a real challenge. Talking to an analyst at a major west coast city – people drive around different lots in the city to verify permitting status, then literally fill out forms by hand, which are then delivered to a central office for data entry at a later date. By the time they’re digitized, who knows what has changed? Budgets run by department, but problems span departments – there’s no “department of homelessness”, but the problem requires action across agencies. Each agency has its own budget, and may use a different database or ERP system to track their data. It’s not as simple as a JOIN.
  26. You can’t just use that new library you read about on Hacker News into production – IT departments vary wildly in their permissiveness, and changes to infrastructure are extremely difficult. CIOs or CDOs may make software decisions for entire departments. You’re probably not going to be using Spark to build a predictive machine learning model that runs on a cluster and scales to millions of predictions a minute. Some agencies allow users to submit data via forms – and it’s messy. Different languages, profanity, and then some. I was speaking to an open data program manager from a city in the south and he was telling me that when they were cleaning up this data, retention laws meant that they had to go in and indicate where data had been changed from its original form. You think data scientists in the private sector complain about how hard it is to clean their data…
  27. Accessibility is a compliance matter – data you release and websites you host it on have to be accessible to people with disabilities, and there’s a shifting landscape of standards. You have no idea who will be accessing your data – it’s open! – and you can’t assume that they’ll be subject matter experts. Metadata, documentation are key – but of course they are time consuming. On the other hand – your audience is everyone! You get to work on things that affect all sorts of people in very real ways. Your predictive models have real impact – like saving lives through fire detector placement.
  28. For every data scientist with a phd working at the NSA or the Census, there’s an analyst working at the department of public works in a city that has to work on 20 projects at once. They’re working with limited resources, limited time, and a public that often doesn’t care a whole lot about how busy they are. Public service isn’t just a catchphrase – there’s real service here.