SlideShare uma empresa Scribd logo
1 de 46
Exploring Data Preparation and Visualization
           Tools for Urban Forestry



                  340 N 12th St, Suite 402
                  Philadelphia, PA 19107
                       215.925.2600
                    info@azavea.com
              www.azavea.com/opentreemap
About Us


    Deborah Boyer
    OpenTreeMap Project Manager
    dboyer@azavea.com
    215.701.7506




   Jeremy Heffner
   Product Manager
   jheffner@azavea.com
   215.701.7712
About Azavea

• Founded in 2000
• B Corporation
• 30+ people
• Based in Philadelphia
   – Boston office
• Geospatial + web + mobile
   – Software development
   – Spatial analysis services
   – User experience
Agenda


• The Ideal: Gathering Organized, Perfect Data

• The Reality: Cleaning and Preparing Your Data

• Adding Context

• Exploring, Preparing and Sharing Data Visualizations

• Questions
Gathering Data
An open source tree data management system
for collaborative, geography enabled urban tree inventory
Main Features

• Search and Explore Tree
  Data

• View Ecosystem Benefits

• Add New Trees

• Edit and Update Trees

• Upload Tree Photos

• Track Stewardship Activities
Data Quality Checks

• Remove duplicate
  trees during data
  upload

• Tree watch list

• Drop down lists

• User groups

• Reputation points
Cleaning and Preparing Data
Data Cleaning: Your Questions



• At what point in the data maintenance
  process do you find yourself cleaning data?



• Are there ways that you would like to
  improve the workflow?
Cleaning & Preparing Data

• Making sense of data starts at the point of collection
   – Define what you want to measure / track
      • Clearly define schema and fields
          – Have a shared meaning for values
          – Data validation on entry

   – Collect your data
   – Examine results
      • Are there common mistakes you could prevent?
      • Are there different interpretations of fields?

   – Close the feedback loop & iterate
Cleaning & Preparing Data

• Common data quality issues
   – Combined fields
      • Address: “340 N 12th St, Suite 402 , Philadelphia, PA 19107”

   – Invalid entries
      • ZIP code: 1234 (length check, is number)
      • Age: 204 (reasonable range check, is number)

   – Format variations
      • State: PA vs. Pennsylvania (drop down or scrubbing rules)

   – Duplicates
      • CRM: John Smith with old and new addresses
Cleaning & Preparing Data




    Not a reasonable option
What does this have to do with trees?

• We track things - tree inventories, potential planting
  sites, community groups, people who requested
  trees, etc .

• Data comes from lots of places - web forms,
  collected by various staff, submitted by community
  groups.

• None of it matches.

• Good data makes our lives easier.
Cleaning & Preparing Data

• Tools to clean tabular data
   – Excel (or open source equivalent)
      • Pros:
          – Broad features
          – Widely utilized / common skill
          – Formulas / sorting / flexible

      • Cons:
          – Doesn’t understand record concept
          – Mass changes can be tedious
Cleaning & Preparing Data

• Tools to clean tabular data
   – DataWrangler
      • http://vis.stanford.edu/wrangler/
      • Pros:
          – Focused on transforming data into relational format
          – Live previews

      • Cons:
          – Alpha quality version
          – Data size limits / online tool
          – Can be difficult to figure out what set of transforms are needed
Cleaning & Preparing Data

• Tools to clean tabular data
   – Google Refine
      • http://code.google.com/p/google-refine/
      • Pros:
          – Understands record concept
          – Formulas / Facets
          – Undo capability
          – Windows / Mac / Linux

      • Cons:
          – There is a learning curve
          – Unusual type of app
                Âť Download, unzip, run exe file, access through browser
Demo
Assembling Data and Building Context
Context: Your Questions

• What challenges have you faced putting your data
  in context?


• Are you struggling to identify what “context” means
  for your organization?


• Do you know what data you’d like to use, but have
  trouble finding it?
Your Data in Context

• Your data is essential!
• But it is more meaningful in context…
   – Ratios & rates
       • Service level
       • Market penetration

   – Indicators & trends
       • How you compare

   – Targeting
       • Key demographics                 Juice Analytics


       • Custom summaries
What does this have to do with trees?



• Trees don’t exist in a vacuum.

• Contextual data = more effective outreach.

• More info gives you new insights.
Making Sense of the Census

• American FactFinder
• http://factfinder2.census.gov
   – Decennial Census
      • Every 10 years
      • Full population survey
      • Just 10 questions

   – American Community Survey (ACS)
      • Monthly sample
      • Aggregated over different time periods (1-, 3- and 5-year)
      • Extremely detailed questions
      • Subject to sampling error
FactFinder Frustrations
Helpers: Social Explorer

• http://www.socialexplorer.com/

• Data Dictionary
   –   Survey
   –   Dataset
   –   Table
   –   Variable
   –   Formula
   –   Population
Helpers: Social Explorer

• Background
  – Key Terms
  – Collection Methodology
  – Uses & applications
Helpers: ACS Alchemist

•   https://github.com/azavea/acs-alchemist 
•   Retrieval of block group-level data
•   Custom variable selection
•   Delivery in spatial data format ready for mapping




This tool was developed by Azavea in collaboration with Jerry Ratcliffe and Ralph Taylor of Temple
University Center for Security and Crime Science. This project was supported by Award No. 2010-DE-BX-
K004, awarded by the National Institute of Justice, Office of Justice Programs, U.S. Department of Justice.
Helpers: ACS Alchemist

As easy as 1-2-3
1.Create a document with your selected variables
Helpers: ACS Alchemist

As easy as 1-2-3
1.Create a document with your selected variables
2.Pick your geographies
Helpers: ACS Alchemist

As easy as 1-2-3
1.Create a document with your selected variables
2.Pick your geographies and geolevels
3.Retrieve your shapefiles
Other Sources

• Public data
   – Open Data Portals
      • Federal, state & local data

   – Political Data
      • Voter data
      • Legislative boundaries



• Commercial data
   – Population Projections
   – Consumer Data
Data Visualizations
Data Visualization: Your Questions

• Do you currently share data with your constituents?


• Where do you use data visualizations (e.g. annual
  report, embedded infographics, live data trackers)?


• Do you currently map your data?
What does this have to do with trees?

• Charts, graphs, maps, and photos help us
  tell a story.

• Show that trees are more than just leaves
  and branches.

• Explore the science without making
  people’s eyes glaze over.
Exploring Data

• Visualization tools
   – Tableau
       • http://www.tableausoftware.com/
       • Pros:
           – Flexible interface makes data exploration easy
           – Fast even on large data sets

       • Cons:
           – Easy to visualize something that doesn’t make sense to look at
           – Price (for desktop tool)
Demo
Exploring Data

• Visualization tools
   – GeoCommons (GeoIQ)
       • http://geocommons.com/
       • Pros:
           – Intuitive interface
           – Analysis tools
           – Geocoding for up to 5,000 records
           – Supports KML (Google Maps) import & export

       • Cons:
           – US-only geocoding
Exploring Data

• Desktop GIS: Proprietary
   – Esri ArcGIS
      • Pros:
          – Industry standard
          – Many tools
          – Extensive training materials
          – Customer support

      • Cons:
          – Windows only
          – Potentially expensive *


            *
Exploring Data

• Visualization tools
   – ArcGIS Explorer online
       • http://www.arcgis.com/explorer/
       • Pros:
           – Supports many data formats
           – Online digitizing
           – Integration with other Esri services
           – Presentation view / mobile app

       • Cons:
           – Can’t export geocoded results
           – Geocoding limited to 250 records
Demo
Exploring Data

• Desktop GIS: Open Source
– Quantum GIS (QGIS)
– GRASS
– uDig
         • Pros:
             – Free
             – Multi-platform (Windows, Mac OS, Linux)

         • Cons:
             – Limited functionality (for advanced users)
             – Community-based support
Questions?
Contact Us


     Deborah Boyer
     OpenTreeMap Project Manager
     dboyer@azavea.com
     215.701.7506




    Jeremy Heffner
    Product Manager
    jheffner@azavea.com
    215.701.7712
Exploring Data Preparation and Visualization
           Tools for Urban Forestry



                  340 N 12th St, Suite 402
                  Philadelphia, PA 19107
                       215.925.2600
                    info@azavea.com
              www.azavea.com/opentreemap

Mais conteĂşdo relacionado

Mais procurados

Growing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
Growing Your Urban Forest: Using the OpenTreeMap Bulk UploaderGrowing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
Growing Your Urban Forest: Using the OpenTreeMap Bulk UploaderAzavea
 
Getting Started with OpenTreeMap Cloud
Getting Started with OpenTreeMap CloudGetting Started with OpenTreeMap Cloud
Getting Started with OpenTreeMap CloudAzavea
 
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...Azavea
 
Using New Tools to Analyze and Plan Your Urban Forest
Using New Tools to Analyze and Plan Your Urban Forest Using New Tools to Analyze and Plan Your Urban Forest
Using New Tools to Analyze and Plan Your Urban Forest Azavea
 
Sample presentation
Sample presentationSample presentation
Sample presentationcle Thompson
 
Sample presentation
Sample presentationSample presentation
Sample presentationsalazark
 
Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016
Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016
Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016Chris Peiffer
 
Libby Hemphill, "Elected Officials and Social Media"
Libby Hemphill, "Elected Officials and Social Media"Libby Hemphill, "Elected Officials and Social Media"
Libby Hemphill, "Elected Officials and Social Media"summersocialwebshop
 

Mais procurados (10)

Growing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
Growing Your Urban Forest: Using the OpenTreeMap Bulk UploaderGrowing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
Growing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
 
Getting Started with OpenTreeMap Cloud
Getting Started with OpenTreeMap CloudGetting Started with OpenTreeMap Cloud
Getting Started with OpenTreeMap Cloud
 
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
 
Using New Tools to Analyze and Plan Your Urban Forest
Using New Tools to Analyze and Plan Your Urban Forest Using New Tools to Analyze and Plan Your Urban Forest
Using New Tools to Analyze and Plan Your Urban Forest
 
Sample presentation
Sample presentationSample presentation
Sample presentation
 
Sample presentation
Sample presentationSample presentation
Sample presentation
 
Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016
Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016
Custom Tools for Urban Forestry Nonprofits and Outreach July 12, 2016
 
CAPS GUIDELINES
CAPS GUIDELINESCAPS GUIDELINES
CAPS GUIDELINES
 
Lee rainie
Lee rainieLee rainie
Lee rainie
 
Libby Hemphill, "Elected Officials and Social Media"
Libby Hemphill, "Elected Officials and Social Media"Libby Hemphill, "Elected Officials and Social Media"
Libby Hemphill, "Elected Officials and Social Media"
 

Destaque

Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist SoftServe
 
Reinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapRReinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapRLilia Gutnik
 
Business Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysisBusiness Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysisAhsan Khan Eco (Superior College)
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningKai Wähner
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and ProcessingMehul Gondaliya
 

Destaque (6)

Data Preparation for Data Science
Data Preparation for Data ScienceData Preparation for Data Science
Data Preparation for Data Science
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
 
Reinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapRReinventing the Modern Information Pipeline: Paxata and MapR
Reinventing the Modern Information Pipeline: Paxata and MapR
 
Business Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysisBusiness Research Methods. data collection preparation and analysis
Business Research Methods. data collection preparation and analysis
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and Processing
 

Semelhante a Exploring Data Preparation and Visualization Tools for Urban Forestry

NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for NonprofitsNTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for NonprofitsAzavea
 
Exploring Collaborative Tree Inventory with OpenTreeMap
Exploring Collaborative Tree Inventory with OpenTreeMapExploring Collaborative Tree Inventory with OpenTreeMap
Exploring Collaborative Tree Inventory with OpenTreeMapAzavea
 
DataUp at ACRL 2013
DataUp at ACRL 2013DataUp at ACRL 2013
DataUp at ACRL 2013Carly Strasser
 
LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...
LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...
LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...locloud
 
Implimenting and Mitigating Change with all of this Newfangled Technology
Implimenting and Mitigating Change with all of this Newfangled TechnologyImplimenting and Mitigating Change with all of this Newfangled Technology
Implimenting and Mitigating Change with all of this Newfangled TechnologyIndiana Online Users Group
 
Open Metrics for Open Repositories at OR2012
Open Metrics for Open Repositories at OR2012Open Metrics for Open Repositories at OR2012
Open Metrics for Open Repositories at OR2012Nick Sheppard
 
Bren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheetsBren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheetsCarly Strasser
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...SEAD
 
Software Programs for Data Analysis
Software Programs for Data AnalysisSoftware Programs for Data Analysis
Software Programs for Data Analysisunmgrc
 
Scoping call presentation_rev_4_mary
Scoping call presentation_rev_4_maryScoping call presentation_rev_4_mary
Scoping call presentation_rev_4_maryrhochambeau32
 
A Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital EraA Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital EraVicki Ferrini
 
Open Data Presentation
Open Data PresentationOpen Data Presentation
Open Data PresentationSaviour Sanders
 
Large scale computing
Large scale computing Large scale computing
Large scale computing Bhupesh Bansal
 
Ordering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataOrdering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataAndy Stretton
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM
 
Sp meetup 17 slidedeck
Sp meetup 17 slidedeckSp meetup 17 slidedeck
Sp meetup 17 slidedeckRic Centre
 
Open data for_resilience
Open data for_resilienceOpen data for_resilience
Open data for_resilienceDami Sonoiki
 
CTOs Perspective on Adding Geospatial and Location-based Information
CTOs Perspective on Adding Geospatial and Location-based InformationCTOs Perspective on Adding Geospatial and Location-based Information
CTOs Perspective on Adding Geospatial and Location-based InformationBradley Brown
 

Semelhante a Exploring Data Preparation and Visualization Tools for Urban Forestry (20)

NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for NonprofitsNTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
 
Exploring Collaborative Tree Inventory with OpenTreeMap
Exploring Collaborative Tree Inventory with OpenTreeMapExploring Collaborative Tree Inventory with OpenTreeMap
Exploring Collaborative Tree Inventory with OpenTreeMap
 
DataUp at ACRL 2013
DataUp at ACRL 2013DataUp at ACRL 2013
DataUp at ACRL 2013
 
LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...
LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...
LoCloud Geolocation enrichment tools, Siri Slettvag, Asplan Viak Internet (Av...
 
Implimenting and Mitigating Change with all of this Newfangled Technology
Implimenting and Mitigating Change with all of this Newfangled TechnologyImplimenting and Mitigating Change with all of this Newfangled Technology
Implimenting and Mitigating Change with all of this Newfangled Technology
 
Open Metrics for Open Repositories at OR2012
Open Metrics for Open Repositories at OR2012Open Metrics for Open Repositories at OR2012
Open Metrics for Open Repositories at OR2012
 
Bren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheetsBren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheets
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
Software Programs for Data Analysis
Software Programs for Data AnalysisSoftware Programs for Data Analysis
Software Programs for Data Analysis
 
Scoping call presentation_rev_4_mary
Scoping call presentation_rev_4_maryScoping call presentation_rev_4_mary
Scoping call presentation_rev_4_mary
 
A Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital EraA Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital Era
 
Hawaii Pacific GIS Conference 2012: LiDAR for Intrastructure and Terrian Mapp...
Hawaii Pacific GIS Conference 2012: LiDAR for Intrastructure and Terrian Mapp...Hawaii Pacific GIS Conference 2012: LiDAR for Intrastructure and Terrian Mapp...
Hawaii Pacific GIS Conference 2012: LiDAR for Intrastructure and Terrian Mapp...
 
About Scanning and Metadata Standards - NEMO 2010
About Scanning and Metadata Standards - NEMO 2010About Scanning and Metadata Standards - NEMO 2010
About Scanning and Metadata Standards - NEMO 2010
 
Open Data Presentation
Open Data PresentationOpen Data Presentation
Open Data Presentation
 
Large scale computing
Large scale computing Large scale computing
Large scale computing
 
Ordering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataOrdering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect data
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
Sp meetup 17 slidedeck
Sp meetup 17 slidedeckSp meetup 17 slidedeck
Sp meetup 17 slidedeck
 
Open data for_resilience
Open data for_resilienceOpen data for_resilience
Open data for_resilience
 
CTOs Perspective on Adding Geospatial and Location-based Information
CTOs Perspective on Adding Geospatial and Location-based InformationCTOs Perspective on Adding Geospatial and Location-based Information
CTOs Perspective on Adding Geospatial and Location-based Information
 

Mais de Azavea

7 misconceptions about predictive policing webinar
7 misconceptions about predictive policing webinar7 misconceptions about predictive policing webinar
7 misconceptions about predictive policing webinarAzavea
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseAzavea
 
HunchLab 2.0 Getting Started
HunchLab 2.0 Getting StartedHunchLab 2.0 Getting Started
HunchLab 2.0 Getting StartedAzavea
 
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...Azavea
 
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...Azavea
 
HunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodHunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodAzavea
 
HunchLab 2.0 Preview Webinar - Place
HunchLab 2.0 Preview Webinar - PlaceHunchLab 2.0 Preview Webinar - Place
HunchLab 2.0 Preview Webinar - PlaceAzavea
 
Five Technology Trends Every Nonprofit Needs to Know
Five Technology Trends Every Nonprofit Needs to KnowFive Technology Trends Every Nonprofit Needs to Know
Five Technology Trends Every Nonprofit Needs to KnowAzavea
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataAzavea
 
Fed Geo Day - Applying GeoTrellis at the US Army Corps
Fed Geo Day - Applying GeoTrellis at the US Army CorpsFed Geo Day - Applying GeoTrellis at the US Army Corps
Fed Geo Day - Applying GeoTrellis at the US Army CorpsAzavea
 
Fed Geo Day - GeoTrellis Intro
Fed Geo Day - GeoTrellis IntroFed Geo Day - GeoTrellis Intro
Fed Geo Day - GeoTrellis IntroAzavea
 
Fed Geo Day 2013 - Azavea Intro
Fed Geo Day 2013 - Azavea Intro Fed Geo Day 2013 - Azavea Intro
Fed Geo Day 2013 - Azavea Intro Azavea
 
Modeling Count-based Raster Data with ArcGIS and R
Modeling Count-based Raster Data with ArcGIS and RModeling Count-based Raster Data with ArcGIS and R
Modeling Count-based Raster Data with ArcGIS and RAzavea
 
How to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIHow to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIAzavea
 
Exploring Mobile Technology with OpenTreeMap Mobile
Exploring Mobile Technology with OpenTreeMap MobileExploring Mobile Technology with OpenTreeMap Mobile
Exploring Mobile Technology with OpenTreeMap MobileAzavea
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime AnalysisAzavea
 

Mais de Azavea (16)

7 misconceptions about predictive policing webinar
7 misconceptions about predictive policing webinar7 misconceptions about predictive policing webinar
7 misconceptions about predictive policing webinar
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
 
HunchLab 2.0 Getting Started
HunchLab 2.0 Getting StartedHunchLab 2.0 Getting Started
HunchLab 2.0 Getting Started
 
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
 
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
 
HunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodHunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the Hood
 
HunchLab 2.0 Preview Webinar - Place
HunchLab 2.0 Preview Webinar - PlaceHunchLab 2.0 Preview Webinar - Place
HunchLab 2.0 Preview Webinar - Place
 
Five Technology Trends Every Nonprofit Needs to Know
Five Technology Trends Every Nonprofit Needs to KnowFive Technology Trends Every Nonprofit Needs to Know
Five Technology Trends Every Nonprofit Needs to Know
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) Data
 
Fed Geo Day - Applying GeoTrellis at the US Army Corps
Fed Geo Day - Applying GeoTrellis at the US Army CorpsFed Geo Day - Applying GeoTrellis at the US Army Corps
Fed Geo Day - Applying GeoTrellis at the US Army Corps
 
Fed Geo Day - GeoTrellis Intro
Fed Geo Day - GeoTrellis IntroFed Geo Day - GeoTrellis Intro
Fed Geo Day - GeoTrellis Intro
 
Fed Geo Day 2013 - Azavea Intro
Fed Geo Day 2013 - Azavea Intro Fed Geo Day 2013 - Azavea Intro
Fed Geo Day 2013 - Azavea Intro
 
Modeling Count-based Raster Data with ArcGIS and R
Modeling Count-based Raster Data with ArcGIS and RModeling Count-based Raster Data with ArcGIS and R
Modeling Count-based Raster Data with ArcGIS and R
 
How to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero APIHow to Conquer your Post-Election Data Chaos with the Cicero API
How to Conquer your Post-Election Data Chaos with the Cicero API
 
Exploring Mobile Technology with OpenTreeMap Mobile
Exploring Mobile Technology with OpenTreeMap MobileExploring Mobile Technology with OpenTreeMap Mobile
Exploring Mobile Technology with OpenTreeMap Mobile
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis
 

Último

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Exploring Data Preparation and Visualization Tools for Urban Forestry

  • 1. Exploring Data Preparation and Visualization Tools for Urban Forestry 340 N 12th St, Suite 402 Philadelphia, PA 19107 215.925.2600 info@azavea.com www.azavea.com/opentreemap
  • 2. About Us Deborah Boyer OpenTreeMap Project Manager dboyer@azavea.com 215.701.7506 Jeremy Heffner Product Manager jheffner@azavea.com 215.701.7712
  • 3. About Azavea • Founded in 2000 • B Corporation • 30+ people • Based in Philadelphia – Boston office • Geospatial + web + mobile – Software development – Spatial analysis services – User experience
  • 4. Agenda • The Ideal: Gathering Organized, Perfect Data • The Reality: Cleaning and Preparing Your Data • Adding Context • Exploring, Preparing and Sharing Data Visualizations • Questions
  • 6. An open source tree data management system for collaborative, geography enabled urban tree inventory
  • 7. Main Features • Search and Explore Tree Data • View Ecosystem Benefits • Add New Trees • Edit and Update Trees • Upload Tree Photos • Track Stewardship Activities
  • 8.
  • 9. Data Quality Checks • Remove duplicate trees during data upload • Tree watch list • Drop down lists • User groups • Reputation points
  • 11. Data Cleaning: Your Questions • At what point in the data maintenance process do you find yourself cleaning data? • Are there ways that you would like to improve the workflow?
  • 12.
  • 13. Cleaning & Preparing Data • Making sense of data starts at the point of collection – Define what you want to measure / track • Clearly define schema and fields – Have a shared meaning for values – Data validation on entry – Collect your data – Examine results • Are there common mistakes you could prevent? • Are there different interpretations of fields? – Close the feedback loop & iterate
  • 14. Cleaning & Preparing Data • Common data quality issues – Combined fields • Address: “340 N 12th St, Suite 402 , Philadelphia, PA 19107” – Invalid entries • ZIP code: 1234 (length check, is number) • Age: 204 (reasonable range check, is number) – Format variations • State: PA vs. Pennsylvania (drop down or scrubbing rules) – Duplicates • CRM: John Smith with old and new addresses
  • 15. Cleaning & Preparing Data Not a reasonable option
  • 16. What does this have to do with trees? • We track things - tree inventories, potential planting sites, community groups, people who requested trees, etc . • Data comes from lots of places - web forms, collected by various staff, submitted by community groups. • None of it matches. • Good data makes our lives easier.
  • 17. Cleaning & Preparing Data • Tools to clean tabular data – Excel (or open source equivalent) • Pros: – Broad features – Widely utilized / common skill – Formulas / sorting / flexible • Cons: – Doesn’t understand record concept – Mass changes can be tedious
  • 18. Cleaning & Preparing Data • Tools to clean tabular data – DataWrangler • http://vis.stanford.edu/wrangler/ • Pros: – Focused on transforming data into relational format – Live previews • Cons: – Alpha quality version – Data size limits / online tool – Can be difficult to figure out what set of transforms are needed
  • 19. Cleaning & Preparing Data • Tools to clean tabular data – Google Refine • http://code.google.com/p/google-refine/ • Pros: – Understands record concept – Formulas / Facets – Undo capability – Windows / Mac / Linux • Cons: – There is a learning curve – Unusual type of app Âť Download, unzip, run exe file, access through browser
  • 20. Demo
  • 21. Assembling Data and Building Context
  • 22. Context: Your Questions • What challenges have you faced putting your data in context? • Are you struggling to identify what “context” means for your organization? • Do you know what data you’d like to use, but have trouble finding it?
  • 23. Your Data in Context • Your data is essential! • But it is more meaningful in context… – Ratios & rates • Service level • Market penetration – Indicators & trends • How you compare – Targeting • Key demographics Juice Analytics • Custom summaries
  • 24. What does this have to do with trees? • Trees don’t exist in a vacuum. • Contextual data = more effective outreach. • More info gives you new insights.
  • 25. Making Sense of the Census • American FactFinder • http://factfinder2.census.gov – Decennial Census • Every 10 years • Full population survey • Just 10 questions – American Community Survey (ACS) • Monthly sample • Aggregated over different time periods (1-, 3- and 5-year) • Extremely detailed questions • Subject to sampling error
  • 27. Helpers: Social Explorer • http://www.socialexplorer.com/ • Data Dictionary – Survey – Dataset – Table – Variable – Formula – Population
  • 28. Helpers: Social Explorer • Background – Key Terms – Collection Methodology – Uses & applications
  • 29. Helpers: ACS Alchemist • https://github.com/azavea/acs-alchemist  • Retrieval of block group-level data • Custom variable selection • Delivery in spatial data format ready for mapping This tool was developed by Azavea in collaboration with Jerry Ratcliffe and Ralph Taylor of Temple University Center for Security and Crime Science. This project was supported by Award No. 2010-DE-BX- K004, awarded by the National Institute of Justice, Office of Justice Programs, U.S. Department of Justice.
  • 30. Helpers: ACS Alchemist As easy as 1-2-3 1.Create a document with your selected variables
  • 31. Helpers: ACS Alchemist As easy as 1-2-3 1.Create a document with your selected variables 2.Pick your geographies
  • 32. Helpers: ACS Alchemist As easy as 1-2-3 1.Create a document with your selected variables 2.Pick your geographies and geolevels 3.Retrieve your shapefiles
  • 33. Other Sources • Public data – Open Data Portals • Federal, state & local data – Political Data • Voter data • Legislative boundaries • Commercial data – Population Projections – Consumer Data
  • 35. Data Visualization: Your Questions • Do you currently share data with your constituents? • Where do you use data visualizations (e.g. annual report, embedded infographics, live data trackers)? • Do you currently map your data?
  • 36. What does this have to do with trees? • Charts, graphs, maps, and photos help us tell a story. • Show that trees are more than just leaves and branches. • Explore the science without making people’s eyes glaze over.
  • 37. Exploring Data • Visualization tools – Tableau • http://www.tableausoftware.com/ • Pros: – Flexible interface makes data exploration easy – Fast even on large data sets • Cons: – Easy to visualize something that doesn’t make sense to look at – Price (for desktop tool)
  • 38. Demo
  • 39. Exploring Data • Visualization tools – GeoCommons (GeoIQ) • http://geocommons.com/ • Pros: – Intuitive interface – Analysis tools – Geocoding for up to 5,000 records – Supports KML (Google Maps) import & export • Cons: – US-only geocoding
  • 40. Exploring Data • Desktop GIS: Proprietary – Esri ArcGIS • Pros: – Industry standard – Many tools – Extensive training materials – Customer support • Cons: – Windows only – Potentially expensive * *
  • 41. Exploring Data • Visualization tools – ArcGIS Explorer online • http://www.arcgis.com/explorer/ • Pros: – Supports many data formats – Online digitizing – Integration with other Esri services – Presentation view / mobile app • Cons: – Can’t export geocoded results – Geocoding limited to 250 records
  • 42. Demo
  • 43. Exploring Data • Desktop GIS: Open Source – Quantum GIS (QGIS) – GRASS – uDig • Pros: – Free – Multi-platform (Windows, Mac OS, Linux) • Cons: – Limited functionality (for advanced users) – Community-based support
  • 45. Contact Us Deborah Boyer OpenTreeMap Project Manager dboyer@azavea.com 215.701.7506 Jeremy Heffner Product Manager jheffner@azavea.com 215.701.7712
  • 46. Exploring Data Preparation and Visualization Tools for Urban Forestry 340 N 12th St, Suite 402 Philadelphia, PA 19107 215.925.2600 info@azavea.com www.azavea.com/opentreemap