SlideShare a Scribd company logo
1 of 25
Breathing life
into a travel survey
Roland Martin
Our client operates a large campus site
in California. There is a limit to the
number of vehicles that can enter their
site at any given time.
Every year, they carry out a travel
survey, with 20 prizes, including a grand
prize of $2,000.
Around 90% of site users responded.
The survey
1. Please select the answer that applies to you.
• I work on the main campus
• I work at an off-campus site
• None of the above
If “none of the above is selected” then skip to end of survey.
2. How do you usually commute to campus?
• Drive alone in a passenger automobile
• Bicycle
• Carpool (two or more people)
• Campus Shuttle (not as a transfer from another mode)
• Motorcycle
• Public transit (e.g. bus)
• Walk
Raw data
Required output
• A database (PostgreSQL / PostGIS)
• Access via desktop GIS (ArcGIS / QGIS)
• Possibly some kind of dashboard…
• Insight
Cleaning up the non-geographic data
• A lot of questions are not fully expressed in column headers
• Several column headers are repeated
• Not all questions are answered
• Some responses are “Other”, with the response in a second column
• Some questions allow multiple responses, so responses are collected
into multiple columns
• A lot of questions are not of interest for this study
Step 1: extract the schema
ResponseID SchemaID
ExternalDataReference q1
Affiliation q2
Staff class/Class level q3
CAC q5
Home lat q6
Home lon q7
FPC q8
Please select the answer that applies to you. q9
Although you are not eligible for this survey, we thank you for / your time and interest. q10
Where on campus is your primary work site? Please click on one location. - X q11
Where on campus is your primary work site? Please click on one location. - Y q12
Which of the following is your off-campus work / site?(Note: Consider your commute to/from / this loc... q13
Which of the following is your off-campus work / site?(Note: Consider your commute to/from / this loc...-TEXT q14
From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-City q16
From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-Zip q17
From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-Cross street 1 q18
From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-Cross street 2 q19
How do you usually commute to campus/work? (i.e., / What is the primary mode you use during your ty... q20
How do you usually commute to campus/work? (i.e., / What is the primary mode you use during your ty...-TEXT q21
Which transit system do you use for the longest distance of your / commute? q22
Which transit system do you use for the longest distance of your / commute?-TEXT q23
How do you usually get to the bus stop or train station from / your home? q24
How do you usually get to the bus stop or train station from / your home?-TEXT q25
Step 1: extract the schema
Step 2: make JSON
{
"4":"A1",
"5":"No",
"6":"No",
"7":31.2345,
"8":-120.1234,
"10":"None of the above",
"11":1,
"Zip":90210,
"City":"Santa Claus",
"State":"CA",
"Staff_Class":"A1"
}
Step 3: populate database
Step 4: set up database with more columns
alter table survey_data
alter column raw_survey_data type jsonb using raw_survey_data::jsonb
, add column survey_date date
, add column response_id text
, add column affiliation text
, add column staff_class_level text
, add column work_location_description text
, add column geom_work geometry
, add column commute_start_city text
, add column commute_start_zip text
, add column commute_mode_primary text
, add column commute_mode_transit text
, add column commute_mode_transit_access_home text
, add column commute_mode_transit_access_work text
, add column typical_station_home text
, add column typical_station_work text
, add column commute_regularity text
, add column commute_mode_primary_regularity text
, add column typical_work_time_arrive text
, add column typical_work_time_depart text
Step 5: populate data columns
update survey_data
set
survey_date = '2016-01-05'
, response_id = raw_survey_data ->> '1'
, external_data_reference = (raw_survey_data ->> '2')::integer
, affiliation = raw_survey_data ->> '3'
, staff_class_level = raw_survey_data ->> '4'
, work_location_description = raw_survey_data ->> '10'
, geom_work = null
, live_on_campus = (raw_survey_data ->> '16')::boolean
, commute_start_city = raw_survey_data ->> '17'
, commute_start_zip = raw_survey_data ->> '18'
, commute_mode_primary = case when
upper(raw_survey_data ->> '21')
not like 'OTHER%'
then
raw_survey_data ->> '21'
else
concat(raw_survey_data ->> '21', ' ',
raw_survey_data ->> '22')
end
Cleaning up the non-geographic data
A lot of questions are not fully expressed in
column headers Schema loaded separately into database
Several column headers are repeated
Not all questions are answered
PostgreSQL’s JSON capabilities used to
reduce data volumes while still retaining
raw data
Some responses are “Other”, with the
response in a second column Questions automatically and manually
assessed to ensure data integritySome questions allow multiple responses,
so responses are collected into multiple
columns
A lot of questions are not of interest for this
study
Some questions are ignored for this study
The story so far…
Transport Scotland: http://www.transport.gov.scot/report/j9425-10.htm
Two questions were very creatively
posed: where do you work on campus,
and where do you park?
Where do you work on campus?
Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html
580 194
502 270
41,616,668,701,171,800 3,360,666,809,082,030
504,5 3,298,699,951,171,870
563 250
4,951,040,344,238,280 3,744,800,109,863,280
486,5 3,238,699,951,171,870
507 3,296,800,231,933,590
487,5 3,748,699,951,171,870
463,5 2,688,699,951,171,870
414,5 2,808,699,951,171,870
584 196
46,261,907,958,984,300 2,562,952,575,683,590
Step 1: create links table for main map
Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html, map data © 2016 Google
Step 2: create links table for inset map
Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html
225.22 -86.66 -14960212.79 4051860.95
153.83 -205.26 -14960403.95 4051498.46
52.06 -91.52 -14960759.31 4051780.99
48.24 -95.43 -14960771.54 4051771.35
58.99 -97.74 -14960735.30 4051764.56
53.99 -101.68 -14960749.27 4051750.63
91.17 -61.12 -14960584.69 4051890.29
151.37 -213.02 -14960403.08 4051498.46
153.84 -205.21 -14960400.46 4051508.46
167.14 -152.50 -14960349.38 4051670.98
166.24 -158.44 -14960368.15 4051650.62
170.70 -146.17 -14960338.90 4051692.41
181.56 -158.92 -14960314.89 4051669.55
179.37 -161.54 -14960324.50 4051649.55
211.50 -94.75 -14960229.33 4051838.50
256.58 -53.58 -14960082.20 4051964.59
250.41 -46.34 -14960093.99 4051972.45
262.27 -48.50 -14960078.71 4051970.30
254.56 -42.21 -14960088.32 4051977.80
369.27 -29.80 -14959562.69 4052246.41
367.87 -17.80 -14959562.69 4052256.77
362.14 -24.05 -14959567.93 4052250.70
373.59 -22.93 -14959557.45 4052252.48
36.34 -247.21 -14960790.33 4051387.24
51.50 -209.42 -14960743.18 4051481.06
Step 3: create a mask
Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html
Main map
Inset map
Step 4: fun with FME
Step 5: fun with QGIS
Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html
Interactive
Dashboard
Thank you!
roland.martin@arup.com

More Related Content

Similar to Breathing Life into a Travel Survey

Transportation planning
Transportation planningTransportation planning
Transportation planningNikolaopi2
 
Euro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataEuro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataFabion Kauker
 
Exercise type detection
Exercise type detectionExercise type detection
Exercise type detectionGoDataDriven
 
ML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdfML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdfShiwani Gupta
 
Clickstream data with spark
Clickstream data with sparkClickstream data with spark
Clickstream data with sparkMarissa Saunders
 
Logistics Data Analyst Internship RRD
Logistics Data Analyst Internship RRDLogistics Data Analyst Internship RRD
Logistics Data Analyst Internship RRDKatie Harvey
 
Automated essay scoring: an introduction to grading essays with NLP and AI
Automated essay scoring: an introduction to grading essays with NLP and AIAutomated essay scoring: an introduction to grading essays with NLP and AI
Automated essay scoring: an introduction to grading essays with NLP and AINathan Thompson
 

Similar to Breathing Life into a Travel Survey (7)

Transportation planning
Transportation planningTransportation planning
Transportation planning
 
Euro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataEuro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street data
 
Exercise type detection
Exercise type detectionExercise type detection
Exercise type detection
 
ML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdfML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdf
 
Clickstream data with spark
Clickstream data with sparkClickstream data with spark
Clickstream data with spark
 
Logistics Data Analyst Internship RRD
Logistics Data Analyst Internship RRDLogistics Data Analyst Internship RRD
Logistics Data Analyst Internship RRD
 
Automated essay scoring: an introduction to grading essays with NLP and AI
Automated essay scoring: an introduction to grading essays with NLP and AIAutomated essay scoring: an introduction to grading essays with NLP and AI
Automated essay scoring: an introduction to grading essays with NLP and AI
 

More from Safe Software

Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
The Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data EcosystemThe Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data EcosystemSafe Software
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataSafe Software
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsSafe Software
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
Mastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISMastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISSafe Software
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & EsriGeospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & EsriSafe Software
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfSafe Software
 
Breaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI TechnologyBreaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI TechnologySafe Software
 
Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...Safe Software
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataSafe Software
 
New Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s FoundersNew Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s FoundersSafe Software
 
Taking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsTaking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsSafe Software
 
Initiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategySafe Software
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Safe Software
 
Mastering DevOps-Driven Data Integration with FME
Mastering DevOps-Driven Data Integration with FMEMastering DevOps-Driven Data Integration with FME
Mastering DevOps-Driven Data Integration with FMESafe Software
 
Identification of landscape features eligible for EU subsidy - Analysis of la...
Identification of landscape features eligible for EU subsidy - Analysis of la...Identification of landscape features eligible for EU subsidy - Analysis of la...
Identification of landscape features eligible for EU subsidy - Analysis of la...Safe Software
 
Utilizing FME as an API Test Framework
Utilizing FME as an API Test Framework Utilizing FME as an API Test Framework
Utilizing FME as an API Test Framework Safe Software
 
Bringing Open Data Integration to the (SWECO) People
Bringing Open Data Integration to the (SWECO) PeopleBringing Open Data Integration to the (SWECO) People
Bringing Open Data Integration to the (SWECO) PeopleSafe Software
 

More from Safe Software (20)

Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
The Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data EcosystemThe Critical Role of Spatial Data in Today's Data Ecosystem
The Critical Role of Spatial Data in Today's Data Ecosystem
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsIgniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Mastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GISMastering MicroStation DGN: How to Integrate CAD and GIS
Mastering MicroStation DGN: How to Integrate CAD and GIS
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & EsriGeospatial Synergy: Amplifying Efficiency with FME & Esri
Geospatial Synergy: Amplifying Efficiency with FME & Esri
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
 
Breaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI TechnologyBreaking Barriers & Leveraging the Latest Developments in AI Technology
Breaking Barriers & Leveraging the Latest Developments in AI Technology
 
Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...Best Practices to Navigating Data and Application Integration for the Enterpr...
Best Practices to Navigating Data and Application Integration for the Enterpr...
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
New Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s FoundersNew Year's Fireside Chat with Safe Software’s Founders
New Year's Fireside Chat with Safe Software’s Founders
 
Taking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New HeightsTaking Off with FME: Elevating Airport Operations to New Heights
Taking Off with FME: Elevating Airport Operations to New Heights
 
Initiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
 
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
 
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
Geospatial Synergy: Amplifying Efficiency with FME & Esri ft. Peak Guest Spea...
 
Mastering DevOps-Driven Data Integration with FME
Mastering DevOps-Driven Data Integration with FMEMastering DevOps-Driven Data Integration with FME
Mastering DevOps-Driven Data Integration with FME
 
Identification of landscape features eligible for EU subsidy - Analysis of la...
Identification of landscape features eligible for EU subsidy - Analysis of la...Identification of landscape features eligible for EU subsidy - Analysis of la...
Identification of landscape features eligible for EU subsidy - Analysis of la...
 
Utilizing FME as an API Test Framework
Utilizing FME as an API Test Framework Utilizing FME as an API Test Framework
Utilizing FME as an API Test Framework
 
Bringing Open Data Integration to the (SWECO) People
Bringing Open Data Integration to the (SWECO) PeopleBringing Open Data Integration to the (SWECO) People
Bringing Open Data Integration to the (SWECO) People
 

Recently uploaded

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Recently uploaded (20)

Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

Breathing Life into a Travel Survey

  • 1. Breathing life into a travel survey Roland Martin
  • 2. Our client operates a large campus site in California. There is a limit to the number of vehicles that can enter their site at any given time.
  • 3. Every year, they carry out a travel survey, with 20 prizes, including a grand prize of $2,000.
  • 4. Around 90% of site users responded.
  • 5. The survey 1. Please select the answer that applies to you. • I work on the main campus • I work at an off-campus site • None of the above If “none of the above is selected” then skip to end of survey. 2. How do you usually commute to campus? • Drive alone in a passenger automobile • Bicycle • Carpool (two or more people) • Campus Shuttle (not as a transfer from another mode) • Motorcycle • Public transit (e.g. bus) • Walk
  • 7. Required output • A database (PostgreSQL / PostGIS) • Access via desktop GIS (ArcGIS / QGIS) • Possibly some kind of dashboard… • Insight
  • 8. Cleaning up the non-geographic data • A lot of questions are not fully expressed in column headers • Several column headers are repeated • Not all questions are answered • Some responses are “Other”, with the response in a second column • Some questions allow multiple responses, so responses are collected into multiple columns • A lot of questions are not of interest for this study
  • 9. Step 1: extract the schema ResponseID SchemaID ExternalDataReference q1 Affiliation q2 Staff class/Class level q3 CAC q5 Home lat q6 Home lon q7 FPC q8 Please select the answer that applies to you. q9 Although you are not eligible for this survey, we thank you for / your time and interest. q10 Where on campus is your primary work site? Please click on one location. - X q11 Where on campus is your primary work site? Please click on one location. - Y q12 Which of the following is your off-campus work / site?(Note: Consider your commute to/from / this loc... q13 Which of the following is your off-campus work / site?(Note: Consider your commute to/from / this loc...-TEXT q14 From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-City q16 From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-Zip q17 From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-Cross street 1 q18 From what city and zip code do you typically begin your / commute? / If you choose to, please also p...-Cross street 2 q19 How do you usually commute to campus/work? (i.e., / What is the primary mode you use during your ty... q20 How do you usually commute to campus/work? (i.e., / What is the primary mode you use during your ty...-TEXT q21 Which transit system do you use for the longest distance of your / commute? q22 Which transit system do you use for the longest distance of your / commute?-TEXT q23 How do you usually get to the bus stop or train station from / your home? q24 How do you usually get to the bus stop or train station from / your home?-TEXT q25
  • 10. Step 1: extract the schema
  • 11. Step 2: make JSON { "4":"A1", "5":"No", "6":"No", "7":31.2345, "8":-120.1234, "10":"None of the above", "11":1, "Zip":90210, "City":"Santa Claus", "State":"CA", "Staff_Class":"A1" }
  • 12. Step 3: populate database
  • 13. Step 4: set up database with more columns alter table survey_data alter column raw_survey_data type jsonb using raw_survey_data::jsonb , add column survey_date date , add column response_id text , add column affiliation text , add column staff_class_level text , add column work_location_description text , add column geom_work geometry , add column commute_start_city text , add column commute_start_zip text , add column commute_mode_primary text , add column commute_mode_transit text , add column commute_mode_transit_access_home text , add column commute_mode_transit_access_work text , add column typical_station_home text , add column typical_station_work text , add column commute_regularity text , add column commute_mode_primary_regularity text , add column typical_work_time_arrive text , add column typical_work_time_depart text
  • 14. Step 5: populate data columns update survey_data set survey_date = '2016-01-05' , response_id = raw_survey_data ->> '1' , external_data_reference = (raw_survey_data ->> '2')::integer , affiliation = raw_survey_data ->> '3' , staff_class_level = raw_survey_data ->> '4' , work_location_description = raw_survey_data ->> '10' , geom_work = null , live_on_campus = (raw_survey_data ->> '16')::boolean , commute_start_city = raw_survey_data ->> '17' , commute_start_zip = raw_survey_data ->> '18' , commute_mode_primary = case when upper(raw_survey_data ->> '21') not like 'OTHER%' then raw_survey_data ->> '21' else concat(raw_survey_data ->> '21', ' ', raw_survey_data ->> '22') end
  • 15. Cleaning up the non-geographic data A lot of questions are not fully expressed in column headers Schema loaded separately into database Several column headers are repeated Not all questions are answered PostgreSQL’s JSON capabilities used to reduce data volumes while still retaining raw data Some responses are “Other”, with the response in a second column Questions automatically and manually assessed to ensure data integritySome questions allow multiple responses, so responses are collected into multiple columns A lot of questions are not of interest for this study Some questions are ignored for this study
  • 16. The story so far… Transport Scotland: http://www.transport.gov.scot/report/j9425-10.htm
  • 17. Two questions were very creatively posed: where do you work on campus, and where do you park?
  • 18. Where do you work on campus? Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html 580 194 502 270 41,616,668,701,171,800 3,360,666,809,082,030 504,5 3,298,699,951,171,870 563 250 4,951,040,344,238,280 3,744,800,109,863,280 486,5 3,238,699,951,171,870 507 3,296,800,231,933,590 487,5 3,748,699,951,171,870 463,5 2,688,699,951,171,870 414,5 2,808,699,951,171,870 584 196 46,261,907,958,984,300 2,562,952,575,683,590
  • 19. Step 1: create links table for main map Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html, map data © 2016 Google
  • 20. Step 2: create links table for inset map Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html 225.22 -86.66 -14960212.79 4051860.95 153.83 -205.26 -14960403.95 4051498.46 52.06 -91.52 -14960759.31 4051780.99 48.24 -95.43 -14960771.54 4051771.35 58.99 -97.74 -14960735.30 4051764.56 53.99 -101.68 -14960749.27 4051750.63 91.17 -61.12 -14960584.69 4051890.29 151.37 -213.02 -14960403.08 4051498.46 153.84 -205.21 -14960400.46 4051508.46 167.14 -152.50 -14960349.38 4051670.98 166.24 -158.44 -14960368.15 4051650.62 170.70 -146.17 -14960338.90 4051692.41 181.56 -158.92 -14960314.89 4051669.55 179.37 -161.54 -14960324.50 4051649.55 211.50 -94.75 -14960229.33 4051838.50 256.58 -53.58 -14960082.20 4051964.59 250.41 -46.34 -14960093.99 4051972.45 262.27 -48.50 -14960078.71 4051970.30 254.56 -42.21 -14960088.32 4051977.80 369.27 -29.80 -14959562.69 4052246.41 367.87 -17.80 -14959562.69 4052256.77 362.14 -24.05 -14959567.93 4052250.70 373.59 -22.93 -14959557.45 4052252.48 36.34 -247.21 -14960790.33 4051387.24 51.50 -209.42 -14960743.18 4051481.06
  • 21. Step 3: create a mask Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html Main map Inset map
  • 22. Step 4: fun with FME
  • 23. Step 5: fun with QGIS Linford Nursing: http://www.linfield.edu/portland/about-portland/location/campus-map.html