SlideShare uma empresa Scribd logo
1 de 16
Baixar para ler offline
Agenda
★ Introduction & Context
★ What? - Research Question
★ Data Description
★ Exploratory Data Analysis
★ Data Cleaning and Curation
★ Regression Analysis
★ Conclusion
WHYthis project
8,277,829
$14.3 billion
property crime offenses
in the nation in 2014.
estimated losses caused by
property crimes in 2014.
Source: https://www.fbi.gov/about-us/cjis/ucr/crime-in-the-u.s/2014/crime-in-the-u.s.-2014/offenses-known-to-law-enforcement/property-crime
WHATare we doing
Does crime in a neighborhood
affect the home values over time?
RESEARCH QUESTION
DATAdescription
★ 1.14 million records
★ Timeline 2010-15
★ 19 variables
★ 46 crime types
★ Time-series data
★ Dimensions: 5692 x 88
★ Year, Month, ZHVI,
Neighborhood, Region
Seattle 911 Response Zillow Data Research
EXPLORATORY
Data Analysis
EXPLORATORY
Data Analysis
DATACleaning & Curation Methods
Getting rid of unwanted records and columns
Reverse Geocoding Latitudes & Longitudes ~ High
Scale Conversion using Shapefiles in R
Data Transformation ~ Tidy Data (Method applied
Hadley Wickham - Journal of Statistical Software,
vol. 59, 2014.)
Data Aggregation by Year, Month & Neighborhood
Merging the clean datasets by Aggregated Matching
Ready for
Regression Analysis
DATACleaning - R Packages used
gpclib - General Polygon Clipping Library for R
rgeos - Interface to Geometry Engine - Open Source
(GEOS)
maptools - Reading / Handling Spatial Objects
rgdal - Bindings for the Geospatial Data Abstraction
Library
spatialEco - Spatial Analysis and Modelling
tidyr dplyrggplot2
Over 300 lines of R code written for the
project:
DATACleaning & Curation numbers
1.14 Million records narrowed down to 229K values
Reverse geocoded 229K different Latitude-
Longitude combinations and mapped them to 82
neighborhood zones
Zillow data transformed from dim [5692 x 88] to
[20592 x 4] dataframes (Reduce & Transpose)
229K values aggregated to 5904 rows (5 Years, 60
Months, 82 Neighborhoods, 5 Crime types)
Final Merged Dataset to append the Zillow Home
Value to each of the corresponding aggregated 911
value
Ready for
Regression Analysis
Data Analysis
Phew, Finally
Predicting home values as a function of
property crimes
WHAT IS OUR MODEL?
Crimes aggregated by month, year and
neighborhood with corresponding value
of house during the same period and
neighborhood
✔ Linear Relationship between
data
✔ Nearly normally-distributed
residuals
✔ Constant variability
✔ Independent observations
Data Analysis
Collinearity
Data Analysis
Multiple Regression
home.value = β0
+ β1
* Assault.Count + β2
* Arrest.Count + β3
* Burglary.Count + β4
* Disturbance.Count + β5
* Robbery.Count
Estimate Pr (>|t|) 95% Confidence Interval
(Intercept) 469028.28 < 2e-16 461599.3, 476457.2
Assault.Count 1401.19 5.18e-09 931.7, 1870.6
Arrest.Count -2278.03 < 2e-16 -2531.6, -2024.4
Burglary.Count -114.53 0.2973 -329.9, 100.87
Disturbance.Count 56.38 0.0273 6.3, 106.4
Robbery.Count -2413.51 < 2e-16 -2876.40, -1950.6
Multiple R-squared: 0.1106
Assault Count $ 1401.19
Arrest Count $ 2278.03
Burglary Count $ 114.53
Disturbance Count $ 56.38
Robbery Count $ 2413.51
Data Analysis
Model Interpretation
Data Analysis
Special Inferences
Since 2010, 4471 of total 17523 assault
cases reported in Downtown Seattle
Disturbance complaints have less or no correlation
with time or neighborhood ~ evenly distributed
SeattlersliketoParty!
Burglary - Unlawful entry to a structure to commit theft or a felony. In
order for burglary to take place, a victim does not have to be present.
Robbery - Take something from someone that has value by utilizing
intimidation, force or threat. In order for robbery to take place, a
victim must be present at the scene
You are less likely to be
arrested if you stay in
an expensive house!
Other Factors include:
■ Distance from Workplace,
UW and SeaTac Airport.
■ Access to Transportation &
Departmental Stores.
■ History of neighborhood
and past data.
Project Scope
Limitations and Key Learnings
Key Learnings
✔ Data Cleaning
✔ R Programming
✔ Open-ended
questions
✔ Data Science
Thank you!

Mais conteúdo relacionado

Semelhante a Home Value Patrol

Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyCOGS Presentations
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseAzavea
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...Azavea
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime AnalysisAzavea
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataAzavea
 

Semelhante a Home Value Patrol (6)

Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis County
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
 
HackerOne_Report
HackerOne_ReportHackerOne_Report
HackerOne_Report
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) Data
 

Último

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 

Último (20)

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 

Home Value Patrol

  • 1.
  • 2. Agenda ★ Introduction & Context ★ What? - Research Question ★ Data Description ★ Exploratory Data Analysis ★ Data Cleaning and Curation ★ Regression Analysis ★ Conclusion
  • 3. WHYthis project 8,277,829 $14.3 billion property crime offenses in the nation in 2014. estimated losses caused by property crimes in 2014. Source: https://www.fbi.gov/about-us/cjis/ucr/crime-in-the-u.s/2014/crime-in-the-u.s.-2014/offenses-known-to-law-enforcement/property-crime
  • 4. WHATare we doing Does crime in a neighborhood affect the home values over time? RESEARCH QUESTION
  • 5. DATAdescription ★ 1.14 million records ★ Timeline 2010-15 ★ 19 variables ★ 46 crime types ★ Time-series data ★ Dimensions: 5692 x 88 ★ Year, Month, ZHVI, Neighborhood, Region Seattle 911 Response Zillow Data Research
  • 8. DATACleaning & Curation Methods Getting rid of unwanted records and columns Reverse Geocoding Latitudes & Longitudes ~ High Scale Conversion using Shapefiles in R Data Transformation ~ Tidy Data (Method applied Hadley Wickham - Journal of Statistical Software, vol. 59, 2014.) Data Aggregation by Year, Month & Neighborhood Merging the clean datasets by Aggregated Matching Ready for Regression Analysis
  • 9. DATACleaning - R Packages used gpclib - General Polygon Clipping Library for R rgeos - Interface to Geometry Engine - Open Source (GEOS) maptools - Reading / Handling Spatial Objects rgdal - Bindings for the Geospatial Data Abstraction Library spatialEco - Spatial Analysis and Modelling tidyr dplyrggplot2 Over 300 lines of R code written for the project:
  • 10. DATACleaning & Curation numbers 1.14 Million records narrowed down to 229K values Reverse geocoded 229K different Latitude- Longitude combinations and mapped them to 82 neighborhood zones Zillow data transformed from dim [5692 x 88] to [20592 x 4] dataframes (Reduce & Transpose) 229K values aggregated to 5904 rows (5 Years, 60 Months, 82 Neighborhoods, 5 Crime types) Final Merged Dataset to append the Zillow Home Value to each of the corresponding aggregated 911 value Ready for Regression Analysis
  • 11. Data Analysis Phew, Finally Predicting home values as a function of property crimes WHAT IS OUR MODEL? Crimes aggregated by month, year and neighborhood with corresponding value of house during the same period and neighborhood
  • 12. ✔ Linear Relationship between data ✔ Nearly normally-distributed residuals ✔ Constant variability ✔ Independent observations Data Analysis Collinearity
  • 13. Data Analysis Multiple Regression home.value = β0 + β1 * Assault.Count + β2 * Arrest.Count + β3 * Burglary.Count + β4 * Disturbance.Count + β5 * Robbery.Count Estimate Pr (>|t|) 95% Confidence Interval (Intercept) 469028.28 < 2e-16 461599.3, 476457.2 Assault.Count 1401.19 5.18e-09 931.7, 1870.6 Arrest.Count -2278.03 < 2e-16 -2531.6, -2024.4 Burglary.Count -114.53 0.2973 -329.9, 100.87 Disturbance.Count 56.38 0.0273 6.3, 106.4 Robbery.Count -2413.51 < 2e-16 -2876.40, -1950.6 Multiple R-squared: 0.1106
  • 14. Assault Count $ 1401.19 Arrest Count $ 2278.03 Burglary Count $ 114.53 Disturbance Count $ 56.38 Robbery Count $ 2413.51 Data Analysis Model Interpretation
  • 15. Data Analysis Special Inferences Since 2010, 4471 of total 17523 assault cases reported in Downtown Seattle Disturbance complaints have less or no correlation with time or neighborhood ~ evenly distributed SeattlersliketoParty! Burglary - Unlawful entry to a structure to commit theft or a felony. In order for burglary to take place, a victim does not have to be present. Robbery - Take something from someone that has value by utilizing intimidation, force or threat. In order for robbery to take place, a victim must be present at the scene You are less likely to be arrested if you stay in an expensive house!
  • 16. Other Factors include: ■ Distance from Workplace, UW and SeaTac Airport. ■ Access to Transportation & Departmental Stores. ■ History of neighborhood and past data. Project Scope Limitations and Key Learnings Key Learnings ✔ Data Cleaning ✔ R Programming ✔ Open-ended questions ✔ Data Science Thank you!