SlideShare uma empresa Scribd logo
1 de 76
Baixar para ler offline
Click to edit Master title style
Click to edit Master subtitle style
1
How were these made?
Turning dirty data into something usable at NCEI
Jared Rennie and Scott Stevens
SAMSI Undergraduate Workshop
October 24th, 2017
2
Outline
• Introduction
• Overview of NCEI / CICS
• How did these get made?
– Temperature Extremes
– Rainfall Estimates
• Questions
3
What We’re Taught
Name How Tall are You (inches)?
Adam 66.2
Barbara 63.4
Clint 69.5
Debra 65.3
Easton 71.5
Frances 68.5
George 64.2
Hilda 61.2
Idris 74.5
4
What We Get
Name How Tall are You (inches)?
Adam 66 inches
Barbara 63.4287
Clint 176.5
Debra
Easton 41.5
Frances 68.5
George 5’4”
Hilda 61.2
Idris 210 lbs
5
What We Get
Name How Tall are You (inches)?
Adam 66 inches Added on some extra units
Barbara 63.4287 Has too much confidence in her
ruler’s precision
Clint 176.5 Gave it in cm instead of inches
Debra Just didn’t answer
Easton 41.5 Made a typo
Frances 68.5
George 5’4” Ignored the instructions
Hilda 61.2
Idris 210 lbs Answered a different question
entirely
6
Yesterday’s Demo
7
Yesterday’s Demo
8
Another Example
9
Another Example
10
TL;DR
• A lot of work goes into our data before most
scientists ever even see it
• Most of the “controversy” regarding climate
change isn’t around the analysis, but the data
• What we start with is messy and of little
value to most scientists, so we have to make
it more user-friendly
– And we have to carefully document how
11
Expectation vs Reality
12
Expectation vs Reality
13
Expectation vs Reality
14
National Centers for Environmental information
• Located in downtown Asheville, NC.
• Holds over 30 Petabytes of weather and
climate data from all over the world.
15
Cooperative Institute for Climate and Satellites
• Research institution working with NCEI.
• Made up of meteorologists, software
developers, and science communicators.
• Vision
– Inspire cutting-edge research and collaboration.
– Advance understanding of the current and
future state of the climate.
– Engage with business, academia, government,
and the public to enhance decision making.
16
CICS-NC Highlights
National Climate Assessment
17
CICS-NC Highlights
NEXRAD Reprocessing
18
CICS-NC Highlights
Satellite Based Products
19
That’s great… but
• These are the final products.
• A lot of hard work goes into generating these
“pretty images.”
– Math, statistics, meteorology, physics, orbital
mechanics, instrumentation, engineering, computer
programming, visualization, communication.
• And most importantly…
20
Documentation!
21
Goal of this talk
• Go into great, statistical detail how the following
images were made.
22
How did this get made?
Jared
23
Surface Temperature: The Beginnings
• The instrumental record of temperature
has its roots in the development of
universal temperature scales in the 18th
century.
– Monthly mean temperature series for De
Bilt, Netherlands extends from 1706 to
the present.
– Several other long European series exist
going back over 200 years.
• Throughout the 1800s measurements
expanded across other continents.
– National Meteorological and Hydrological
Services (NMHS) around the world have
operated networks to support weather
and climate observations since the late
19th
Century.
24
Surface Temperature: The Beginnings
• In the 1980’s and 1990’s, major efforts were
made to collect observations and create
consolidated global datasets.
– Global Historical Climatology Network – Monthly
(GHCN-M)
• Focus shifted to daily data in the 2000’s
– Global Historical Climatology Network – Daily
(GHCN-D)
25
Surface Temperature: The Beginnings
• GHCN-M
– In-Situ data for 7,280 stations in 226 countries.
– Monthly maximum, minimum, and average
temperature.
– Automated ingest, qc, and corrected for bias.
• GHCN-D
– Integrated database of daily climate summaries.
– TMAX, TMIN, PRCP, SNOW, SNWD, others.
– 100,000 stations globally (~30,000 temperature).
26
Surface Temperature: The Beginnings
• GHCN, along with other datasets, are the
foundation for understanding trends in global
surface temperature.
Land Land + Ocean
NCEI GHCN-M /
GHCN-D
NOAAGlobalTemp
UK Met
Office
CRUTEM HADCRUT
NASA GISS GISS
27
GHCN Flow Diagram
28
How did this get made?
1. Collect weather stations from around the globe.
2. Merge similar stations together.
3. Run through quality control.
4. Perform homogenization.
5. Grid station data.
6. Create departure from average.
29
How did this get made?
1. Collect weather stations from around the globe.
2. Merge similar stations together.
3. Run through quality control.
4. Perform homogenization.
5. Grid station data.
6. Create departure from average.
30
Data sources
31
Data sources
32
Data sources
33
How did this get made?
1. Collect weather stations from around the globe.
2. Merge similar stations together.
3. Run through quality control.
4. Perform homogenization.
5. Grid station data.
6. Create departure from average.
34
Merge Program
• Iterative process between the first source, and all
candidate sources.
• Compare each candidate station to all the target
stations, and determine one of the following three
possibilities:
– Candidate station should merge with target station.
– Candidate station is unique, and should be added
as a new station in the target source.
– Not enough information, station is withheld.
Reference: Rennie et al. (2014)
35
Merge Program
• Metadata Tests
– Probability of station match is calculated based
upon three criteria:
• Geographic distance
• Elevation difference
• Station name similarity using Jaccard Index
– Intersection divided by Union
– Final weighted probability is calculated, and if it
passes this test, moves on to second test, otherwise
it is withheld.
Reference: Rennie et al. (2014)
36
Merge Program
• Data Tests
– Compare overlapping data using Index of
Agreement.
– From IA, calculate probability of station match and
probability of station uniqueness.
– Used to determine final fate of station.
Reference: Rennie et al. (2014)
37
Number of Stations
38
Number of Stations
39
How did this get made?
1. Collect weather stations from around the globe.
2. Merge similar stations together.
3. Run through quality control.
4. Perform homogenization.
5. Grid station data.
6. Create departure from average.
40
Quality Control Checks
Flag Name Description
E Inter-station Duplicate Check
Identify different stations that have same 12 monthly
values for a given year.
I Internal Consistency Check Looks for cases where TMIN > TMAX.
L Isolated Value Check Value has 18 months of missing data on either side.
O Climatological Outlier Check Monthly value is >= 5 standard deviations from mean.
S Spatial Consistency Check
Any monthly value between 2.5 and 5 standard
deviations are compared against closest neighbors within
500 km. Z-scores are used to determine if failed or not.
W Month-over-month Check
Data ingested from non-US countries via satellite are
flagged if data is duplicated from previous months.
Reference: Lawrimore et al. (2011)
41
How did this get made?
1. Collect weather stations from around the globe.
2. Merge similar stations together.
3. Run through quality control.
4. Perform homogenization.
5. Grid station data.
6. Create departure from average.
42
Homogenization
• Looks for non-climatic shifts in temperature.
• Examples include
– Station moves
– Instrument changes
– Errors in observation
• Applied to monthly data using the Pairwise
Homogeneity Algorithm (PHA).
• Daily homogenization exists, but in limited
fashion.
43
Homogenization
• Pairwise Homogeneity Algorithm (PHA)
– Identifies 100 nearest neighbors, calculates
anomalies, and differences are made.
– Statistically significant shifts in mean are identified
as change points.
• Uses the Standard Normal Homogeneity Test (SNHT).
Reference: Menne and Williams (2009)
BEFORE
PHA
Neighbors
Station
AFTER
PHA
Neighbors
Station
46
Homogenization
• Homogenization Example: Central Park
“As is clearly evident, adjustments made the dust bowl period cooler, while post 1995 had no adjustments applied. This
results in a temperature trend that is steeper because the past is cooler than the present. The only problem is that it isn’t
what the data actually recorded then.
I think maybe we need to coin a new term for NOAA NCDC – ‘dust bowl deniers’. Yes it appears there is man made
warming underway but the men are in Asheville, North Carolina at NOAA’s National Climatic Data Center.”
Newfoundland, NJ (47 km away)
Flemington, NJ (80 km away)
Belvidere, NJ (94 km away)
Albany, NY (219 km away)
51
Homogenization
• Metadata is important to help identify items
such as station moves, instrumentation
changes, etc.
• Sometimes unknown.
• Historical Observing Metadata Repository.
– HOMR
– Equipment was updated in 1995 with better
calibrated temperature sensor.
1995-06-27 Update: CHANGE NWS MANAGER
1995-06-27 Update: 10. CHG EQUIP, OBSVR, ADDR, PHONE, LAT/LON
53
How did this get made?
1. Collect weather stations from around the globe.
2. Merge similar stations together.
3. Run through quality control.
4. Perform homogenization.
5. Grid station data.
6. Create departure from average.
54
Gridding Method
55
Gridding Method
• Thin-Plate Smoothing Spline Method.
• Uses Australian National University Splines
(ANUSPLIN) package.
Variable
(Temperature)
lat/lon Smoothing
Parameter
Penalty
Function
Reference: Vose et al. (2014)
56
How did this get made?
1. Collect weather stations from around the globe.
2. Merge similar stations together.
3. Run through quality control.
4. Perform homogenization.
5. Grid station data.
6. Create departure from average.
57
What is an Anomaly
• Departure from a reference value.
– Normalizes the data.
• Diagnostic tool that provides a big picture
overview of temperatures compared to a
reference value.
• Removes regional influences such as
mountains, deserts.
Average Temperature recorded on 20171001
1981-2010 Climatological Normal for Oct 1
Average Temperature Anomaly for 20171001
61
Now you know!
62
Now you know!
63
How did this get made?
Scott
64
How much did it rain here?
● How do we get from
this to a rainfall
estimate?
● What potential errors
or roadblocks can you
see?
65
Radar Sees Everything
Resolution poor at higher
distances from radar
Probably migrating geese
Actually closer to
Wilmington radar. They
probably disagree.
66
Get a second opinion...
KRAX - Raleigh KLTX - Wilmington
67
Get ALL the opinions and merge them
68
What’s Even Falling?
69
That’s Only the Lowest Level
70
Clean Out the Birds
71
Ok, so how much is falling?
72
What about the whole last hour?
73
What do the gauges say?
74
The finished product
75
Before and After
Before After
Spatial Extent 250 km for each radar Nationwide
Time Resolution
About six minutes for each radar,
not synced with each other at all
2 minutes
Coordinates and Resolution Polar; Widens with distance Cartesian Lat/Lon Grid
Vertical Resolution ~ 19 vertical tilts Single integrated estimate
Variable Available Radar Reflectivity (dbZ) Rainrate (mm/hr)
Data Quality
Contaminated with birds and
windfarms and stuff
Not contaminated with birds and
windfarms and stuff
76
Thank You!
jared@cicsnc.org scott@cicsnc.org

Mais conteúdo relacionado

Mais procurados

TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...
TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...
TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...
Ramesh Dhungel
 
Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...
Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...
Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...
Ramesh Dhungel
 
Capstone Poster Design 4-23
Capstone Poster Design 4-23Capstone Poster Design 4-23
Capstone Poster Design 4-23
Trevor Bengtsson
 
Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...
Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...
Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...
Ashneel Chandra
 
FR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREA
FR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREAFR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREA
FR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREA
grssieee
 

Mais procurados (20)

Follow the steps of a meteorologist
Follow the steps of a meteorologistFollow the steps of a meteorologist
Follow the steps of a meteorologist
 
Surface temperatures on_titan_during_northern_winter_and_spring
Surface temperatures on_titan_during_northern_winter_and_springSurface temperatures on_titan_during_northern_winter_and_spring
Surface temperatures on_titan_during_northern_winter_and_spring
 
Final Report
Final ReportFinal Report
Final Report
 
0507 Event Analysis 051101 Event Seminar2
0507 Event Analysis 051101 Event Seminar20507 Event Analysis 051101 Event Seminar2
0507 Event Analysis 051101 Event Seminar2
 
Optimization and Statistical Learning in Geophysical Applications
Optimization and Statistical Learning in Geophysical Applications Optimization and Statistical Learning in Geophysical Applications
Optimization and Statistical Learning in Geophysical Applications
 
Emulating GCM projections by pattern scaling: performance and unforced climat...
Emulating GCM projections by pattern scaling: performance and unforced climat...Emulating GCM projections by pattern scaling: performance and unforced climat...
Emulating GCM projections by pattern scaling: performance and unforced climat...
 
Watson, Andrew: Substantially larger estimates of global ocean-atmosphere flu...
Watson, Andrew: Substantially larger estimates of global ocean-atmosphere flu...Watson, Andrew: Substantially larger estimates of global ocean-atmosphere flu...
Watson, Andrew: Substantially larger estimates of global ocean-atmosphere flu...
 
Numerical Weather Prediction
Numerical Weather PredictionNumerical Weather Prediction
Numerical Weather Prediction
 
Simulating Weather: Numerical Weather Prediction as Computational Simulation
Simulating Weather: Numerical Weather Prediction as Computational SimulationSimulating Weather: Numerical Weather Prediction as Computational Simulation
Simulating Weather: Numerical Weather Prediction as Computational Simulation
 
Weather forecasting
Weather forecastingWeather forecasting
Weather forecasting
 
Öncel Akademi: İstatistiksel Sismoloji
Öncel Akademi: İstatistiksel SismolojiÖncel Akademi: İstatistiksel Sismoloji
Öncel Akademi: İstatistiksel Sismoloji
 
TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...
TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...
TIME INTEGRATION OF EVAPOTRANSPIRATION USING A TWO SOURCE SURFACE ENERGY BALA...
 
Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...
Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...
Comparison of Latent Heat Flux Using Aerodynamic Methods and Using the Penman...
 
2010 AMS Poster
2010 AMS Poster2010 AMS Poster
2010 AMS Poster
 
Record Setting: The Origins of Extreme Hail on 19 March 2018 during VORTEX-SE
Record Setting: The Origins of Extreme Hail on 19 March 2018 during VORTEX-SERecord Setting: The Origins of Extreme Hail on 19 March 2018 during VORTEX-SE
Record Setting: The Origins of Extreme Hail on 19 March 2018 during VORTEX-SE
 
USDA/GIS Update - Data, Products & Services
USDA/GIS Update - Data, Products & ServicesUSDA/GIS Update - Data, Products & Services
USDA/GIS Update - Data, Products & Services
 
Capstone Poster Design 4-23
Capstone Poster Design 4-23Capstone Poster Design 4-23
Capstone Poster Design 4-23
 
Climate Extremes Workshop - The Dependence Between Extreme Precipitation and...
Climate Extremes Workshop -  The Dependence Between Extreme Precipitation and...Climate Extremes Workshop -  The Dependence Between Extreme Precipitation and...
Climate Extremes Workshop - The Dependence Between Extreme Precipitation and...
 
Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...
Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...
Proposal Presentation - Morphology of cyclonic storms in the South Pacific Re...
 
FR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREA
FR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREAFR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREA
FR3.L10.4: SMOS SOIL MOISTURE VALUES EVALUATION OVER SAHELIAN AREA
 

Destaque

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
The Statistical and Applied Mathematical Sciences Institute
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
The Statistical and Applied Mathematical Sciences Institute
 

Destaque (20)

CLIM Undergraduate Workshop: Introduction to Spatial Data Analysis with R - M...
CLIM Undergraduate Workshop: Introduction to Spatial Data Analysis with R - M...CLIM Undergraduate Workshop: Introduction to Spatial Data Analysis with R - M...
CLIM Undergraduate Workshop: Introduction to Spatial Data Analysis with R - M...
 
CLIM Undergraduate Workshop: Extreme Value Analysis for Climate Research - Wh...
CLIM Undergraduate Workshop: Extreme Value Analysis for Climate Research - Wh...CLIM Undergraduate Workshop: Extreme Value Analysis for Climate Research - Wh...
CLIM Undergraduate Workshop: Extreme Value Analysis for Climate Research - Wh...
 
CLIM Undergraduate Workshop: Tutorial on R Software - Huang Huang, Oct 23, 2017
CLIM Undergraduate Workshop: Tutorial on R Software - Huang Huang, Oct 23, 2017CLIM Undergraduate Workshop: Tutorial on R Software - Huang Huang, Oct 23, 2017
CLIM Undergraduate Workshop: Tutorial on R Software - Huang Huang, Oct 23, 2017
 
CLIM Undergraduate Workshop: Statistical Development and challenges for Paleo...
CLIM Undergraduate Workshop: Statistical Development and challenges for Paleo...CLIM Undergraduate Workshop: Statistical Development and challenges for Paleo...
CLIM Undergraduate Workshop: Statistical Development and challenges for Paleo...
 
CLIM Undergraduate Workshop: (Attachment) Performing Extreme Value Analysis (...
CLIM Undergraduate Workshop: (Attachment) Performing Extreme Value Analysis (...CLIM Undergraduate Workshop: (Attachment) Performing Extreme Value Analysis (...
CLIM Undergraduate Workshop: (Attachment) Performing Extreme Value Analysis (...
 
CLIM Undergraduate Workshop: Undergraduate Workshop Introduction - Elvan Ceyh...
CLIM Undergraduate Workshop: Undergraduate Workshop Introduction - Elvan Ceyh...CLIM Undergraduate Workshop: Undergraduate Workshop Introduction - Elvan Ceyh...
CLIM Undergraduate Workshop: Undergraduate Workshop Introduction - Elvan Ceyh...
 
Summer Program on Transportation Statistics, Dynamic Modeling of Transportati...
Summer Program on Transportation Statistics, Dynamic Modeling of Transportati...Summer Program on Transportation Statistics, Dynamic Modeling of Transportati...
Summer Program on Transportation Statistics, Dynamic Modeling of Transportati...
 
Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...
Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...
Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...
 
Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...
Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...
Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...
 
Summer Program on Transportation Statistics, Statistical Challenges for Advan...
Summer Program on Transportation Statistics, Statistical Challenges for Advan...Summer Program on Transportation Statistics, Statistical Challenges for Advan...
Summer Program on Transportation Statistics, Statistical Challenges for Advan...
 
Summer Program on Transportation Statistics, What about the Driver in Driver...
Summer Program on Transportation Statistics, What about the Driver in  Driver...Summer Program on Transportation Statistics, What about the Driver in  Driver...
Summer Program on Transportation Statistics, What about the Driver in Driver...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Detection & Attributi...
CLIM Fall 2017 Course: Statistics for Climate Research, Detection & Attributi...CLIM Fall 2017 Course: Statistics for Climate Research, Detection & Attributi...
CLIM Fall 2017 Course: Statistics for Climate Research, Detection & Attributi...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
CLIM Fall 2017 Course: Statistics for Climate Research, Climate Informatics -...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Geostats for Large Da...
CLIM Fall 2017 Course: Statistics for Climate Research, Geostats for Large Da...CLIM Fall 2017 Course: Statistics for Climate Research, Geostats for Large Da...
CLIM Fall 2017 Course: Statistics for Climate Research, Geostats for Large Da...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Nonstationary Covaria...
CLIM Fall 2017 Course: Statistics for Climate Research, Nonstationary Covaria...CLIM Fall 2017 Course: Statistics for Climate Research, Nonstationary Covaria...
CLIM Fall 2017 Course: Statistics for Climate Research, Nonstationary Covaria...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 

Semelhante a CLIM Undergraduate Workshop: How was this Made?: Making Dirty Data into Something Usable at NCEI - Stevens & Rennie, Oct 24, 2017

Koomeyoncomputingtrends v2
Koomeyoncomputingtrends v2Koomeyoncomputingtrends v2
Koomeyoncomputingtrends v2
Jonathan Koomey
 
Sampling-SDM2012_Jun
Sampling-SDM2012_JunSampling-SDM2012_Jun
Sampling-SDM2012_Jun
MDO_Lab
 

Semelhante a CLIM Undergraduate Workshop: How was this Made?: Making Dirty Data into Something Usable at NCEI - Stevens & Rennie, Oct 24, 2017 (20)

CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
 
Learning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learningLearning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learning
 
EcoTas13 BradEvans e-MAST
EcoTas13 BradEvans e-MASTEcoTas13 BradEvans e-MAST
EcoTas13 BradEvans e-MAST
 
Convective Heat Transfer Measurements at the Martian Surface
Convective Heat Transfer Measurements at the Martian SurfaceConvective Heat Transfer Measurements at the Martian Surface
Convective Heat Transfer Measurements at the Martian Surface
 
CliffStat.ppt
CliffStat.pptCliffStat.ppt
CliffStat.ppt
 
Francisco J. Doblas-Big Data y cambio climático
Francisco J. Doblas-Big Data y cambio climáticoFrancisco J. Doblas-Big Data y cambio climático
Francisco J. Doblas-Big Data y cambio climático
 
Richardson phenocam ACEAS 2014
Richardson phenocam ACEAS 2014Richardson phenocam ACEAS 2014
Richardson phenocam ACEAS 2014
 
Extending the modern record back in time using proxy data
Extending the modern record back in time using proxy dataExtending the modern record back in time using proxy data
Extending the modern record back in time using proxy data
 
CLIM: Transition Workshop - Extremes and Networks: a review of activities -...
CLIM: Transition Workshop - Extremes and  Networks: a  review of activities -...CLIM: Transition Workshop - Extremes and  Networks: a  review of activities -...
CLIM: Transition Workshop - Extremes and Networks: a review of activities -...
 
Weather jun02
Weather jun02Weather jun02
Weather jun02
 
Koomeyoncomputingtrends v2
Koomeyoncomputingtrends v2Koomeyoncomputingtrends v2
Koomeyoncomputingtrends v2
 
PLOTCON NYC: Custom Colormaps for Your Field
PLOTCON NYC: Custom Colormaps for Your FieldPLOTCON NYC: Custom Colormaps for Your Field
PLOTCON NYC: Custom Colormaps for Your Field
 
Sampling-SDM2012_Jun
Sampling-SDM2012_JunSampling-SDM2012_Jun
Sampling-SDM2012_Jun
 
Mauro Sulis
Mauro SulisMauro Sulis
Mauro Sulis
 
Advanced DOE with Minitab (presentation in Costa Rica)
Advanced DOE with Minitab (presentation in Costa Rica)Advanced DOE with Minitab (presentation in Costa Rica)
Advanced DOE with Minitab (presentation in Costa Rica)
 
Using explainable machine learning to evaluate climate change projections
Using explainable machine learning to evaluate climate change projectionsUsing explainable machine learning to evaluate climate change projections
Using explainable machine learning to evaluate climate change projections
 
Andrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
Andrews, Arlyn: Towards an International Reference Network for Greenhouse GasesAndrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
Andrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
 
Big data
Big dataBig data
Big data
 
Towards climate-smart decision support systems in the Philippines
Towards climate-smart decision support systems in the PhilippinesTowards climate-smart decision support systems in the Philippines
Towards climate-smart decision support systems in the Philippines
 

Mais de The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
The Statistical and Applied Mathematical Sciences Institute
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
The Statistical and Applied Mathematical Sciences Institute
 

Mais de The Statistical and Applied Mathematical Sciences Institute (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
 
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
 
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
 
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
 
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
 
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
 
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
 
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
 
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
 
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
 
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
 
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
 
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
 
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 

Último

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 

Último (20)

Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 

CLIM Undergraduate Workshop: How was this Made?: Making Dirty Data into Something Usable at NCEI - Stevens & Rennie, Oct 24, 2017

  • 1. Click to edit Master title style Click to edit Master subtitle style 1 How were these made? Turning dirty data into something usable at NCEI Jared Rennie and Scott Stevens SAMSI Undergraduate Workshop October 24th, 2017
  • 2. 2 Outline • Introduction • Overview of NCEI / CICS • How did these get made? – Temperature Extremes – Rainfall Estimates • Questions
  • 3. 3 What We’re Taught Name How Tall are You (inches)? Adam 66.2 Barbara 63.4 Clint 69.5 Debra 65.3 Easton 71.5 Frances 68.5 George 64.2 Hilda 61.2 Idris 74.5
  • 4. 4 What We Get Name How Tall are You (inches)? Adam 66 inches Barbara 63.4287 Clint 176.5 Debra Easton 41.5 Frances 68.5 George 5’4” Hilda 61.2 Idris 210 lbs
  • 5. 5 What We Get Name How Tall are You (inches)? Adam 66 inches Added on some extra units Barbara 63.4287 Has too much confidence in her ruler’s precision Clint 176.5 Gave it in cm instead of inches Debra Just didn’t answer Easton 41.5 Made a typo Frances 68.5 George 5’4” Ignored the instructions Hilda 61.2 Idris 210 lbs Answered a different question entirely
  • 10. 10 TL;DR • A lot of work goes into our data before most scientists ever even see it • Most of the “controversy” regarding climate change isn’t around the analysis, but the data • What we start with is messy and of little value to most scientists, so we have to make it more user-friendly – And we have to carefully document how
  • 14. 14 National Centers for Environmental information • Located in downtown Asheville, NC. • Holds over 30 Petabytes of weather and climate data from all over the world.
  • 15. 15 Cooperative Institute for Climate and Satellites • Research institution working with NCEI. • Made up of meteorologists, software developers, and science communicators. • Vision – Inspire cutting-edge research and collaboration. – Advance understanding of the current and future state of the climate. – Engage with business, academia, government, and the public to enhance decision making.
  • 19. 19 That’s great… but • These are the final products. • A lot of hard work goes into generating these “pretty images.” – Math, statistics, meteorology, physics, orbital mechanics, instrumentation, engineering, computer programming, visualization, communication. • And most importantly…
  • 21. 21 Goal of this talk • Go into great, statistical detail how the following images were made.
  • 22. 22 How did this get made? Jared
  • 23. 23 Surface Temperature: The Beginnings • The instrumental record of temperature has its roots in the development of universal temperature scales in the 18th century. – Monthly mean temperature series for De Bilt, Netherlands extends from 1706 to the present. – Several other long European series exist going back over 200 years. • Throughout the 1800s measurements expanded across other continents. – National Meteorological and Hydrological Services (NMHS) around the world have operated networks to support weather and climate observations since the late 19th Century.
  • 24. 24 Surface Temperature: The Beginnings • In the 1980’s and 1990’s, major efforts were made to collect observations and create consolidated global datasets. – Global Historical Climatology Network – Monthly (GHCN-M) • Focus shifted to daily data in the 2000’s – Global Historical Climatology Network – Daily (GHCN-D)
  • 25. 25 Surface Temperature: The Beginnings • GHCN-M – In-Situ data for 7,280 stations in 226 countries. – Monthly maximum, minimum, and average temperature. – Automated ingest, qc, and corrected for bias. • GHCN-D – Integrated database of daily climate summaries. – TMAX, TMIN, PRCP, SNOW, SNWD, others. – 100,000 stations globally (~30,000 temperature).
  • 26. 26 Surface Temperature: The Beginnings • GHCN, along with other datasets, are the foundation for understanding trends in global surface temperature. Land Land + Ocean NCEI GHCN-M / GHCN-D NOAAGlobalTemp UK Met Office CRUTEM HADCRUT NASA GISS GISS
  • 28. 28 How did this get made? 1. Collect weather stations from around the globe. 2. Merge similar stations together. 3. Run through quality control. 4. Perform homogenization. 5. Grid station data. 6. Create departure from average.
  • 29. 29 How did this get made? 1. Collect weather stations from around the globe. 2. Merge similar stations together. 3. Run through quality control. 4. Perform homogenization. 5. Grid station data. 6. Create departure from average.
  • 33. 33 How did this get made? 1. Collect weather stations from around the globe. 2. Merge similar stations together. 3. Run through quality control. 4. Perform homogenization. 5. Grid station data. 6. Create departure from average.
  • 34. 34 Merge Program • Iterative process between the first source, and all candidate sources. • Compare each candidate station to all the target stations, and determine one of the following three possibilities: – Candidate station should merge with target station. – Candidate station is unique, and should be added as a new station in the target source. – Not enough information, station is withheld. Reference: Rennie et al. (2014)
  • 35. 35 Merge Program • Metadata Tests – Probability of station match is calculated based upon three criteria: • Geographic distance • Elevation difference • Station name similarity using Jaccard Index – Intersection divided by Union – Final weighted probability is calculated, and if it passes this test, moves on to second test, otherwise it is withheld. Reference: Rennie et al. (2014)
  • 36. 36 Merge Program • Data Tests – Compare overlapping data using Index of Agreement. – From IA, calculate probability of station match and probability of station uniqueness. – Used to determine final fate of station. Reference: Rennie et al. (2014)
  • 39. 39 How did this get made? 1. Collect weather stations from around the globe. 2. Merge similar stations together. 3. Run through quality control. 4. Perform homogenization. 5. Grid station data. 6. Create departure from average.
  • 40. 40 Quality Control Checks Flag Name Description E Inter-station Duplicate Check Identify different stations that have same 12 monthly values for a given year. I Internal Consistency Check Looks for cases where TMIN > TMAX. L Isolated Value Check Value has 18 months of missing data on either side. O Climatological Outlier Check Monthly value is >= 5 standard deviations from mean. S Spatial Consistency Check Any monthly value between 2.5 and 5 standard deviations are compared against closest neighbors within 500 km. Z-scores are used to determine if failed or not. W Month-over-month Check Data ingested from non-US countries via satellite are flagged if data is duplicated from previous months. Reference: Lawrimore et al. (2011)
  • 41. 41 How did this get made? 1. Collect weather stations from around the globe. 2. Merge similar stations together. 3. Run through quality control. 4. Perform homogenization. 5. Grid station data. 6. Create departure from average.
  • 42. 42 Homogenization • Looks for non-climatic shifts in temperature. • Examples include – Station moves – Instrument changes – Errors in observation • Applied to monthly data using the Pairwise Homogeneity Algorithm (PHA). • Daily homogenization exists, but in limited fashion.
  • 43. 43 Homogenization • Pairwise Homogeneity Algorithm (PHA) – Identifies 100 nearest neighbors, calculates anomalies, and differences are made. – Statistically significant shifts in mean are identified as change points. • Uses the Standard Normal Homogeneity Test (SNHT). Reference: Menne and Williams (2009)
  • 46. 46 Homogenization • Homogenization Example: Central Park “As is clearly evident, adjustments made the dust bowl period cooler, while post 1995 had no adjustments applied. This results in a temperature trend that is steeper because the past is cooler than the present. The only problem is that it isn’t what the data actually recorded then. I think maybe we need to coin a new term for NOAA NCDC – ‘dust bowl deniers’. Yes it appears there is man made warming underway but the men are in Asheville, North Carolina at NOAA’s National Climatic Data Center.”
  • 48. Flemington, NJ (80 km away)
  • 49. Belvidere, NJ (94 km away)
  • 50. Albany, NY (219 km away)
  • 51. 51 Homogenization • Metadata is important to help identify items such as station moves, instrumentation changes, etc. • Sometimes unknown. • Historical Observing Metadata Repository. – HOMR – Equipment was updated in 1995 with better calibrated temperature sensor. 1995-06-27 Update: CHANGE NWS MANAGER 1995-06-27 Update: 10. CHG EQUIP, OBSVR, ADDR, PHONE, LAT/LON
  • 52.
  • 53. 53 How did this get made? 1. Collect weather stations from around the globe. 2. Merge similar stations together. 3. Run through quality control. 4. Perform homogenization. 5. Grid station data. 6. Create departure from average.
  • 55. 55 Gridding Method • Thin-Plate Smoothing Spline Method. • Uses Australian National University Splines (ANUSPLIN) package. Variable (Temperature) lat/lon Smoothing Parameter Penalty Function Reference: Vose et al. (2014)
  • 56. 56 How did this get made? 1. Collect weather stations from around the globe. 2. Merge similar stations together. 3. Run through quality control. 4. Perform homogenization. 5. Grid station data. 6. Create departure from average.
  • 57. 57 What is an Anomaly • Departure from a reference value. – Normalizes the data. • Diagnostic tool that provides a big picture overview of temperatures compared to a reference value. • Removes regional influences such as mountains, deserts.
  • 63. 63 How did this get made? Scott
  • 64. 64 How much did it rain here? ● How do we get from this to a rainfall estimate? ● What potential errors or roadblocks can you see?
  • 65. 65 Radar Sees Everything Resolution poor at higher distances from radar Probably migrating geese Actually closer to Wilmington radar. They probably disagree.
  • 66. 66 Get a second opinion... KRAX - Raleigh KLTX - Wilmington
  • 67. 67 Get ALL the opinions and merge them
  • 69. 69 That’s Only the Lowest Level
  • 71. 71 Ok, so how much is falling?
  • 72. 72 What about the whole last hour?
  • 73. 73 What do the gauges say?
  • 75. 75 Before and After Before After Spatial Extent 250 km for each radar Nationwide Time Resolution About six minutes for each radar, not synced with each other at all 2 minutes Coordinates and Resolution Polar; Widens with distance Cartesian Lat/Lon Grid Vertical Resolution ~ 19 vertical tilts Single integrated estimate Variable Available Radar Reflectivity (dbZ) Rainrate (mm/hr) Data Quality Contaminated with birds and windfarms and stuff Not contaminated with birds and windfarms and stuff