SlideShare uma empresa Scribd logo
1 de 3
Ecoinformatics International Technical Collaboration Partnership
International Web Meeting - Linked Open Data and Environmental Information
Day 1 – December 6, 2010

Geospatial Topic – Dave Smith


December 6, 2010



                                                                                           Dave Smith
                                                                              USEPA/OEI/OIC/IESD/ISSB
                                                                                 smith.davidg@epa.gov
                                                                                          202-566-0797


Document Change History
  Revision     Date                            Author                  Description
 1.0          12/6/2010       David G. Smith                        Initial Version



FRS as a Linked Open Data Pilot - Background

EPA maintains a database of facilities, which is aggregated from a variety of sources – 32 federal
databases (mostly EPA, along with a few others such as Energy Information Administration), and 57
state and tribal databases. Information about facilities is conflated from these sources, to include
facility name and geographic location (to include spatial feature type such as point or polygon, latitude,
longitude, coordinate reference system, and collection metadata), physical and mailing address, points
of contact, activities conducted at the given location (via North American Industry Classification System -
NAICS and its’ predecessor, Standard Industrial Classification - SIC codes), and any associated program
identifiers, permit numbers, and other related items.

This in turn serves as a geospatial foundation piece for some of EPA’s reporting and mapping tools and
capabilities, such as Envirofacts, MyEnvironment and other tools, allowing parametric data and reports
from a variety of programs to be linked to facilities.

Currently this integration is being done via traditional means, i.e. Relational Database Management
System queries; additionally, web services and APIs are limited - as such, integration opportunity is
generally limited to what we can do within the Agency.
EcoInformatics – Geospatial Discussion
                              November 11, 2010                          December 6, 2010


Opportunity

Via Linked Open Data approaches, there is opportunity and potential for publishing this facilities data
framework to allow analysis across other agencies as well, such as Occupational Safety and Health
Administration - OSHA or Mine Safety and Health Administration - MSHA enforcement histories,
offshore platforms using Bureau of Ocean Energy Management, Regulation and Enforcement - BOEMRE
data, and other types of cross-cutting, government-wide approaches, as more Linked Open Data assets
become available.

Initial Efforts

EPA is still in the planning stages – we have published some initial FRS data as RDF via Data.gov,
however we are now working to iteratively refine our LOD publishing approach, through the use of a
“cookbook” approach which we hope to be able to apply to a number of EPA datasets, which will
establish a framework to provide consistent methodologies and approaches for publishing Linked Open
Data agencywide. Part of this will be to leverage existing agency investments in metadata, data
dictionaries, terminologies and ontologies, toward further contextualizing of agency data assets.

For FRS, we hope to contextualize the various facets of the data, e.g. corporate/organizational entity,
points of contact, activities and other aspects.

Geospatial Enablement

There are multiple aspects to geo-enablement via Linked Open Data – one being how to represent the
features in a manner that works for mapping, such as points, lines, polygons and associated topologies,
the associated coordinates, along with metadata describing such things as coordinate reference systems
and locational accuracy estimates.

For the geospatial feature component of FRS, we hope to look at current OGC standards and efforts,
such as the GeoSemantics SWG, as well as emergent GeoSPARQL efforts, and to collaborate with the
Spatial Ontology Community of Practice (SOCOP). We will need to delve into the most efficacious means
of representing features, such as GeoRSS, along with current coordinate reference systems (e.g. NAD83)
toward interoperability and geospatial analysis.

Another aspect of this deals with the geography of interest, delving into relating the facility attribute
ontology with the surrounding terrain ontology to contextualize, for example, if we are dealing with a
mining facility, can one relate the facility interest with other datasets such as geology, stratigraphy, and
other mining-related data?

These may require some tuning in how we collect and model data, for example, most of our data has
historically been program-specific, with some of these subtler nuances currently only reachable through
imperfect derivation, based on things like NAICS code.

Next Steps
                                                     2
EcoInformatics – Geospatial Discussion
                              November 11, 2010                       December 6, 2010


We hope to collaborate with our counterparts in other agencies on best practices and lessons learned –
in the case of EPA’s Facility Registry System, there are direct, tangible, and implementable pieces which
we can put into motion, and there is opportunity to develop a more robust Linked Open Data approach,
an effort which has already kicked off.




                                                    3

Mais conteúdo relacionado

Mais procurados

NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012
IUPUI
 
Data behind figures in AAS journals
Data behind figures in AAS journalsData behind figures in AAS journals
Data behind figures in AAS journals
Chris Biemesderfer
 

Mais procurados (8)

NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012NSF Data Policies webcast February 29, 2012
NSF Data Policies webcast February 29, 2012
 
SEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability ResearchSEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability Research
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Data behind figures in AAS journals
Data behind figures in AAS journalsData behind figures in AAS journals
Data behind figures in AAS journals
 
Data management federal requirements 9 2015
Data management federal requirements 9 2015Data management federal requirements 9 2015
Data management federal requirements 9 2015
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
An On-line Collaborative Data Management System
An On-line Collaborative Data Management SystemAn On-line Collaborative Data Management System
An On-line Collaborative Data Management System
 
Va sla nov 15 final
Va sla nov 15 finalVa sla nov 15 final
Va sla nov 15 final
 

Semelhante a EcoInformatics FRS Presentation - Discussion 20101206

sers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenariosers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenario
Rudolf Husar
 
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
Rudolf Husar
 
Data management plans
Data management plansData management plans
Data management plans
Brad Houston
 

Semelhante a EcoInformatics FRS Presentation - Discussion 20101206 (20)

Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information Retrieval
 
Role of metadata in transportation agency data programs
Role of metadata in transportation agency data programsRole of metadata in transportation agency data programs
Role of metadata in transportation agency data programs
 
RDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOneRDA, Data Citation, and PIDs for DataOne
RDA, Data Citation, and PIDs for DataOne
 
Matching data detection for the integration system
Matching data detection for the integration systemMatching data detection for the integration system
Matching data detection for the integration system
 
sers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenariosers, Applications and the Community of Practice for the Air Quality Scenario
sers, Applications and the Community of Practice for the Air Quality Scenario
 
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
 
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxf
 
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
 
Data management plans
Data management plansData management plans
Data management plans
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
Implementation of Matching Tree Technique for Online Record Linkage
Implementation of Matching Tree Technique for Online Record LinkageImplementation of Matching Tree Technique for Online Record Linkage
Implementation of Matching Tree Technique for Online Record Linkage
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceRDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 
Data integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics caseData integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics case
 
Data integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics caseData integration in a Hadoop-based data lake: A bioinformatics case
Data integration in a Hadoop-based data lake: A bioinformatics case
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
 

Mais de Dave Smith / USEPA Office of Environmental Information

Mais de Dave Smith / USEPA Office of Environmental Information (11)

DC Web API Meetup Oct 4 2016
DC Web API Meetup Oct 4 2016DC Web API Meetup Oct 4 2016
DC Web API Meetup Oct 4 2016
 
GeoDC Maker Talks: GPS-Enabled Sensor Platforms using Arduino
GeoDC Maker Talks:  GPS-Enabled Sensor Platforms using ArduinoGeoDC Maker Talks:  GPS-Enabled Sensor Platforms using Arduino
GeoDC Maker Talks: GPS-Enabled Sensor Platforms using Arduino
 
FRS Emergency Response Data Quality Initiatives
FRS Emergency Response Data Quality InitiativesFRS Emergency Response Data Quality Initiatives
FRS Emergency Response Data Quality Initiatives
 
Chemical Facilities Safety - Executive Order 13560
Chemical Facilities Safety - Executive Order 13560Chemical Facilities Safety - Executive Order 13560
Chemical Facilities Safety - Executive Order 13560
 
HIFLD Presentation Fall 2013
HIFLD Presentation Fall 2013HIFLD Presentation Fall 2013
HIFLD Presentation Fall 2013
 
Linked Data W3C 20110629
Linked Data W3C  20110629Linked Data W3C  20110629
Linked Data W3C 20110629
 
ESRI DevMeetup 201100607
ESRI DevMeetup 201100607ESRI DevMeetup 201100607
ESRI DevMeetup 201100607
 
Linked GeoData - WhereCampDC 20110610
Linked GeoData - WhereCampDC 20110610Linked GeoData - WhereCampDC 20110610
Linked GeoData - WhereCampDC 20110610
 
Health Data Initiative 20110609
Health Data Initiative 20110609Health Data Initiative 20110609
Health Data Initiative 20110609
 
FRS Linked Open Data Concept v1.3 20101130
FRS Linked Open Data Concept v1.3 20101130FRS Linked Open Data Concept v1.3 20101130
FRS Linked Open Data Concept v1.3 20101130
 
EcoInformatics FRS Presentation 20101206
EcoInformatics FRS Presentation 20101206EcoInformatics FRS Presentation 20101206
EcoInformatics FRS Presentation 20101206
 

Último

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 

Último (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 

EcoInformatics FRS Presentation - Discussion 20101206

  • 1. Ecoinformatics International Technical Collaboration Partnership International Web Meeting - Linked Open Data and Environmental Information Day 1 – December 6, 2010 Geospatial Topic – Dave Smith December 6, 2010 Dave Smith USEPA/OEI/OIC/IESD/ISSB smith.davidg@epa.gov 202-566-0797 Document Change History Revision Date Author Description 1.0 12/6/2010 David G. Smith Initial Version FRS as a Linked Open Data Pilot - Background EPA maintains a database of facilities, which is aggregated from a variety of sources – 32 federal databases (mostly EPA, along with a few others such as Energy Information Administration), and 57 state and tribal databases. Information about facilities is conflated from these sources, to include facility name and geographic location (to include spatial feature type such as point or polygon, latitude, longitude, coordinate reference system, and collection metadata), physical and mailing address, points of contact, activities conducted at the given location (via North American Industry Classification System - NAICS and its’ predecessor, Standard Industrial Classification - SIC codes), and any associated program identifiers, permit numbers, and other related items. This in turn serves as a geospatial foundation piece for some of EPA’s reporting and mapping tools and capabilities, such as Envirofacts, MyEnvironment and other tools, allowing parametric data and reports from a variety of programs to be linked to facilities. Currently this integration is being done via traditional means, i.e. Relational Database Management System queries; additionally, web services and APIs are limited - as such, integration opportunity is generally limited to what we can do within the Agency.
  • 2. EcoInformatics – Geospatial Discussion November 11, 2010 December 6, 2010 Opportunity Via Linked Open Data approaches, there is opportunity and potential for publishing this facilities data framework to allow analysis across other agencies as well, such as Occupational Safety and Health Administration - OSHA or Mine Safety and Health Administration - MSHA enforcement histories, offshore platforms using Bureau of Ocean Energy Management, Regulation and Enforcement - BOEMRE data, and other types of cross-cutting, government-wide approaches, as more Linked Open Data assets become available. Initial Efforts EPA is still in the planning stages – we have published some initial FRS data as RDF via Data.gov, however we are now working to iteratively refine our LOD publishing approach, through the use of a “cookbook” approach which we hope to be able to apply to a number of EPA datasets, which will establish a framework to provide consistent methodologies and approaches for publishing Linked Open Data agencywide. Part of this will be to leverage existing agency investments in metadata, data dictionaries, terminologies and ontologies, toward further contextualizing of agency data assets. For FRS, we hope to contextualize the various facets of the data, e.g. corporate/organizational entity, points of contact, activities and other aspects. Geospatial Enablement There are multiple aspects to geo-enablement via Linked Open Data – one being how to represent the features in a manner that works for mapping, such as points, lines, polygons and associated topologies, the associated coordinates, along with metadata describing such things as coordinate reference systems and locational accuracy estimates. For the geospatial feature component of FRS, we hope to look at current OGC standards and efforts, such as the GeoSemantics SWG, as well as emergent GeoSPARQL efforts, and to collaborate with the Spatial Ontology Community of Practice (SOCOP). We will need to delve into the most efficacious means of representing features, such as GeoRSS, along with current coordinate reference systems (e.g. NAD83) toward interoperability and geospatial analysis. Another aspect of this deals with the geography of interest, delving into relating the facility attribute ontology with the surrounding terrain ontology to contextualize, for example, if we are dealing with a mining facility, can one relate the facility interest with other datasets such as geology, stratigraphy, and other mining-related data? These may require some tuning in how we collect and model data, for example, most of our data has historically been program-specific, with some of these subtler nuances currently only reachable through imperfect derivation, based on things like NAICS code. Next Steps 2
  • 3. EcoInformatics – Geospatial Discussion November 11, 2010 December 6, 2010 We hope to collaborate with our counterparts in other agencies on best practices and lessons learned – in the case of EPA’s Facility Registry System, there are direct, tangible, and implementable pieces which we can put into motion, and there is opportunity to develop a more robust Linked Open Data approach, an effort which has already kicked off. 3