SlideShare uma empresa Scribd logo
1 de 13
Baixar para ler offline
Geocoding news at the source
Gerd Kamp

dpa-infocom GmbH
Overview

Motivation
# News always happen in a spatio-temporal context
        # you want to attach that context as metadata to the news
# The illustration of news via maps is common practice since ages
        # but typically different from putting pins into maps


Current Status
# started evangelizing within dpa in Q2/06
# geocoding our regional wires since 11/07
# geocoding places of stories as well as places in stories
# manual process
        # with support systems integrated into the editorial systems


© 2008 Gerd Kamp                                                       2
Locations of news stories / Semantics (current status)

A scope of a news story
# is a geoname that is part of a (official administrative) hierarchical partition of a
  defined geographic extent,
# representing the largest area wrt. the above hierarchy where this story is deemed
  relevant (by an editor)


A variant of scopes are legal scopes


Assigning geographic areas of relevance is something editors have been doing for ages
# National wire vs. regional wire
# Front section vs. local section



© 2008 Gerd Kamp                                                                        3
Locations of news stories / Semantics (current status)

A locus of a news story
# is a geoname that is part of a set of geonames for a defined geographic extent
# representing the smallest area wrt. the above set where (the) events of this story are
  happening / have happened / are going to happen


A place of production of a news story
# is either a geoname or address or lat/lon




© 2008 Gerd Kamp                                                                           4
Location within news

A location in a (news) story is a location directly or indirectly mentioned in the
news story itself
# typically not geographic names but rather addresses, street segments, blocks, or
  POIs
# not all geographic entities are necessarily identified
        # relevance
        # ranking




© 2008 Gerd Kamp                                                                     5
Geonames

A geographic name
# is a name applied to a geographic feature. It is the proper name, specific term, or
  expression by which a particular geographic entity is, or was, known. A geographic
  entity is any relatively permanent part of the natural or manmade landscape or
  seascape that has recognizable identity within a particular cultural context.
# A geographic name, then, may refer to any place, feature, or area on the Earth's
  surface, or to a related group of similar places, features, or areas.


# Typically there are national bodies defining geonames
        # U.S. Board for Geographic Names
        # Ständiger Ausschuss für Geographische Namen
        # New players are entering the game (e.g. Geonames, YahooLocation Platform)


© 2008 Gerd Kamp                                                                       6
Hierarchical partition (current draft definition)

A hierarchical partition of scopes of a geographic extent e is a directed acyclic
graph (DAG) with the following properties:
#    There is a single source s_top (the top level scope) with a geographic extent being
     coterminous with the geographic extent (using coterminous as having matching
     boundaries interpretation
# every scope has a property denoting its level in the hierarchy with the top level scope
  having the level 1
# for any given point p in e there is at least one corresponding scope s_point at some
  level in the DAG
# for every scope that has more than one successor the geographic extent of set of
  successors is coterminous with the geographic extent of this scope
# for every scope that has more than one predecessor the geographic extent of set of
  predecessors is coterminous with the geographic extent of this scope


© 2008 Gerd Kamp                                                                            7
Example

A story about legislation in a state is assigned a statewide scope (although the dateline
is the state capitol)




© 2008 Gerd Kamp                                                                            8
Example

A story about an accident within A with a driver coming from B




© 2008 Gerd Kamp                                                 9
Example (News Industry Text Format - NITF)

<nitf xmlns:georss=quot;http://www.georss.org/georssquot;>
<head>
<title>Bayern München II schlägt Karlsruhe 3:1</title>
<location class=quot;scopequot;>
<region region-code=quot;09184000quot; code-source=quot;AGSquot;>München
 <georss:point>11.5725580365 48.1379548096</georss:point>
</region>
<state state-code=quot;09000000quot; code-source=quot;AGSquot;>Bayern 
 <georss:point>11.5725580365 48.1379548096</georss:point>
</state>
<country iso-cc=quot;DEUquot;>Deutschland</country>
</location>
<location class=quot;scopequot;>
<city city-code=quot;09162000quot; code-source=quot;AGSquot;>München 
 <georss:point>11.5725580365 48.1379548096</georss:point>
</city>
<state state-code=quot;09000000quot; code-source=quot;AGSquot;>Bayern
 <georss:point>11.5725580365 48.1379548096</georss:point>
</state>
<country iso-cc=quot;DEUquot;>Deutschland</country>
</location>
<location class=quot;scopequot;>
<city city-code=quot;08212000quot; code-source=quot;AGSquot;>Karlsruhe 
 <georss:point>8.40437796821 49.0092142029</georss:point>
</city>
<state state-code=quot;08000000quot; code-source=quot;AGSquot;>Baden-Württemberg 
 <georss:point>9.17871582656 48.7750805322</
georss:point>
</state>
<country iso-cc=quot;DEUquot;>Deutschland</country>


© 2008 Gerd Kamp                                                                                                     10
Example NITF (cont‘d)

<location class=quot;addressquot;> Grünwalder Stadion, Grünwalder Straße, München, Germany 
     <georss:point>11.566936 48.101078</georss:point>
<city>München</city>
<region>München</region>
<state>Bayern</state>
<country iso-cc=quot;DEUquot;>Deutschland</country>
</location>




© 2008 Gerd Kamp                                                                       11
Next steps / To Do

Gathering feedback
Evangelizing within main stream media organizations
How to represent best in GeoRSS , KML, ...
# multiple locations
# locations of different types and classes
Working toward a generally available ontology of geonames / a framework for
describing ontology
Investigatiing connections to /applications from
# computational geometry
# qualitative spatial reasoning
to geoname based graphs / ontologies



© 2008 Gerd Kamp                                                              12
More Info

gkamp@acm.org
http://relations.ka2.de/tag/goingplaces




© 2008 Gerd Kamp                          13

Mais conteúdo relacionado

Destaque

Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)
Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)
Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)
gkamp
 
Presentation on Net4Freedom, State Secretary Hanna Hellquist
Presentation on Net4Freedom, State Secretary Hanna HellquistPresentation on Net4Freedom, State Secretary Hanna Hellquist
Presentation on Net4Freedom, State Secretary Hanna Hellquist
Carl Wettermark
 
Iria A Todo El Mundo
Iria A Todo El MundoIria A Todo El Mundo
Iria A Todo El Mundo
guest8d485e
 
Slide Show Medaka1
Slide Show Medaka1Slide Show Medaka1
Slide Show Medaka1
guest124c20
 
La Costola 2
La Costola 2La Costola 2
La Costola 2
missgh
 
Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)
Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)
Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)
gkamp
 
China Trip 09 Lyrics
China Trip 09 LyricsChina Trip 09 Lyrics
China Trip 09 Lyrics
Wu Lǎoshī
 

Destaque (20)

Sosial lytting frokostseminar
Sosial lytting frokostseminarSosial lytting frokostseminar
Sosial lytting frokostseminar
 
Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)
Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)
Map it- Geocoding and maps for local media (IFRA GoLocal, Oct. 2010)
 
Cloud - The Backbone of IoT
Cloud - The Backbone of IoTCloud - The Backbone of IoT
Cloud - The Backbone of IoT
 
Presentation on Net4Freedom, State Secretary Hanna Hellquist
Presentation on Net4Freedom, State Secretary Hanna HellquistPresentation on Net4Freedom, State Secretary Hanna Hellquist
Presentation on Net4Freedom, State Secretary Hanna Hellquist
 
Lad brugerne gøre arbejdet
Lad brugerne gøre arbejdetLad brugerne gøre arbejdet
Lad brugerne gøre arbejdet
 
Prøv dig frem eller specificer dig ihjel
Prøv dig frem eller specificer dig ihjelPrøv dig frem eller specificer dig ihjel
Prøv dig frem eller specificer dig ihjel
 
Personal communications for all your facets
Personal communications for all your facetsPersonal communications for all your facets
Personal communications for all your facets
 
Iria A Todo El Mundo
Iria A Todo El MundoIria A Todo El Mundo
Iria A Todo El Mundo
 
Do it on purpose!
Do it on purpose!Do it on purpose!
Do it on purpose!
 
Slide Show Medaka1
Slide Show Medaka1Slide Show Medaka1
Slide Show Medaka1
 
Creuna designthinking
Creuna designthinkingCreuna designthinking
Creuna designthinking
 
Tænk mobilt fra starten
Tænk mobilt fra startenTænk mobilt fra starten
Tænk mobilt fra starten
 
Creuna hackathon - pictures
Creuna hackathon -  picturesCreuna hackathon -  pictures
Creuna hackathon - pictures
 
Sketching
SketchingSketching
Sketching
 
La Costola 2
La Costola 2La Costola 2
La Costola 2
 
Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)
Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)
Regional News in Times of iPad, Twitter & Co. (Cassini Convention, Nov. 2010)
 
Zapataz Combinadas
Zapataz CombinadasZapataz Combinadas
Zapataz Combinadas
 
China Trip 09 Lyrics
China Trip 09 LyricsChina Trip 09 Lyrics
China Trip 09 Lyrics
 
NETTSTEDET DITT – SKJERP FOKUS, ØK VERDIEN
NETTSTEDET DITT – SKJERP FOKUS, ØK VERDIENNETTSTEDET DITT – SKJERP FOKUS, ØK VERDIEN
NETTSTEDET DITT – SKJERP FOKUS, ØK VERDIEN
 
Mee
MeeMee
Mee
 

Último

Último (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Geocoding news at the source

  • 1. Geocoding news at the source Gerd Kamp dpa-infocom GmbH
  • 2. Overview Motivation # News always happen in a spatio-temporal context # you want to attach that context as metadata to the news # The illustration of news via maps is common practice since ages # but typically different from putting pins into maps Current Status # started evangelizing within dpa in Q2/06 # geocoding our regional wires since 11/07 # geocoding places of stories as well as places in stories # manual process # with support systems integrated into the editorial systems © 2008 Gerd Kamp 2
  • 3. Locations of news stories / Semantics (current status) A scope of a news story # is a geoname that is part of a (official administrative) hierarchical partition of a defined geographic extent, # representing the largest area wrt. the above hierarchy where this story is deemed relevant (by an editor) A variant of scopes are legal scopes Assigning geographic areas of relevance is something editors have been doing for ages # National wire vs. regional wire # Front section vs. local section © 2008 Gerd Kamp 3
  • 4. Locations of news stories / Semantics (current status) A locus of a news story # is a geoname that is part of a set of geonames for a defined geographic extent # representing the smallest area wrt. the above set where (the) events of this story are happening / have happened / are going to happen A place of production of a news story # is either a geoname or address or lat/lon © 2008 Gerd Kamp 4
  • 5. Location within news A location in a (news) story is a location directly or indirectly mentioned in the news story itself # typically not geographic names but rather addresses, street segments, blocks, or POIs # not all geographic entities are necessarily identified # relevance # ranking © 2008 Gerd Kamp 5
  • 6. Geonames A geographic name # is a name applied to a geographic feature. It is the proper name, specific term, or expression by which a particular geographic entity is, or was, known. A geographic entity is any relatively permanent part of the natural or manmade landscape or seascape that has recognizable identity within a particular cultural context. # A geographic name, then, may refer to any place, feature, or area on the Earth's surface, or to a related group of similar places, features, or areas. # Typically there are national bodies defining geonames # U.S. Board for Geographic Names # Ständiger Ausschuss für Geographische Namen # New players are entering the game (e.g. Geonames, YahooLocation Platform) © 2008 Gerd Kamp 6
  • 7. Hierarchical partition (current draft definition) A hierarchical partition of scopes of a geographic extent e is a directed acyclic graph (DAG) with the following properties: # There is a single source s_top (the top level scope) with a geographic extent being coterminous with the geographic extent (using coterminous as having matching boundaries interpretation # every scope has a property denoting its level in the hierarchy with the top level scope having the level 1 # for any given point p in e there is at least one corresponding scope s_point at some level in the DAG # for every scope that has more than one successor the geographic extent of set of successors is coterminous with the geographic extent of this scope # for every scope that has more than one predecessor the geographic extent of set of predecessors is coterminous with the geographic extent of this scope © 2008 Gerd Kamp 7
  • 8. Example A story about legislation in a state is assigned a statewide scope (although the dateline is the state capitol) © 2008 Gerd Kamp 8
  • 9. Example A story about an accident within A with a driver coming from B © 2008 Gerd Kamp 9
  • 10. Example (News Industry Text Format - NITF) <nitf xmlns:georss=quot;http://www.georss.org/georssquot;> <head> <title>Bayern München II schlägt Karlsruhe 3:1</title> <location class=quot;scopequot;> <region region-code=quot;09184000quot; code-source=quot;AGSquot;>München <georss:point>11.5725580365 48.1379548096</georss:point> </region> <state state-code=quot;09000000quot; code-source=quot;AGSquot;>Bayern <georss:point>11.5725580365 48.1379548096</georss:point> </state> <country iso-cc=quot;DEUquot;>Deutschland</country> </location> <location class=quot;scopequot;> <city city-code=quot;09162000quot; code-source=quot;AGSquot;>München <georss:point>11.5725580365 48.1379548096</georss:point> </city> <state state-code=quot;09000000quot; code-source=quot;AGSquot;>Bayern <georss:point>11.5725580365 48.1379548096</georss:point> </state> <country iso-cc=quot;DEUquot;>Deutschland</country> </location> <location class=quot;scopequot;> <city city-code=quot;08212000quot; code-source=quot;AGSquot;>Karlsruhe <georss:point>8.40437796821 49.0092142029</georss:point> </city> <state state-code=quot;08000000quot; code-source=quot;AGSquot;>Baden-Württemberg <georss:point>9.17871582656 48.7750805322</ georss:point> </state> <country iso-cc=quot;DEUquot;>Deutschland</country> © 2008 Gerd Kamp 10
  • 11. Example NITF (cont‘d) <location class=quot;addressquot;> Grünwalder Stadion, Grünwalder Straße, München, Germany <georss:point>11.566936 48.101078</georss:point> <city>München</city> <region>München</region> <state>Bayern</state> <country iso-cc=quot;DEUquot;>Deutschland</country> </location> © 2008 Gerd Kamp 11
  • 12. Next steps / To Do Gathering feedback Evangelizing within main stream media organizations How to represent best in GeoRSS , KML, ... # multiple locations # locations of different types and classes Working toward a generally available ontology of geonames / a framework for describing ontology Investigatiing connections to /applications from # computational geometry # qualitative spatial reasoning to geoname based graphs / ontologies © 2008 Gerd Kamp 12