SlideShare a Scribd company logo
1 of 33
Download to read offline
Address Day
what next after the Address Wars
Jeni Tennison - @JeniT
5 March 2015
https://openaddressesuk.org
@openaddressesuk
In economics, a public good is a good that is
both non-excludable and non-rivalrous in
that individuals cannot be effectively
excluded from use and where use by one
individual does not reduce availability to
others.
Wikipedia - Public good
"Tompkins Square Park Central Knoll" by David Shankbone - (CC BY-SA 3.0) via Wikimedia Commons
open
data
public
good
sum of what
everyone
would pay
what it costs
to maintain
When should a good be public?
Address data should be open data
● National Information Infrastructure
● Not just for posting mail...
○ geocoding for route finding
○ associating people with areas
○ classification for targeting interventions
○ linking datasets together
● Denmark has taken this step
○ 1000% increase use of address data
○ costs = €0.2M - benefits = €14M
Current real life problems
● startup wanting to build an application
○ prohibitive costs
○ prohibitive licensing complexity
● SME with a geodemographic product
○ prohibitive costs
○ limiting customer base & growth
● New build owners
○ 3 months to register to vote, order pizza
Funding public goods
● Government via taxation
● Collaborative bound by contract
● Cross-subsidy by selling other goods
● Voluntary effort
● Social norms
"The sale of the PAF with the Royal Mail was a mistake.
Public access to public sector data must never be sold or
given away again. This type of information, like census
information and many other data sets, is very expensive
to collect and collate into useable form, but it also has
huge potential value to the economy and society as a
whole if it is kept as an open, public good."
Bernard Jenkin, Chair of Public Administration Select Committee
Hypothesis 1: the maintenance of open address
data can only be effectively funded through
taxation
Hypothesis 2: it is possible to build and maintain
a sustainable open address database using
collaboration, cross-subsidy and voluntary effort
Goals
● Free, openly licensed, up-to-date bulk
downloads of addresses
● Freemium services over that data
○ eg validation, auto-completion, geocoding
● 100% open source, collaboratively
maintained
● Initial ~£400k investment from government
○ compared with £25M annual cost maintaining PAF
Eventual Architecture
“Definitive” UK address list
- where the address data is safe to use
- where each record has confidence and provenance
Bulk
- Download
- Upload
APIs
- Add
- Sort
- Validate
- Search
URLs
- Linked data
- Extensibility
Service Providers
Aggregators, digital, telecoms, public sector, distribution, academics, manufacturers etc
Services
- Websites,
Users
Value
Revenueforsustainability
This takes time
Large
datasets and
inference to
tackle the
bulk of the
challenge
“80/20” rule
Ongoing,
collaborative
maintenance
Targeted
work. Low-
volume
records to fill
existing gaps
in available
datasets
NB: dates are “just for fun”
Approaches
1. Load open datasets containing addresses
2. Build out crowdsourcing mechanisms
3. Use inference to fill gaps
and throughout:
● keep track of provenance
● keep track of confidence
Loading datasets
Third Party IPR
Possibly infected if validated
against PAF or AddressBase
⇒ most Government “open”
data is infected
A few not:
● Companies House
● err...
Platform for loading bulk data
Originally developed for OpenCorporates
Sandboxed environment for running scripts
Motivating crowdsourcing
Bulk
- Download
- Upload
APIs
- Add
- Sort
- Validate
- Search
URLs
- Linked data
- Extensibility
Value
Building Blocks
- towns, postcodes, streets
- used to parse data and provide
confidence in the address list
- links between towns, postcodes
and streets are learned from
addresses
Authoritative and definitive UK
address list
- where the address data is safe to
use
- where each record has
confidence and provenance
Revenueforsustainability
● Turn free-text
addresses into
building blocks
● Can be used with data
containing third party
IPR
● Optional “contribute”
option
Address parsing service
Inference
Fogralea
ZE1 0SE
© Open Addresses Ltd.
7 9 11 13 15 17 19 21 23 25 27 29
6 8 10 12 14 16 18 20 22 24 26 28
Fogralea
ZE1 0SE
7 9 11 13 15 17 19 21 23 25 27 29
6 8 10 12 14 16 18 20 22 24 26 28
Fogralea
ZE1 0SE
What about
nos. 1 to 4?
Same
postcode? We
cannot know!
Fogralea
ZE1 0SE
Enabling collaborative maintenance
St James House, St James Square, Cheltenham, GL50 3PR
7, St James Square, Cheltenham, GL50 3PT
St James North 1, St James Square, Cheltenham, GL50 3PR
St James North 3, St James Square, Cheltenham, GL50 3PR
3, St James Square, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham Spa, GL50 3PR
St James North 1, St James Square, Cheltenham, GL50 3PR
St James Place, Jessop Avenue, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham, GL50 3PR
Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR
56, Cheltenham Road, London, SE15 3AR
Calculating confidence
St James House, St James Square, Cheltenham, GL50 3PR
7, St James Square, Cheltenham, GL50 3PT
St James North 1, St James Square, Cheltenham, GL50 3PR
St James North 3, St James Square, Cheltenham, GL50 3PR
3, St James Square, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham Spa, GL50 3PR
St James North 1, St James Square, Cheltenham, GL50 3PR
St James Place, Jessop Avenue, Cheltenham, GL50 3PR
St James House, St James Square, Cheltenham, GL50 3PR
Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR
56, Cheltenham Road, London, SE15 3AR
Calculating confidence
Sector Town Count Total Confidence
...
HD3 4 HUDDERSFIELD 66 66 87.71%
...
DG8 6 NEWTON STEWART 11 12 65.69%
DG8 6 STRANRAER 1 12 0.00%
DG8 7 NEWTON STEWART 1 1 0.00%
...
W3 6 LONDON 196 196 92.96%
...
CH44 4 WALLASEY 23 29 76.06%
CH44 4 WIRRAL 6 29 8.22%
Calculating confidence
This postcode/town association is right but
confidence is low because of the low count
This postcode/town association is incorrect
Another correct postcode/town association,
but with a higher count
This is what happens when post towns are
re-organised; Wirral is now split in
Birkenhead, Wallasey, Wirral and Prenton
This is how a correct postcode/town
association looks like
Provenance
Summary
● Built most of the supporting platform
○ parsing free text / messy addresses
○ collaborative loading of data
○ providing downloads, search & URL identity
○ recording provenance & assigning confidence
○ using inference to fill in gaps
● We have low numbers of addresses currently
○ but the right mechanisms to add more
○ and many potential partners
What next?
● Building the platform
● Building the community of collaborators
● Building services to aid cross-subsidy
● Increasing quantity & quality of addresses
● Can anyone else reuse the technology?
● Can anyone else reuse the approach?
Any Questions?
@JeniT - jeni.tennison@openaddressesuk.org
https://openaddressesuk.org
info@openaddressesuk.org
@openaddressesuk
Open Addresses Ltd. is a new company being set
up to create and maintain an address database
for the UK that will be made available to the
public as Open Data. It will facilitate the
collaborative maintenance of the address
database with various stakeholders from the UK
Government, industry and non-profit.
Offices
Where?

More Related Content

Similar to BCS Address Day - Open Addresses

CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdfCHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
issane
 
The Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled ReformThe Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled Reform
davidircameron
 
Lga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open dataLga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open data
Gesche Schmid
 

Similar to BCS Address Day - Open Addresses (20)

ODUG LA incentive scheme - Final bristol deck 03/07/2014
ODUG LA incentive scheme - Final bristol deck 03/07/2014ODUG LA incentive scheme - Final bristol deck 03/07/2014
ODUG LA incentive scheme - Final bristol deck 03/07/2014
 
LA Open Data Incentive Scheme – launch presentation, July 2014
LA Open Data Incentive Scheme – launch presentation, July 2014LA Open Data Incentive Scheme – launch presentation, July 2014
LA Open Data Incentive Scheme – launch presentation, July 2014
 
Providing Funding to Enhance the use of Open Data in the Public Sector
Providing Funding to Enhance the use of Open Data in the Public SectorProviding Funding to Enhance the use of Open Data in the Public Sector
Providing Funding to Enhance the use of Open Data in the Public Sector
 
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
OpenDataCommunities and Hampshire Hub presentation for Hampshire and Isle of ...
 
Open Addresses - for Bath Hacked
Open Addresses - for Bath HackedOpen Addresses - for Bath Hacked
Open Addresses - for Bath Hacked
 
Developing an Open Data initiative: Lessons Learned
Developing an Open Data initiative: Lessons LearnedDeveloping an Open Data initiative: Lessons Learned
Developing an Open Data initiative: Lessons Learned
 
ODUG LA Incentive Scheme - Leeds Launch deck
ODUG LA Incentive Scheme - Leeds Launch deckODUG LA Incentive Scheme - Leeds Launch deck
ODUG LA Incentive Scheme - Leeds Launch deck
 
Local Open Data: a perspective from local government in England 2014
Local Open Data: a perspective from local government in England 2014Local Open Data: a perspective from local government in England 2014
Local Open Data: a perspective from local government in England 2014
 
Local Open Data: A perspective from local government in England by Gesche Schmid
Local Open Data: A perspective from local government in England by Gesche SchmidLocal Open Data: A perspective from local government in England by Gesche Schmid
Local Open Data: A perspective from local government in England by Gesche Schmid
 
GEOFF CONNELL: Better Connected live 2016
GEOFF CONNELL: Better Connected live 2016GEOFF CONNELL: Better Connected live 2016
GEOFF CONNELL: Better Connected live 2016
 
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdfCHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
CHadley_OS-presentation-for-EG-workshop-on-open-data-Feb-2022.pdf
 
The Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled ReformThe Perfect Storm for Technology Enabled Reform
The Perfect Storm for Technology Enabled Reform
 
Open Data and Economic Growth: The Latest Evidence
Open Data and Economic Growth: The Latest EvidenceOpen Data and Economic Growth: The Latest Evidence
Open Data and Economic Growth: The Latest Evidence
 
Local open data strategy 2015 03-19
Local open data strategy  2015 03-19 Local open data strategy  2015 03-19
Local open data strategy 2015 03-19
 
How Important is location to your business
How Important is location to your businessHow Important is location to your business
How Important is location to your business
 
Intro To Procurement And Tendering In Wales Slideset
Intro To Procurement And Tendering In Wales   SlidesetIntro To Procurement And Tendering In Wales   Slideset
Intro To Procurement And Tendering In Wales Slideset
 
Open Data: Its Value and Lessons Learned
Open Data: Its Value and Lessons LearnedOpen Data: Its Value and Lessons Learned
Open Data: Its Value and Lessons Learned
 
Local Open Data. Presentation for Cambridgeshire Insight
Local Open Data. Presentation for Cambridgeshire InsightLocal Open Data. Presentation for Cambridgeshire Insight
Local Open Data. Presentation for Cambridgeshire Insight
 
T and od v2
T and od v2T and od v2
T and od v2
 
Lga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open dataLga local transparency roadshow 2014 value of local open data
Lga local transparency roadshow 2014 value of local open data
 

More from Jeni Tennison

Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?
Jeni Tennison
 

More from Jeni Tennison (6)

The challenges of building a strong data infrastructure
The challenges of building a strong data infrastructureThe challenges of building a strong data infrastructure
The challenges of building a strong data infrastructure
 
Collisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web ContentCollisions, Chimera and Consonance in Web Content
Collisions, Chimera and Consonance in Web Content
 
Data All the Way Down
Data All the Way DownData All the Way Down
Data All the Way Down
 
Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?Semantic Web and RDF: Can we reach escape velocity?
Semantic Web and RDF: Can we reach escape velocity?
 
How the Web of Data Will be Won
How the Web of Data Will be WonHow the Web of Data Will be Won
How the Web of Data Will be Won
 
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFaOpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
OpenTech 2008: Power of Information - Rewiring the London Gazette with RDFa
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

BCS Address Day - Open Addresses

  • 1. Address Day what next after the Address Wars Jeni Tennison - @JeniT 5 March 2015 https://openaddressesuk.org @openaddressesuk
  • 2. In economics, a public good is a good that is both non-excludable and non-rivalrous in that individuals cannot be effectively excluded from use and where use by one individual does not reduce availability to others. Wikipedia - Public good
  • 3. "Tompkins Square Park Central Knoll" by David Shankbone - (CC BY-SA 3.0) via Wikimedia Commons
  • 5. sum of what everyone would pay what it costs to maintain When should a good be public?
  • 6. Address data should be open data ● National Information Infrastructure ● Not just for posting mail... ○ geocoding for route finding ○ associating people with areas ○ classification for targeting interventions ○ linking datasets together ● Denmark has taken this step ○ 1000% increase use of address data ○ costs = €0.2M - benefits = €14M
  • 7. Current real life problems ● startup wanting to build an application ○ prohibitive costs ○ prohibitive licensing complexity ● SME with a geodemographic product ○ prohibitive costs ○ limiting customer base & growth ● New build owners ○ 3 months to register to vote, order pizza
  • 8. Funding public goods ● Government via taxation ● Collaborative bound by contract ● Cross-subsidy by selling other goods ● Voluntary effort ● Social norms
  • 9. "The sale of the PAF with the Royal Mail was a mistake. Public access to public sector data must never be sold or given away again. This type of information, like census information and many other data sets, is very expensive to collect and collate into useable form, but it also has huge potential value to the economy and society as a whole if it is kept as an open, public good." Bernard Jenkin, Chair of Public Administration Select Committee
  • 10. Hypothesis 1: the maintenance of open address data can only be effectively funded through taxation Hypothesis 2: it is possible to build and maintain a sustainable open address database using collaboration, cross-subsidy and voluntary effort
  • 11.
  • 12. Goals ● Free, openly licensed, up-to-date bulk downloads of addresses ● Freemium services over that data ○ eg validation, auto-completion, geocoding ● 100% open source, collaboratively maintained ● Initial ~£400k investment from government ○ compared with £25M annual cost maintaining PAF
  • 13. Eventual Architecture “Definitive” UK address list - where the address data is safe to use - where each record has confidence and provenance Bulk - Download - Upload APIs - Add - Sort - Validate - Search URLs - Linked data - Extensibility Service Providers Aggregators, digital, telecoms, public sector, distribution, academics, manufacturers etc Services - Websites, Users Value Revenueforsustainability
  • 14. This takes time Large datasets and inference to tackle the bulk of the challenge “80/20” rule Ongoing, collaborative maintenance Targeted work. Low- volume records to fill existing gaps in available datasets NB: dates are “just for fun”
  • 15. Approaches 1. Load open datasets containing addresses 2. Build out crowdsourcing mechanisms 3. Use inference to fill gaps and throughout: ● keep track of provenance ● keep track of confidence
  • 16. Loading datasets Third Party IPR Possibly infected if validated against PAF or AddressBase ⇒ most Government “open” data is infected A few not: ● Companies House ● err...
  • 17. Platform for loading bulk data Originally developed for OpenCorporates Sandboxed environment for running scripts
  • 18. Motivating crowdsourcing Bulk - Download - Upload APIs - Add - Sort - Validate - Search URLs - Linked data - Extensibility Value Building Blocks - towns, postcodes, streets - used to parse data and provide confidence in the address list - links between towns, postcodes and streets are learned from addresses Authoritative and definitive UK address list - where the address data is safe to use - where each record has confidence and provenance Revenueforsustainability
  • 19. ● Turn free-text addresses into building blocks ● Can be used with data containing third party IPR ● Optional “contribute” option Address parsing service
  • 21. Fogralea ZE1 0SE © Open Addresses Ltd.
  • 22. 7 9 11 13 15 17 19 21 23 25 27 29 6 8 10 12 14 16 18 20 22 24 26 28 Fogralea ZE1 0SE
  • 23. 7 9 11 13 15 17 19 21 23 25 27 29 6 8 10 12 14 16 18 20 22 24 26 28 Fogralea ZE1 0SE
  • 24. What about nos. 1 to 4? Same postcode? We cannot know! Fogralea ZE1 0SE
  • 26. St James House, St James Square, Cheltenham, GL50 3PR 7, St James Square, Cheltenham, GL50 3PT St James North 1, St James Square, Cheltenham, GL50 3PR St James North 3, St James Square, Cheltenham, GL50 3PR 3, St James Square, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham Spa, GL50 3PR St James North 1, St James Square, Cheltenham, GL50 3PR St James Place, Jessop Avenue, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham, GL50 3PR Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR 56, Cheltenham Road, London, SE15 3AR Calculating confidence
  • 27. St James House, St James Square, Cheltenham, GL50 3PR 7, St James Square, Cheltenham, GL50 3PT St James North 1, St James Square, Cheltenham, GL50 3PR St James North 3, St James Square, Cheltenham, GL50 3PR 3, St James Square, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham Spa, GL50 3PR St James North 1, St James Square, Cheltenham, GL50 3PR St James Place, Jessop Avenue, Cheltenham, GL50 3PR St James House, St James Square, Cheltenham, GL50 3PR Apt. 3, St James Place, Jessop Avenue, Cheltenham, GL50 3PR 56, Cheltenham Road, London, SE15 3AR Calculating confidence
  • 28. Sector Town Count Total Confidence ... HD3 4 HUDDERSFIELD 66 66 87.71% ... DG8 6 NEWTON STEWART 11 12 65.69% DG8 6 STRANRAER 1 12 0.00% DG8 7 NEWTON STEWART 1 1 0.00% ... W3 6 LONDON 196 196 92.96% ... CH44 4 WALLASEY 23 29 76.06% CH44 4 WIRRAL 6 29 8.22% Calculating confidence This postcode/town association is right but confidence is low because of the low count This postcode/town association is incorrect Another correct postcode/town association, but with a higher count This is what happens when post towns are re-organised; Wirral is now split in Birkenhead, Wallasey, Wirral and Prenton This is how a correct postcode/town association looks like
  • 30. Summary ● Built most of the supporting platform ○ parsing free text / messy addresses ○ collaborative loading of data ○ providing downloads, search & URL identity ○ recording provenance & assigning confidence ○ using inference to fill in gaps ● We have low numbers of addresses currently ○ but the right mechanisms to add more ○ and many potential partners
  • 31. What next? ● Building the platform ● Building the community of collaborators ● Building services to aid cross-subsidy ● Increasing quantity & quality of addresses ● Can anyone else reuse the technology? ● Can anyone else reuse the approach?
  • 32. Any Questions? @JeniT - jeni.tennison@openaddressesuk.org https://openaddressesuk.org info@openaddressesuk.org @openaddressesuk
  • 33. Open Addresses Ltd. is a new company being set up to create and maintain an address database for the UK that will be made available to the public as Open Data. It will facilitate the collaborative maintenance of the address database with various stakeholders from the UK Government, industry and non-profit. Offices Where?