4. bad for official sources
bad for users
often adds little value
bad for data quality
5. problem cause
data accuracy
data is re-keyed. few eyeballs. often little downside to
lying
gaps in data
high (& often duplicated) cost of data entry. limited to
payers
lack of granularity
legacy systems/data models hard to re-engineer in
closed world
errors go uncorrected few feedback mechanisms
black box/no
provenance
can’t reveal (sometimes dubious) sources. limits
usefulness/trust
isolated
proprietary IDs are internal identifiers & are barriers to
sharing & improved data quality
example proprietary data quality issues
6. in a data-driven world we
need company data…
with provenance
with clarity
in a form that can be combined
with other data
7. Mission: every public company data
item in the world matched to the
company
•Legal Entities – the critical underpinning of the corporate
world
•De-siloing data from official corporate registers, and other
government data, especially regulatory
•Linking critical – and previously obscure datasets
•Line of sight to primary data – transparency and trust
8.
9.
10. • Utility - or rather a lack of…
Gazettes predate the web
Published in unstructured, unclassified,
digitally antagonistic formats…
undiscoverable, unsearchable, barely
browsable let alone comparable!
Why?
11. • Official public notices
Critical company events
liquidation, dissolution, winding-up
orders, mergers, AGM’s etc
Signals
often precede changes in company
registers by up to 4 weeks
What?