The document discusses the history and development of open data portals and platforms. It notes the launch of early government open data portals like Data.gov in 2009 and the growth of the open data ecosystem. It then describes the open source CKAN platform and how the DKAN distribution was created to bring open data capabilities to the Drupal content management system, allowing governments already using Drupal to more easily publish and manage open data. The document provides an overview of DKAN's features and ongoing development areas.
15. • No vendor lock-In / choice of consultants / ability to build in-
house capacity
• Collaborate w/ our peers (White House)
• Security transparency (US DoD is a major consumer for this
reason)
• Open-Source platforms often pay more heed to open formats
and standards (e.g.: DCAT, RDFa, OData, JSON vs Shapefiles, PDF,
etc.)
• Innovation: healthy open-source projects can aggregate more
engineering effort than proprietary alternatives, propagate great
new extensions faster
• Freedom of Hosting Options: consume as a cloud-hosted
service today, change our mind and host in-house tomorrow, etc.
Why Open-Source Matters...
18. With
DKAN
Distro,
Drupal
Itself
Now
Also
Becoming
a
Public
Sector
Data
Management
System
(“DMS”)
19. •
MATURE:
>1
million
sites
(2%
of
all
sites),
3,718
Code
commits/wk,
6,388
issue
comments/wk
•
IN-‐HOUSE
SKILLS:
24%
of
.gov
sites
•
EXTENSIBLE:
18,489
Modules,
1,512
Themes,
21,009
Contributors
•
FISMA-‐CerZfied
Cloud
HosZng
OpZons
•
INTEGRATES
easily
w/
public
websites
lots
of
de
facto
data
is
already
published
as
content
Why Drupal?
31. Open Data is Just “Sharing Your Files”
• Datasets
are
collecZons
of
resources,
with
some
descripZve
metadata
• Resources
are
just
files.
They
can
be
any
kind
of
file,
but
ocen
they
are
CSV
files,
spreadsheets
or
some
other
kind
of
tabular
data
file.
• OrganizaZons
create
datasets
and
upload
resources.
• Data
consumers
can
browse
datasets
and
someZmes
see
visualiza0ons
of
resources.
31
32. DKAN
•
Fully functional data portal housing datasets, Solr search, accessible
via JSON and RDF; csv or xml files uploaded through Drupal, stored in
*SQL, visualized through Recline.js
• Seeks
to
replicate
CKAN
2.0
funcZonality,
design,
standards,
&
API
• Reuses CKAN components wherever possible (e.g.: Recline.js)
• Built with support and input from the Open Knowledge Foundation
• Fully open project, with code on Drupal.org/project/DKAN
45. •
Adding
feedback
on
datasets,
other
social
features
•
Support
for
addiZonal
file
types
•
Adding
DKAN_DataSet
&
DKAN_DataStore
modules
to
other
Distros
•Refactoring
DKAN_DataStore
to
align
with
new
US
Project
Open
Data,
new
OpenData
module
•Offering
enterprise
support
&
hosted
SaaS
DKAN
Ongoing Development
46. Recipes
add /data.html &
/data.json pages
to existing Drupal
site with new Open
Data Module?
(sandbox project)
add data management
&
publishing features
to a Drupal site with
DKAN Data Set &
DKAN Data Store
Modules
deploy new Open
Data Catalog / Portal
with the DKAN
Distribution