SlideShare uma empresa Scribd logo
1 de 39
Baixar para ler offline
www.adequate.at
Workshop on Quality Assessment and
Improvements on Open Data (Portals)
opendata.ch conference, 14.6.2016, 12.45 - 14:00pm CEST
Lausanne, Casino de Montbenon, Allée Ernest-Ansermet 3
Slides published CC-BY AT 3.0
Jürgen Umbrich
Vienna University of
Economics and Business
juergen.umbrich@wu.ac.at
Johann Höchtl
Donau-Universität Krems
johann.hoechtl@donau-uni.ac.at
Martin Kaltenböck
Semantic Web Company
m.kaltenboeck@semantic-web.at
www.adequate.at
Agenda
2
Time Session Remarks
20’ incl q&a Welcome & Introduction
● WS Objectives, Agenda & WS Team
● Participants
● The ADEQUATe project: basics, objectives, status & outlook
Martin Kaltenböck (SWC)
20’ incl q&a Results of Requirements Elicitation, DQ Metrics and Interaction items
● What do the users want?
● What are the most “important” ones? What are metrics specifically targeting
openness?
● Why data portal quality interaction items with end users and what do we
plan to do in ADEQUATe?
Johann Höchtl (DUK)
20’ incl q&a Best Practise & the ADEQUATe OD Framework
● Data & CSV on the web working group recommendations (W3C)
● AD Framework: architecture & components
Jürgen Umbrich (WU)
15’ open discussion Interactive & open discussion on DQ issues:
● Requirements for DQ in Open Data
● What is in place or planned for DQ
Moderated by the WS Team
www.adequate.at
FFG Project
http://www.adequate.at
3
Das Projekt „ADEQUATe“ wird im Rahmen des FTI - Programms „IKT der Zukunft“ durch das Bundesministerium für Verkehr, Innovation und
Technologie gefördert und von der Österreichischen Forschungsförderungsgesellschaft abgewickelt [Projektnummer: 849982].
www.adequate.at
What is ?
ADEQUATe Open Data: Analytics & Data Enrichment to improve
the QUAliTy of Open Data builds on two observations:
An increasing amount of Open Data becomes available as an important resource for emerging businesses and further on the
integration of such open, freely re-usable data sources into organisations’ data warehouse and data management systems is
seen as a key success factor for competitive advantages in a data-driven economy.
The project now identifies crucial issues which have to be tackled to fully exploit the value of open data and the efficient
integration with other data sources:
● the overall quality issues with metadata and the data itself
● the lack of interoperability between data sources
The project's approach is to address this points already in an early stage – when the open data is freshly provided by either
governmental organisations or others.
4
www.adequate.at
What is ?
5
www.adequate.at
What is ?
✓ 3 Partners:
1. Semantic Web Company
2. Danube University Krems
3. University of Economics Vienna
✓ 30 months project duration, Oct. 2015 - March 2018
✓ 2 Use Case Partners: data.gv.at & opendataportal.at
✓ Objective: Improvement of Data Quality through:
○ Quality Assessment and Monitoring
○ Automatic Algorithms
○ Making use of Linked Data principles
○ Improvements of the data by the user (community)
6
www.adequate.at
Project Structure & Schedule
7
WP1 - Requirements & Specification
WP2 - Quality Improvement & Monitoring Framework
WP3 - Algorithms & Tools for Quality Improvements
WP4 - Data Linkage
WP5 - Community driven Quality Improvements
WP6 - Use Case Integration
WP7 - Project Management & Dissemination
www.adequate.at
Outlook & Timing of Results
8
M30 (03/2018)
Evaluation, Refinements, Improvements
M21 (06/2017)
Quality improvements Use case connection
M15 (12/2016)
Quality monitoring framework Data linkage
M10 (07/2016)
Architecture Blueprint
M9 (06/2017)
Quality metrics Requirements
www.adequate.at
Concrete Outputs & Outlook
✓ End of June 2016: 3 Deliverables
○ State of the Art
○ Requirements Elicitation
○ Quality Metrics
✓ End of July 2016: 1 Deliverable
○ Architecture Blueprint
○ All components specified
✓ End of 2016: ADEQUATe Framework - 1st release
○ Assessment & Monitoring Framework
○ Data Quality Algorithms & Tools
○ Linked Data Mechanisms
○ 1st set of user driven Mechanisms
✓ Early 2017: Dock onto ODP & data.gv.at
9
www.adequate.at
Requirements Elicitation,
DQ metrics and Interaction items
10
www.adequate.at
Results of Requirements Elicitation
11
www.adequate.at
Contents and Formats
○ I would really prefer to have the data themselves consistent. [...] metadata does not
match; standards regarding the representation of their content
○ It would be really great if we could shift somehow to UTF-8
○ meta data for CSV files were incomplete [...] header for CSV was missing
○ no static identifiers for objects in data sets. This in turn leads to problems if you want
to track changes related to these objects over time
Results of Requirements Elicitation
12
www.adequate.at
Communication
○ central communication point for exchanging experiences and issues
○ Meta data should be written in English language
Reliability
○ Servers are restarted every day [...] hosted data becomes unavailable
Results of Requirements Elicitation
13
www.adequate.at
DQ metrics (1)
Completeness
● Metadata Completeness: How many (manadatory) metadata keys have values?
● Table completeness: How many (CSV) cells have non-null values
Timeliness
● Tau of Data: How “outdated” are datasets based on the promised update
frequency
14
www.adequate.at
DQ metrics (2)
Machine readability
● Regularity of CSV-files (CSV-Lint), RDF, ...
● Structural consistency - variations in structure of CSV files
Openness
● Open formats - no well-defined definition of what constitutes an open
● Open Licenses - Seems opendefinitions.org has them all covered
Persistence
15
www.adequate.at
DQ Metrics - Persistence?
16
www.adequate.at
ADEQUATe: 11 Dimensions & 46 Metrics
17
www.adequate.at
Contributors to DQ improvement
Publishers Community
18
Algorithms & Linked Data
www.adequate.at
Contributors to DQ Improvements (1/2)
● Providers
○ Correctness and Completeness of Data and Metadata
○ SLAs governing availability
○ Readiness for feedback, discussion and interaction
● Algorithms
○ Automated improvements
■ Availability checks and reporting
■ Missing information, outliers
■ Check of format (valid UTF8?), size
■ Data format conversions: CSV → CSV on the web specification
○ Semi-automated Improvements and Enhancements
■ Identification of related data sets
■ Mapping of (data) attributes, ...
● Interaction with the Data Community 19
www.adequate.at
Interaction: Data Community
20
● Control the results of automated enhancements
○ Interlinking
○ format conversions
○ encodings
● Correct mistakes and report mistakes
● Data enrichment and transformations
www.adequate.at
Interaction: Data Community
21
https://open.wien.gv.at/site/riesenbaum-in-wien-entdeckt/#more-87184
www.adequate.at
Interaction: Forking: Identify - Improve - Share
22
1 47 11
2 48 15
1 47 11
2 48 151
1 47 11
2 47 15
2
www.adequate.at
Interaction: Forking: Identify - Improve - Share
23
www.adequate.at
Making results tangible
24
https://github.com/antontarasenko/gpq/blob/master/notebooks/contracts_intro.ipynb
Government Procurement Queries project
US Government contracts 2000 - 2016 (USAspending.gov)
www.adequate.at
The ADEQUATe OD Framework
&
publishing CSVs for humans and machines
25
www.adequate.at
The ADEQUATe Framework
26
● The ADEQUATe framework offers:
○ quality assessment and monitoring
○ a set of data quality improvement algorithms
○ a set of algorithms to create, maintain a knowledge graph and “link” data into this graph
■ Think about shared identifiers for addresses, companies, departments, parties, ...
○ community involvement ( e.g., data editors, feedback loops, forking & merging)
● Main objectives:
○ all developed components will be Open Source ( see the ADEQUATe Github Repo)
○ components should be used as standalone components
■ Use only what you need
www.adequate.at
The ADEQUATe Framework
27
● Core Components
1. Data monitoring
2. Knowledge Vault
3. Quality Assessment
4. Quality Improvement
5. Data Linkage
6. Community Improvement
7. UI, API & User authentication
Users
(Meta)Data
Monitor
Knowledge
Vault
Quality
Assessment
Orchestration / API
Quality
Improvement
Linkage
Community
Improvement
Authentication / Load Balancing /
UI Public API catalog
data.gv.at
ODP
Clients
RESTful API
Component
Data
www.adequate.at
W3C CSV on the Web & ADEQUATe
One core feature in ADEQUATe will be to use the CSV on the Web metadata
standard, which allows to:
➢ describe CSV files
○ used dialect & encoding
○ table & column descriptions ( with language tags)
○ data types and value ranges for columns
➢ add semantics to it
○ primary & foreign key, URIs, entity types, ...
➢ validate CSV files against a predefined schema
➢ specify the transformation
○ CSV -> JSON or RDF
28
www.adequate.at
W3C CSV on the Web: Metadata standard
29
www.adequate.at
W3C CSV on the Web: Metadata standard
30
www.adequate.at
W3C CSV on the Web: Metadata standard
31
www.adequate.at
W3C CSV on the Web: Example (JSON-LD) 1/3
{
"@context": ["http://www.w3.org/ns/csvw", {"@language": "en"}],
"url": "http://data.mumok.at/exhibition.csv",
"dc:title": "Exhibitions for objects from the mumok collection",
"dcat:keyword": ["art", "museum", "exhibition"],
"dc:publisher": {
"schema:name": "mumok - museum moderner kunst stiftung ludwig wien",
"schema:url": {"@id": "http://www.mumok.at"}
},
"dc:license": {"@id": "https://creativecommons.org/licenses/by/3.0/at/legalcode"},
"dc:modified": {"@value": "2015-07-04", "@type": "xsd:date"},
….
32
www.adequate.at
W3C CSV on the Web: Example (JSON-LD) 2/3
"dialect": {
"encoding": "utf-8", "lineTerminators": ["rn", "n"],
"quoteChar": """, "doubleQuote": true,
"skipRows": 0, "commentPrefix": "#",
"header": true, "headerRowCount": 1,
"delimiter": ",",
"skipColumns": 0,
"skipBlankRows": false,
"skipInitialSpace": false,
"trim": false
},
33
www.adequate.at
W3C CSV on the Web: Example (JSON-LD) 3/3
"tableSchema": {
"columns": [{
"name": "exhibition_id",
"titles": "Exhibition Identifier",
"dc:description": "A unique identifier for the exhibition.",
"datatype": "integer",
"required": true
}, {
"name": "city",
"titles": "City",
"dc:description": "The city in which the exhibition took place (no language defined, mostly in German).",
"datatype": "string"
}
34
www.adequate.at
W3C CSV on the Web: Discovery
● Registered content type: application/csvm+json
● 3 discovery mechanisms
○ File extension
■ http://data.mumok.at/exhibition.csv -> http://data.mumok.at/exhibition.csv-metadata.json
○ Well-known location
■ /.well-known/csvm
○ LINK HTTP Header
35
» curl -I http://data.mumok.at/exhibition.csv
HTTP/1.1 200 OK
Date: Thu, 26 Nov 2015 22:18:47 GMT
Server: Apache/2.2.22 (Debian)
….
Content-Length: 112723
Content-Type: text/csv; charset=utf-8; header=present
Link: </exhibition.csv-metadata.json>;rel=describedBy;
type=application/csvm+json
www.adequate.at
CSV on the Web Summary
● Don’t publish CSV on the Web for humans, publish also for machines
○ e.g., EXCEL exports
● RFC 4180
● Encoding
○ Use UTF-8, don’t mix encodings
● File extension: .csv
● Content-type: text/csv
Optional, but big improvement!
● Ideally, publish CSV MetaData along your CSV file
● Avoid acronyms or encodings (e.g., sex=1,2,3)
36
www.adequate.at
CSV on the Web Summary
37
● CSV URLs
● CSVs link to other CSVs
● CSVs link to other resources
● RDF and JSON conversion
REFERENCES
● CSV on the Web Working Group
● CSV on the Web Community Group
● CSV on the Web Github Repository
● Tabular Data on the Web - A Introduction to CSV on the Web (Slides)
● Implementing CSV on the Web ( Gregg Kellogg)
●
www.adequate.at
Announcements & Pointers
38
@adequate_od
17-19 May 2017
Danube University Krems
30.8.-02.09.2016, Helsinki
www.adequate.at
Contact
39
Jürgen Umbrich
Vienna University of Economics and Business
Juergen.umbrich @ wu.ac.at
Short CV:https://www.wu.ac.at/en/infobiz/team/umbrich/
Johann Höchtl
Donau-Universität Krems
Johann.hoechtl @ donau-uni.ac.at
Short CV: https://at.linkedin.com/in/johannhoechtl
http://adequate.at/ http://vienna.theodi.org
Martin Kaltenböck
Semantic Web Company
m.kaltenboeck@semantic-web.at
Short CV: https://www.linkedin.com/in/martinkaltenboeck

Mais conteúdo relacionado

Mais procurados

Grant Funding Programme
Grant Funding ProgrammeGrant Funding Programme
Grant Funding ProgrammeJisc RDM
 
Text mining and machine learning
Text mining and machine learningText mining and machine learning
Text mining and machine learningJisc RDM
 
Demonstration of the 4C cost comparison tool
Demonstration of the 4C cost comparison toolDemonstration of the 4C cost comparison tool
Demonstration of the 4C cost comparison toolJisc RDM
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharingJisc RDM
 
Show me the money - the long path to a sustainable RDM Facility
Show me the money - the long path to a sustainable RDM FacilityShow me the money - the long path to a sustainable RDM Facility
Show me the money - the long path to a sustainable RDM FacilityJisc RDM
 
A discovery service for UK research data
A discovery service for UK research dataA discovery service for UK research data
A discovery service for UK research dataJisc RDM
 
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...Martin Hamilton
 
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014Jisc
 
DMPOnline by Sarah Jones
DMPOnline by Sarah JonesDMPOnline by Sarah Jones
DMPOnline by Sarah JonesJisc RDM
 
Research data spring: filling in the digital preservation gap
Research data spring: filling in the digital preservation gapResearch data spring: filling in the digital preservation gap
Research data spring: filling in the digital preservation gapJisc RDM
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the NetherlandsJisc RDM
 
Business cases and costs RDN
Business cases and costs RDNBusiness cases and costs RDN
Business cases and costs RDNJisc RDM
 
From Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaFrom Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaJisc RDM
 
Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015Jisc
 
Business case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM serviceBusiness case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM serviceJisc RDM
 
Jisc Research Data Discovery Service Project
Jisc Research Data Discovery Service ProjectJisc Research Data Discovery Service Project
Jisc Research Data Discovery Service ProjectJisc RDM
 
Journal research data policy update
Journal research data policy updateJournal research data policy update
Journal research data policy updateJisc RDM
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc RDM
 
Jisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring UpdateJisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring UpdateJisc RDM
 

Mais procurados (20)

Grant Funding Programme
Grant Funding ProgrammeGrant Funding Programme
Grant Funding Programme
 
Text mining and machine learning
Text mining and machine learningText mining and machine learning
Text mining and machine learning
 
Demonstration of the 4C cost comparison tool
Demonstration of the 4C cost comparison toolDemonstration of the 4C cost comparison tool
Demonstration of the 4C cost comparison tool
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Show me the money - the long path to a sustainable RDM Facility
Show me the money - the long path to a sustainable RDM FacilityShow me the money - the long path to a sustainable RDM Facility
Show me the money - the long path to a sustainable RDM Facility
 
A discovery service for UK research data
A discovery service for UK research dataA discovery service for UK research data
A discovery service for UK research data
 
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
Jisc support for equipment sharing - update for S-Lab Rothamsted conference J...
 
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
 
DMPOnline by Sarah Jones
DMPOnline by Sarah JonesDMPOnline by Sarah Jones
DMPOnline by Sarah Jones
 
RDA UK
RDA UKRDA UK
RDA UK
 
Research data spring: filling in the digital preservation gap
Research data spring: filling in the digital preservation gapResearch data spring: filling in the digital preservation gap
Research data spring: filling in the digital preservation gap
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
 
Business cases and costs RDN
Business cases and costs RDNBusiness cases and costs RDN
Business cases and costs RDN
 
From Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaFrom Box to Hydra via Archivematica
From Box to Hydra via Archivematica
 
Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015Research data spring - Jisc Digital Festival 2015
Research data spring - Jisc Digital Festival 2015
 
Business case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM serviceBusiness case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM service
 
Jisc Research Data Discovery Service Project
Jisc Research Data Discovery Service ProjectJisc Research Data Discovery Service Project
Jisc Research Data Discovery Service Project
 
Journal research data policy update
Journal research data policy updateJournal research data policy update
Journal research data policy update
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016
 
Jisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring UpdateJisc Research Data Shared Service - Spring Update
Jisc Research Data Shared Service - Spring Update
 

Destaque

Cendris Data Quality Management
Cendris Data Quality ManagementCendris Data Quality Management
Cendris Data Quality ManagementJan Hendrik Fleury
 
Open Source Power Quality Data Visualization
Open Source Power Quality Data VisualizationOpen Source Power Quality Data Visualization
Open Source Power Quality Data VisualizationGrid Protection Alliance
 
Open government data and the psi directive et
Open government data and the psi directive etOpen government data and the psi directive et
Open government data and the psi directive etOpen Data Support
 
The PSI Directive and Open Government Data
The PSI Directive and Open Government DataThe PSI Directive and Open Government Data
The PSI Directive and Open Government DataOpen Data Support
 
Quality Metrics for Linked Open Data
Quality Metrics for  Linked Open Data Quality Metrics for  Linked Open Data
Quality Metrics for Linked Open Data ebrahim_bagheri
 
Eduvision - Big data voor de Overheid
Eduvision - Big data voor de OverheidEduvision - Big data voor de Overheid
Eduvision - Big data voor de OverheidEduvision Opleidingen
 
Smart City - Pilot in Amsterdam
Smart City - Pilot in Amsterdam Smart City - Pilot in Amsterdam
Smart City - Pilot in Amsterdam KPN IoT
 
Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...
Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...
Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...BOBCATSSS 2017
 
An introduction to open data
An introduction to open dataAn introduction to open data
An introduction to open dataSally Lait
 

Destaque (10)

Cendris Data Quality Management
Cendris Data Quality ManagementCendris Data Quality Management
Cendris Data Quality Management
 
Open Source Power Quality Data Visualization
Open Source Power Quality Data VisualizationOpen Source Power Quality Data Visualization
Open Source Power Quality Data Visualization
 
Open government data and the psi directive et
Open government data and the psi directive etOpen government data and the psi directive et
Open government data and the psi directive et
 
The PSI Directive and Open Government Data
The PSI Directive and Open Government DataThe PSI Directive and Open Government Data
The PSI Directive and Open Government Data
 
Quality Metrics for Linked Open Data
Quality Metrics for  Linked Open Data Quality Metrics for  Linked Open Data
Quality Metrics for Linked Open Data
 
Eduvision - Big data voor de Overheid
Eduvision - Big data voor de OverheidEduvision - Big data voor de Overheid
Eduvision - Big data voor de Overheid
 
Smart City - Pilot in Amsterdam
Smart City - Pilot in Amsterdam Smart City - Pilot in Amsterdam
Smart City - Pilot in Amsterdam
 
Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...
Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...
Jovana Pistek and Christian van der Kooi - Open government data workshop - BO...
 
Open data quality
Open data qualityOpen data quality
Open data quality
 
An introduction to open data
An introduction to open dataAn introduction to open data
An introduction to open data
 

Semelhante a Presentation ADEQUATe Project: Workshop on Quality Assessment and Improvements in Open Data (Catalogues)

Semantically enhanced quality assurance in the jurion business use case
Semantically enhanced quality assurance in the jurion  business use caseSemantically enhanced quality assurance in the jurion  business use case
Semantically enhanced quality assurance in the jurion business use caseDimitris Kontokostas
 
ADEQUATe beta Launch
ADEQUATe beta LaunchADEQUATe beta Launch
ADEQUATe beta LaunchStadt Wien
 
Institutionalising open data quality - Processes Standards, Tools
Institutionalising open data quality - Processes Standards, ToolsInstitutionalising open data quality - Processes Standards, Tools
Institutionalising open data quality - Processes Standards, ToolsJohann Höchtl
 
ALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and ToolsALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and ToolsAlignedProject
 
Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...
Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...
Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...DataBench
 
Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...
Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...
Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...Rachel Vacek
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018ARDC
 
When Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkWhen Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkJim Kaplan CIA CFE
 
Data Ops at TripActions
Data Ops at TripActionsData Ops at TripActions
Data Ops at TripActionsRob Winters
 
Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Debraj GuhaThakurta
 
What types of data sources does Tableau support.docx
What types of data sources does Tableau support.docxWhat types of data sources does Tableau support.docx
What types of data sources does Tableau support.docxTechnogeeks
 
AnalytiX DS - Master Deck
AnalytiX DS - Master DeckAnalytiX DS - Master Deck
AnalytiX DS - Master DeckAnalytiX DS
 
Power BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics ResearchPower BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics ResearchLuciano Vilas Boas
 
Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalMartin Kaltenböck
 
Emerging PM Tools Webinar
Emerging PM Tools WebinarEmerging PM Tools Webinar
Emerging PM Tools WebinarLivio Paradiso
 
Enhancing educational data quality in heterogeneous learning contexts using p...
Enhancing educational data quality in heterogeneous learning contexts using p...Enhancing educational data quality in heterogeneous learning contexts using p...
Enhancing educational data quality in heterogeneous learning contexts using p...Alex Rayón Jerez
 
Big Data projects.pdf
Big Data projects.pdfBig Data projects.pdf
Big Data projects.pdfssuserf0a206
 
Koneksys Presentation March 2021
Koneksys Presentation March 2021Koneksys Presentation March 2021
Koneksys Presentation March 2021Axel Reichwein
 

Semelhante a Presentation ADEQUATe Project: Workshop on Quality Assessment and Improvements in Open Data (Catalogues) (20)

Semantically enhanced quality assurance in the jurion business use case
Semantically enhanced quality assurance in the jurion  business use caseSemantically enhanced quality assurance in the jurion  business use case
Semantically enhanced quality assurance in the jurion business use case
 
Sebastian Hellmann
Sebastian HellmannSebastian Hellmann
Sebastian Hellmann
 
ADEQUATe beta Launch
ADEQUATe beta LaunchADEQUATe beta Launch
ADEQUATe beta Launch
 
Institutionalising open data quality - Processes Standards, Tools
Institutionalising open data quality - Processes Standards, ToolsInstitutionalising open data quality - Processes Standards, Tools
Institutionalising open data quality - Processes Standards, Tools
 
ALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and ToolsALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and Tools
 
Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...
Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...
Improving Business Performance Through Big Data Benchmarking, Todor Ivanov, B...
 
Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...
Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...
Hitting the Road towards a Greater Digital Destination: Evaluating and Testin...
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018
 
When Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkWhen Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t Work
 
Data Ops at TripActions
Data Ops at TripActionsData Ops at TripActions
Data Ops at TripActions
 
Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017
 
What types of data sources does Tableau support.docx
What types of data sources does Tableau support.docxWhat types of data sources does Tableau support.docx
What types of data sources does Tableau support.docx
 
AnalytiX DS - Master Deck
AnalytiX DS - Master DeckAnalytiX DS - Master Deck
AnalytiX DS - Master Deck
 
Power BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics ResearchPower BI vs Tableau vs Cognos: A Data Analytics Research
Power BI vs Tableau vs Cognos: A Data Analytics Research
 
Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance Professional
 
Emerging PM Tools Webinar
Emerging PM Tools WebinarEmerging PM Tools Webinar
Emerging PM Tools Webinar
 
Enhancing educational data quality in heterogeneous learning contexts using p...
Enhancing educational data quality in heterogeneous learning contexts using p...Enhancing educational data quality in heterogeneous learning contexts using p...
Enhancing educational data quality in heterogeneous learning contexts using p...
 
Big Data projects.pdf
Big Data projects.pdfBig Data projects.pdf
Big Data projects.pdf
 
Koneksys Presentation March 2021
Koneksys Presentation March 2021Koneksys Presentation March 2021
Koneksys Presentation March 2021
 
Research Paper
Research PaperResearch Paper
Research Paper
 

Mais de Martin Kaltenböck

Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
 
Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...
Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...
Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...Martin Kaltenböck
 
Text Mining in PoolParty Semantic Suite
Text Mining in PoolParty Semantic SuiteText Mining in PoolParty Semantic Suite
Text Mining in PoolParty Semantic SuiteMartin Kaltenböck
 
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...Martin Kaltenböck
 
The European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online MarketplaceThe European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online MarketplaceMartin Kaltenböck
 
PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview Martin Kaltenböck
 
PoolParty Semantic Suite - Solutions for Sustainable Development
PoolParty Semantic Suite - Solutions for Sustainable DevelopmentPoolParty Semantic Suite - Solutions for Sustainable Development
PoolParty Semantic Suite - Solutions for Sustainable DevelopmentMartin Kaltenböck
 
Climate Technology Transfer supported through Linked Data A Proof of Concept ...
Climate Technology Transfer supported through Linked Data A Proof of Concept ...Climate Technology Transfer supported through Linked Data A Proof of Concept ...
Climate Technology Transfer supported through Linked Data A Proof of Concept ...Martin Kaltenböck
 
Introduction to: Big Data Europe Project
Introduction to: Big Data Europe Project Introduction to: Big Data Europe Project
Introduction to: Big Data Europe Project Martin Kaltenböck
 
Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)
Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)
Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)Martin Kaltenböck
 
Linked Open Data Pilot Österreich - Beta Launch
Linked Open Data Pilot Österreich - Beta LaunchLinked Open Data Pilot Österreich - Beta Launch
Linked Open Data Pilot Österreich - Beta LaunchMartin Kaltenböck
 
Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...
Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...
Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...Martin Kaltenböck
 
Linked Open Data Pilotprojekt Österreich - LOD Pilot AT
Linked Open Data Pilotprojekt Österreich - LOD Pilot ATLinked Open Data Pilotprojekt Österreich - LOD Pilot AT
Linked Open Data Pilotprojekt Österreich - LOD Pilot ATMartin Kaltenböck
 
PoolParty Semantic Suite Overview
PoolParty Semantic Suite OverviewPoolParty Semantic Suite Overview
PoolParty Semantic Suite OverviewMartin Kaltenböck
 
Semantic Information Management using PoolParty 4
Semantic Information Management using PoolParty 4Semantic Information Management using PoolParty 4
Semantic Information Management using PoolParty 4Martin Kaltenböck
 
Using DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data IntegrationUsing DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data IntegrationMartin Kaltenböck
 
Linked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot AustriaLinked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot AustriaMartin Kaltenböck
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentMartin Kaltenböck
 
eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...
eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...
eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...Martin Kaltenböck
 
Enterprise Terminology Management as a Basis for powerful Semantic Services
Enterprise Terminology Management as a Basis for powerful Semantic ServicesEnterprise Terminology Management as a Basis for powerful Semantic Services
Enterprise Terminology Management as a Basis for powerful Semantic ServicesMartin Kaltenböck
 

Mais de Martin Kaltenböck (20)

Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycle
 
Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...
Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...
Knowledge Graph Implementation into Drupal Content Management System (CMS) fo...
 
Text Mining in PoolParty Semantic Suite
Text Mining in PoolParty Semantic SuiteText Mining in PoolParty Semantic Suite
Text Mining in PoolParty Semantic Suite
 
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...
 
The European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online MarketplaceThe European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online Marketplace
 
PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview
 
PoolParty Semantic Suite - Solutions for Sustainable Development
PoolParty Semantic Suite - Solutions for Sustainable DevelopmentPoolParty Semantic Suite - Solutions for Sustainable Development
PoolParty Semantic Suite - Solutions for Sustainable Development
 
Climate Technology Transfer supported through Linked Data A Proof of Concept ...
Climate Technology Transfer supported through Linked Data A Proof of Concept ...Climate Technology Transfer supported through Linked Data A Proof of Concept ...
Climate Technology Transfer supported through Linked Data A Proof of Concept ...
 
Introduction to: Big Data Europe Project
Introduction to: Big Data Europe Project Introduction to: Big Data Europe Project
Introduction to: Big Data Europe Project
 
Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)
Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)
Einführung Linked Open Data (LOD) - Introduction to Linked Open Data (LOD)
 
Linked Open Data Pilot Österreich - Beta Launch
Linked Open Data Pilot Österreich - Beta LaunchLinked Open Data Pilot Österreich - Beta Launch
Linked Open Data Pilot Österreich - Beta Launch
 
Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...
Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...
Open Data Portal (ODP) Österreich - Präsentation bei der opendata.ch 2014 in ...
 
Linked Open Data Pilotprojekt Österreich - LOD Pilot AT
Linked Open Data Pilotprojekt Österreich - LOD Pilot ATLinked Open Data Pilotprojekt Österreich - LOD Pilot AT
Linked Open Data Pilotprojekt Österreich - LOD Pilot AT
 
PoolParty Semantic Suite Overview
PoolParty Semantic Suite OverviewPoolParty Semantic Suite Overview
PoolParty Semantic Suite Overview
 
Semantic Information Management using PoolParty 4
Semantic Information Management using PoolParty 4Semantic Information Management using PoolParty 4
Semantic Information Management using PoolParty 4
 
Using DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data IntegrationUsing DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data Integration
 
Linked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot AustriaLinked Open Data (LOD) Pilot Austria
Linked Open Data (LOD) Pilot Austria
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...
eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...
eGovernment Konferenz 2013,Österreich - Workshop: Grundlagen und Mehrwerte vo...
 
Enterprise Terminology Management as a Basis for powerful Semantic Services
Enterprise Terminology Management as a Basis for powerful Semantic ServicesEnterprise Terminology Management as a Basis for powerful Semantic Services
Enterprise Terminology Management as a Basis for powerful Semantic Services
 

Último

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Último (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Presentation ADEQUATe Project: Workshop on Quality Assessment and Improvements in Open Data (Catalogues)

  • 1. www.adequate.at Workshop on Quality Assessment and Improvements on Open Data (Portals) opendata.ch conference, 14.6.2016, 12.45 - 14:00pm CEST Lausanne, Casino de Montbenon, Allée Ernest-Ansermet 3 Slides published CC-BY AT 3.0 Jürgen Umbrich Vienna University of Economics and Business juergen.umbrich@wu.ac.at Johann Höchtl Donau-Universität Krems johann.hoechtl@donau-uni.ac.at Martin Kaltenböck Semantic Web Company m.kaltenboeck@semantic-web.at
  • 2. www.adequate.at Agenda 2 Time Session Remarks 20’ incl q&a Welcome & Introduction ● WS Objectives, Agenda & WS Team ● Participants ● The ADEQUATe project: basics, objectives, status & outlook Martin Kaltenböck (SWC) 20’ incl q&a Results of Requirements Elicitation, DQ Metrics and Interaction items ● What do the users want? ● What are the most “important” ones? What are metrics specifically targeting openness? ● Why data portal quality interaction items with end users and what do we plan to do in ADEQUATe? Johann Höchtl (DUK) 20’ incl q&a Best Practise & the ADEQUATe OD Framework ● Data & CSV on the web working group recommendations (W3C) ● AD Framework: architecture & components Jürgen Umbrich (WU) 15’ open discussion Interactive & open discussion on DQ issues: ● Requirements for DQ in Open Data ● What is in place or planned for DQ Moderated by the WS Team
  • 3. www.adequate.at FFG Project http://www.adequate.at 3 Das Projekt „ADEQUATe“ wird im Rahmen des FTI - Programms „IKT der Zukunft“ durch das Bundesministerium für Verkehr, Innovation und Technologie gefördert und von der Österreichischen Forschungsförderungsgesellschaft abgewickelt [Projektnummer: 849982].
  • 4. www.adequate.at What is ? ADEQUATe Open Data: Analytics & Data Enrichment to improve the QUAliTy of Open Data builds on two observations: An increasing amount of Open Data becomes available as an important resource for emerging businesses and further on the integration of such open, freely re-usable data sources into organisations’ data warehouse and data management systems is seen as a key success factor for competitive advantages in a data-driven economy. The project now identifies crucial issues which have to be tackled to fully exploit the value of open data and the efficient integration with other data sources: ● the overall quality issues with metadata and the data itself ● the lack of interoperability between data sources The project's approach is to address this points already in an early stage – when the open data is freshly provided by either governmental organisations or others. 4
  • 6. www.adequate.at What is ? ✓ 3 Partners: 1. Semantic Web Company 2. Danube University Krems 3. University of Economics Vienna ✓ 30 months project duration, Oct. 2015 - March 2018 ✓ 2 Use Case Partners: data.gv.at & opendataportal.at ✓ Objective: Improvement of Data Quality through: ○ Quality Assessment and Monitoring ○ Automatic Algorithms ○ Making use of Linked Data principles ○ Improvements of the data by the user (community) 6
  • 7. www.adequate.at Project Structure & Schedule 7 WP1 - Requirements & Specification WP2 - Quality Improvement & Monitoring Framework WP3 - Algorithms & Tools for Quality Improvements WP4 - Data Linkage WP5 - Community driven Quality Improvements WP6 - Use Case Integration WP7 - Project Management & Dissemination
  • 8. www.adequate.at Outlook & Timing of Results 8 M30 (03/2018) Evaluation, Refinements, Improvements M21 (06/2017) Quality improvements Use case connection M15 (12/2016) Quality monitoring framework Data linkage M10 (07/2016) Architecture Blueprint M9 (06/2017) Quality metrics Requirements
  • 9. www.adequate.at Concrete Outputs & Outlook ✓ End of June 2016: 3 Deliverables ○ State of the Art ○ Requirements Elicitation ○ Quality Metrics ✓ End of July 2016: 1 Deliverable ○ Architecture Blueprint ○ All components specified ✓ End of 2016: ADEQUATe Framework - 1st release ○ Assessment & Monitoring Framework ○ Data Quality Algorithms & Tools ○ Linked Data Mechanisms ○ 1st set of user driven Mechanisms ✓ Early 2017: Dock onto ODP & data.gv.at 9
  • 12. www.adequate.at Contents and Formats ○ I would really prefer to have the data themselves consistent. [...] metadata does not match; standards regarding the representation of their content ○ It would be really great if we could shift somehow to UTF-8 ○ meta data for CSV files were incomplete [...] header for CSV was missing ○ no static identifiers for objects in data sets. This in turn leads to problems if you want to track changes related to these objects over time Results of Requirements Elicitation 12
  • 13. www.adequate.at Communication ○ central communication point for exchanging experiences and issues ○ Meta data should be written in English language Reliability ○ Servers are restarted every day [...] hosted data becomes unavailable Results of Requirements Elicitation 13
  • 14. www.adequate.at DQ metrics (1) Completeness ● Metadata Completeness: How many (manadatory) metadata keys have values? ● Table completeness: How many (CSV) cells have non-null values Timeliness ● Tau of Data: How “outdated” are datasets based on the promised update frequency 14
  • 15. www.adequate.at DQ metrics (2) Machine readability ● Regularity of CSV-files (CSV-Lint), RDF, ... ● Structural consistency - variations in structure of CSV files Openness ● Open formats - no well-defined definition of what constitutes an open ● Open Licenses - Seems opendefinitions.org has them all covered Persistence 15
  • 16. www.adequate.at DQ Metrics - Persistence? 16
  • 18. www.adequate.at Contributors to DQ improvement Publishers Community 18 Algorithms & Linked Data
  • 19. www.adequate.at Contributors to DQ Improvements (1/2) ● Providers ○ Correctness and Completeness of Data and Metadata ○ SLAs governing availability ○ Readiness for feedback, discussion and interaction ● Algorithms ○ Automated improvements ■ Availability checks and reporting ■ Missing information, outliers ■ Check of format (valid UTF8?), size ■ Data format conversions: CSV → CSV on the web specification ○ Semi-automated Improvements and Enhancements ■ Identification of related data sets ■ Mapping of (data) attributes, ... ● Interaction with the Data Community 19
  • 20. www.adequate.at Interaction: Data Community 20 ● Control the results of automated enhancements ○ Interlinking ○ format conversions ○ encodings ● Correct mistakes and report mistakes ● Data enrichment and transformations
  • 22. www.adequate.at Interaction: Forking: Identify - Improve - Share 22 1 47 11 2 48 15 1 47 11 2 48 151 1 47 11 2 47 15 2
  • 25. www.adequate.at The ADEQUATe OD Framework & publishing CSVs for humans and machines 25
  • 26. www.adequate.at The ADEQUATe Framework 26 ● The ADEQUATe framework offers: ○ quality assessment and monitoring ○ a set of data quality improvement algorithms ○ a set of algorithms to create, maintain a knowledge graph and “link” data into this graph ■ Think about shared identifiers for addresses, companies, departments, parties, ... ○ community involvement ( e.g., data editors, feedback loops, forking & merging) ● Main objectives: ○ all developed components will be Open Source ( see the ADEQUATe Github Repo) ○ components should be used as standalone components ■ Use only what you need
  • 27. www.adequate.at The ADEQUATe Framework 27 ● Core Components 1. Data monitoring 2. Knowledge Vault 3. Quality Assessment 4. Quality Improvement 5. Data Linkage 6. Community Improvement 7. UI, API & User authentication Users (Meta)Data Monitor Knowledge Vault Quality Assessment Orchestration / API Quality Improvement Linkage Community Improvement Authentication / Load Balancing / UI Public API catalog data.gv.at ODP Clients RESTful API Component Data
  • 28. www.adequate.at W3C CSV on the Web & ADEQUATe One core feature in ADEQUATe will be to use the CSV on the Web metadata standard, which allows to: ➢ describe CSV files ○ used dialect & encoding ○ table & column descriptions ( with language tags) ○ data types and value ranges for columns ➢ add semantics to it ○ primary & foreign key, URIs, entity types, ... ➢ validate CSV files against a predefined schema ➢ specify the transformation ○ CSV -> JSON or RDF 28
  • 29. www.adequate.at W3C CSV on the Web: Metadata standard 29
  • 30. www.adequate.at W3C CSV on the Web: Metadata standard 30
  • 31. www.adequate.at W3C CSV on the Web: Metadata standard 31
  • 32. www.adequate.at W3C CSV on the Web: Example (JSON-LD) 1/3 { "@context": ["http://www.w3.org/ns/csvw", {"@language": "en"}], "url": "http://data.mumok.at/exhibition.csv", "dc:title": "Exhibitions for objects from the mumok collection", "dcat:keyword": ["art", "museum", "exhibition"], "dc:publisher": { "schema:name": "mumok - museum moderner kunst stiftung ludwig wien", "schema:url": {"@id": "http://www.mumok.at"} }, "dc:license": {"@id": "https://creativecommons.org/licenses/by/3.0/at/legalcode"}, "dc:modified": {"@value": "2015-07-04", "@type": "xsd:date"}, …. 32
  • 33. www.adequate.at W3C CSV on the Web: Example (JSON-LD) 2/3 "dialect": { "encoding": "utf-8", "lineTerminators": ["rn", "n"], "quoteChar": """, "doubleQuote": true, "skipRows": 0, "commentPrefix": "#", "header": true, "headerRowCount": 1, "delimiter": ",", "skipColumns": 0, "skipBlankRows": false, "skipInitialSpace": false, "trim": false }, 33
  • 34. www.adequate.at W3C CSV on the Web: Example (JSON-LD) 3/3 "tableSchema": { "columns": [{ "name": "exhibition_id", "titles": "Exhibition Identifier", "dc:description": "A unique identifier for the exhibition.", "datatype": "integer", "required": true }, { "name": "city", "titles": "City", "dc:description": "The city in which the exhibition took place (no language defined, mostly in German).", "datatype": "string" } 34
  • 35. www.adequate.at W3C CSV on the Web: Discovery ● Registered content type: application/csvm+json ● 3 discovery mechanisms ○ File extension ■ http://data.mumok.at/exhibition.csv -> http://data.mumok.at/exhibition.csv-metadata.json ○ Well-known location ■ /.well-known/csvm ○ LINK HTTP Header 35 » curl -I http://data.mumok.at/exhibition.csv HTTP/1.1 200 OK Date: Thu, 26 Nov 2015 22:18:47 GMT Server: Apache/2.2.22 (Debian) …. Content-Length: 112723 Content-Type: text/csv; charset=utf-8; header=present Link: </exhibition.csv-metadata.json>;rel=describedBy; type=application/csvm+json
  • 36. www.adequate.at CSV on the Web Summary ● Don’t publish CSV on the Web for humans, publish also for machines ○ e.g., EXCEL exports ● RFC 4180 ● Encoding ○ Use UTF-8, don’t mix encodings ● File extension: .csv ● Content-type: text/csv Optional, but big improvement! ● Ideally, publish CSV MetaData along your CSV file ● Avoid acronyms or encodings (e.g., sex=1,2,3) 36
  • 37. www.adequate.at CSV on the Web Summary 37 ● CSV URLs ● CSVs link to other CSVs ● CSVs link to other resources ● RDF and JSON conversion REFERENCES ● CSV on the Web Working Group ● CSV on the Web Community Group ● CSV on the Web Github Repository ● Tabular Data on the Web - A Introduction to CSV on the Web (Slides) ● Implementing CSV on the Web ( Gregg Kellogg) ●
  • 38. www.adequate.at Announcements & Pointers 38 @adequate_od 17-19 May 2017 Danube University Krems 30.8.-02.09.2016, Helsinki
  • 39. www.adequate.at Contact 39 Jürgen Umbrich Vienna University of Economics and Business Juergen.umbrich @ wu.ac.at Short CV:https://www.wu.ac.at/en/infobiz/team/umbrich/ Johann Höchtl Donau-Universität Krems Johann.hoechtl @ donau-uni.ac.at Short CV: https://at.linkedin.com/in/johannhoechtl http://adequate.at/ http://vienna.theodi.org Martin Kaltenböck Semantic Web Company m.kaltenboeck@semantic-web.at Short CV: https://www.linkedin.com/in/martinkaltenboeck