SlideShare uma empresa Scribd logo
1 de 44
Baixar para ler offline
Towards FAIRer Biological Knowledge Networks 

Using a Hybrid Linked Data 

and Graph Database approach
Harpenden, 5/6/2018



Marco Brandizi <marco.brandizi@rothamsted.ac.uk>
Find these slides on SlideShare
KnetMiner-inspired Artwork

by Hugo Dalton (hugodalton.com)
Can we do More with KnetMiner Data?
(and better)
Behind the Scenes
• Starting point: graph data model
• With concepts, relations between concepts hierarchies of concept classes and relation
types
• => There are standardised ways for it
• Make app development easier
• independent components on top of a unified data model
• clear separation between data access and apps
• Serve third-party applications, making their data access no different than ours
• Simplify the way we ingest data,
• ease conversions from multiple formats into unified model
• relax the high-memory requirements need (e.g., backing data store)
• prepare for scalability (e.g., cloud stores, big data stores)
Putting it on a Bigger Picture
The Semantic Web Way
• It’s for networked knowledge (semantic networks)
• Focuses on sharing via web technologies and
principles (eg, share resolvable URIs)
• Rich ‘schema’ language, already much used in life
sciences (i.e., ontologies, coming from frames and 1st-
order logics)
• protocol + a standard query language (SPARQL)
Modelling data with OWL:

Promises & Wishes
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
• Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/
worse
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
• Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/
worse
• Likely not reasonable: Not Invented Here
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
• Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/
worse
• Likely not reasonable: Not Invented Here
• No comment: if I reinvent it, I can publish it
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
• Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/
worse
• Likely not reasonable: Not Invented Here
• No comment: if I reinvent it, I can publish it
• Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
• Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/
worse
• Likely not reasonable: Not Invented Here
• No comment: if I reinvent it, I can publish it
• Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back
In fact, they did this
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
• Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/
worse
• Likely not reasonable: Not Invented Here
• No comment: if I reinvent it, I can publish it
• Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back
In fact, they did this
Modelling data with OWL:

Promises & Wishes
• Rich semantics, but also very formal, so that powerful automated reasoning is possible
• On the messy web ocean?!
• What about performance?!
• Very formal semantics is not very easy:
• SomeValuesFrom Restriction?!
• My blood sample derives from some Skolem human, not from
NCBI:HomoSapiens?
• Ontologies defined for the whole world and then harmoniously and lovely shared
• Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
• Reasonable: I have a different point of view
• Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/
worse
• Likely not reasonable: Not Invented Here
• No comment: if I reinvent it, I can publish it
• Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back
In fact, they did this
Simplifying Views in BioKNO
obo:GO_0030015
a owl:Class ;
rdfs:label "CCR4-NOT core complex"^^xsd:string ;
rdfs:subClassOf obo:GO_0044424, obo:GO_0044424,
[
a owl:Restriction ;
owl:onProperty <http://purl.obolibrary.org/obo/BFO_0000050> ; # 'part of'
owl:someValuesFrom obo:GO_0030014 # CCR4-NOT complex
] ;
oboInOwl:id "GO:0030015"^^xsd:string ;
obo:IAO_0000115 "The core of the CCR4-NOT complex. In Saccharomyces the CCR4-NOT...";
oboInOwl:hasOBONamespace "cellular_component"^^xsd:string .
Simplifying Views in BioKNO
obo:GO_0030015
a owl:Class ;
rdfs:label "CCR4-NOT core complex"^^xsd:string ;
rdfs:subClassOf obo:GO_0044424, obo:GO_0044424,
[
a owl:Restriction ;
owl:onProperty <http://purl.obolibrary.org/obo/BFO_0000050> ; # 'part of'
owl:someValuesFrom obo:GO_0030014 # CCR4-NOT complex
] ;
oboInOwl:id "GO:0030015"^^xsd:string ;
obo:IAO_0000115 "The core of the CCR4-NOT complex. In Saccharomyces the CCR4-NOT...";
oboInOwl:hasOBONamespace "cellular_component"^^xsd:string .
obo:GO_0030014 a bk:GeneOntologyTerm ;
dc:identifier obo:GO_0030014_acc ;
bk:is_a obo:GO_0044424 , obo:GO_0043234 ;
bk:prefName "CCR4-NOT complex" .
obo:GO_0030015 a bk:GeneOntologyTerm;
bk:prefName "CCR4-NOT core complex";
bk:is_a obo:GO_0044424, obo:GO_0043234 ;
bk:part_of obo:GO_0030014;
dc:identifier obo:GO_0030015_acc.
obo:GO_0044424 a bk:GeneOntologyTerm;
bk:prefName "intracellular part" ;
• OWL is simplified mixing classes with SKOS-style
concepts
• More suitable for less formal, more simple
taxonomies
• OWL-2 punning makes it consistent
The BioKNO Ontology

(and The rest of the World)
BioKNO External Ontologies Mapping Type
bk:Concept skos:Concept Subclass
bk:Relation
bk:relFrom
bk:relTypeRef
bk:relTo
rdf:Statement

rdf:subject
rdf:predicate
rdf:object
Subclass
Subproperties
(ie, mapping to RDF reified
statements)
bk:Path, bk:Participant, bk:Interaction, bk:Transport,
bk:Protein, bk:Gene
Classes with same names in BioPAX and SIO Equivalent Class
bk:participates_in
bk:has_participant
Relation Ontology (RO) properties with same names

biopax:participant (as sub-property)
Equivalent property
bk:produces
bk:produced_by
bk:consumes
bk:consumed_by
biopax:product (as sub-property)
RO properties with same names
Equivalent property
bk:regulates
bk:positively_regulates
bk:negatively_regulates
RO properties with same names Equivalent property
bk:is_a
bk:part_of, bk:has_part
bk:occurs_in, bk:co_occurs_with
skos:broader
Basic Formal Ontology (BFO)/RO properties with same
names
Equivalent property
bk:Publication schema:CreativeWork Subclass
bka:abstract
bka:title (also known as AbstractHeader)
bka:authors
dcterms:description
dcterms:title
dc:creator
Sub-property
The BioKNO Ontology
Putting it on a Bigger Picture
Putting it on a Bigger Picture
Accessing RDF through SPARQL
Accessing RDF through SPARQL
Accessing RDF through SPARQL
CONSTRUCT {
?protIri bk:expressed_by ?sampleIri.
?degRelIri
a bk:Relation;
bka:PVALUE ?pValue;
bk:evidence bkev:EXP; # Inferred from experiment
bk:relFrom ?protIri; # Details defined by UniProt info
bk:relTo ?sampleIri; # Details defined by sample_degs_2.tsv
bk:relTypeRef bk:expressed_by.
}
WHERE {
# Some IDs and IRIs to be defined above
BIND ( LCASE ( REPLACE ( ?Sample, ' ', '_' ) ) AS ?sampleId )
BIND ( IRI ( CONCAT ( STR ( bkr: ), ?Gene_Symbol ) ) AS ?protIri )
BIND ( IRI ( CONCAT ( STR ( bkr: ), 'degex_', ?sampleId ) ) AS ?sampleIri )
BIND ( IRI ( CONCAT ( STR ( bkr: ), 'degex_', ?sampleId, '_', LCASE ( ?Gene_Symbol ) ) )

AS ?degRelIri )
BIND ( xsd:double ( ?p_value ) AS ?pValue )
}
Extraction, Loading, Transformation

SPARQL/TARQL Example
SPARQL/RDF for ELT
• RDF-to-RDF translation via CONSTRUCT (or SPARUL)
• TARQL: Using SPARQL to RDF-Convert Tabular CSV Files
• RDF/XML can be transformed via XSL
• We have done it for bio-specific ontology definitions in Ondex
• Programmatic conversions
• Using RDF frameworks, eg, Jena, RDF4J (former Sesame), rdflib for
Python
• See also java2rdf (https://github.com/EBIBioSamples/java2rdf)
• We have used it for the Ondex->RDF converter
SPARQL/RDF for ELT
• RDF-to-RDF translation via CONSTRUCT (or SPARUL)
• TARQL: Using SPARQL to RDF-Convert Tabular CSV Files
• RDF/XML can be transformed via XSL
• We have done it for bio-specific ontology definitions in Ondex
• Programmatic conversions
• Using RDF frameworks, eg, Jena, RDF4J (former Sesame), rdflib for
Python
• See also java2rdf (https://github.com/EBIBioSamples/java2rdf)
• We have used it for the Ondex->RDF converter
Issues
https://lod-cloud.net/
Issues
https://lod-cloud.net/
• Still not so popular (especially in more commercial contexts)
• It’s (perceived as) difficult (in particular, SPARQL)
• Bad reputation
• Performance can still be an issue
• eg, optimising SPARQL can be hard
• Specific issues
• eg, I need contextualised/attribute-attached properties
• and I don’t fancy reified relations…
Another Graph Database World

Property Graphs
Neo4j on top Of RDF
Application to Semantic Motif Search
The rdf2neo Tool
https://github.com/Rothamsted/rdf2neo
Triple Stores vs Prop Graphs
Neo4j, Cypher DBs, Graph DBs Semantic Web/Triple Stores
Data xchg format
- No official one, just Cypher, 

Support for GraphML, RDF

+/- Focus on backing applications

+ Focus on data sharing standards

Data model
+ Relations with properties

- Metadata/schemas/ontologies management
- Relations cannot have properties (reification
required)

+ Metadata/schemas/ontologies as first citizen
and standardised OWL
Performance + complex graph traversals + Comparable in most cases
Query Language
+ Cypher is easier (eg, compact, implicit elems)?

- Expressivity issues (unions)

- No standard QL (but efforts in progress, eg,
OpenCypher)
- SPARQL is Harder? (URIs, namespaces,
verbosity)

+ SPARQL More expressive
Standardisation,
openness
+/- (TinkerPop is open, Neo4j isn’t)

+ Commercial support

+ More alive and up-to date (e.g., support for
Hadoop, nice Neo4j browser, easy installation)
+ Natively open, many open implementations

- Instability and many short-lived prototypes

- Advancements seems to be slowing down

+ Some nice open and commercial browser
(LODEStar,
Scalability,

big data
+/- Commercial support to clustering/clouds for
Neo4j

+ Open support in TinkerPop
+ Load Balancing/Cluster solutions, Commercial
Cloud support (eg GraphDB)

+ SPARQL Over TinkerPop (via SAIL inteface)
Bridging to RDF: JSON-LD
…
"@id": "bkr:TOB1",
"@type": "bk:Protein",
"prefName": "TOB1 Human",
"dcterms:identifier": "TOB1",
"is_annotated_by": "obo:GO_0030014",
"participates_in": {
"@id": "http://www.wikipathways.org/id1",
"@type": "bk:Pathway",
"evidence": "bkev:IMPD",
"prefName":

“Bone Morphogenic Protein (BMP) Signalling and Regulation"
}
}
{
"@context": {
"bk": "http://www.ondex.org/bioknet/terms/",
"bka": "http://www.ondex.org/bioknet/terms/attributes/",
"bkds": "http://www.ondex.org/bioknet/terms/dataSources/",
"bkev": "http://www.ondex.org/bioknet/terms/evidences/",
"bkr": "http://www.ondex.org/bioknet/resources/",
"dcterms": "http://purl.org/dc/terms/",
"obo": "http://purl.obolibrary.org/obo/",
"xsd": "http://www.w3.org/2001/XMLSchema#",
"@vocab": "http://www.ondex.org/bioknet/terms/",
"dcterms:identifier": { "@type": "xsd:string" },
"evidence": { "@type": “@id" }
},
…
KnetMiner UI Overview
Search Select Explore
Addressing FAIR
• Findable (or, Semantic Web is still useful)
• SPARQL endpoint
• which powers URI Resolution
• Dataset-level metadata (e.g., VoID)
• Mapping to Standard Ontologies
• Interested in contributing to existing standards

(e.g., Bioschemas)
• API/JSON-Schema formalisation
• Accessible
• Multiple access means (SPARQL, URIs, JSON APIs, Cypher)
• Triple Stores and Property Graphs are complementary, not
alternative
• Data dumps
• Interoperable (or, Sem Web is still useful)
• Unified model encoded in at least one common syntax (RDF)
• URIs are reused
• Mappings to ontologies
• Reusable
• All of the above, plus multiple interfaces under unified model
• Support to common graph languages (e.g., Cytoscape.js)
• Converters (e.g., our RDF conversion scripts/tools, rdf2neo)
• Open Data licences
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and Graph Database approach

Mais conteúdo relacionado

Semelhante a Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and Graph Database approach

Oak Hill's Sr Projects Res Step By Step PPT#3
Oak Hill's Sr Projects Res Step By Step PPT#3Oak Hill's Sr Projects Res Step By Step PPT#3
Oak Hill's Sr Projects Res Step By Step PPT#3Jeremy Young
 
Open Creativity Scoring Tutorial
Open Creativity Scoring TutorialOpen Creativity Scoring Tutorial
Open Creativity Scoring TutorialDenisDumas2
 
Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014Oak Hall School
 
UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)
UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)
UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)ux singapore
 
Selfish Accessibility: WordCamp Buffalo 2014
Selfish Accessibility: WordCamp Buffalo 2014Selfish Accessibility: WordCamp Buffalo 2014
Selfish Accessibility: WordCamp Buffalo 2014Adrian Roselli
 
Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011Rich Bowen
 
Selfish Accessibility: WordCamp Toronto 2014
Selfish Accessibility: WordCamp Toronto 2014Selfish Accessibility: WordCamp Toronto 2014
Selfish Accessibility: WordCamp Toronto 2014Adrian Roselli
 
A Dickens of A Keynote
A Dickens of A KeynoteA Dickens of A Keynote
A Dickens of A KeynoteJames Gray
 
Lipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsLipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsDorothea Salo
 
Selfish Accessibility: a11y Camp Toronto 2014
Selfish Accessibility: a11y Camp Toronto 2014Selfish Accessibility: a11y Camp Toronto 2014
Selfish Accessibility: a11y Camp Toronto 2014Adrian Roselli
 
Information Architecture Heuristics
Information Architecture HeuristicsInformation Architecture Heuristics
Information Architecture HeuristicsAbby Covert
 
“Selfish Accessibility” for Create Upstate 2016
“Selfish Accessibility” for Create Upstate 2016“Selfish Accessibility” for Create Upstate 2016
“Selfish Accessibility” for Create Upstate 2016Adrian Roselli
 
How to go from structureless to structured without losing your vibe
How to go from structureless to structured without losing your vibeHow to go from structureless to structured without losing your vibe
How to go from structureless to structured without losing your vibeCamille Fournier
 
Using the Ebscohost Research Databases final
Using the Ebscohost Research Databases final Using the Ebscohost Research Databases final
Using the Ebscohost Research Databases final Oak Hall School
 

Semelhante a Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and Graph Database approach (20)

Oak Hill's Sr Projects Res Step By Step PPT#3
Oak Hill's Sr Projects Res Step By Step PPT#3Oak Hill's Sr Projects Res Step By Step PPT#3
Oak Hill's Sr Projects Res Step By Step PPT#3
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
Better Search Engine Testing
Better Search Engine TestingBetter Search Engine Testing
Better Search Engine Testing
 
Open Creativity Scoring Tutorial
Open Creativity Scoring TutorialOpen Creativity Scoring Tutorial
Open Creativity Scoring Tutorial
 
Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014
 
UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)
UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)
UXSG2014 Lightning Talks - Selfish accessibility (Adrian Roselli)
 
Selfish Accessibility: WordCamp Buffalo 2014
Selfish Accessibility: WordCamp Buffalo 2014Selfish Accessibility: WordCamp Buffalo 2014
Selfish Accessibility: WordCamp Buffalo 2014
 
Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011Write A Better FM - Ohio Linux 2011
Write A Better FM - Ohio Linux 2011
 
Selfish Accessibility: WordCamp Toronto 2014
Selfish Accessibility: WordCamp Toronto 2014Selfish Accessibility: WordCamp Toronto 2014
Selfish Accessibility: WordCamp Toronto 2014
 
Waves keynote2c
Waves keynote2cWaves keynote2c
Waves keynote2c
 
A Dickens of A Keynote
A Dickens of A KeynoteA Dickens of A Keynote
A Dickens of A Keynote
 
Lipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsLipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library Systems
 
Selfish Accessibility: a11y Camp Toronto 2014
Selfish Accessibility: a11y Camp Toronto 2014Selfish Accessibility: a11y Camp Toronto 2014
Selfish Accessibility: a11y Camp Toronto 2014
 
Information Architecture Heuristics
Information Architecture HeuristicsInformation Architecture Heuristics
Information Architecture Heuristics
 
“Selfish Accessibility” for Create Upstate 2016
“Selfish Accessibility” for Create Upstate 2016“Selfish Accessibility” for Create Upstate 2016
“Selfish Accessibility” for Create Upstate 2016
 
How to go from structureless to structured without losing your vibe
How to go from structureless to structured without losing your vibeHow to go from structureless to structured without losing your vibe
How to go from structureless to structured without losing your vibe
 
Metadata
MetadataMetadata
Metadata
 
Using the Ebscohost Research Databases final
Using the Ebscohost Research Databases final Using the Ebscohost Research Databases final
Using the Ebscohost Research Databases final
 
Contributions to the World of eScience from the Royal Society of Chemistry
Contributions to the World of eScience from the Royal Society of ChemistryContributions to the World of eScience from the Royal Society of Chemistry
Contributions to the World of eScience from the Royal Society of Chemistry
 
Beyond the paper CV and developing a scientific profile through social media,...
Beyond the paper CV and developing a scientific profile through social media,...Beyond the paper CV and developing a scientific profile through social media,...
Beyond the paper CV and developing a scientific profile through social media,...
 

Mais de Rothamsted Research, UK

FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseRothamsted Research, UK
 
Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesRothamsted Research, UK
 
AgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with BioschemasAgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with BioschemasRothamsted Research, UK
 
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Publishing and Consuming FAIR DataA Case in the Agri-Food DomainPublishing and Consuming FAIR DataA Case in the Agri-Food Domain
Publishing and Consuming FAIR Data A Case in the Agri-Food DomainRothamsted Research, UK
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesRothamsted Research, UK
 
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Rothamsted Research, UK
 
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerA Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerRothamsted Research, UK
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Rothamsted Research, UK
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...Rothamsted Research, UK
 
myEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference servicemyEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference serviceRothamsted Research, UK
 

Mais de Rothamsted Research, UK (20)

FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
 
Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use Cases
 
AgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with BioschemasAgriSchemas: Sharing Agrifood data with Bioschemas
AgriSchemas: Sharing Agrifood data with Bioschemas
 
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Publishing and Consuming FAIR DataA Case in the Agri-Food DomainPublishing and Consuming FAIR DataA Case in the Agri-Food Domain
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
 
Continuos Integration @Knetminer
Continuos Integration @KnetminerContinuos Integration @Knetminer
Continuos Integration @Knetminer
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
AgriSchemas Progress Report
AgriSchemas Progress ReportAgriSchemas Progress Report
AgriSchemas Progress Report
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use Cases
 
Notes about SWAT4LS 2018
Notes about SWAT4LS 2018Notes about SWAT4LS 2018
Notes about SWAT4LS 2018
 
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
 
Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018
 
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerA Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...
 
Interoperable Open Data: Which Recipes?
Interoperable Open Data: Which Recipes?Interoperable Open Data: Which Recipes?
Interoperable Open Data: Which Recipes?
 
Linked Data with the EBI RDF Platform
Linked Data with the EBI RDF PlatformLinked Data with the EBI RDF Platform
Linked Data with the EBI RDF Platform
 
BioSD Linked Data: Lessons Learned
BioSD Linked Data: Lessons LearnedBioSD Linked Data: Lessons Learned
BioSD Linked Data: Lessons Learned
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
myEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference servicemyEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference service
 
Dev 2014 LOD tutorial
Dev 2014 LOD tutorialDev 2014 LOD tutorial
Dev 2014 LOD tutorial
 

Último

Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excelysmaelreyes
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 

Último (20)

Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excel
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 

Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and Graph Database approach

  • 1. Towards FAIRer Biological Knowledge Networks 
 Using a Hybrid Linked Data 
 and Graph Database approach Harpenden, 5/6/2018
 
 Marco Brandizi <marco.brandizi@rothamsted.ac.uk> Find these slides on SlideShare KnetMiner-inspired Artwork
 by Hugo Dalton (hugodalton.com)
  • 2. Can we do More with KnetMiner Data? (and better)
  • 3. Behind the Scenes • Starting point: graph data model • With concepts, relations between concepts hierarchies of concept classes and relation types • => There are standardised ways for it • Make app development easier • independent components on top of a unified data model • clear separation between data access and apps • Serve third-party applications, making their data access no different than ours • Simplify the way we ingest data, • ease conversions from multiple formats into unified model • relax the high-memory requirements need (e.g., backing data store) • prepare for scalability (e.g., cloud stores, big data stores)
  • 4. Putting it on a Bigger Picture
  • 5. The Semantic Web Way • It’s for networked knowledge (semantic networks) • Focuses on sharing via web technologies and principles (eg, share resolvable URIs) • Rich ‘schema’ language, already much used in life sciences (i.e., ontologies, coming from frames and 1st- order logics) • protocol + a standard query language (SPARQL)
  • 6. Modelling data with OWL:
 Promises & Wishes
  • 7. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible
  • 8. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?!
  • 9. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?!
  • 10. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy:
  • 11. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?!
  • 12. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens?
  • 13. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared
  • 14. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise
  • 15. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view
  • 16. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view • Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/ worse
  • 17. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view • Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/ worse • Likely not reasonable: Not Invented Here
  • 18. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view • Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/ worse • Likely not reasonable: Not Invented Here • No comment: if I reinvent it, I can publish it
  • 19. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view • Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/ worse • Likely not reasonable: Not Invented Here • No comment: if I reinvent it, I can publish it • Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back
  • 20. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view • Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/ worse • Likely not reasonable: Not Invented Here • No comment: if I reinvent it, I can publish it • Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back In fact, they did this
  • 21. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view • Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/ worse • Likely not reasonable: Not Invented Here • No comment: if I reinvent it, I can publish it • Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back In fact, they did this
  • 22. Modelling data with OWL:
 Promises & Wishes • Rich semantics, but also very formal, so that powerful automated reasoning is possible • On the messy web ocean?! • What about performance?! • Very formal semantics is not very easy: • SomeValuesFrom Restriction?! • My blood sample derives from some Skolem human, not from NCBI:HomoSapiens? • Ontologies defined for the whole world and then harmoniously and lovely shared • Fairly reasonable: all those vocabularies are headaches, I don’t have expertise • Reasonable: I have a different point of view • Not always reasonable: possibly, yours is complicated/wrong/stupid/idiotic/ worse • Likely not reasonable: Not Invented Here • No comment: if I reinvent it, I can publish it • Just joking (maybe…): Your ontology is good, but I’d rather stab you on your back In fact, they did this
  • 23. Simplifying Views in BioKNO obo:GO_0030015 a owl:Class ; rdfs:label "CCR4-NOT core complex"^^xsd:string ; rdfs:subClassOf obo:GO_0044424, obo:GO_0044424, [ a owl:Restriction ; owl:onProperty <http://purl.obolibrary.org/obo/BFO_0000050> ; # 'part of' owl:someValuesFrom obo:GO_0030014 # CCR4-NOT complex ] ; oboInOwl:id "GO:0030015"^^xsd:string ; obo:IAO_0000115 "The core of the CCR4-NOT complex. In Saccharomyces the CCR4-NOT..."; oboInOwl:hasOBONamespace "cellular_component"^^xsd:string .
  • 24. Simplifying Views in BioKNO obo:GO_0030015 a owl:Class ; rdfs:label "CCR4-NOT core complex"^^xsd:string ; rdfs:subClassOf obo:GO_0044424, obo:GO_0044424, [ a owl:Restriction ; owl:onProperty <http://purl.obolibrary.org/obo/BFO_0000050> ; # 'part of' owl:someValuesFrom obo:GO_0030014 # CCR4-NOT complex ] ; oboInOwl:id "GO:0030015"^^xsd:string ; obo:IAO_0000115 "The core of the CCR4-NOT complex. In Saccharomyces the CCR4-NOT..."; oboInOwl:hasOBONamespace "cellular_component"^^xsd:string . obo:GO_0030014 a bk:GeneOntologyTerm ; dc:identifier obo:GO_0030014_acc ; bk:is_a obo:GO_0044424 , obo:GO_0043234 ; bk:prefName "CCR4-NOT complex" . obo:GO_0030015 a bk:GeneOntologyTerm; bk:prefName "CCR4-NOT core complex"; bk:is_a obo:GO_0044424, obo:GO_0043234 ; bk:part_of obo:GO_0030014; dc:identifier obo:GO_0030015_acc. obo:GO_0044424 a bk:GeneOntologyTerm; bk:prefName "intracellular part" ; • OWL is simplified mixing classes with SKOS-style concepts • More suitable for less formal, more simple taxonomies • OWL-2 punning makes it consistent
  • 25. The BioKNO Ontology
 (and The rest of the World) BioKNO External Ontologies Mapping Type bk:Concept skos:Concept Subclass bk:Relation bk:relFrom bk:relTypeRef bk:relTo rdf:Statement
 rdf:subject rdf:predicate rdf:object Subclass Subproperties (ie, mapping to RDF reified statements) bk:Path, bk:Participant, bk:Interaction, bk:Transport, bk:Protein, bk:Gene Classes with same names in BioPAX and SIO Equivalent Class bk:participates_in bk:has_participant Relation Ontology (RO) properties with same names
 biopax:participant (as sub-property) Equivalent property bk:produces bk:produced_by bk:consumes bk:consumed_by biopax:product (as sub-property) RO properties with same names Equivalent property bk:regulates bk:positively_regulates bk:negatively_regulates RO properties with same names Equivalent property bk:is_a bk:part_of, bk:has_part bk:occurs_in, bk:co_occurs_with skos:broader Basic Formal Ontology (BFO)/RO properties with same names Equivalent property bk:Publication schema:CreativeWork Subclass bka:abstract bka:title (also known as AbstractHeader) bka:authors dcterms:description dcterms:title dc:creator Sub-property
  • 27. Putting it on a Bigger Picture
  • 28. Putting it on a Bigger Picture
  • 32. CONSTRUCT { ?protIri bk:expressed_by ?sampleIri. ?degRelIri a bk:Relation; bka:PVALUE ?pValue; bk:evidence bkev:EXP; # Inferred from experiment bk:relFrom ?protIri; # Details defined by UniProt info bk:relTo ?sampleIri; # Details defined by sample_degs_2.tsv bk:relTypeRef bk:expressed_by. } WHERE { # Some IDs and IRIs to be defined above BIND ( LCASE ( REPLACE ( ?Sample, ' ', '_' ) ) AS ?sampleId ) BIND ( IRI ( CONCAT ( STR ( bkr: ), ?Gene_Symbol ) ) AS ?protIri ) BIND ( IRI ( CONCAT ( STR ( bkr: ), 'degex_', ?sampleId ) ) AS ?sampleIri ) BIND ( IRI ( CONCAT ( STR ( bkr: ), 'degex_', ?sampleId, '_', LCASE ( ?Gene_Symbol ) ) )
 AS ?degRelIri ) BIND ( xsd:double ( ?p_value ) AS ?pValue ) } Extraction, Loading, Transformation
 SPARQL/TARQL Example
  • 33. SPARQL/RDF for ELT • RDF-to-RDF translation via CONSTRUCT (or SPARUL) • TARQL: Using SPARQL to RDF-Convert Tabular CSV Files • RDF/XML can be transformed via XSL • We have done it for bio-specific ontology definitions in Ondex • Programmatic conversions • Using RDF frameworks, eg, Jena, RDF4J (former Sesame), rdflib for Python • See also java2rdf (https://github.com/EBIBioSamples/java2rdf) • We have used it for the Ondex->RDF converter
  • 34. SPARQL/RDF for ELT • RDF-to-RDF translation via CONSTRUCT (or SPARUL) • TARQL: Using SPARQL to RDF-Convert Tabular CSV Files • RDF/XML can be transformed via XSL • We have done it for bio-specific ontology definitions in Ondex • Programmatic conversions • Using RDF frameworks, eg, Jena, RDF4J (former Sesame), rdflib for Python • See also java2rdf (https://github.com/EBIBioSamples/java2rdf) • We have used it for the Ondex->RDF converter
  • 36. Issues https://lod-cloud.net/ • Still not so popular (especially in more commercial contexts) • It’s (perceived as) difficult (in particular, SPARQL) • Bad reputation • Performance can still be an issue • eg, optimising SPARQL can be hard • Specific issues • eg, I need contextualised/attribute-attached properties • and I don’t fancy reified relations…
  • 37. Another Graph Database World
 Property Graphs
  • 38. Neo4j on top Of RDF
  • 39. Application to Semantic Motif Search
  • 41. Triple Stores vs Prop Graphs Neo4j, Cypher DBs, Graph DBs Semantic Web/Triple Stores Data xchg format - No official one, just Cypher, 
 Support for GraphML, RDF
 +/- Focus on backing applications + Focus on data sharing standards Data model + Relations with properties - Metadata/schemas/ontologies management - Relations cannot have properties (reification required) + Metadata/schemas/ontologies as first citizen and standardised OWL Performance + complex graph traversals + Comparable in most cases Query Language + Cypher is easier (eg, compact, implicit elems)?
 - Expressivity issues (unions) - No standard QL (but efforts in progress, eg, OpenCypher) - SPARQL is Harder? (URIs, namespaces, verbosity)
 + SPARQL More expressive Standardisation, openness +/- (TinkerPop is open, Neo4j isn’t) + Commercial support + More alive and up-to date (e.g., support for Hadoop, nice Neo4j browser, easy installation) + Natively open, many open implementations - Instability and many short-lived prototypes - Advancements seems to be slowing down + Some nice open and commercial browser (LODEStar, Scalability,
 big data +/- Commercial support to clustering/clouds for Neo4j
 + Open support in TinkerPop + Load Balancing/Cluster solutions, Commercial Cloud support (eg GraphDB)
 + SPARQL Over TinkerPop (via SAIL inteface)
  • 42. Bridging to RDF: JSON-LD … "@id": "bkr:TOB1", "@type": "bk:Protein", "prefName": "TOB1 Human", "dcterms:identifier": "TOB1", "is_annotated_by": "obo:GO_0030014", "participates_in": { "@id": "http://www.wikipathways.org/id1", "@type": "bk:Pathway", "evidence": "bkev:IMPD", "prefName":
 “Bone Morphogenic Protein (BMP) Signalling and Regulation" } } { "@context": { "bk": "http://www.ondex.org/bioknet/terms/", "bka": "http://www.ondex.org/bioknet/terms/attributes/", "bkds": "http://www.ondex.org/bioknet/terms/dataSources/", "bkev": "http://www.ondex.org/bioknet/terms/evidences/", "bkr": "http://www.ondex.org/bioknet/resources/", "dcterms": "http://purl.org/dc/terms/", "obo": "http://purl.obolibrary.org/obo/", "xsd": "http://www.w3.org/2001/XMLSchema#", "@vocab": "http://www.ondex.org/bioknet/terms/", "dcterms:identifier": { "@type": "xsd:string" }, "evidence": { "@type": “@id" } }, …
  • 43. KnetMiner UI Overview Search Select Explore Addressing FAIR • Findable (or, Semantic Web is still useful) • SPARQL endpoint • which powers URI Resolution • Dataset-level metadata (e.g., VoID) • Mapping to Standard Ontologies • Interested in contributing to existing standards
 (e.g., Bioschemas) • API/JSON-Schema formalisation • Accessible • Multiple access means (SPARQL, URIs, JSON APIs, Cypher) • Triple Stores and Property Graphs are complementary, not alternative • Data dumps • Interoperable (or, Sem Web is still useful) • Unified model encoded in at least one common syntax (RDF) • URIs are reused • Mappings to ontologies • Reusable • All of the above, plus multiple interfaces under unified model • Support to common graph languages (e.g., Cytoscape.js) • Converters (e.g., our RDF conversion scripts/tools, rdf2neo) • Open Data licences