SlideShare uma empresa Scribd logo
1 de 87
Baixar para ler offline
A discovery platform for translational research
Núria Queralt Rosinach
Integrative Biomedical Informatics Group (IBI)
Research Programme on Biomedical Informatics (GRIB)
Hospital del Mar Research Institute (IMIM)
Pompeu Fabra University (UPF)
Barcelona
Usage Tutorial
Outline
• How can DisGeNEThelp your research?
• DisGeNETDiscovery Platform Overview
• DisGeNETLinked Open Data
– Introduction
• RDF-LDDescription: Data Model,VoID, Interlinking
• Implementation
• Accessibility
• Documentation
• UseCases
– Queryingthe DisGeNET-RDF
• Hands-on
DisGeNET- Tutorial 2SWAT4LS 2015
How can DisGeNET help your
research?
DisGeNET- Tutorial 3SWAT4LS 2015
Big Questions 4 Big Data
Genotype Phenotype
Environment (life-style, chemicals, radiation,
infections,clinical care
intervention,…)
Human
Biology
Medical
Sciences
Understanding
Human
Diseases
PPI
DDI
Comorbidities
-EMR, EHR, IoT
-Imaging
-Patientregistries
-Clinical trials
-Epidemiologic
studies
-…
-Data Bases
-Literature
-OMICS
-Animal models
-…
DisGeNET- Tutorial 4
Translational Research
Genotype Phenotype
Environment
Molecular Patient
Understanding
Human
Diseases -EMR, EHR, IoT
-Imaging
-Patientregistries
-Clinical trials
-Epidemiologic
studies
-…
-Data Bases
-Literature
-OMICS
-Animal models
-…
Keyin
Translational
Research
•Decision-making
•Prevention
•Diagnosis
•Therapies
•ResearchDiscoveryDisGeNET- Tutorial 5SWAT4LS 2015
OMIM:300123;OMIM:312000
ORPHA393; ORPHA90695; ORPHA3157; ORPHA79495; ORPHA67045
MentalRetardation;Panhypopituitarism;46,XXsexreversal 3
MESH:C538613;MESH:C538613
No Data
Mental retardation -?- SOX3
Access to Gene-Disease Associations
SOX3
DisGeNET- Tutorial 6SWAT4LS 2015
OMIM:300123;OMIM:312000
ORPHA393; ORPHA90695; ORPHA3157; ORPHA79495; ORPHA67045
MentalRetardation;Panhypopituitarism;46,XXsexreversal 3
MESH:C538613;MESH:C538613
No Data
Mental retardation -?- SOX3
Access to Gene-Disease Associations
SOX3
Lack of:
• Normalization
• Semantic integration
• Data model harmonization
• Unified access
DisGeNET- Tutorial 7SWAT4LS 2015
http://www.disgenet.org/
•Piñero et al. DisGeNET: a discovery platformforthe dynamical explorationof humandiseases andtheir
genes. Database (2015)Vol. 2015: article ID bav028,(2015)
• Knowledgeplatformon human gene-diseaseassociations(GDAs)
• Integrates informationfromexpert-curateddatabasesandfrom the
literature(textmining)
• All disease areas
• Supportingevidence
• Analysis tools
DisGeNET- Tutorial 8SWAT4LS 2015
Research Questions
ANALYSIS
KNOWLEDGE
DISCOVERY
ACTIONABLE
INFORMATION
Evidence
• Which genes are associated to Marfan
syndrome?
• Which disease genes have approved
drugs annotated?
• Which disease genes have differential
expression?
• Which disease genes share a pathway?
• Is there genetic variation related to the
MECP2 and Rett Syndrome association?
• What evidence supports the association
between APP gene and Alzheimer
Disease?
• Which genes and evidence support the
comorbidity between Chronic Kidney
disease and Diabetes Mellitus, Type 2?
DisGeNET- Tutorial 9SWAT4LS 2015
DisGeNET Discovery Platform
Overview
DisGeNET- Tutorial 10SWAT4LS 2015
DisGeNET Implementation
Bio-Entity Finder and Relation Extraction
Gene-disease associationsGene-disease associations
Biomedical databases
Textmining
http://ibi.imim.es/befree/
DisGeNET- Tutorial 11
DisGeNET Sources
CURATED PREDICTED LITERATURE
GAD
LHGDN
DisGeNETv3.0
DisGeNET- Tutorial 12SWAT4LS 2015
Data Integration
• Use of Standardsandcontrolled vocabularies
Gene-Disease
Disease Gene
Source
Score
Article
Sentence
SNP
MeSHClass UMLSSTY PantherClass PathwayProtein
S = Scurated + Spredicted+ Sliterature
EVIDENCE DisGeNETONTOLOGY
DisGeNET- Tutorial 13SWAT4LS 2015
Data Integration
• Use of Standardsandcontrolled vocabularies
Gene-Disease
Disease Gene
Source
Score
Article
Sentence
SNP
MeSHClass UMLSSTY PantherClass PathwayProtein
S = Scurated + Spredicted+ Sliterature
EVIDENCE DisGeNETONTOLOGY
Typeof
association
DisGeNET- Tutorial 14SWAT4LS 2015
Data Integration
• Use of Standardsandcontrolled vocabularies
Gene-Disease
Disease Gene
Source
Score
Article
Sentence
SNP
MeSHClass UMLSSTY PantherClass PathwayProtein
EVIDENCE DisGeNETONTOLOGY
S = Scurated + Spredicted + Sliterature
Aggregationofevidence
Notrelated to text mining
score
http://www.disgenet.org/web/
DisGeNET/menu/dbinfo#score
DisGeNET- Tutorial 15SWAT4LS 2015
Source Genes Diseases Associations
Curated 7,878 6,761 26,522
Predicted 2,557 2,003 9,536
Literature 16,298 11,374 408,175
All 17,181 14,619 429,111
DisGeNET Statistics (May 15th, 2015)
82 %
Largevolume of information unlocked by text mining the literature
DisGeNETv3.0AnnualRelease
DisGeNET- Tutorial 16SWAT4LS 2015
Tools for exploration
Usage stats (Ago2014-Ago2015):
• 12,040 users, 22,696 sessions (4:33 min/session)
• 14,494 downloads (database, Cytoscape plugin, RDF/Nanopubs)
• DisGeNET used in +20 publications, cited in +60 articles
(Onexus)
DisGeNET- Tutorial 17SWAT4LS 2015
Panhypopituitarism
UMLS:C0342376
SOX3
NCBI:6658
Web Interface
DisGeNET- Tutorial 18SWAT4LS 2015
Panhypopituitarism
UMLS:C0342376
SOX3
NCBI:6658
Cytoscape Plugin
Edge=EVIDENCE
(Source,PMID,typeof relation)
DisGeNET- Tutorial 19SWAT4LS 2015
DisGeNET Linked Open Data
DisGeNET- Tutorial 20SWAT4LS 2015
• RDF andtrusty nanopublications
– URIs:RDF providersor
– SIO
– Use of standards(11 ontologiesin NCBO)
•Metadata description (W3C HCLS)
•Interlinking
•Bio2RDF
•Linked Life Data
•Access
•Download Data Dump
•SPARQL Endpoint
•Faceted Browser
•Open PHACTS
• Nanopublication Network
• Open license
•Datahub
•Software
DisGeNET as Linked Open Data
http://lod-cloud.net/;Aug2014DisGeNET- Tutorial 21SWAT4LS 2015
DisGeNET-RDF
DisGeNET- Tutorial 22SWAT4LS 2015
Data Model
• How to describe an association?
a) As a property
b) As a class
Gene associated Disease
S P O
Gene Association Disease
PO SP O
DisGeNET- Tutorial 23SWAT4LS 2015
Data Model
• How to describe an association?
a) As a property
b) As a class
Gene associated Disease
S P O
Gene Association Disease
PO SP O
DisGeNET- Tutorial 24SWAT4LS 2015
Data Model
• How to describe an association?
a) As a property
b) As a class
Gene associated Disease
S P O
Gene Association Disease
PO SP O
Provenanceand Evidence
RDF triples
DisGeNET- Tutorial 25SWAT4LS 2015
Data Model
• Ontology-basedintegration
• DisGeNET Standards
– SharedIDs
– Standardontologies
Gene Association Disease
PO SP O
http://semanticscience.org/ontology/sio.owl
DisGeNETAssociation
Type Ontology
rdf:type
DisGeNET- Tutorial 26SWAT4LS 2015
Data Model
• Semantic Annotation: Standard ontologies
Prefix Namespace Vocabularies
ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus
sio http://semanticscience.org/resource/ SIO
up http://purl.uniprot.org/core/ UniProt
void http://rdfs.org/ns/void# VoID
foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary
dcterms http://purl.org/dc/terms/ DCMI Terms
rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF
rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema
xsd http://www.w3.org/2001/XMLSchema# XML Schema
owl http://www.w3.org/2002/07/owl# OWL
skos http://www.w3.org/2004/02/skos/core# SKOS
DisGeNET- Tutorial 27SWAT4LS 2015
Data Model
• Semantic Annotation: Standard ontologies
Prefix Namespace Vocabularies
ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus
sio http://semanticscience.org/resource/ SIO
up http://purl.uniprot.org/core/ UniProt
void http://rdfs.org/ns/void# VoID
foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary
dcterms http://purl.org/dc/terms/ DCMI Terms
rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF
rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema
xsd http://www.w3.org/2001/XMLSchema# XML Schema
owl http://www.w3.org/2002/07/owl# OWL
skos http://www.w3.org/2004/02/skos/core# SKOS
RDF Structure
DisGeNET- Tutorial 28SWAT4LS 2015
Data Model
• Semantic Annotation: Standard ontologies
Prefix Namespace Vocabularies
ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus
sio http://semanticscience.org/resource/ SIO
up http://purl.uniprot.org/core/ UniProt
void http://rdfs.org/ns/void# VoID
foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary
dcterms http://purl.org/dc/terms/ DCMI Terms
rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF
rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema
xsd http://www.w3.org/2001/XMLSchema# XML Schema
owl http://www.w3.org/2002/07/owl# OWL
skos http://www.w3.org/2004/02/skos/core# SKOS
Biomedical
entities
Relationships
RDF Structure
DisGeNET- Tutorial 29SWAT4LS 2015
Data Model
• Semantic Annotation: Standard ontologies
Prefix Namespace Vocabularies
ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus
sio http://semanticscience.org/resource/ SIO
up http://purl.uniprot.org/core/ UniProt
void http://rdfs.org/ns/void# VoID
foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary
dcterms http://purl.org/dc/terms/ DCMI Terms
rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF
rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema
xsd http://www.w3.org/2001/XMLSchema# XML Schema
owl http://www.w3.org/2002/07/owl# OWL
skos http://www.w3.org/2004/02/skos/core# SKOS
Biomedical
entities
Relationships
RDF Structure
Metadata
DisGeNET- Tutorial 30SWAT4LS 2015
Data Model
• URIs in DisGeNET: shared, cool & dereferenceable
– ID Normalization
– DisGeNET URIs:
– Estable URIs from primary data providers
– Identifiers.org
http://rdf.disgenet.org/resource/entity/ID
http://identifiers.org/data-collection-namespace/ID
Unique
association
attributes
DisGeNET- Tutorial 31SWAT4LS 2015
Data Model
• URIs in DisGeNET: shared, cool & dereferenceable
– ID Normalization
– Gene-Disease Association::DisGeNET ID
Entity URI Semantics
Gene-Disease
Association
http://rdf.disgenet.org/resource/gda/
DGNf5cb3969d75871f05a5d5f984f8dfc34
sio:SIO_001122
PubMed article http://identifiers.org/pubmed/9837812 ncit:C47902
Source http://rdf.disgenet.org/v3.0.0/void/uniprot-20150221
dctypes:Dataset,
dcat:Distribution
Score
http://rdf.disgenet.org/resource/gda/
ncbigene:4728_umls:C0023264_association_DisGeNET
Score
ncit:C25338
SNP http://identifiers.org/dbsnp/rs28939679 ncit:C18279
DisGeNET- Tutorial 32SWAT4LS 2015
Data Model
• URIs in DisGeNET: shared, cool & dereferenceable
– ID Normalization
– Gene::NCBI Gene ID
Entity URI Semantics
Gene http://identifiers.org/ncbigene/4728 ncit:C16612
HGNC Gene Symbol http://identifiers.org/hgnc.symbol/NDUFS8 ncit:C43568
Protein http://identifiers.org/uniprot/O00217 ncit:C17021
Panther Class
http://rdf.disgenet.org/resource/panther.classification
/PC00211
rdfs:Class
Pathway http://identifiers.org/reactome/REACT_111217 ncit:C20633
DisGeNET- Tutorial 33SWAT4LS 2015
Data Model
• URIs in DisGeNET: shared, cool & dereferenceable
– ID Normalization
– Disease::UMLS Concept Unique Identifier (CUI)
Entity URI Semantics
Disease
http://linkedlifedata.com/resource/umls/id/
C0023264
ncit:C7057
MeSH Class http://rdf.imim.es/rh-mesh.owl#C18 rdfs:Class
UMLS SemanticType
http://biotop.googlecode.com/svn/trunk/
umlssn.owl#T047
rdfs:Class
Phenotype http://purl.obolibrary.org/obo/HP_0004633 sio:SIO_010056
Cross References http://identifiers.org/vocab-namespace/ID
Human Disease
Ontology, MesH,
OMIM, Orphanet,
Decipher, NCIt, ICD9,
Human Phenotype
Ontology
DisGeNET- Tutorial 34SWAT4LS 2015
Data Model
DisGeNET- Tutorial 35SWAT4LS 2015
Data Model
http://rdf.disgenet.org/download/v3.0.0
/DisGeNET-RDF-Example.ttl (Turtle)
DisGeNET- Tutorial 36SWAT4LS 2015
Metada Dataset Description
DisGeNET-RDF VoID file (Vocabulary of Interlinked Datasets)
DisGeNET-RDF
Gene Disease Association Disease Class Pathway
Dataset
subsets
1ºsources
DisGeNETDatabase DisGeNET
CTD UniProt ClinVar MGD BeFree
Pathway
SNP STY PubMed Panther Class Protein
HGNC
Symbol
DisGeNET- Tutorial 37SWAT4LS 2015
Interlinking
DisGeNET -- RDF link -> LOD cloud
DisGeNET UniProt
skos:exactMatch
Dataset1 Dataset2
Important in Federated Queries!
DisGeNET- Tutorial 38SWAT4LS 2015
Interlinking
?s skos:exactMatch ?o
DisGeNET
PubMed
UniProt OMIM
NCBI
Gene
Orphanet
UMLS
DBpedia MeSH
dbSNP
Biomedical
Databases
and
Disease
Terminologies
DisGeNET- Tutorial 39SWAT4LS 2015
DisGeNET as Linked Open Data
• Interlinking: 4,962,315 RDF links to RDF datasets in the LOD
https://datahub.io/dataset/disgenet
(morestatistics)
DisGeNET- Tutorial 40
Federated Query Support
• SPARQL 1.1: SERVICE <sparql endpoint> {}
Disease ID Gene ID
GDA ID
Skos:exactMatch
DisGeNET- Tutorial 41SWAT4LS 2015
Implementation
• DisGeNET RDF data, VoID dataset description, and six OWL ontologies
loaded into the RDF Store
• Total number of triples: 24,882,432 (8,5G)
SPARQL
Endpoint
Faceted
Browser
LODEStar:SPARQL+ LD
Browser
Hardware:7.1.0
Usage Restrictions
• SPARQL:
• only SELECT, DESCRIBE,ASK,
CONSTRUCT
• performance opt:
• Max # of rows per result
• Max query costestimation
time
• Max query execution time
Security:basicsetupDisGeNET- Tutorial 42
Accessibility
• Download: RDF dump + linksets
– http://rdf.disgenet.org/download/
• FacetedBrowser
– http://rdf.disgenet.org/fct/
• SPARQLendpoint
– http://rdf.disgenet.org/sparql/
• EBI::LODEStarSPARQL + Linked Data Browser
– http://rdf.disgenet.org/lodestar/sparql
• Open PHACTSAPIs
– https://dev.openphacts.org/docs/1.5
DisGeNET- Tutorial 43SWAT4LS 2015
Documentation
• Descriptions
• RDF Schema
• Pointsof access
• SPARQLquery examples @:
http://rdf.disgenet.org/
• Support @:
support@disgenet.org
DisGeNET- Tutorial 44SWAT4LS 2015
Querying the DisGeNET-RDF
DisGeNET- Tutorial 45SWAT4LS 2015
SPARQL QUERIES
• Not easy
• RDF Schema-aware
• Performance issues
• Optimal queries: there is a trade off
between the amount of time you
spend analyzing and transforming the
query and the performance gains of
those transformations
• Technology-dependant
• crossing a lot of information
decrease speed (making the system
fails): better local
• Other approacheson development
• Q/A based on natural language
• Linked Data Fragments
• ElasticSearch
DisGeNET- Tutorial 46SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
http://rdf.disgenet.org/sparql/
http://rdf.disgenet.org/lodestar/sparql
• Contains all DisGeNET data
• Free access
• SPARQL 1.1 Standard
DisGeNET- Tutorial 47SWAT4LS 2015
Data Model
DisGeNET- Tutorial 48SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Minimal Resource Description Graph
• rdfs:label: name + identifier
• rdfs:comment:human-readable description
• dcterms:title: resource name
• dcterms:identifier: namespace:identifier
• void:inDataset: RDF subset provenance
DisGeNET- Tutorial 49SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Minimal Resource Description Graph
?subject
SELECT DISTINCT*
FROM <http://rdf.disgenet.org>
WHERE{
?subjectrdf:type?type;
rdfs:label ?label;
rdfs:comment?comment;
dcterms:identifier ?id ;
dcterms:title?title ;
void:inDataset?rdfSource.
}
LIMIT100
?type
rdf:type
?label
?comment
?id
?title
?rdfSource
DisGeNET- Tutorial 50SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Minimal Resource Description Graph
DisGeNET- Tutorial 51SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
DisGeNET- Tutorial 52SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
?gda
SELECT DISTINCT?gda
FROM <http://rdf.disgenet.org>
WHERE{
?gdardf:typesio:SIO_001122.
}
LIMIT100
sio:SIO_001122
rdf:type
DisGeNET- Tutorial 53SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
SELECT DISTINCT?gda
FROM <http://rdf.disgenet.org>
WHERE{
?gdardf:typesio:SIO_001122.
}
LIMIT100
DisGeNET- Tutorial 54SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
• Which is the sio:SIO_001122class?
DisGeNET- Tutorial 55SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
• Which is the sio:SIO_001122class?
SELECTDISTINCT?gda?type ?label
FROM<http://rdf.disgenet.org>
WHERE {
?gdardf:type ?type .
FILTER(?type= sio:SIO_001122)
?typerdfs:label ?label
}
LIMIT100
DisGeNET- Tutorial 56SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach ?gda, show me the ?gene and the ?disease associated,and the
?typeOfAssociation
DisGeNET- Tutorial 57SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach ?gda, show me the ?gene and the ?disease associated,and the
?typeOfAssociation
SELECTDISTINCT?gda?gene ?disease ?type ?label
FROM<http://rdf.disgenet.org>
WHERE {
?gdardf:type ?type ;
sio:SIO_000628?gene,?disease .
?typerdfs:label ?label .
?genea ncit:C16612.
?diseasea ncit:C7057
}
LIMIT50
DisGeNET- Tutorial 58SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach ?gda, show me the ?gene and the ?disease associated,the
?paper,and the ?sentence description of the relationship in the paper
DisGeNET- Tutorial 59SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach ?gda, show me the ?gene and the ?disease associated,the
?paper,and the ?sentence description of the relationship in the paper
SELECTDISTINCT?gda?gene ?disease ?paper ?sentence
FROM<http://rdf.disgenet.org>
WHERE {
?gdasio:SIO_000628?gene,?disease ;
sio:SIO_000772?paper;
dcterms:description?sentence.
?genea ncit:C16612.
?diseasea ncit:C7057
}
LIMIT50
DisGeNET- Tutorial 60SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach ?gda, show me the ?gene and the ?disease associated,the
?paper,and the ?sentence description of the relationship in the paper
SELECTDISTINCT?gda?gene ?disease ?paper ?sentence
FROM<http://rdf.disgenet.org>
WHERE {
?gdasio:SIO_000628?gene,?disease ;
sio:SIO_000772?paper;
dcterms:description?sentence.
FILTER(regex(str(?sentence),"syndrome","i"))
?genea ncit:C16612.
?diseasea ncit:C7057
}
LIMIT50
DisGeNET- Tutorial 61SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach ?gda show me the ?gene, ?disease, ?source, and the level of
?evidenceof the association
DisGeNET- Tutorial 62SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach ?gda show me the ?gene, ?disease, ?source, and the level of
?evidenceof the association
PREFIXwi: <http://purl.org/ontology/wi/core#>
SELECTDISTINCT?gda?gene ?disease ?source ?evidence
FROM<http://rdf.disgenet.org>
WHERE {
?gdasio:SIO_000628?gene,?disease ;
sio:SIO_000253?source.
?genea ncit:C16612.
?diseasea ncit:C7057.
?sourcewi:evidence ?evidence
}
LIMIT50
DisGeNET- Tutorial 63SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach gene-diseasepairshow me the ?number of evidences and the
score?value
DisGeNET- Tutorial 64SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
•Foreach gene-diseasepairshow me the ?number of evidences and the
score?value
SELECTDISTINCT?gene ?disease count(DISTINCT?gda)AS ?numberOfEvidences
?scoreValue
FROM<http://rdf.disgenet.org>
WHERE {
?gdasio:SIO_000628?gene,?disease ;
sio:SIO_000216?score.
?genea ncit:C16612.
?diseasea ncit:C7057.
?scoresio:SIO_000300?scoreValue
}
ORDER BY DESC(?numberOfEvidences)DESC(?scoreValue)
LIMIT50
DisGeNET- Tutorial 65SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
• For each ?gdashow me the ?snp
DisGeNET- Tutorial 66SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene-Disease Association Graph
• For each ?gdashow me the ?snp
• Go to theWeb and understandand execute Q1.1-Q1.4
SELECTDISTINCT?gda?gene ?disease ?snp FROM
<http://rdf.disgenet.org>
WHERE {
?gdasio:SIO_000628?gene,?disease ;
sio:SIO_000001?snp.
?genea ncit:C16612.
?diseasea ncit:C7057.
}
LIMIT50
DisGeNET- Tutorial 67SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene Graph
DisGeNET- Tutorial 68SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene Graph
?gene
SELECT DISTINCT?gene
FROM <http://rdf.disgenet.org>
WHERE{
?generdf:type ncit:C16612.
}
LIMIT100
Gene
rdf:type
DisGeNET- Tutorial 69SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene Graph
SELECT DISTINCT?gene
FROM <http://rdf.disgenet.org>
WHERE{
?generdf:type ncit:C16612.
}
LIMIT100
DisGeNET- Tutorial 70SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Gene Graph
• For each ?gene show me:
• ?identifier, ?name, ?geneSymbol
• ?protein(s)
• ?panther class(es)and ?pantherclassname
• ?pathway(s)and ?pathwayname
• Go to web and understand/executeQ1.5
DisGeNET- Tutorial 71SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease Graph
DisGeNET- Tutorial 72SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease Graph
SELECT DISTINCT?disease
FROM <http://rdf.disgenet.org>
WHERE{
?diseasea ncit:C7057.
}
LIMIT100
?disease
Disease
rdf:type
DisGeNET- Tutorial 73SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease Graph
SELECT DISTINCT?disease
FROM <http://rdf.disgenet.org>
WHERE{
?diseasea ncit:C7057.
}
LIMIT100
DisGeNET- Tutorial 74SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease Graph
• For the disease <http://linkedlifedata.com/resource/umls/id/C0596263>showme:
• the disease ?name, MeSH disease class ?label, and the umlsSTY ?title
• show all cross-referencesto other disease terminologies
• Go to the Web and understand/executeQ1.6
DisGeNET- Tutorial 75SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease mapping to other ontologies
SELECT DISTINCT?disease
FROM <http://rdf.disgenet.org>
WHERE{
?diseaseskos:exactMatch?ontology .
}?ontology
Disease
?link
COVERAGE
DisGeNET- Tutorial 76SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Ontology Walking queries
• Grouping of similar instances
• Filtering data
• Query data by classes
•Ontologies loaded in our RDF triple store: SIO, DO,
ORDO, NCIT, HPO, and ECO (OWL)
• Go to the Web and understand/executeQ1.7andQ1.11
?child rdfs:subClassOf+ ?parent
DisGeNET- Tutorial 77SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease-Phenotype Association Graph (curated from HPO)
DisGeNET- Tutorial 78SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease-Phenotype Association Graph (curated from HPO)
• Why this model?
SELECTDISTINCT?disease count(distinct?hpdisease)as ?hpdiseases count(distinct
?phenotype)as ?phenotypes WHERE {
?disease rdf:type ncit:C7057.
?disease skos:exactMatch?hpdisease.
?hpdisease sio:SIO_000341?phenotype.
}
ORDER BY DESC(?hpdiseases)
LIMIT100
SELECTDISTINCT?disease ?hpdisease count(distinct?phenotype)as ?phenotypes
WHERE {
?disease rdf:type ncit:C7057.
?disease skos:exactMatch?hpdisease.
?hpdisease sio:SIO_000341?phenotype.
FILTER (?disease = <http://linkedlifedata.com/resource/umls/id/C3280766>)
}
GROUPBY ?disease ?hpdisease
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease-Phenotype Association Graph (curated from HPO)
• How many phenotypes are associatedwith Orphanet:209
DisGeNET- Tutorial 80SWAT4LS 2015
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease-Phenotype Association Graph (curated from HPO)
• How many phenotypes are associatedwith Orphanet:209
DisGeNET- Tutorial 81SWAT4LS 2015
SELECTDISTINCT?disease ?hpdisease count(distinct?phenotype)as ?phenotypes
WHERE {
?disease rdf:type ncit:C7057.
?disease skos:exactMatch?hpdisease.
?hpdisease sio:SIO_000341?phenotype.
FILTER (?hpdisease = <http://identifiers.org/orphanet/209>)
}
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease-Phenotype Association Graph (curated from HPO)
DisGeNET- Tutorial 82SWAT4LS 2015
• How many diseases are associatedwith a phenotype
Querying DisGeNET
• SPARQL Queries over DisGeNET data
• Disease-Phenotype Association Graph (curated from HPO)
• Go to the Web and understand/executeQ1.10and Q1.12
DisGeNET- Tutorial 83SWAT4LS 2015
• How many diseases are associatedwith a phenotype
SELECTDISTINCT?phenotype ?phenotypeName count(distinct?disease)as
?diseases
WHERE {
?hpdisease sio:SIO_000341?phenotype.
?phenotype dcterms:title ?phenotypeName.
?disease skos:exactMatch?hpdisease.
?disease rdf:type ncit:C7057;
dcterms:title ?diseaseName .
}
ORDER BY DESC(?diseases)
LIMIT100
Querying DisGeNET + LOD cloud
• Federated Queries: DisGeNET + external datasets
• Go to the Web and understand/executethe FederatedQueries
DisGeNET- Tutorial 84SWAT4LS 2015
Use Cases
• What genes are associated to Marfan syndrome?
• What evidence supports the association between APP gene and Alzheimer
Disease?
• What disease classes are associated with APP gene?
• Which genes and evidence support the comorbidity between Chronic
Kidney disease and Diabetes Mellitus, Type 2?
• What SNPs are related to the MECP2 and Rett Syndrome association?
• Which diseases are associated to post-translational modifications type of
association?
• What disease genes are hitted by compounds in ChEMBL?
• What disease genes have differential expression in Gene Expression Atlas?
• What disease genes are in WikiPathways?
• Find compounds (from ChEMBL) that target genes (from DisGeNET) that
participate in the same pathway (from WikiPathways)
DisGeNET- Tutorial 85SWAT4LS 2015
Acknowledgments
IBI Group
Alba Gutiérrez-Sacristán
Àlex Bravo
Janet Piñero
NúriaQueralt Rosinach
Alexia Giannoula
MiguelA. Mayer
LauraI. Furlong
FerranSanz
DisGeNET- Tutorial 86SWAT4LS 2015
Thanks for your attention!
Questions are welcome
DisGeNET- Tutorial 87SWAT4LS 2015

Mais conteúdo relacionado

Mais procurados

Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...Carole Goble
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
 
Reproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An OverviewReproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An Overviewdgarijo
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Alejandra Gonzalez-Beltran
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openlyFAIRDOM
 

Mais procurados (20)

Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
 
CSHALS 2013
CSHALS 2013CSHALS 2013
CSHALS 2013
 
NETTAB 2013
NETTAB 2013NETTAB 2013
NETTAB 2013
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
NETTAB 2012
NETTAB 2012NETTAB 2012
NETTAB 2012
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
Beyond the PDF 2, 2013
Beyond the PDF 2, 2013Beyond the PDF 2, 2013
Beyond the PDF 2, 2013
 
Reproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An OverviewReproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An Overview
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
FAIRy Stories
FAIRy StoriesFAIRy Stories
FAIRy Stories
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
CV_10/17
CV_10/17CV_10/17
CV_10/17
 
PRIDE-ProteomeXchange
PRIDE-ProteomeXchangePRIDE-ProteomeXchange
PRIDE-ProteomeXchange
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
 
Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Proteomics data standards
Proteomics data standardsProteomics data standards
Proteomics data standards
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openly
 

Semelhante a DisGeNET Tutorial SWAT4LS 2015-12-07

DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...Núria Queralt Rosinach
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EITESANGO
 
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsAmit Sheth
 
Semantic Web Adoption
Semantic Web AdoptionSemantic Web Adoption
Semantic Web Adoptionguest262aaa
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisCatherine Canevet
 
DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...Núria Queralt Rosinach
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Maulik Kamdar
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseHilmar Lapp
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Ben Gardner
 
Use of open_linked_data_in_bioinformatics
Use of open_linked_data_in_bioinformaticsUse of open_linked_data_in_bioinformatics
Use of open_linked_data_in_bioinformaticsRemzi Çelebi
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
HyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesHyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesMichel Dumontier
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyFAIRDOM
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 

Semelhante a DisGeNET Tutorial SWAT4LS 2015-12-07 (20)

DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
 
Semantic Web Adoption
Semantic Web AdoptionSemantic Web Adoption
Semantic Web Adoption
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)
 
Use of open_linked_data_in_bioinformatics
Use of open_linked_data_in_bioinformaticsUse of open_linked_data_in_bioinformatics
Use of open_linked_data_in_bioinformatics
 
Semantic (Web) Technologies for Translational Research in Life Sciences
Semantic (Web) Technologies for Translational Research in Life SciencesSemantic (Web) Technologies for Translational Research in Life Sciences
Semantic (Web) Technologies for Translational Research in Life Sciences
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
HyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesHyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologies
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 

Último

Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...Namrata Singh
 
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableGENUINE ESCORT AGENCY
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...parulsinha
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...chandars293
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426jennyeacort
 
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...BhumiSaxena1
 
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...karishmasinghjnh
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Sheetaleventcompany
 
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...hotbabesbook
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls ServiceGENUINE ESCORT AGENCY
 
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Availableperfect solution
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...GENUINE ESCORT AGENCY
 
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableGENUINE ESCORT AGENCY
 
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...parulsinha
 
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...chetankumar9855
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋TANUJA PANDEY
 
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In AhmedabadGENUINE ESCORT AGENCY
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...chandars293
 

Último (20)

Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
 
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
 
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
 
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
 
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
 
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
 
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
 
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 

DisGeNET Tutorial SWAT4LS 2015-12-07

  • 1. A discovery platform for translational research Núria Queralt Rosinach Integrative Biomedical Informatics Group (IBI) Research Programme on Biomedical Informatics (GRIB) Hospital del Mar Research Institute (IMIM) Pompeu Fabra University (UPF) Barcelona Usage Tutorial
  • 2. Outline • How can DisGeNEThelp your research? • DisGeNETDiscovery Platform Overview • DisGeNETLinked Open Data – Introduction • RDF-LDDescription: Data Model,VoID, Interlinking • Implementation • Accessibility • Documentation • UseCases – Queryingthe DisGeNET-RDF • Hands-on DisGeNET- Tutorial 2SWAT4LS 2015
  • 3. How can DisGeNET help your research? DisGeNET- Tutorial 3SWAT4LS 2015
  • 4. Big Questions 4 Big Data Genotype Phenotype Environment (life-style, chemicals, radiation, infections,clinical care intervention,…) Human Biology Medical Sciences Understanding Human Diseases PPI DDI Comorbidities -EMR, EHR, IoT -Imaging -Patientregistries -Clinical trials -Epidemiologic studies -… -Data Bases -Literature -OMICS -Animal models -… DisGeNET- Tutorial 4
  • 5. Translational Research Genotype Phenotype Environment Molecular Patient Understanding Human Diseases -EMR, EHR, IoT -Imaging -Patientregistries -Clinical trials -Epidemiologic studies -… -Data Bases -Literature -OMICS -Animal models -… Keyin Translational Research •Decision-making •Prevention •Diagnosis •Therapies •ResearchDiscoveryDisGeNET- Tutorial 5SWAT4LS 2015
  • 6. OMIM:300123;OMIM:312000 ORPHA393; ORPHA90695; ORPHA3157; ORPHA79495; ORPHA67045 MentalRetardation;Panhypopituitarism;46,XXsexreversal 3 MESH:C538613;MESH:C538613 No Data Mental retardation -?- SOX3 Access to Gene-Disease Associations SOX3 DisGeNET- Tutorial 6SWAT4LS 2015
  • 7. OMIM:300123;OMIM:312000 ORPHA393; ORPHA90695; ORPHA3157; ORPHA79495; ORPHA67045 MentalRetardation;Panhypopituitarism;46,XXsexreversal 3 MESH:C538613;MESH:C538613 No Data Mental retardation -?- SOX3 Access to Gene-Disease Associations SOX3 Lack of: • Normalization • Semantic integration • Data model harmonization • Unified access DisGeNET- Tutorial 7SWAT4LS 2015
  • 8. http://www.disgenet.org/ •Piñero et al. DisGeNET: a discovery platformforthe dynamical explorationof humandiseases andtheir genes. Database (2015)Vol. 2015: article ID bav028,(2015) • Knowledgeplatformon human gene-diseaseassociations(GDAs) • Integrates informationfromexpert-curateddatabasesandfrom the literature(textmining) • All disease areas • Supportingevidence • Analysis tools DisGeNET- Tutorial 8SWAT4LS 2015
  • 9. Research Questions ANALYSIS KNOWLEDGE DISCOVERY ACTIONABLE INFORMATION Evidence • Which genes are associated to Marfan syndrome? • Which disease genes have approved drugs annotated? • Which disease genes have differential expression? • Which disease genes share a pathway? • Is there genetic variation related to the MECP2 and Rett Syndrome association? • What evidence supports the association between APP gene and Alzheimer Disease? • Which genes and evidence support the comorbidity between Chronic Kidney disease and Diabetes Mellitus, Type 2? DisGeNET- Tutorial 9SWAT4LS 2015
  • 11. DisGeNET Implementation Bio-Entity Finder and Relation Extraction Gene-disease associationsGene-disease associations Biomedical databases Textmining http://ibi.imim.es/befree/ DisGeNET- Tutorial 11
  • 12. DisGeNET Sources CURATED PREDICTED LITERATURE GAD LHGDN DisGeNETv3.0 DisGeNET- Tutorial 12SWAT4LS 2015
  • 13. Data Integration • Use of Standardsandcontrolled vocabularies Gene-Disease Disease Gene Source Score Article Sentence SNP MeSHClass UMLSSTY PantherClass PathwayProtein S = Scurated + Spredicted+ Sliterature EVIDENCE DisGeNETONTOLOGY DisGeNET- Tutorial 13SWAT4LS 2015
  • 14. Data Integration • Use of Standardsandcontrolled vocabularies Gene-Disease Disease Gene Source Score Article Sentence SNP MeSHClass UMLSSTY PantherClass PathwayProtein S = Scurated + Spredicted+ Sliterature EVIDENCE DisGeNETONTOLOGY Typeof association DisGeNET- Tutorial 14SWAT4LS 2015
  • 15. Data Integration • Use of Standardsandcontrolled vocabularies Gene-Disease Disease Gene Source Score Article Sentence SNP MeSHClass UMLSSTY PantherClass PathwayProtein EVIDENCE DisGeNETONTOLOGY S = Scurated + Spredicted + Sliterature Aggregationofevidence Notrelated to text mining score http://www.disgenet.org/web/ DisGeNET/menu/dbinfo#score DisGeNET- Tutorial 15SWAT4LS 2015
  • 16. Source Genes Diseases Associations Curated 7,878 6,761 26,522 Predicted 2,557 2,003 9,536 Literature 16,298 11,374 408,175 All 17,181 14,619 429,111 DisGeNET Statistics (May 15th, 2015) 82 % Largevolume of information unlocked by text mining the literature DisGeNETv3.0AnnualRelease DisGeNET- Tutorial 16SWAT4LS 2015
  • 17. Tools for exploration Usage stats (Ago2014-Ago2015): • 12,040 users, 22,696 sessions (4:33 min/session) • 14,494 downloads (database, Cytoscape plugin, RDF/Nanopubs) • DisGeNET used in +20 publications, cited in +60 articles (Onexus) DisGeNET- Tutorial 17SWAT4LS 2015
  • 20. DisGeNET Linked Open Data DisGeNET- Tutorial 20SWAT4LS 2015
  • 21. • RDF andtrusty nanopublications – URIs:RDF providersor – SIO – Use of standards(11 ontologiesin NCBO) •Metadata description (W3C HCLS) •Interlinking •Bio2RDF •Linked Life Data •Access •Download Data Dump •SPARQL Endpoint •Faceted Browser •Open PHACTS • Nanopublication Network • Open license •Datahub •Software DisGeNET as Linked Open Data http://lod-cloud.net/;Aug2014DisGeNET- Tutorial 21SWAT4LS 2015
  • 23. Data Model • How to describe an association? a) As a property b) As a class Gene associated Disease S P O Gene Association Disease PO SP O DisGeNET- Tutorial 23SWAT4LS 2015
  • 24. Data Model • How to describe an association? a) As a property b) As a class Gene associated Disease S P O Gene Association Disease PO SP O DisGeNET- Tutorial 24SWAT4LS 2015
  • 25. Data Model • How to describe an association? a) As a property b) As a class Gene associated Disease S P O Gene Association Disease PO SP O Provenanceand Evidence RDF triples DisGeNET- Tutorial 25SWAT4LS 2015
  • 26. Data Model • Ontology-basedintegration • DisGeNET Standards – SharedIDs – Standardontologies Gene Association Disease PO SP O http://semanticscience.org/ontology/sio.owl DisGeNETAssociation Type Ontology rdf:type DisGeNET- Tutorial 26SWAT4LS 2015
  • 27. Data Model • Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus sio http://semanticscience.org/resource/ SIO up http://purl.uniprot.org/core/ UniProt void http://rdfs.org/ns/void# VoID foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary dcterms http://purl.org/dc/terms/ DCMI Terms rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema xsd http://www.w3.org/2001/XMLSchema# XML Schema owl http://www.w3.org/2002/07/owl# OWL skos http://www.w3.org/2004/02/skos/core# SKOS DisGeNET- Tutorial 27SWAT4LS 2015
  • 28. Data Model • Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus sio http://semanticscience.org/resource/ SIO up http://purl.uniprot.org/core/ UniProt void http://rdfs.org/ns/void# VoID foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary dcterms http://purl.org/dc/terms/ DCMI Terms rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema xsd http://www.w3.org/2001/XMLSchema# XML Schema owl http://www.w3.org/2002/07/owl# OWL skos http://www.w3.org/2004/02/skos/core# SKOS RDF Structure DisGeNET- Tutorial 28SWAT4LS 2015
  • 29. Data Model • Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus sio http://semanticscience.org/resource/ SIO up http://purl.uniprot.org/core/ UniProt void http://rdfs.org/ns/void# VoID foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary dcterms http://purl.org/dc/terms/ DCMI Terms rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema xsd http://www.w3.org/2001/XMLSchema# XML Schema owl http://www.w3.org/2002/07/owl# OWL skos http://www.w3.org/2004/02/skos/core# SKOS Biomedical entities Relationships RDF Structure DisGeNET- Tutorial 29SWAT4LS 2015
  • 30. Data Model • Semantic Annotation: Standard ontologies Prefix Namespace Vocabularies ncit http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl# NCI Thesaurus sio http://semanticscience.org/resource/ SIO up http://purl.uniprot.org/core/ UniProt void http://rdfs.org/ns/void# VoID foaf http://xmlns.com/foaf/0.1/ FOAFVocabulary dcterms http://purl.org/dc/terms/ DCMI Terms rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# RDF rdfs http://www.w3.org/2000/01/rdf-schema# RDF Schema xsd http://www.w3.org/2001/XMLSchema# XML Schema owl http://www.w3.org/2002/07/owl# OWL skos http://www.w3.org/2004/02/skos/core# SKOS Biomedical entities Relationships RDF Structure Metadata DisGeNET- Tutorial 30SWAT4LS 2015
  • 31. Data Model • URIs in DisGeNET: shared, cool & dereferenceable – ID Normalization – DisGeNET URIs: – Estable URIs from primary data providers – Identifiers.org http://rdf.disgenet.org/resource/entity/ID http://identifiers.org/data-collection-namespace/ID Unique association attributes DisGeNET- Tutorial 31SWAT4LS 2015
  • 32. Data Model • URIs in DisGeNET: shared, cool & dereferenceable – ID Normalization – Gene-Disease Association::DisGeNET ID Entity URI Semantics Gene-Disease Association http://rdf.disgenet.org/resource/gda/ DGNf5cb3969d75871f05a5d5f984f8dfc34 sio:SIO_001122 PubMed article http://identifiers.org/pubmed/9837812 ncit:C47902 Source http://rdf.disgenet.org/v3.0.0/void/uniprot-20150221 dctypes:Dataset, dcat:Distribution Score http://rdf.disgenet.org/resource/gda/ ncbigene:4728_umls:C0023264_association_DisGeNET Score ncit:C25338 SNP http://identifiers.org/dbsnp/rs28939679 ncit:C18279 DisGeNET- Tutorial 32SWAT4LS 2015
  • 33. Data Model • URIs in DisGeNET: shared, cool & dereferenceable – ID Normalization – Gene::NCBI Gene ID Entity URI Semantics Gene http://identifiers.org/ncbigene/4728 ncit:C16612 HGNC Gene Symbol http://identifiers.org/hgnc.symbol/NDUFS8 ncit:C43568 Protein http://identifiers.org/uniprot/O00217 ncit:C17021 Panther Class http://rdf.disgenet.org/resource/panther.classification /PC00211 rdfs:Class Pathway http://identifiers.org/reactome/REACT_111217 ncit:C20633 DisGeNET- Tutorial 33SWAT4LS 2015
  • 34. Data Model • URIs in DisGeNET: shared, cool & dereferenceable – ID Normalization – Disease::UMLS Concept Unique Identifier (CUI) Entity URI Semantics Disease http://linkedlifedata.com/resource/umls/id/ C0023264 ncit:C7057 MeSH Class http://rdf.imim.es/rh-mesh.owl#C18 rdfs:Class UMLS SemanticType http://biotop.googlecode.com/svn/trunk/ umlssn.owl#T047 rdfs:Class Phenotype http://purl.obolibrary.org/obo/HP_0004633 sio:SIO_010056 Cross References http://identifiers.org/vocab-namespace/ID Human Disease Ontology, MesH, OMIM, Orphanet, Decipher, NCIt, ICD9, Human Phenotype Ontology DisGeNET- Tutorial 34SWAT4LS 2015
  • 37. Metada Dataset Description DisGeNET-RDF VoID file (Vocabulary of Interlinked Datasets) DisGeNET-RDF Gene Disease Association Disease Class Pathway Dataset subsets 1ºsources DisGeNETDatabase DisGeNET CTD UniProt ClinVar MGD BeFree Pathway SNP STY PubMed Panther Class Protein HGNC Symbol DisGeNET- Tutorial 37SWAT4LS 2015
  • 38. Interlinking DisGeNET -- RDF link -> LOD cloud DisGeNET UniProt skos:exactMatch Dataset1 Dataset2 Important in Federated Queries! DisGeNET- Tutorial 38SWAT4LS 2015
  • 39. Interlinking ?s skos:exactMatch ?o DisGeNET PubMed UniProt OMIM NCBI Gene Orphanet UMLS DBpedia MeSH dbSNP Biomedical Databases and Disease Terminologies DisGeNET- Tutorial 39SWAT4LS 2015
  • 40. DisGeNET as Linked Open Data • Interlinking: 4,962,315 RDF links to RDF datasets in the LOD https://datahub.io/dataset/disgenet (morestatistics) DisGeNET- Tutorial 40
  • 41. Federated Query Support • SPARQL 1.1: SERVICE <sparql endpoint> {} Disease ID Gene ID GDA ID Skos:exactMatch DisGeNET- Tutorial 41SWAT4LS 2015
  • 42. Implementation • DisGeNET RDF data, VoID dataset description, and six OWL ontologies loaded into the RDF Store • Total number of triples: 24,882,432 (8,5G) SPARQL Endpoint Faceted Browser LODEStar:SPARQL+ LD Browser Hardware:7.1.0 Usage Restrictions • SPARQL: • only SELECT, DESCRIBE,ASK, CONSTRUCT • performance opt: • Max # of rows per result • Max query costestimation time • Max query execution time Security:basicsetupDisGeNET- Tutorial 42
  • 43. Accessibility • Download: RDF dump + linksets – http://rdf.disgenet.org/download/ • FacetedBrowser – http://rdf.disgenet.org/fct/ • SPARQLendpoint – http://rdf.disgenet.org/sparql/ • EBI::LODEStarSPARQL + Linked Data Browser – http://rdf.disgenet.org/lodestar/sparql • Open PHACTSAPIs – https://dev.openphacts.org/docs/1.5 DisGeNET- Tutorial 43SWAT4LS 2015
  • 44. Documentation • Descriptions • RDF Schema • Pointsof access • SPARQLquery examples @: http://rdf.disgenet.org/ • Support @: support@disgenet.org DisGeNET- Tutorial 44SWAT4LS 2015
  • 45. Querying the DisGeNET-RDF DisGeNET- Tutorial 45SWAT4LS 2015
  • 46. SPARQL QUERIES • Not easy • RDF Schema-aware • Performance issues • Optimal queries: there is a trade off between the amount of time you spend analyzing and transforming the query and the performance gains of those transformations • Technology-dependant • crossing a lot of information decrease speed (making the system fails): better local • Other approacheson development • Q/A based on natural language • Linked Data Fragments • ElasticSearch DisGeNET- Tutorial 46SWAT4LS 2015
  • 47. Querying DisGeNET • SPARQL Queries over DisGeNET data http://rdf.disgenet.org/sparql/ http://rdf.disgenet.org/lodestar/sparql • Contains all DisGeNET data • Free access • SPARQL 1.1 Standard DisGeNET- Tutorial 47SWAT4LS 2015
  • 49. Querying DisGeNET • SPARQL Queries over DisGeNET data • Minimal Resource Description Graph • rdfs:label: name + identifier • rdfs:comment:human-readable description • dcterms:title: resource name • dcterms:identifier: namespace:identifier • void:inDataset: RDF subset provenance DisGeNET- Tutorial 49SWAT4LS 2015
  • 50. Querying DisGeNET • SPARQL Queries over DisGeNET data • Minimal Resource Description Graph ?subject SELECT DISTINCT* FROM <http://rdf.disgenet.org> WHERE{ ?subjectrdf:type?type; rdfs:label ?label; rdfs:comment?comment; dcterms:identifier ?id ; dcterms:title?title ; void:inDataset?rdfSource. } LIMIT100 ?type rdf:type ?label ?comment ?id ?title ?rdfSource DisGeNET- Tutorial 50SWAT4LS 2015
  • 51. Querying DisGeNET • SPARQL Queries over DisGeNET data • Minimal Resource Description Graph DisGeNET- Tutorial 51SWAT4LS 2015
  • 52. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph DisGeNET- Tutorial 52SWAT4LS 2015
  • 53. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph ?gda SELECT DISTINCT?gda FROM <http://rdf.disgenet.org> WHERE{ ?gdardf:typesio:SIO_001122. } LIMIT100 sio:SIO_001122 rdf:type DisGeNET- Tutorial 53SWAT4LS 2015
  • 54. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph SELECT DISTINCT?gda FROM <http://rdf.disgenet.org> WHERE{ ?gdardf:typesio:SIO_001122. } LIMIT100 DisGeNET- Tutorial 54SWAT4LS 2015
  • 55. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph • Which is the sio:SIO_001122class? DisGeNET- Tutorial 55SWAT4LS 2015
  • 56. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph • Which is the sio:SIO_001122class? SELECTDISTINCT?gda?type ?label FROM<http://rdf.disgenet.org> WHERE { ?gdardf:type ?type . FILTER(?type= sio:SIO_001122) ?typerdfs:label ?label } LIMIT100 DisGeNET- Tutorial 56SWAT4LS 2015
  • 57. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach ?gda, show me the ?gene and the ?disease associated,and the ?typeOfAssociation DisGeNET- Tutorial 57SWAT4LS 2015
  • 58. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach ?gda, show me the ?gene and the ?disease associated,and the ?typeOfAssociation SELECTDISTINCT?gda?gene ?disease ?type ?label FROM<http://rdf.disgenet.org> WHERE { ?gdardf:type ?type ; sio:SIO_000628?gene,?disease . ?typerdfs:label ?label . ?genea ncit:C16612. ?diseasea ncit:C7057 } LIMIT50 DisGeNET- Tutorial 58SWAT4LS 2015
  • 59. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach ?gda, show me the ?gene and the ?disease associated,the ?paper,and the ?sentence description of the relationship in the paper DisGeNET- Tutorial 59SWAT4LS 2015
  • 60. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach ?gda, show me the ?gene and the ?disease associated,the ?paper,and the ?sentence description of the relationship in the paper SELECTDISTINCT?gda?gene ?disease ?paper ?sentence FROM<http://rdf.disgenet.org> WHERE { ?gdasio:SIO_000628?gene,?disease ; sio:SIO_000772?paper; dcterms:description?sentence. ?genea ncit:C16612. ?diseasea ncit:C7057 } LIMIT50 DisGeNET- Tutorial 60SWAT4LS 2015
  • 61. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach ?gda, show me the ?gene and the ?disease associated,the ?paper,and the ?sentence description of the relationship in the paper SELECTDISTINCT?gda?gene ?disease ?paper ?sentence FROM<http://rdf.disgenet.org> WHERE { ?gdasio:SIO_000628?gene,?disease ; sio:SIO_000772?paper; dcterms:description?sentence. FILTER(regex(str(?sentence),"syndrome","i")) ?genea ncit:C16612. ?diseasea ncit:C7057 } LIMIT50 DisGeNET- Tutorial 61SWAT4LS 2015
  • 62. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach ?gda show me the ?gene, ?disease, ?source, and the level of ?evidenceof the association DisGeNET- Tutorial 62SWAT4LS 2015
  • 63. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach ?gda show me the ?gene, ?disease, ?source, and the level of ?evidenceof the association PREFIXwi: <http://purl.org/ontology/wi/core#> SELECTDISTINCT?gda?gene ?disease ?source ?evidence FROM<http://rdf.disgenet.org> WHERE { ?gdasio:SIO_000628?gene,?disease ; sio:SIO_000253?source. ?genea ncit:C16612. ?diseasea ncit:C7057. ?sourcewi:evidence ?evidence } LIMIT50 DisGeNET- Tutorial 63SWAT4LS 2015
  • 64. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach gene-diseasepairshow me the ?number of evidences and the score?value DisGeNET- Tutorial 64SWAT4LS 2015
  • 65. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph •Foreach gene-diseasepairshow me the ?number of evidences and the score?value SELECTDISTINCT?gene ?disease count(DISTINCT?gda)AS ?numberOfEvidences ?scoreValue FROM<http://rdf.disgenet.org> WHERE { ?gdasio:SIO_000628?gene,?disease ; sio:SIO_000216?score. ?genea ncit:C16612. ?diseasea ncit:C7057. ?scoresio:SIO_000300?scoreValue } ORDER BY DESC(?numberOfEvidences)DESC(?scoreValue) LIMIT50 DisGeNET- Tutorial 65SWAT4LS 2015
  • 66. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph • For each ?gdashow me the ?snp DisGeNET- Tutorial 66SWAT4LS 2015
  • 67. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene-Disease Association Graph • For each ?gdashow me the ?snp • Go to theWeb and understandand execute Q1.1-Q1.4 SELECTDISTINCT?gda?gene ?disease ?snp FROM <http://rdf.disgenet.org> WHERE { ?gdasio:SIO_000628?gene,?disease ; sio:SIO_000001?snp. ?genea ncit:C16612. ?diseasea ncit:C7057. } LIMIT50 DisGeNET- Tutorial 67SWAT4LS 2015
  • 68. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene Graph DisGeNET- Tutorial 68SWAT4LS 2015
  • 69. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene Graph ?gene SELECT DISTINCT?gene FROM <http://rdf.disgenet.org> WHERE{ ?generdf:type ncit:C16612. } LIMIT100 Gene rdf:type DisGeNET- Tutorial 69SWAT4LS 2015
  • 70. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene Graph SELECT DISTINCT?gene FROM <http://rdf.disgenet.org> WHERE{ ?generdf:type ncit:C16612. } LIMIT100 DisGeNET- Tutorial 70SWAT4LS 2015
  • 71. Querying DisGeNET • SPARQL Queries over DisGeNET data • Gene Graph • For each ?gene show me: • ?identifier, ?name, ?geneSymbol • ?protein(s) • ?panther class(es)and ?pantherclassname • ?pathway(s)and ?pathwayname • Go to web and understand/executeQ1.5 DisGeNET- Tutorial 71SWAT4LS 2015
  • 72. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease Graph DisGeNET- Tutorial 72SWAT4LS 2015
  • 73. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease Graph SELECT DISTINCT?disease FROM <http://rdf.disgenet.org> WHERE{ ?diseasea ncit:C7057. } LIMIT100 ?disease Disease rdf:type DisGeNET- Tutorial 73SWAT4LS 2015
  • 74. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease Graph SELECT DISTINCT?disease FROM <http://rdf.disgenet.org> WHERE{ ?diseasea ncit:C7057. } LIMIT100 DisGeNET- Tutorial 74SWAT4LS 2015
  • 75. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease Graph • For the disease <http://linkedlifedata.com/resource/umls/id/C0596263>showme: • the disease ?name, MeSH disease class ?label, and the umlsSTY ?title • show all cross-referencesto other disease terminologies • Go to the Web and understand/executeQ1.6 DisGeNET- Tutorial 75SWAT4LS 2015
  • 76. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease mapping to other ontologies SELECT DISTINCT?disease FROM <http://rdf.disgenet.org> WHERE{ ?diseaseskos:exactMatch?ontology . }?ontology Disease ?link COVERAGE DisGeNET- Tutorial 76SWAT4LS 2015
  • 77. Querying DisGeNET • SPARQL Queries over DisGeNET data • Ontology Walking queries • Grouping of similar instances • Filtering data • Query data by classes •Ontologies loaded in our RDF triple store: SIO, DO, ORDO, NCIT, HPO, and ECO (OWL) • Go to the Web and understand/executeQ1.7andQ1.11 ?child rdfs:subClassOf+ ?parent DisGeNET- Tutorial 77SWAT4LS 2015
  • 78. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease-Phenotype Association Graph (curated from HPO) DisGeNET- Tutorial 78SWAT4LS 2015
  • 79. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease-Phenotype Association Graph (curated from HPO) • Why this model? SELECTDISTINCT?disease count(distinct?hpdisease)as ?hpdiseases count(distinct ?phenotype)as ?phenotypes WHERE { ?disease rdf:type ncit:C7057. ?disease skos:exactMatch?hpdisease. ?hpdisease sio:SIO_000341?phenotype. } ORDER BY DESC(?hpdiseases) LIMIT100 SELECTDISTINCT?disease ?hpdisease count(distinct?phenotype)as ?phenotypes WHERE { ?disease rdf:type ncit:C7057. ?disease skos:exactMatch?hpdisease. ?hpdisease sio:SIO_000341?phenotype. FILTER (?disease = <http://linkedlifedata.com/resource/umls/id/C3280766>) } GROUPBY ?disease ?hpdisease
  • 80. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease-Phenotype Association Graph (curated from HPO) • How many phenotypes are associatedwith Orphanet:209 DisGeNET- Tutorial 80SWAT4LS 2015
  • 81. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease-Phenotype Association Graph (curated from HPO) • How many phenotypes are associatedwith Orphanet:209 DisGeNET- Tutorial 81SWAT4LS 2015 SELECTDISTINCT?disease ?hpdisease count(distinct?phenotype)as ?phenotypes WHERE { ?disease rdf:type ncit:C7057. ?disease skos:exactMatch?hpdisease. ?hpdisease sio:SIO_000341?phenotype. FILTER (?hpdisease = <http://identifiers.org/orphanet/209>) }
  • 82. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease-Phenotype Association Graph (curated from HPO) DisGeNET- Tutorial 82SWAT4LS 2015 • How many diseases are associatedwith a phenotype
  • 83. Querying DisGeNET • SPARQL Queries over DisGeNET data • Disease-Phenotype Association Graph (curated from HPO) • Go to the Web and understand/executeQ1.10and Q1.12 DisGeNET- Tutorial 83SWAT4LS 2015 • How many diseases are associatedwith a phenotype SELECTDISTINCT?phenotype ?phenotypeName count(distinct?disease)as ?diseases WHERE { ?hpdisease sio:SIO_000341?phenotype. ?phenotype dcterms:title ?phenotypeName. ?disease skos:exactMatch?hpdisease. ?disease rdf:type ncit:C7057; dcterms:title ?diseaseName . } ORDER BY DESC(?diseases) LIMIT100
  • 84. Querying DisGeNET + LOD cloud • Federated Queries: DisGeNET + external datasets • Go to the Web and understand/executethe FederatedQueries DisGeNET- Tutorial 84SWAT4LS 2015
  • 85. Use Cases • What genes are associated to Marfan syndrome? • What evidence supports the association between APP gene and Alzheimer Disease? • What disease classes are associated with APP gene? • Which genes and evidence support the comorbidity between Chronic Kidney disease and Diabetes Mellitus, Type 2? • What SNPs are related to the MECP2 and Rett Syndrome association? • Which diseases are associated to post-translational modifications type of association? • What disease genes are hitted by compounds in ChEMBL? • What disease genes have differential expression in Gene Expression Atlas? • What disease genes are in WikiPathways? • Find compounds (from ChEMBL) that target genes (from DisGeNET) that participate in the same pathway (from WikiPathways) DisGeNET- Tutorial 85SWAT4LS 2015
  • 86. Acknowledgments IBI Group Alba Gutiérrez-Sacristán Àlex Bravo Janet Piñero NúriaQueralt Rosinach Alexia Giannoula MiguelA. Mayer LauraI. Furlong FerranSanz DisGeNET- Tutorial 86SWAT4LS 2015
  • 87. Thanks for your attention! Questions are welcome DisGeNET- Tutorial 87SWAT4LS 2015