SlideShare uma empresa Scribd logo
1 de 45
Introducing the KnetMiner Knowledge Graph:
things, not strings.
Keywan Hassani-Pak
Head of Bioinformatics
1 April 2019 – Rothamsted Seminar
Outline
Biological Knowledge Discovery
Unravelling the biological story behind complex traits
Genes are rarely single actors
To explain the metaphor
• Unravelling the roles of genes in complex trait genomics is often similar
to unmasking the heroes and villains in a plant biology who-dunnit
• Multiple genes are generally involved and play different roles. How they
interact reveals the plotline(s) in the trait story
• We build software and data resources to help automate the process of
explaining the plotlines and unravelling the story behind complex trait
biology
• We are creating computational tools that bring different types of
evidence together and then to weigh-up the competing story lines to
present the user with the most compelling ones.
Assembling the evidence – data is just the beginning!
Genetic diversity
Sequencing data
Phenotype data
Literature
Pathways
Ontologies
Better yields
NUE
Disease Resistance
Reference genomes
Gene expression
QTL
Data Information Knowledge Understanding
We need all of these components to come together to understand our traits
KnetMiner – Accelerating biological discovery
http://knetminer.rothamsted.ac.uk/
@KnetMiner
• Gene Networks
• Bio Databases
• Data Integration
• Text Mining
• Visualisation
• AI & Graphs
• RDF & Neo4j
• Java & JavaScript
How does KnetMiner work?
Computer says: “Try this”
The Rise of Graph Analytics
From Leonhard Euler to Google’s Knowledge Graphs
Seven Bridges of Königsberg - a historically notable problem in mathematics
Its resolution by Leonhard Euler in 1736 laid the foundations of graph theory
Leonhard Euler - 1736
I N T E R A C TO M E N E T WO R K G R A P H
Pathfinding
Finds the shortest path or
evaluates route availability
and quality
Centrality
Determines the
importance of distinct
nodes in the network
Community Detection
Evaluates how a group is
clustered or partitioned
Graph Algorithms
North WykeHarpenden
distance to
Rothamsted
Research
Research Institute
Site Site
miles: 230
What is a Knowledge Graph?
year: 1843
1843
Year
founded in
Knowledge Graph: things, not strings
KnetMiner Knowledge Graph
A Knowledge Graph for gene mining and biological discovery
RNA
SNP Phenotype
Variety
Trial
Soil
Protein
Crop
Insect
Disease
Microbe
Species
Gene
Pesticide
Publication
Function
associated
interacts
Weather
Fertiliser
Field
Manag
ement
Time
Germ
plasm
Meta
bolite
Weeds
Where do KnetMiner relations come from?
Some examples:
Example - GWAS data
http://plants.ensembl.org/biomart/martview
Example Arabidopsis
#SNP=66,816 | #Gene=27,502 | #Phenotype=107
… transform into a graph
(SNP)
(Trait)
associated
Using Ondex-Knet-Builder
https://github.com/Rothamsted/ondex-knet-builder
Add Biological interaction datasets
http://thebiogrid.org
(SNP)
(Tait)
associated
… add biological interactions
Towards FAIR KnetMiner Knowledge Graphs
• Green: Ondex plug-ins
• rdf2neo is a generic, non Ondex-specific rdf->Neo4j conversion tool
• Brandizi et al., IB-2018
(https://dx.doi.org/10.1515%2Fjib-2018-0023)
• Brandizi et al., SWAT4LS-2018
(https://doi.org/10.6084/m9.figshare.7314323.v1)
Programmatic Access via Graph Query Languages
MATCH
// branching via ‘|’
(prot:Protein) - [:produced_by|consumed_by] -> (:Reaction)
// variable-length chains
- [:part_of*1..3] -> (pway:Path)
RETURN
prot.name, pwy LIMIT 1000
// Very compact forms available:
MATCH (prot:Protein) - (pway:Path) RETURN pway
• RDF + OWL used as a standardised modelling/representation language (see BioKNO
ontology: github.com/Rothamsted/bioknet-onto)
• SPARQL available too, both having pros/cons (see our benchmark:
github.com/Rothamsted/graphdb-benchmarks)
• Cypher being used for KnetMiner queries (work in progress)
Open Source Code
client web-server
knet-builder
Deployment Model
Data Integration
Workflows
Database Service Model
Private data
Public data
Databases and graph
queries required by
KnetMiner
For a species or
domain of interest
Graph Queries
Using Knet-Builder tools
How to search and interpret so much information?
• Methods needed to evaluate millions of
relationships in knowledge graph, prioritize
genes and extract relevant subgraphs
• Interactive and exploratory tools needed to
enable knowledge discovery
• Interpretation should be the task of domain
experts i.e. biologists!
KnetMiner Use Cases
Visualising data connections promises faster discoveries
What does KnetMiner know
about your gene?
Reverse genetics applications: gene to phenotype
Wheat
Arabidopsis
What happens if we
knockout TT2 (R Myb) in
wheat?
With Andy Phillips
What KnetMiner knows about TT2 (R Myb)
TT2 (R Myb) on chromosome 3D in wheat is predicted (p-value=0.01) to regulate the
transcriptional activation of MFT according to data from the analysis of 850 RNA-seq
samples in wheat (Ramírez-González et al. 2018) using GENIE3 (Huynh-Thu et al. 2010).
The TT2 3B homoeolog is not predicted to regulate MFT, and the TT2 3A homoeolog is not
annotated in the latest version of the wheat genome.
MFT has been recently linked to grain germination [(Zong Y ; Li Q ): “Recent studies in
both Arabidopsis and wheat have uncovered a new role of MOTHER OF FT AND TFL1
(MFT) in seed germination”] and seed dormancy [(Nakamura S ): “Mapping analysis
showed that MFT on chromosome 3A (MFT-3A) colocalized with the seed dormancy
quantitative trait locus (QTL) QPhs.ocs-3A.”].
The MFT ortholog in Arabidopsis has a 3’ UTR variant that has been associated with (p-
value = 5.5x10-5) increased germination rate after 56 days of dry storage (Atwell et al.
2010).
DFW Nov 2018
Visualising connections can lead to new lines of inquiries
• Can grain colour and PHS be linked because R Myb targets the grain germination gene MFT?
• Do white grain varieties (R Myb mutants) have increased root hair density?
• Is there a link between root hair density and PHS?
What does KnetMiner know
about your trait?
Forward genetics applications: phenotype to gene
https://arapheno.1001genomes.org
https://aragwas.1001genomes.org
P-value: 7.7E-7
Distance: 285bp
Identity: 81%
PPI: Two-Hybrid
ortholog
Arabidopsis
Wheat
http://knetminer.rothamsted.ac.uk/Triticum_aestivum/
IRRI Germplasm Acquisition of early seedling
traits and image processing
GWAS
Gene discovery
related to seed
vigour
Guillaume Menard
Smita Kurup
Peter Eastmond
Kirsty Hassall
David Hughes
Colin Li
Direct Seeded Rice
Improve seedling
establishment,
emergence and
vigour
http://knetminer.rothamsted.ac.uk/Oryza_sativa/
PMID: 16377752
Future Work
Growing and Learning the Knowledge Graph
Growing the KnetMiner
Knowledge Graph
Work in progress
Future development
• Personalised search experience
• Save and share your networks
• Like and Dislike buttons on relations
• Personalised networks based on usage data
• Better knowledge visualisation
• Reduce information overload using network clustering
• Annotate nodes with quantitative user data
• Predictive network analysis
• Find research trends using publication and graph data
• Automatic story and hypotheses generation
Future vision – Automatic story generation
TT2 (R Myb) on chromosome 3D in wheat is predicted (p-value=0.01) to regulate the transcriptional activation of MFT
according to data from the analysis of 850 RNA-seq samples in wheat (Ramírez-González et al. 2018) using GENIE3
(Huynh-Thu et al. 2010). The TT2 3B homeologue is not predicted to regulate MFT, and the TT2 3A homeologue is not
annotated in the latest version of the wheat genome. MFT has been recently linked to grain germination [(Zong Y ; Li Q ):
“Recent studies in both Arabidopsis and wheat have uncovered a new role of MOTHER OF FT AND TFL1 (MFT) in seed
germination”] and seed dormancy [(Nakamura S ): “Mapping analysis showed that MFT on chromosome 3A (MFT-3A)
colocalized with the seed dormancy quantitative trait locus (QTL) QPhs.ocs-3A.”]. The MFT ortholog in Arabidopsis has a
3’ UTR variant that has been associated with (p-value = 5.5x10-5) increased germination rate after 56 days of dry
storage (Atwell et al. 2010).
• Well structured semantics of entities and
relationships in the network
• Maintaining provenance of derived
relations
• Confidence values on relations
• Supporting literature
KnetMiner Impact
• Over 1800 unique users last year
• 68% of users from non-UK countries
• KnetMiner part of 3 grants recently submitted to BBSRC (BBR, sLoLa)
• KnetMiner code is open-source; v3.0 released in Feb 2019
• Developers are starting to contribute to our open-source tools
• Resources are starting to link into KnetMiner (eg. WheatIS, T3, Ensembl)
• Requests to build KnetMiner for new species (eg. Sugarcane, honey bee)
• Invited to run training courses and workshops (eg. EBI, PAG, IB2018)
• KnetMiner used in two agrifood companies, several companies have
expressed interest
The right data in the right
format and in the right hands at
the right time, saves lives.
#us2ts2019 @hdeus
Acknowledgements (strings - for humans only)
Bioinformatics Lab
Ajit Singh
Marco Brandizi
Sandeep Amberkar
Emma Bailey
Dan Smith
David Hughes
Rob King
Colin Li
Chris Rawlings
William Brown (ITS)
Follow us on Twitter: @KnetMiner
Collaborators & Contributors
Richard Holland (NFVL)
Misha Kapushesky (Genestack)
Kevin Dialdestoro (Genestack)
Vasiliki Koutra (KCL)
Martin Castellote (INTA)
Philipp Bayer (UWA)
Jean-Luc Jannink (Cornell)
Clay Birkett (Cornell)
Cyril Pommier (INRA)
Ramil Mauleon (IRRI)
Jan Taubert (KWS)
Uwe Scholz (IPK)
Matthias Lange (IPK)
Kumar Saurabh Singh (Exeter)
Monika Mistry
DFW WP4 members
Data Contributors
Clay Birkett (Cornell)
Cristobal Uauy (JIC)
Philippa Borrill (JIC)
Andrea Bräutigam (Uni Bielefeld)
Philipp Bayer (UWA)
Ramil Mauleon (IRRI)
Funding
Designing Future Wheat (BBSRC)
Pest Genomics Initiative
Users
Andy Phillips (RRes)
Rowan Mitchell (RRes)
Steve Hanley (RRes)
Richard Barker (NASA)
Acknowledgements (things - for machines &humans)
www.knetminer.org
Twitter: @KnetMiner

Mais conteúdo relacionado

Mais procurados

Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final PresentationShruthi Choudary
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureRobert Cormia
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to BioinformaticsLeighton Pritchard
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsRai University
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsShikha Thakur
 
Careers in bioinformatics
Careers in bioinformaticsCareers in bioinformatics
Careers in bioinformaticsentranzz123
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to BioinformaticsDenis C. Bauer
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformaticsbiinoida
 
Careers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and JobsCareers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and JobsM Abdullah Chaudhry
 
Bioinformatrics and it's application
Bioinformatrics and it's applicationBioinformatrics and it's application
Bioinformatrics and it's applicationApurbo Datta
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsphilmaweb
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSMSCW Mysore
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsElena Sügis
 

Mais procurados (20)

Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of Nature
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
 
Bioinformatics on internet
Bioinformatics on internetBioinformatics on internet
Bioinformatics on internet
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Careers in bioinformatics
Careers in bioinformaticsCareers in bioinformatics
Careers in bioinformatics
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Careers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and JobsCareers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and Jobs
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics Software
Bioinformatics SoftwareBioinformatics Software
Bioinformatics Software
 
Bioinformatrics and it's application
Bioinformatrics and it's applicationBioinformatrics and it's application
Bioinformatrics and it's application
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 

Semelhante a Introducing the KnetMiner Knowledge Graph: things, not strings

Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentationNaeem Ahmed
 
Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentationNaeem Ahmed
 
GRM 2011: Approaches, resources and tools for rice gene discovery and breeding
GRM 2011: Approaches, resources and tools for rice gene discovery and breedingGRM 2011: Approaches, resources and tools for rice gene discovery and breeding
GRM 2011: Approaches, resources and tools for rice gene discovery and breedingCGIAR Generation Challenge Programme
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsAmna Jalil
 
Thesis def
Thesis defThesis def
Thesis defJay Vyas
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p collegeSKUASTKashmir
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 
Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...
Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...
Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...FAO
 
A systematic approach to Genotype-Phenotype correlations
A systematic approach to Genotype-Phenotype correlationsA systematic approach to Genotype-Phenotype correlations
A systematic approach to Genotype-Phenotype correlationsfisherp
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxxRowlet
 
Genomics platform for agriculture-CAT lecture
Genomics platform for agriculture-CAT lectureGenomics platform for agriculture-CAT lecture
Genomics platform for agriculture-CAT lectureSenthil Natesan
 
Genetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chickenGenetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chickenMartin Johnsson
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...rahulmonikasharma
 

Semelhante a Introducing the KnetMiner Knowledge Graph: things, not strings (20)

Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentation
 
Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentation
 
GRM 2011: Approaches, resources and tools for rice gene discovery and breeding
GRM 2011: Approaches, resources and tools for rice gene discovery and breedingGRM 2011: Approaches, resources and tools for rice gene discovery and breeding
GRM 2011: Approaches, resources and tools for rice gene discovery and breeding
 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Thesis def
Thesis defThesis def
Thesis def
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...
Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...
Key Note Speaker on “Knowledge Management in Plant Genetic Resources: Impact ...
 
A systematic approach to Genotype-Phenotype correlations
A systematic approach to Genotype-Phenotype correlationsA systematic approach to Genotype-Phenotype correlations
A systematic approach to Genotype-Phenotype correlations
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptx
 
2016 bergen-sars
2016 bergen-sars2016 bergen-sars
2016 bergen-sars
 
Genomics platform for agriculture-CAT lecture
Genomics platform for agriculture-CAT lectureGenomics platform for agriculture-CAT lecture
Genomics platform for agriculture-CAT lecture
 
KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017
 
Cloud bioinformatics 2
Cloud bioinformatics 2Cloud bioinformatics 2
Cloud bioinformatics 2
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Genetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chickenGenetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chicken
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
 
Genome project.pdf
Genome project.pdfGenome project.pdf
Genome project.pdf
 

Último

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...amitlee9823
 

Último (20)

Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 

Introducing the KnetMiner Knowledge Graph: things, not strings

  • 1. Introducing the KnetMiner Knowledge Graph: things, not strings. Keywan Hassani-Pak Head of Bioinformatics 1 April 2019 – Rothamsted Seminar
  • 3. Biological Knowledge Discovery Unravelling the biological story behind complex traits
  • 4. Genes are rarely single actors To explain the metaphor • Unravelling the roles of genes in complex trait genomics is often similar to unmasking the heroes and villains in a plant biology who-dunnit • Multiple genes are generally involved and play different roles. How they interact reveals the plotline(s) in the trait story • We build software and data resources to help automate the process of explaining the plotlines and unravelling the story behind complex trait biology • We are creating computational tools that bring different types of evidence together and then to weigh-up the competing story lines to present the user with the most compelling ones.
  • 5. Assembling the evidence – data is just the beginning! Genetic diversity Sequencing data Phenotype data Literature Pathways Ontologies Better yields NUE Disease Resistance Reference genomes Gene expression QTL Data Information Knowledge Understanding We need all of these components to come together to understand our traits
  • 6. KnetMiner – Accelerating biological discovery http://knetminer.rothamsted.ac.uk/ @KnetMiner • Gene Networks • Bio Databases • Data Integration • Text Mining • Visualisation • AI & Graphs • RDF & Neo4j • Java & JavaScript
  • 7. How does KnetMiner work? Computer says: “Try this”
  • 8. The Rise of Graph Analytics From Leonhard Euler to Google’s Knowledge Graphs
  • 9. Seven Bridges of Königsberg - a historically notable problem in mathematics Its resolution by Leonhard Euler in 1736 laid the foundations of graph theory Leonhard Euler - 1736
  • 10. I N T E R A C TO M E N E T WO R K G R A P H
  • 11. Pathfinding Finds the shortest path or evaluates route availability and quality Centrality Determines the importance of distinct nodes in the network Community Detection Evaluates how a group is clustered or partitioned Graph Algorithms
  • 12. North WykeHarpenden distance to Rothamsted Research Research Institute Site Site miles: 230 What is a Knowledge Graph? year: 1843 1843 Year founded in
  • 14.
  • 15. KnetMiner Knowledge Graph A Knowledge Graph for gene mining and biological discovery
  • 17. Where do KnetMiner relations come from? Some examples:
  • 18. Example - GWAS data http://plants.ensembl.org/biomart/martview Example Arabidopsis #SNP=66,816 | #Gene=27,502 | #Phenotype=107
  • 19. … transform into a graph (SNP) (Trait) associated Using Ondex-Knet-Builder https://github.com/Rothamsted/ondex-knet-builder
  • 20. Add Biological interaction datasets http://thebiogrid.org
  • 22. Towards FAIR KnetMiner Knowledge Graphs • Green: Ondex plug-ins • rdf2neo is a generic, non Ondex-specific rdf->Neo4j conversion tool • Brandizi et al., IB-2018 (https://dx.doi.org/10.1515%2Fjib-2018-0023) • Brandizi et al., SWAT4LS-2018 (https://doi.org/10.6084/m9.figshare.7314323.v1)
  • 23. Programmatic Access via Graph Query Languages MATCH // branching via ‘|’ (prot:Protein) - [:produced_by|consumed_by] -> (:Reaction) // variable-length chains - [:part_of*1..3] -> (pway:Path) RETURN prot.name, pwy LIMIT 1000 // Very compact forms available: MATCH (prot:Protein) - (pway:Path) RETURN pway • RDF + OWL used as a standardised modelling/representation language (see BioKNO ontology: github.com/Rothamsted/bioknet-onto) • SPARQL available too, both having pros/cons (see our benchmark: github.com/Rothamsted/graphdb-benchmarks) • Cypher being used for KnetMiner queries (work in progress)
  • 24. Open Source Code client web-server knet-builder Deployment Model Data Integration Workflows Database Service Model Private data Public data Databases and graph queries required by KnetMiner For a species or domain of interest Graph Queries Using Knet-Builder tools
  • 25. How to search and interpret so much information? • Methods needed to evaluate millions of relationships in knowledge graph, prioritize genes and extract relevant subgraphs • Interactive and exploratory tools needed to enable knowledge discovery • Interpretation should be the task of domain experts i.e. biologists!
  • 26. KnetMiner Use Cases Visualising data connections promises faster discoveries
  • 27. What does KnetMiner know about your gene? Reverse genetics applications: gene to phenotype
  • 28.
  • 29. Wheat Arabidopsis What happens if we knockout TT2 (R Myb) in wheat? With Andy Phillips
  • 30. What KnetMiner knows about TT2 (R Myb) TT2 (R Myb) on chromosome 3D in wheat is predicted (p-value=0.01) to regulate the transcriptional activation of MFT according to data from the analysis of 850 RNA-seq samples in wheat (Ramírez-González et al. 2018) using GENIE3 (Huynh-Thu et al. 2010). The TT2 3B homoeolog is not predicted to regulate MFT, and the TT2 3A homoeolog is not annotated in the latest version of the wheat genome. MFT has been recently linked to grain germination [(Zong Y ; Li Q ): “Recent studies in both Arabidopsis and wheat have uncovered a new role of MOTHER OF FT AND TFL1 (MFT) in seed germination”] and seed dormancy [(Nakamura S ): “Mapping analysis showed that MFT on chromosome 3A (MFT-3A) colocalized with the seed dormancy quantitative trait locus (QTL) QPhs.ocs-3A.”]. The MFT ortholog in Arabidopsis has a 3’ UTR variant that has been associated with (p- value = 5.5x10-5) increased germination rate after 56 days of dry storage (Atwell et al. 2010).
  • 31. DFW Nov 2018 Visualising connections can lead to new lines of inquiries • Can grain colour and PHS be linked because R Myb targets the grain germination gene MFT? • Do white grain varieties (R Myb mutants) have increased root hair density? • Is there a link between root hair density and PHS?
  • 32. What does KnetMiner know about your trait? Forward genetics applications: phenotype to gene
  • 34. https://aragwas.1001genomes.org P-value: 7.7E-7 Distance: 285bp Identity: 81% PPI: Two-Hybrid ortholog Arabidopsis Wheat http://knetminer.rothamsted.ac.uk/Triticum_aestivum/
  • 35. IRRI Germplasm Acquisition of early seedling traits and image processing GWAS Gene discovery related to seed vigour Guillaume Menard Smita Kurup Peter Eastmond Kirsty Hassall David Hughes Colin Li Direct Seeded Rice Improve seedling establishment, emergence and vigour http://knetminer.rothamsted.ac.uk/Oryza_sativa/
  • 37. Future Work Growing and Learning the Knowledge Graph
  • 40. Future development • Personalised search experience • Save and share your networks • Like and Dislike buttons on relations • Personalised networks based on usage data • Better knowledge visualisation • Reduce information overload using network clustering • Annotate nodes with quantitative user data • Predictive network analysis • Find research trends using publication and graph data • Automatic story and hypotheses generation
  • 41. Future vision – Automatic story generation TT2 (R Myb) on chromosome 3D in wheat is predicted (p-value=0.01) to regulate the transcriptional activation of MFT according to data from the analysis of 850 RNA-seq samples in wheat (Ramírez-González et al. 2018) using GENIE3 (Huynh-Thu et al. 2010). The TT2 3B homeologue is not predicted to regulate MFT, and the TT2 3A homeologue is not annotated in the latest version of the wheat genome. MFT has been recently linked to grain germination [(Zong Y ; Li Q ): “Recent studies in both Arabidopsis and wheat have uncovered a new role of MOTHER OF FT AND TFL1 (MFT) in seed germination”] and seed dormancy [(Nakamura S ): “Mapping analysis showed that MFT on chromosome 3A (MFT-3A) colocalized with the seed dormancy quantitative trait locus (QTL) QPhs.ocs-3A.”]. The MFT ortholog in Arabidopsis has a 3’ UTR variant that has been associated with (p-value = 5.5x10-5) increased germination rate after 56 days of dry storage (Atwell et al. 2010). • Well structured semantics of entities and relationships in the network • Maintaining provenance of derived relations • Confidence values on relations • Supporting literature
  • 42. KnetMiner Impact • Over 1800 unique users last year • 68% of users from non-UK countries • KnetMiner part of 3 grants recently submitted to BBSRC (BBR, sLoLa) • KnetMiner code is open-source; v3.0 released in Feb 2019 • Developers are starting to contribute to our open-source tools • Resources are starting to link into KnetMiner (eg. WheatIS, T3, Ensembl) • Requests to build KnetMiner for new species (eg. Sugarcane, honey bee) • Invited to run training courses and workshops (eg. EBI, PAG, IB2018) • KnetMiner used in two agrifood companies, several companies have expressed interest
  • 43. The right data in the right format and in the right hands at the right time, saves lives. #us2ts2019 @hdeus
  • 44. Acknowledgements (strings - for humans only) Bioinformatics Lab Ajit Singh Marco Brandizi Sandeep Amberkar Emma Bailey Dan Smith David Hughes Rob King Colin Li Chris Rawlings William Brown (ITS) Follow us on Twitter: @KnetMiner Collaborators & Contributors Richard Holland (NFVL) Misha Kapushesky (Genestack) Kevin Dialdestoro (Genestack) Vasiliki Koutra (KCL) Martin Castellote (INTA) Philipp Bayer (UWA) Jean-Luc Jannink (Cornell) Clay Birkett (Cornell) Cyril Pommier (INRA) Ramil Mauleon (IRRI) Jan Taubert (KWS) Uwe Scholz (IPK) Matthias Lange (IPK) Kumar Saurabh Singh (Exeter) Monika Mistry DFW WP4 members Data Contributors Clay Birkett (Cornell) Cristobal Uauy (JIC) Philippa Borrill (JIC) Andrea Bräutigam (Uni Bielefeld) Philipp Bayer (UWA) Ramil Mauleon (IRRI) Funding Designing Future Wheat (BBSRC) Pest Genomics Initiative Users Andy Phillips (RRes) Rowan Mitchell (RRes) Steve Hanley (RRes) Richard Barker (NASA)
  • 45. Acknowledgements (things - for machines &humans) www.knetminer.org Twitter: @KnetMiner

Notas do Editor

  1. Good afternoon. Thank you for the kind introduction. Coincidently, I’ve been with Rothamsted 11-years today. I joined the BAB department in 2008, and now as Head of Bioinformatics, my team and I have worked and collaborated with many of you. Researchers at Rothamsted and around the world are working to push the boundaries of human knowledge and tackle key challenges like sustainable agriculture, climate change and food security. You would think they have access to the best available tools to help them in their quest for knowledge. In reality the opposite is often true: the research tools at our disposal are only substandard - finding and accessing relevant information can be extremely tedious. In this seminar, I am going to present how we developed an intelligent data model, known as the KnetMiner Knowledge Graph, that helps you as a researcher to discover new information quickly and easily.
  2. My talk is structured in 5 sections What do we understand with biological knowledge discovery? What are graphs and how are they useful? How do we build the KnetMiner knowledge graph? Finally I will show some use cases and discuss future work.
  3. There are many types of biological knowledge discovery, e.g. Drug discovery, trait discovery or gene discovery. Today I will mostly focus on gene discovery, so what is gene discovery?
  4. Let me use the metaphor “Genes are rarely single actors” to explain what we mean with gene discovery.
  5. Assembling the evidence is a tedious job for humans. Machines can help but face challenges. Let’s review the type of evidences needed to go from data to understanding. Clifford Stoll, an American Astronomer, said: “Data is not information, [...]” Data comes in many formats Excel spreadsheets, FASTA, GFF, Tabular, XML and you name it. FAIR. Information is in many locations and interconnected Over 1600 biological databases (NAR Database Issue 2019) Knowledge can be buried in free text or in structured form (like ontologies) Over 29 million articles (PubMed) - Over 750 biomedical ontologies (BioPortal) Ontologies are structured keyword lists that life scientists have created to define and connect concepts in their subject areas. Understanding needs intelligence Human vs artificial If all this layers are structured such that machines can understand what we mean (FAIR), then machines can play a crucial role in helping us to understand traits.
  6. KnetMiner, with a silent "K" and standing for Knowledge Network Miner, is a web-app that makes biological knowledge discovery faster and fun. It is used by researchers at Rothamsted and around the world, 68% of users are from outside of UK Ajit Singh works on the front-end and usability of KnetMiner. Marco Brandizi works on the back-end technologies and interoperability of KnetMiner.
  7. KnetMiner is NOT a black box. It is powered by a large integrated knowledge graph and graph analytics algorithms to search and visualise knowledge.
  8. Graph analytics has a history dating back to 1736, when Leonhard Euler solved the “Seven Bridges of Königsberg” problem.
  9. The problem asked whether it was possible to visit all four areas of a city, connected by seven bridges, while only crossing each bridge once. With the insight that only the connections themselves were relevant, Euler set the groundwork for graph theory and its mathematics. Euler modelled the problem as a graph of 4 x nodes and 7 x edges Euler proved that in any graph such walks are only possible if exactly zero or two nodes have an odd number of edges. The four areas (nodes) of Königsberg however all had odd number of bridges (edges). So it wasn’t possible! But graph analytics did not catch on immediately. Two hundred years would pass before the first graph textbook was published in 1936. In the late 1960s and 1970s, network science and applied graph analytics really began to emerge. In the last few years, there’s been an explosion of interest in graph technologies. In 2017, a survey (Forrester data) indicated that “69% of enterprises have or plan to implement graph databases within the next 12 months.” Demand is accelerating based on a need to better understand real-world networks and forecast their behaviours, which is resulting in many new graph-based solutions.
  10. Networks are a representation, a tool to understand complex systems and the complex connections inherent in today’s data. For example, you can represent how protein or social systems work by thinking about interactions between pairs of proteins or people. By analyzing the structure of this representation, we answer questions and make predictions about how the system works or how individuals behave within it. In this sense, network science is a set of technical tools applicable to nearly any domain, and graphs are the mathematical models used to perform analysis.
  11. Most graph algorithms can be classified in three major groups: pathfinding, centrality and community detection KnetMiner uses algorithms from all three categories to… Find the shortest path or evaluate route availability and quality (Pathfinding) Determine the importance of distinct nodes in the network (Centrality) Visualise graphs based on how a group is clustered or partitioned (Community detections)
  12. Knowledge graphs are special type of graph. They aim to structure what is known about the world. Nodes and relations have a type (Institute, Site, based in, distance to) Relations are based on known facts (nano-publications) Properties (miles, year) There are often different ways to model the real-world: example year. Always ask your knowledge engineer!
  13. The knowledge graph encodes relationships between entities as an explicit relationship in a database, and not just in a sentence on a document, essentially making it machine readable. KGs makes information that much easier to discover and enable world-aware inferences. Google is building their own KG, let’s take a look how it’s used. https://paul4innovating.com/2018/05/15/the-arrival-and-potential-of-knowledge-graphs-into-our-world/simple-enough-for-knowledge-graphs/
  14. The Knowledge Graph can help you make some unexpected discoveries. You might learn a new fact or new connection that prompts a whole new line of inquiry. Do you know where Matt Groening, the creator of the Simpsons, got the idea for Homer, Marge and Lisa’s names? It’s a bit of a surprise. All of these are linked in the graph. It’s not just a catalog of objects; it also models all these inter-relationships. It’s the intelligence between these different entities that’s the key.
  15. I am now going to introduce the KnetMiner KG – explain what information it captures and how we build it.
  16. What types of entities do we need for evidence-based gene discovery? Gene, RNA, Protein, Metabolite, SNP, Phenotype, Publication: These are some examples of things we capture in our KG. A KG can always be extended with other types of entities. The intelligence between these different entities is the key.
  17. Where do KnetMiner relations come from? This is taken from the wheat knetminer release notes, relations come from: Public databases Pre-publication collaborative datasets Extracted from the literature (pubmed abstracts) Generated by KnetMiner itself
  18. So how do we convert datasets into a graph? Explain columns The relationship between the columns is hard to understand for machines.
  19. But when we transform the table into a graph of things and their explicit relationships, the data becomes more meaningful to machines. And connections are easier to “see” for humans.
  20. If we go and take a second dataset, like PPI from BioGrid…
  21. We can overlay it over the graph and build a more comprehensive biological picture
  22. KnetMiner KGs can have millions of nodes and edges. How do we store it and make it accessible? We have made important progress towards making the KG FAIR using new technologies to replace the existing Ondex backend Neo4j graph database with Cypher graph query language Semantic Web (RDF) knowledgebase with SPARQL endpoint
  23. Graph Query Languages allow us to query the knowledge programmatically and build intelligent search tools that accelerate discovery. If you like to know more about this work, please read our recent publications.
  24. In simple terms, once we have the KG, methods are needed to evaluate…
  25. Hypotheses validation and generation
  26. Mining in public databases
  27. No statistically significant hits (genes) in AraGWAS But biologically significant path in the knowledge graph exist
  28. Analyse your private GWAS/QTL data
  29. By Helena Deus
  30. Acknowledgements: Various collaborators who have worked with us on KnetMiner, including contributing partners from science, academia and industry.
  31. Acknowledgements: Various collaborators who have worked with us on KnetMiner, including contributing partners from science, academia and industry. Note: Keywan (the “K”)