SlideShare uma empresa Scribd logo
1 de 21
KUPKB: Sharing, Connecting and
 Exposing Kidney and Urinary
Knowledge using RDF and OWL




              www.kupkb.org

        Julie Klein & Simon Jupp
      Bio-health informatics group
        University of Manchester
The problem domain

Thousands of studies have been conducted by the kidney research community



          On different species
                                            human   mouse



          On different materials

                                            urine   tissue    cell



         • On different biological levels

                                            gene    protein



      Large diversity  Integration of the knowldege is complex
Where does the data go?
      Bespoke kidney laboratory databases
                                             Research Papers




            Generalist databases




Scattered, hidden in figures, coming in different formats
                Most of the data is lost!
The Kidney and Urinary Pathway Knowledge Base:

                                        SHARE AND CONNECT

The iKUP Browser:

                                        EXPOSE


                        www.kupkb.org
Stucture

 Populous
                           Experimental data


KUP Ontology
  (schema)
                                  RightField




               RDF triple store


                                               iKUP Browser
                KUP Knowledge Base
Ontologies provide the schema
                  What has been observed, where and when?



      Mouse anatomy                                             Experimental factors
         ontology

                                     Gene Ontology


                                                                  Animal model
      Cell type ontology
                                                                Disease ontology




             We needed to connect these reference ontologies.
Creation of a specialized Kidney and Urinary Pathway Ontology (KUPO)
                           http://www.e-lico.org/public/kupo/
Ontologies by stealth
                 The domain experts are the experts so get them build it
                                                    Biological
                             Cells        Anatomy
                                                    processes(
                            (CTO)          (MAO)
                                                       GO)

Spreadsheet
  OPPL Scripts




 Ontology

                 Populous generates simple Excel based templates
                          http://www.e-lico.eu/populous.html
Describing/Collecting experimental data
Gathering good meta-data AND data again by stealth using RightField




                      Content of the meta-data cells is constraint to
                      the relevant set of KUPO terms




                  http://www.sysmo-db.org/rightfield
Describing/Collecting experimental data
Gathering good meta-data AND data again by stealth using RightField




                      Content of the meta-data cells is constraint to
                      the relevant set of KUPO terms
Mashing it all together



 Kidney and Urinary Pathway Ontology                          Experimental data
~1800 classes (~40,000 after imports closure)         220 KUP experiments integrated


                                        Owl reasoning




              RDF triple store
               ~35M triples
                                         KUP Knowledge Base
SPARQLing results
Make it all RDF/OWL and expose a SPARQL endpoint…
                                       …then we are done right?

   We can now ask queries that span several databases
   We can exploit OWL semantics for intelligent answers


     BUT!
 Easy to use application…
                                       …this is what the biologist really want
The iKUP browser




Built as an easy-to-use and light Google Web Toolkit application
To expose data from the KUPKB
Doing some biology
1. A biological question         2. No answer with classical tools
Can calreticulin be associated   Search in Pubmed and Google does
to the development of human      not return any relevant result!
kidney disease?



3. Querying the KUPKB




4. Validation in the wet-lab     5. Publish an innovative result
KUPKB in silico result           Accepted for publication in the FASEB J!
confirmed.
Reusing and Building


Ontologies provide the schema                        Experimental data


                                Owl reasoning




       RDF triple store
                                KUP Knowledge Base
Reusing and Building


    Ontologies provide the schema                          Experimental data
Kidney and Urinary Pathway Ontology                 Annotations, homogenization
   Tool to facilitate building of onto.            Tool to facilitate data annotation
                                        Owl reasoning




            RDF triple store                                     iKUP Browser
                                     KUP Knowledge Base
What next


 User study and evaluation experiments ongoing with
  Manchester Web Ergonomics Lab

 Application to other biological domains
    Change the domain model in the ontologies and we can construct any
     organ knowledge base in this way
    Already interests in gut, liver, heart and metabolic diseases
Acknowledgments
•   Simon Jupp

•   Stuart Owen, Matthew Horridge, Katy Wolstencroft and Carole Goble @
    University of Manchester for RightField

•   Joost Schanstra, Panagiotis Moulos, Jean-Loup Bascands @ Renal Fibrosis
    Lab, Toulouse, France

•   Aristidis Charonis, Bénédicte Buffin-Meyer, Myriem Fernandez for the CALR
    example

•   e-LICO FP7 project and EuroKUP

•   Robert Stevens, ontology development, University of Manchester

    Open Source License: GNU Lesser General Public License
    Code: http://code.google.com/p/kupkb-dev/
Thank you for listening…




www.kupk b .or
Some rough stats…
• 195 KUP experiments integrated
• KUPKB RDF store ~35M triples
• KUPK Ontology ~1800 classes. ~40,000 after imports closure



Architecture
• Sesame and BigOWLIM for the RDF store
• Web site developed with Google web toolkit
• OWL API and HermiT reasoner for classification and faceted browsing
Summary
   The KUPKB RDF store is a mashup of biological knowledge relating to the
    KUP domain

   Ontologies provide the schema and a consistent data annotation mechanism

   We expose this knowledge base through a simple web interface that real
    biologists can use, the iKUP

   iKUP and KUPKB provides a faster mechanism for the biologist to survey the
    data in biological publications and helps the hypothesis generation process.

   It is a testament to the tools and APIs that such applications are now being
    delivered at relatively low cost

Mais conteúdo relacionado

Mais procurados

Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
Denis Shestakov
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
Connected Data World
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
Sreekanth Gali
 
Toast 2015 qiime_talk
Toast 2015 qiime_talkToast 2015 qiime_talk
Toast 2015 qiime_talk
TOASTworkshop
 
Java Introductie
Java IntroductieJava Introductie
Java Introductie
mbruggen
 

Mais procurados (19)

Article
ArticleArticle
Article
 
Introduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsIntroduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR Genomics
 
Flash introduction to Qiime2 -- 16S Amplicon analysis
Flash introduction to Qiime2 -- 16S Amplicon analysisFlash introduction to Qiime2 -- 16S Amplicon analysis
Flash introduction to Qiime2 -- 16S Amplicon analysis
 
Representation of kidney structures in Uberon
Representation of kidney structures in UberonRepresentation of kidney structures in Uberon
Representation of kidney structures in Uberon
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Prosite
PrositeProsite
Prosite
 
Biological Database Systems
Biological Database SystemsBiological Database Systems
Biological Database Systems
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Rishi
RishiRishi
Rishi
 
BHL @ #TDWG09 - with discussion
BHL @ #TDWG09 - with discussionBHL @ #TDWG09 - with discussion
BHL @ #TDWG09 - with discussion
 
Use of data
Use of dataUse of data
Use of data
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
 
Toast 2015 qiime_talk
Toast 2015 qiime_talkToast 2015 qiime_talk
Toast 2015 qiime_talk
 
WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...
 
Java Introductie
Java IntroductieJava Introductie
Java Introductie
 

Destaque

Destaque (8)

S Cheng - eagle-i: development and expansion of a scientific resource discove...
S Cheng - eagle-i: development and expansion of a scientific resource discove...S Cheng - eagle-i: development and expansion of a scientific resource discove...
S Cheng - eagle-i: development and expansion of a scientific resource discove...
 
D Baker - Galaxy Update
D Baker - Galaxy UpdateD Baker - Galaxy Update
D Baker - Galaxy Update
 
B Kinoshita - Creating biology pipelines with BioUno
B Kinoshita - Creating biology pipelines with BioUnoB Kinoshita - Creating biology pipelines with BioUno
B Kinoshita - Creating biology pipelines with BioUno
 
D Robinson - Using HDF5 to work with large quantities of rich biological data
D Robinson - Using HDF5 to work with large quantities of rich biological dataD Robinson - Using HDF5 to work with large quantities of rich biological data
D Robinson - Using HDF5 to work with large quantities of rich biological data
 
A Kalderimis - InterMine: Embeddable datamining components
A Kalderimis - InterMine: Embeddable datamining componentsA Kalderimis - InterMine: Embeddable datamining components
A Kalderimis - InterMine: Embeddable datamining components
 
M Gumbel - SCABIO: a framework for bioinformatics algorithms in Scala
M Gumbel - SCABIO: a framework for bioinformatics algorithms in ScalaM Gumbel - SCABIO: a framework for bioinformatics algorithms in Scala
M Gumbel - SCABIO: a framework for bioinformatics algorithms in Scala
 
VIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic VariationVIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic Variation
 
Wolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national serviceWolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national service
 

Semelhante a J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledge using RDF and OWL

Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
Monica Munoz-Torres
 
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
Neo4j
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Monica Munoz-Torres
 

Semelhante a J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledge using RDF and OWL (20)

Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Facilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppFacilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-jupp
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experience
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
 
Ontologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlinOntologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlin
 
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Bio solr building a better search for bioinformatics
Bio solr   building a better search for bioinformaticsBio solr   building a better search for bioinformatics
Bio solr building a better search for bioinformatics
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
 

Mais de Jan Aerts

Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?
Jan Aerts
 
Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013
Jan Aerts
 
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Jan Aerts
 

Mais de Jan Aerts (20)

Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?Visual Analytics in Omics - why, what, how?
Visual Analytics in Omics - why, what, how?
 
Visual Analytics in Omics: why, what, how?
Visual Analytics in Omics: why, what, how?Visual Analytics in Omics: why, what, how?
Visual Analytics in Omics: why, what, how?
 
Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013Visual Analytics talk at ISMB2013
Visual Analytics talk at ISMB2013
 
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)
 
Humanizing Data Analysis
Humanizing Data AnalysisHumanizing Data Analysis
Humanizing Data Analysis
 
Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualization
 
L Fu - Dao: a novel programming language for bioinformatics
L Fu - Dao: a novel programming language for bioinformaticsL Fu - Dao: a novel programming language for bioinformatics
L Fu - Dao: a novel programming language for bioinformatics
 
J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...
J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...
J Wang - bioKepler: a comprehensive bioinformatics scientific workflow module...
 
S Cain - GMOD in the cloud
S Cain - GMOD in the cloudS Cain - GMOD in the cloud
S Cain - GMOD in the cloud
 
B Temperton - The Bioinformatics Testing Consortium
B Temperton - The Bioinformatics Testing ConsortiumB Temperton - The Bioinformatics Testing Consortium
B Temperton - The Bioinformatics Testing Consortium
 
J Goecks - The Galaxy Visual Analysis Framework
J Goecks - The Galaxy Visual Analysis FrameworkJ Goecks - The Galaxy Visual Analysis Framework
J Goecks - The Galaxy Visual Analysis Framework
 
S Cain - GMOD in the cloud
S Cain - GMOD in the cloudS Cain - GMOD in the cloud
S Cain - GMOD in the cloud
 
B Chapman - Toolkit for variation comparison and analysis
B Chapman - Toolkit for variation comparison and analysisB Chapman - Toolkit for variation comparison and analysis
B Chapman - Toolkit for variation comparison and analysis
 
P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...
P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...
P Rocca-Serra - The open source ISA metadata tracking framework: from data cu...
 
A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...
A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...
A Kanterakis - PyPedia: a python crowdsourcing development environment for bi...
 
E Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutesE Afgan - Zero to a bioinformatics analysis platform in four minutes
E Afgan - Zero to a bioinformatics analysis platform in four minutes
 
M Reich - GenomeSpace
M Reich - GenomeSpaceM Reich - GenomeSpace
M Reich - GenomeSpace
 
CT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloudCT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloud
 
L Forer - Cloudgene: an execution platform for MapReduce programs in public a...
L Forer - Cloudgene: an execution platform for MapReduce programs in public a...L Forer - Cloudgene: an execution platform for MapReduce programs in public a...
L Forer - Cloudgene: an execution platform for MapReduce programs in public a...
 
Holland R - Pistoia Alliance Sequence Squeeze
Holland R - Pistoia Alliance Sequence SqueezeHolland R - Pistoia Alliance Sequence Squeeze
Holland R - Pistoia Alliance Sequence Squeeze
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Último (20)

Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledge using RDF and OWL

  • 1. KUPKB: Sharing, Connecting and Exposing Kidney and Urinary Knowledge using RDF and OWL www.kupkb.org Julie Klein & Simon Jupp Bio-health informatics group University of Manchester
  • 2. The problem domain Thousands of studies have been conducted by the kidney research community  On different species human mouse  On different materials urine tissue cell • On different biological levels gene protein Large diversity  Integration of the knowldege is complex
  • 3. Where does the data go? Bespoke kidney laboratory databases Research Papers Generalist databases Scattered, hidden in figures, coming in different formats Most of the data is lost!
  • 4. The Kidney and Urinary Pathway Knowledge Base: SHARE AND CONNECT The iKUP Browser: EXPOSE www.kupkb.org
  • 5. Stucture Populous Experimental data KUP Ontology (schema) RightField RDF triple store iKUP Browser KUP Knowledge Base
  • 6. Ontologies provide the schema What has been observed, where and when? Mouse anatomy Experimental factors ontology Gene Ontology Animal model Cell type ontology Disease ontology We needed to connect these reference ontologies. Creation of a specialized Kidney and Urinary Pathway Ontology (KUPO) http://www.e-lico.org/public/kupo/
  • 7. Ontologies by stealth The domain experts are the experts so get them build it Biological Cells Anatomy processes( (CTO) (MAO) GO) Spreadsheet OPPL Scripts Ontology Populous generates simple Excel based templates http://www.e-lico.eu/populous.html
  • 8. Describing/Collecting experimental data Gathering good meta-data AND data again by stealth using RightField Content of the meta-data cells is constraint to the relevant set of KUPO terms http://www.sysmo-db.org/rightfield
  • 9. Describing/Collecting experimental data Gathering good meta-data AND data again by stealth using RightField Content of the meta-data cells is constraint to the relevant set of KUPO terms
  • 10. Mashing it all together Kidney and Urinary Pathway Ontology Experimental data ~1800 classes (~40,000 after imports closure) 220 KUP experiments integrated Owl reasoning RDF triple store ~35M triples KUP Knowledge Base
  • 11. SPARQLing results Make it all RDF/OWL and expose a SPARQL endpoint… …then we are done right?  We can now ask queries that span several databases  We can exploit OWL semantics for intelligent answers BUT!  Easy to use application… …this is what the biologist really want
  • 12. The iKUP browser Built as an easy-to-use and light Google Web Toolkit application
  • 13. To expose data from the KUPKB
  • 14. Doing some biology 1. A biological question 2. No answer with classical tools Can calreticulin be associated Search in Pubmed and Google does to the development of human not return any relevant result! kidney disease? 3. Querying the KUPKB 4. Validation in the wet-lab 5. Publish an innovative result KUPKB in silico result Accepted for publication in the FASEB J! confirmed.
  • 15. Reusing and Building Ontologies provide the schema Experimental data Owl reasoning RDF triple store KUP Knowledge Base
  • 16. Reusing and Building Ontologies provide the schema Experimental data Kidney and Urinary Pathway Ontology Annotations, homogenization Tool to facilitate building of onto. Tool to facilitate data annotation Owl reasoning RDF triple store iKUP Browser KUP Knowledge Base
  • 17. What next  User study and evaluation experiments ongoing with Manchester Web Ergonomics Lab  Application to other biological domains  Change the domain model in the ontologies and we can construct any organ knowledge base in this way  Already interests in gut, liver, heart and metabolic diseases
  • 18. Acknowledgments • Simon Jupp • Stuart Owen, Matthew Horridge, Katy Wolstencroft and Carole Goble @ University of Manchester for RightField • Joost Schanstra, Panagiotis Moulos, Jean-Loup Bascands @ Renal Fibrosis Lab, Toulouse, France • Aristidis Charonis, Bénédicte Buffin-Meyer, Myriem Fernandez for the CALR example • e-LICO FP7 project and EuroKUP • Robert Stevens, ontology development, University of Manchester Open Source License: GNU Lesser General Public License Code: http://code.google.com/p/kupkb-dev/
  • 19. Thank you for listening… www.kupk b .or
  • 20. Some rough stats… • 195 KUP experiments integrated • KUPKB RDF store ~35M triples • KUPK Ontology ~1800 classes. ~40,000 after imports closure Architecture • Sesame and BigOWLIM for the RDF store • Web site developed with Google web toolkit • OWL API and HermiT reasoner for classification and faceted browsing
  • 21. Summary  The KUPKB RDF store is a mashup of biological knowledge relating to the KUP domain  Ontologies provide the schema and a consistent data annotation mechanism  We expose this knowledge base through a simple web interface that real biologists can use, the iKUP  iKUP and KUPKB provides a faster mechanism for the biologist to survey the data in biological publications and helps the hypothesis generation process.  It is a testament to the tools and APIs that such applications are now being delivered at relatively low cost

Notas do Editor

  1. Renal physiology Human urinary protein map Renal pathophysiology Biomarker discovery
  2. Animate
  3. Animate
  4. Animate
  5. Animate