SlideShare a Scribd company logo
1 of 20
Download to read offline
ISA project , ISA tools and
RDF conversion efforts
Philippe Rocca-Serra
Oxford University, Oxford, UK
HCLS Scientific Discourse call, November, 29th 2010
proccaserra@gmail.com
mage-tab | pride-ml | sra-xml | others
	

	

 	

 	

 ISA infrastructure overview
A focus on standards...
comply to
common
standards
Not just microarray data....
Telling more about experimental design
repeated measurements, sample sizes....
IISAcreator: a tool for reporting studies
structured reports...declaring variables
Making sense: ontology term tagging
ISAcreator Configurator provides configurations to ISAcreator...
These configurations tell ISAcreator what is the minimum
amount of information needed to describe experiments.
ISAcreator is packaged with a default set of configurations,
however you can create your own...
MIAPE
XML
MIAME
XML
MIMS
XML
MIENS
XML
MIGS
XML
convert to different formats for submission to public repositories,
e.g. MAGE-TAB (for ArrayExpress), PRIDE-ML (for PRIDE) or SRA-XML (for ENA/NCBI)
ISA Converter: parsing ISA-TAB
documents ->Conversion to Objects
Why an RDF conversion?
• Interest in federated queries
• Harvard collaborators (S. Das,T.
Clark,W Hide, O Hoffmann)
Why are we doing this?
• Experiments where transcription profiling and
metaboliting profiling  and liver injury in rodent
• Experiments funded by UK BBSRC
• Experiments performed by an organization located
in the Netherlands
• Experiments performed on rodent where there are
at least 3 biological replicates per treatment groups
• Experiments performed by persons belonging to
John smith group.
RDF conversion: the plan
• Initial focus on representation of
experimental design
• treatment, perturbation
• response variable
• Later on, focus on molecular dimension
• rely on biordf preliminary work on
gene expression (generilized solution)
RDF conversion: resources
• Identifying Existing Ontological Resources
• dc, skos for document metadata
• foaf, foafCorp, vcard for Person/Contact
• bibo, cito, fabio for Publication references.
• swan experiment, obi for material processing, data
production & analysis
RDF conversion: snippet
RDF
conversion:
Experimental
graph
Credit: Sudeshna Das,Tim Clark, HCLS Sci-Disc,
November 2010
protocol
planned process
transcription profiling
measurement datum
transcript
abundance*
MOE430_2 design*
planning
labeled cRNA
image
Affymetrix
has_specific_output
has_specific_input
is_about
utilises instrument
is manufacturer of
total RNA
collecting specimen from organism
blood specimen
liver specimen
skeletal muscle specimen*
gonadal adipose tissue specimen*
total RNA extraction
intraperitoneal
administration
Rattus norvegicus
treated subject*
labeling
has_specific_output
has_specific_input
has_specific_output
nucleic acid hybridization
has_specific_input
strain
chemical compound
has_specific_output
has_specific_input
has_specific_output
has_specific_input
independent
variable
specification
dependent variable specification
biotin
label role
duration of exposure
DNA microarray
feature extraction
data
transformation
specimen role anatomical entity
factorial design
treated organism*
metabolite concentration*
metabolite profiling utilises instrument Instrument
5 mm inverse geometry 1H/broadband probe
NMR assay
bearer_of
derives_from
is_a
has_specific_input
realizes
has_part
has_specific_output
utilises instrument
image acquisition /scanning
free induction
decay
spectrum*
utilises instrument
has_specific_input
has_specific_output
organism
bearer_of
hybridized microarray slide*
transcription measurement function
inheres_inrealizes
concretizes
is_about
is_about
is_about
is_about
is_about
chemical mixture
treated role ?
bearer_of
has_part
study design
has_part
is_about
measuring function
intensity of magnetic field
number of acquisition
extraction
phenol phase
supernatant*
GCRMA
normalization*
has_specific_input
has_specific_output
is_a
orotic acid*
DMSO
is_a
Wistar rat*; Kyoto rat*
is_a
1 day post injection*
14 days post injection*
is_a
normalized data
set has_specific_output
has_specific_input
has_specific_output
is_duration_of
waiting
realizes/
concretizes
some
specification
Bruker BEST NMR system
has_part
has_specific_input
has_specific_
output
is_a
has_specific_input
has_specific_output
measured
expression level
Transcript
metabolite
is_about
oligonucleotide sequence
has_part
derives_from
is_proxy_for
manufacturing
RNA
is_a
is_about
transformed data
set
is_proxy_for
has_specific_output
realizes
complementary nucleotide probe role
bearer_of
is_about
realizes
inheres_in
3 days post injection*
isatab.sf.net
proccaserra@gmail.com
Acknowledgements
Susanna Sansone, Un. of Oxford
Eamonn Maguire, Un. of Oxford
SWAN-Data-Experiments working group
Sudeshna Das
Tim Clark
Stephane Corlosquet
HCLS working groups

More Related Content

Similar to Hcls sci disc-isa2rdf

Strata NYC 2015 - Supercharging R with Apache Spark
Strata NYC 2015 - Supercharging R with Apache SparkStrata NYC 2015 - Supercharging R with Apache Spark
Strata NYC 2015 - Supercharging R with Apache Spark
Databricks
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
Jian Qin
 
E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016
Sven Schlarb
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
Norman Morrison
 

Similar to Hcls sci disc-isa2rdf (20)

FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream Processing
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
Strata NYC 2015 - Supercharging R with Apache Spark
Strata NYC 2015 - Supercharging R with Apache SparkStrata NYC 2015 - Supercharging R with Apache Spark
Strata NYC 2015 - Supercharging R with Apache Spark
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-Seq
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
COPO kick-off meeting
COPO kick-off meetingCOPO kick-off meeting
COPO kick-off meeting
 
Spark meetup TCHUG
Spark meetup TCHUGSpark meetup TCHUG
Spark meetup TCHUG
 
04 open source_tools
04 open source_tools04 open source_tools
04 open source_tools
 
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
 
Standarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata filesStandarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata files
 
E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016
 
Distributed messaging through Kafka
Distributed messaging through KafkaDistributed messaging through Kafka
Distributed messaging through Kafka
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
 
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
Leveraging NLP and Deep Learning for Document Recommendations in the CloudLeveraging NLP and Deep Learning for Document Recommendations in the Cloud
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
 
Data Science with the Help of Metadata
Data Science with the Help of MetadataData Science with the Help of Metadata
Data Science with the Help of Metadata
 
NGS: Mapping and de novo assembly
NGS: Mapping and de novo assemblyNGS: Mapping and de novo assembly
NGS: Mapping and de novo assembly
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
Enabling exploratory data science with Spark and R
Enabling exploratory data science with Spark and REnabling exploratory data science with Spark and R
Enabling exploratory data science with Spark and R
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

Hcls sci disc-isa2rdf

  • 1. ISA project , ISA tools and RDF conversion efforts Philippe Rocca-Serra Oxford University, Oxford, UK HCLS Scientific Discourse call, November, 29th 2010 proccaserra@gmail.com
  • 2. mage-tab | pride-ml | sra-xml | others ISA infrastructure overview
  • 3. A focus on standards... comply to common standards
  • 5. Telling more about experimental design
  • 7. IISAcreator: a tool for reporting studies
  • 9. Making sense: ontology term tagging
  • 10.
  • 11. ISAcreator Configurator provides configurations to ISAcreator... These configurations tell ISAcreator what is the minimum amount of information needed to describe experiments. ISAcreator is packaged with a default set of configurations, however you can create your own... MIAPE XML MIAME XML MIMS XML MIENS XML MIGS XML
  • 12. convert to different formats for submission to public repositories, e.g. MAGE-TAB (for ArrayExpress), PRIDE-ML (for PRIDE) or SRA-XML (for ENA/NCBI) ISA Converter: parsing ISA-TAB documents ->Conversion to Objects
  • 13. Why an RDF conversion? • Interest in federated queries • Harvard collaborators (S. Das,T. Clark,W Hide, O Hoffmann)
  • 14. Why are we doing this? • Experiments where transcription profiling and metaboliting profiling  and liver injury in rodent • Experiments funded by UK BBSRC • Experiments performed by an organization located in the Netherlands • Experiments performed on rodent where there are at least 3 biological replicates per treatment groups • Experiments performed by persons belonging to John smith group.
  • 15. RDF conversion: the plan • Initial focus on representation of experimental design • treatment, perturbation • response variable • Later on, focus on molecular dimension • rely on biordf preliminary work on gene expression (generilized solution)
  • 16. RDF conversion: resources • Identifying Existing Ontological Resources • dc, skos for document metadata • foaf, foafCorp, vcard for Person/Contact • bibo, cito, fabio for Publication references. • swan experiment, obi for material processing, data production & analysis
  • 18. RDF conversion: Experimental graph Credit: Sudeshna Das,Tim Clark, HCLS Sci-Disc, November 2010
  • 19. protocol planned process transcription profiling measurement datum transcript abundance* MOE430_2 design* planning labeled cRNA image Affymetrix has_specific_output has_specific_input is_about utilises instrument is manufacturer of total RNA collecting specimen from organism blood specimen liver specimen skeletal muscle specimen* gonadal adipose tissue specimen* total RNA extraction intraperitoneal administration Rattus norvegicus treated subject* labeling has_specific_output has_specific_input has_specific_output nucleic acid hybridization has_specific_input strain chemical compound has_specific_output has_specific_input has_specific_output has_specific_input independent variable specification dependent variable specification biotin label role duration of exposure DNA microarray feature extraction data transformation specimen role anatomical entity factorial design treated organism* metabolite concentration* metabolite profiling utilises instrument Instrument 5 mm inverse geometry 1H/broadband probe NMR assay bearer_of derives_from is_a has_specific_input realizes has_part has_specific_output utilises instrument image acquisition /scanning free induction decay spectrum* utilises instrument has_specific_input has_specific_output organism bearer_of hybridized microarray slide* transcription measurement function inheres_inrealizes concretizes is_about is_about is_about is_about is_about chemical mixture treated role ? bearer_of has_part study design has_part is_about measuring function intensity of magnetic field number of acquisition extraction phenol phase supernatant* GCRMA normalization* has_specific_input has_specific_output is_a orotic acid* DMSO is_a Wistar rat*; Kyoto rat* is_a 1 day post injection* 14 days post injection* is_a normalized data set has_specific_output has_specific_input has_specific_output is_duration_of waiting realizes/ concretizes some specification Bruker BEST NMR system has_part has_specific_input has_specific_ output is_a has_specific_input has_specific_output measured expression level Transcript metabolite is_about oligonucleotide sequence has_part derives_from is_proxy_for manufacturing RNA is_a is_about transformed data set is_proxy_for has_specific_output realizes complementary nucleotide probe role bearer_of is_about realizes inheres_in 3 days post injection* isatab.sf.net proccaserra@gmail.com
  • 20. Acknowledgements Susanna Sansone, Un. of Oxford Eamonn Maguire, Un. of Oxford SWAN-Data-Experiments working group Sudeshna Das Tim Clark Stephane Corlosquet HCLS working groups