SlideShare uma empresa Scribd logo
1 de 17
DREAM Challenge
 Gustavo Stolovitzky, IBM Computational Biology Center
         Andrea Califano, Columbia University
DREAM

• DREAM is a Dialogue for Reverse Engineering
  Assessments and Methods.
• The main objective is to catalyze the interaction
  between experiment and theory in the area of
  cellular network inference and quantitative model
  building in systems biology
Challenges

• Network Topology and Parameter Inference Challenge
• Sage Bionetworks - DREAM Breast Cancer Prognosis C
• The DREAM-Phil Bowen ALS Prediction Prize4Life
• NCI-DREAM Drug Sensitivity Prediction Challenge
Project/Challenges

•   Network Topology and Parameter Inference challenge
    •   Develop/apply optimization methods including the selection of the
        most informative experiments, to accurately estimate parameters
        and predict outcomes of perturbations in Systems Biology models
        given
        •   In Model 1 the complete structure of the model (including
            expressions for the kinetic rate laws) for a gene regulatory
            network composed of 9 genes. Protein and mRNA are
            explicitly modeled.
        •   In Model 2 an incomplete structure of the model, with missing
            regulatory links, for a gene regulatory network composed of
            11 genes. Here, participants will also have to find the missing
            links. Only proteins are explicitly modeled.
DREAM Phil Bowen ALS Prediction Prize4Life


•   Goal
    •   Challenge is to predict the progression of disease in ALS patients based
        on the patient’s current disease status
    •   Specifically develop an approach to predict a given patient’s disease status
        within a year’s time based on 3 months of data
•   Data
    •   Includes demographics, medical and family history data, functional
        measures, vital signs, and lab data (blood
        chemistry/hematology/urinalysis) collected at multiple times
•   Disease progression will be calculated as the average change in (ALSFRS)
    Amyotrophic Lateral Sclerosis Functional Rating Scale over a year’s time
    from enrollment in a clinical trial
•   At the end of the challenge, the prediction submitted (based on 3 months of
    data) will be compared against the actual ALSFRS slope experienced by the
    patient over a year
Output
•   Improve disease prediction beyond the current capabilities by
    •   Developing more accurate (sensitive and specific) methods of
        predicting progression,
    •   Identifying markers (variables) that would enable a
        determination of expected future disease progression earlier
        on in the course of the disease
•   Validation
    •   Validate the model against a subset of patients that are neither
        part of the training set nor the final validation (test) set.
    •   Submit the actual code written in R language (“Validation
        Code”) and InnoCentive will run the code against the interim
        validation data set
NCI-DREAM Drug Sensitivity Prediction
           Challenge




• Use genomic information to build models capable of
  ranking the sensitivity of cancer cell lines to a set of
  small molecule compounds or their combinations
Sub Challenges
•   Sub Challenge 1
    •   Predict the sensitivity of breast cancer cell lines to previously
        untested compounds
        •   Model capable of ranking the sensitivity of 18 breast cancer
            cell lines to 31 compounds
        •   Challenge in this case is to link the drug effects to the
            underlying genetics of the 53 cell lines.
•   Sub Challenge 2
    •   Predicting compound combinations that have a synergistic
        effect in reducing viability of a DLBCL cell line
        •   Predict the activity of pairs of compounds in the DLBCL LY3
            cell line from expression profiles acquired after treatment
            of the cell line with each of 14 individual compounds
Sage Bionetworks - DREAM Breast Cancer Prognosis
                    Challenge
•   Background
    •   Molecular diagnostics for cancer therapeutic decision making are among
        the most promising applications of genomic technology
    •   Molecular profiles have proved particularly powerful in adding prognosis
        information to standard clinical practice in breast cancer
•   Trends emerging
    •   Genes defining predictive signatures of the same phenotype often do
        not overlap across studies
    •   Predictive signatures are not very robust
    •   No consensus regarding the most accurate signatures or computational
        methods for inferring predictive signatures
    •   No consensus regarding the added value of incorporating molecular
        data in addition to or instead of traditionally used clinical covariates
Goal/Challenge
•   Goal
    •   To assess the accuracy of computational models designed to predict
        breast cancer survival, based on clinical information about the patient's
        tumor as well as genome-wide molecular profiling data including gene
        expression and copy number profiles
•   Challenge
    •   Create a community-based effort to provide an unbiased assessment of
        models and methodologies for the prediction of breast cancer survival
    •   Common dataset will be provided to all participants, with a validation
        dataset held out for model evaluation
    •   Novel dataset will be generated at the end of the Challenge and used to
        provide a final, unbiased score for each model
DATA
•   Training data set from METABRIC cohort of 2000 breast cancer samples
    •   Include detailed clinical annotations
    •   10 median year survival time, gene expression and copy number data
•   Additional breast cancer datasets curated by Sage Bionetworks
    •   Can be use in the model development
•   Web based platform called Synapse
    •   Enable transparent reproducible model building and analysis
        workflows as well as sharing of data, tools and models with the
        challenge community
•   Validation dataset
    •   Derived from 300- 500 fresh frozen primary tumors with the same
        clinical annotations and survival data as the METABRIC cohort
DATA
•   Survival data 
    •   Survival data is loaded into R as a Surv object as defined in the R survival package.
    •   This object is simply a 2 column matrix with sample names on the rows and columns:
            •    time – time from diagnosis to last follow up.
            •    status – weather the patient was alive at last follow up time
•   Feature data
    •   Gene expression data.
        •   Performed on the Illumina HT 12v3 platform
        •   Loaded as Bioconductor ExpressionSet object
        •   Data normalized
•   Copy number data.
    •   Performed on the Affymetrix SNP 6.0 platform
    •   Loaded as Bioconductor ExpressionSet object
    •   Data normalized
•   Clinical covariates
    •   Loaded as a data.frame  object with features
Covariates Features
Covariates Features
Submission
•   Models built for this Challenge will be constructed using the
    R programming language and uploaded to a common platform
    (Synapse) provided by Sage Bionetworks
•   Models will be uploaded as R objects implementing a function
    called customPredict() that returns a vector of survival
    predictions when given a set of feature data as input
•   customPredict() function will be run by a validation script for
    each submitted model and resulting predictions will be scored
•   Phase 3 submissions must be accompanied by a write-up that
    includes a short description of the approach used in the final
    model
Scoring
•   Challenge models will be scored by calculating the
    concordance index between the predicted survival and
    the true survival information in the validation dataset
    (accounting for the censor variable indicating whether
    the patient was alive at last follow-up)
•   Final assessment of models and the determination of
    the best performer will be based on the concordance
    index of predictions on the test dataset in Phase 3 of
    the Challenge.
•   In addition, other scoring metrics will be considered
    depending on the suggestions of the community
    throughout the Challenge
Time Line
•   Deadline for submitting models for the Breast Cancer Prognosis Challenge
    is 5PM EST October 15th
•   Best performers will be announced at the DREAM 7 Conference taking
    place in San Francisco on November 12 to 16
•   Final assessment of all models in newly generated data
    •   Additional cohort of 350 breast cancer samples with archived fresh
        frozen tumor samples has been identified by Anne-Lise Borresen-Dale of
        Oslo University Hospital and a generous donation has been made by the
        Avon Foundation to obtain gene expression and copy number data on
        these samples
    •   Currently curation of the clinical records of this patient cohort to
        harmonize with the current METABRIC dataset and generation of the
        genomic profiling data for these samples is being carried on
    •   Aim is to generate these data by the November 12 DREAM conference

Mais conteúdo relacionado

Mais procurados

BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposter
Elsa Fecke
 
NetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbioNetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbio
Alexander Pico
 

Mais procurados (20)

Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Systems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSystems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traits
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
dream
dreamdream
dream
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
 
BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposter
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
 
NetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbioNetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbio
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Computational biology
Computational biologyComputational biology
Computational biology
 
iOmics
iOmicsiOmics
iOmics
 
computer simulation in pharmacokinetics and pharmacodynamics
 computer simulation in pharmacokinetics and pharmacodynamics computer simulation in pharmacokinetics and pharmacodynamics
computer simulation in pharmacokinetics and pharmacodynamics
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
 

Destaque

HAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international developmentHAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international development
NIDOS
 
2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS
Tatiana Karelina
 

Destaque (10)

H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
 
Creating Topic Maps Ontologies for Space Experiments
Creating Topic Maps Ontologies for Space ExperimentsCreating Topic Maps Ontologies for Space Experiments
Creating Topic Maps Ontologies for Space Experiments
 
Computational and Neural Engineering Journal - SciDocPublishers
Computational and Neural Engineering Journal - SciDocPublishersComputational and Neural Engineering Journal - SciDocPublishers
Computational and Neural Engineering Journal - SciDocPublishers
 
HAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international developmentHAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international development
 
Creating a new language to support open innovation
Creating a new language to support open innovationCreating a new language to support open innovation
Creating a new language to support open innovation
 
current evidence based treatment for gynecologic cancer
current evidence based treatment for gynecologic cancercurrent evidence based treatment for gynecologic cancer
current evidence based treatment for gynecologic cancer
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biology
 
01 hap4 4-space - Hourly Analysis Program - Carrier
01 hap4 4-space - Hourly Analysis Program - Carrier01 hap4 4-space - Hourly Analysis Program - Carrier
01 hap4 4-space - Hourly Analysis Program - Carrier
 
2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS
 
Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems Biology
 

Semelhante a DREAM Challenge

coad_machine_learning
coad_machine_learningcoad_machine_learning
coad_machine_learning
Ford Sleeman
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
DataScienceConferenc1
 

Semelhante a DREAM Challenge (20)

Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
 
Predicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B PatientsPredicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B Patients
 
BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
 
Cancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision SystemCancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision System
 
coad_machine_learning
coad_machine_learningcoad_machine_learning
coad_machine_learning
 
Datamining in BreastCancer.pptx
Datamining in BreastCancer.pptxDatamining in BreastCancer.pptx
Datamining in BreastCancer.pptx
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
 
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
COMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICSCOMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICS
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
 
DataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptxDataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptx
 
computer simulations.pptx
computer simulations.pptxcomputer simulations.pptx
computer simulations.pptx
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 
Lisa_Grimm
Lisa_GrimmLisa_Grimm
Lisa_Grimm
 
PSN for Precision Medicine
PSN for Precision MedicinePSN for Precision Medicine
PSN for Precision Medicine
 
cloudapplications.pptx
cloudapplications.pptxcloudapplications.pptx
cloudapplications.pptx
 
Cloud applications - Protein Structure Predication and gene expression data...
Cloud applications - Protein Structure Predication  and  gene expression data...Cloud applications - Protein Structure Predication  and  gene expression data...
Cloud applications - Protein Structure Predication and gene expression data...
 
Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...
 
16
1616
16
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Último (20)

Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 

DREAM Challenge

  • 1. DREAM Challenge Gustavo Stolovitzky, IBM Computational Biology Center Andrea Califano, Columbia University
  • 2. DREAM • DREAM is a Dialogue for Reverse Engineering Assessments and Methods. • The main objective is to catalyze the interaction between experiment and theory in the area of cellular network inference and quantitative model building in systems biology
  • 3. Challenges • Network Topology and Parameter Inference Challenge • Sage Bionetworks - DREAM Breast Cancer Prognosis C • The DREAM-Phil Bowen ALS Prediction Prize4Life • NCI-DREAM Drug Sensitivity Prediction Challenge
  • 4. Project/Challenges • Network Topology and Parameter Inference challenge • Develop/apply optimization methods including the selection of the most informative experiments, to accurately estimate parameters and predict outcomes of perturbations in Systems Biology models given • In Model 1 the complete structure of the model (including expressions for the kinetic rate laws) for a gene regulatory network composed of 9 genes. Protein and mRNA are explicitly modeled. • In Model 2 an incomplete structure of the model, with missing regulatory links, for a gene regulatory network composed of 11 genes. Here, participants will also have to find the missing links. Only proteins are explicitly modeled.
  • 5. DREAM Phil Bowen ALS Prediction Prize4Life • Goal • Challenge is to predict the progression of disease in ALS patients based on the patient’s current disease status • Specifically develop an approach to predict a given patient’s disease status within a year’s time based on 3 months of data • Data • Includes demographics, medical and family history data, functional measures, vital signs, and lab data (blood chemistry/hematology/urinalysis) collected at multiple times • Disease progression will be calculated as the average change in (ALSFRS) Amyotrophic Lateral Sclerosis Functional Rating Scale over a year’s time from enrollment in a clinical trial • At the end of the challenge, the prediction submitted (based on 3 months of data) will be compared against the actual ALSFRS slope experienced by the patient over a year
  • 6. Output • Improve disease prediction beyond the current capabilities by • Developing more accurate (sensitive and specific) methods of predicting progression, • Identifying markers (variables) that would enable a determination of expected future disease progression earlier on in the course of the disease • Validation • Validate the model against a subset of patients that are neither part of the training set nor the final validation (test) set. • Submit the actual code written in R language (“Validation Code”) and InnoCentive will run the code against the interim validation data set
  • 7. NCI-DREAM Drug Sensitivity Prediction Challenge • Use genomic information to build models capable of ranking the sensitivity of cancer cell lines to a set of small molecule compounds or their combinations
  • 8. Sub Challenges • Sub Challenge 1 • Predict the sensitivity of breast cancer cell lines to previously untested compounds • Model capable of ranking the sensitivity of 18 breast cancer cell lines to 31 compounds • Challenge in this case is to link the drug effects to the underlying genetics of the 53 cell lines. • Sub Challenge 2 • Predicting compound combinations that have a synergistic effect in reducing viability of a DLBCL cell line • Predict the activity of pairs of compounds in the DLBCL LY3 cell line from expression profiles acquired after treatment of the cell line with each of 14 individual compounds
  • 9. Sage Bionetworks - DREAM Breast Cancer Prognosis Challenge • Background • Molecular diagnostics for cancer therapeutic decision making are among the most promising applications of genomic technology • Molecular profiles have proved particularly powerful in adding prognosis information to standard clinical practice in breast cancer • Trends emerging • Genes defining predictive signatures of the same phenotype often do not overlap across studies • Predictive signatures are not very robust • No consensus regarding the most accurate signatures or computational methods for inferring predictive signatures • No consensus regarding the added value of incorporating molecular data in addition to or instead of traditionally used clinical covariates
  • 10. Goal/Challenge • Goal • To assess the accuracy of computational models designed to predict breast cancer survival, based on clinical information about the patient's tumor as well as genome-wide molecular profiling data including gene expression and copy number profiles • Challenge • Create a community-based effort to provide an unbiased assessment of models and methodologies for the prediction of breast cancer survival • Common dataset will be provided to all participants, with a validation dataset held out for model evaluation • Novel dataset will be generated at the end of the Challenge and used to provide a final, unbiased score for each model
  • 11. DATA • Training data set from METABRIC cohort of 2000 breast cancer samples • Include detailed clinical annotations • 10 median year survival time, gene expression and copy number data • Additional breast cancer datasets curated by Sage Bionetworks • Can be use in the model development • Web based platform called Synapse • Enable transparent reproducible model building and analysis workflows as well as sharing of data, tools and models with the challenge community • Validation dataset • Derived from 300- 500 fresh frozen primary tumors with the same clinical annotations and survival data as the METABRIC cohort
  • 12. DATA • Survival data  • Survival data is loaded into R as a Surv object as defined in the R survival package. • This object is simply a 2 column matrix with sample names on the rows and columns: • time – time from diagnosis to last follow up. • status – weather the patient was alive at last follow up time • Feature data • Gene expression data. • Performed on the Illumina HT 12v3 platform • Loaded as Bioconductor ExpressionSet object • Data normalized • Copy number data. • Performed on the Affymetrix SNP 6.0 platform • Loaded as Bioconductor ExpressionSet object • Data normalized • Clinical covariates • Loaded as a data.frame  object with features
  • 15. Submission • Models built for this Challenge will be constructed using the R programming language and uploaded to a common platform (Synapse) provided by Sage Bionetworks • Models will be uploaded as R objects implementing a function called customPredict() that returns a vector of survival predictions when given a set of feature data as input • customPredict() function will be run by a validation script for each submitted model and resulting predictions will be scored • Phase 3 submissions must be accompanied by a write-up that includes a short description of the approach used in the final model
  • 16. Scoring • Challenge models will be scored by calculating the concordance index between the predicted survival and the true survival information in the validation dataset (accounting for the censor variable indicating whether the patient was alive at last follow-up) • Final assessment of models and the determination of the best performer will be based on the concordance index of predictions on the test dataset in Phase 3 of the Challenge. • In addition, other scoring metrics will be considered depending on the suggestions of the community throughout the Challenge
  • 17. Time Line • Deadline for submitting models for the Breast Cancer Prognosis Challenge is 5PM EST October 15th • Best performers will be announced at the DREAM 7 Conference taking place in San Francisco on November 12 to 16 • Final assessment of all models in newly generated data • Additional cohort of 350 breast cancer samples with archived fresh frozen tumor samples has been identified by Anne-Lise Borresen-Dale of Oslo University Hospital and a generous donation has been made by the Avon Foundation to obtain gene expression and copy number data on these samples • Currently curation of the clinical records of this patient cohort to harmonize with the current METABRIC dataset and generation of the genomic profiling data for these samples is being carried on • Aim is to generate these data by the November 12 DREAM conference

Notas do Editor

  1. reverse engineering of gene networks has been on the identification of causal interaction topologies between genes. how can we decide between models having very similar topologies and how do we characterize the actual kinetics of these networks in a way that accurately reflects the causal relationships implied in the proposed topology?
  2. Amyotrophic Lateral Sclerosis (ALS)–also known as Lou Gehrig’s disease (in the US) or Motor Neurone disease (outside the US)–is a fatal neurological disease causing death of the nerve cells in the brain and spinal cord which control voluntary muscle movements.  In the early stages of the disease, it is currently very difficult to determine whether a given patient will experience slow or fast disease progression. lack of specific and reliable predictors.
  3. challenge 1 - genomic characterization of 53 cell lines;GI50 concentrations for 31 compounds on 35 cell lines; 18 cell lines for which GI 50 concentrations are not given DLBCL - diffuse large B cell lymphoma cell lines
  4. covariate - may be of direct interest or it may be a confounding or interacting variable.
  5. METABRIC - Molecular taxonomy of Breast Cancer International Consortium