SlideShare uma empresa Scribd logo
1 de 38
Baixar para ler offline
Single Cell Sequencing
for Drug Discovery:
Applications and
Challenges
Sarah Middleton
Computational Biologist, GSK
Image: Quanta Magazine
Single Cell Sequencing
for Drug Discovery:
Applications and
Challenges
Sarah Middleton
Computational Biologist, GSK
Image: Quanta Magazine
Target validation
• Perturbation ‘omics analysis (e.g. CRISPR)
• In vitro / in vivo expression analysis
Target identification
• Genetic disease association (GWAS)
• Disease vs healthy gene expression
• Pathway/network analysis
Pre-clinical development
• Biomarker prediction
• Animal model data analysis
• Patient stratification
Post-market
research
• Drug repurposing
• Combination therapy
Clinical trials
• Patient stratification
• Medical image analysis
Lead optimization &
Candidate selection
• Mechanism of action
• Pharmacodynamics
Compound screening
• Tractability prediction
• Assay development/analysis
• Rational design
Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature
Computational biology in pharma R&D
Target validation
• Perturbation ‘omics analysis (e.g. CRISPR)
• In vitro / in vivo expression analysis
Target identification
• Genetic disease association (GWAS)
• Disease vs healthy gene expression
• Pathway/network analysis
Pre-clinical development
• Biomarker prediction
• Animal model data analysis
• Patient stratification
Post-market
research
• Drug repurposing
• Combination therapy
Clinical trials
• Patient stratification
• Medical image analysis
Lead optimization &
Candidate selection
• Mechanism of action
• Pharmacodynamics
Compound screening
• Tractability prediction
• Assay development/analysis
• Rational design
Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature
Computational biology in pharma R&D
‘Omics data:
• Genome (DNA-seq)
• Transcriptome (RNA-seq;
aka “gene expression”)
• Proteome (mass spec)
• Metabolome, interactome, ...
Gene expression
analysis
(RNA-seq)
Image adapted from: Grun et al. (2015) Nature
“Single cell”
Gene expression
analysis
(RNA-seq)
Image adapted from: Grun et al. (2015) Nature
Millions
of cells
“Bulk”
“Single cell”
Gene expression
analysis
(RNA-seq)
Image adapted from: Grun et al. (2015) Nature
Millions
of cells
“Bulk”
“Single cell”
Cell 1 Cell 2 Cell 3
Image adapted from: Owens, B. (2012) Nature
RNA
RNA
RNA
Image adapted from: Owens, B. (2012) Nature
Reverse Transcription:
~50% capture efficiency
Several other steps:
80-90% efficiency each
Overall: 10-20%
RNA molecules
(~100-300k/cell)
RNA
RNA
RNA
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
Cancer patients
Responders Non-responders
Immunotherapy
(activates immune response
to fight the cancer)
Collect tumor
single cells
before and after
treatment
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
Cancer patients
Responders Non-responders
Immunotherapy
(activates immune response
to fight the cancer)
Collect tumor
single cells
before and after
treatment
Pre-treatment cells
Identify predictive
markers of response
Post-treatment cells
Identify mechanisms of
immunotherapy
resistance
Single cell application: mechanisms of therapy-resistant cancer
Image: Valdes-More et al. (2018) Frontiers in Immunology
CDK4/6 inhibitors as potential combination therapy to improve response
Jerby-Arnon et al. (2018) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Single cell application: Disease-specific cell types in Alzheimer’s
Keren-Shaul et al. (2017) Cell
Regular
microglia
Disease-specific microglia
Activated; digest
aggregates, plaques
Negative
feedback
(checkpoints)
Scaling up single cell analysis
Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com
1.3 million neurons
from mouse brain
Scaling up single cell analysis
Goal: create a map of all cells in the human body
Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com
1.3 million neurons
from mouse brain
Several trillion cells; hundreds of cell types
Scaling up single cell analysis
Goal: create a map of all cells in the human body
Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com
1.3 million neurons
from mouse brain
Human microbiome
100 trillion microbes/person
Several trillion cells; hundreds of cell types
Scaling up single cell analysis
Wolf et al. (2018) Genome Biology
HDF5 files for large ‘omics data
n > p
n = observations (cells, 100k-1mil+)
p = features (genes, 20k)
https://towardsdatascience.com/deep-learning-for-single-cell-biology-935d45064438; Lin et al. (2017) Nucleic Acids Research; Shaham et al. (2017) Bioinformatics
Autoencoder (AE) for non-linear
dimensionality reduction
PCA AE
Big data has advantages! Neural networks for cell type identification
Residual neural networks for batch effect correction
The missing data problem
Genes
Cells
Truth
Observed
Undercounting & “dropout”
Color = # molecules
van Dijk et al. (2018) Cell
Reverse Transcription:
~50% capture efficiency
Several other steps:
80-90% efficiency each
Overall: 10-20%
RNA molecules
(~100-300k/cell) Dropout
The missing data problem
1. Model missing information
2. Imputation
van Dijk et al. (2018) Cell; Arisdakessian et al. (2018) bioRxiv
Missing data makes cell subtype identification difficult
Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics
Coarse-grained types separate well...
tsne1
tsne2
Missing data makes cell subtype identification difficult
Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics
Coarse-grained types separate well...
tsne1
tsne2
Missing data makes cell subtype identification difficult
Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics
Coarse-grained types separate well...
...But finer subdivisions can be murky
tsne1
tsne2
tsne1
tsne2
T cells
Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol
Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol
T cell subtypes in Asthma
Example: CD4+ T cells from lung
Russ et al. (2013) Frontiers in Genetics
Lung biopsy Blood
CD4+
T cells
CD8+
T cells
CD45+
T cells
Healthy Asthma
T regulatory cells (“Tregs”)
Immune-suppressive functions
 Difference in Treg abundance between healthy and asthma?
 Difference in Treg gene expression between healthy and asthma?
Example: CD4+ T cells from lung
Cells belonging to each donor
tsne1
tsne2
Result: can’t confidently identify sub-types of CD4 T cells
Two problems:
1. Marker only detected in a few cells (likely due to dropouts)
2. Clustering is driven by donor (likely due to batch effects)
Treg marker expression
tsne1
tsne2
Simple approach to improve cell type identification
1. Aggregated info gives a clearer picture
• Individual gene expression is noisy & prone
to dropout
• Combining signal from multiple genes can be
more reliable
2. Biologically relevant genes are better for
clustering
• Variation less dominated by technical/batch
effects
Two concepts that help:
Summed expr of 21
Treg-related genes
Treg
markers
Simple approach to improve cell type identification
A better feature space for cell type
identification
1. Define a set of axes to measure similarity of
each cell to various cell types or functional
states
2. Place cells into the space according to their
scores for each axis & identify clusters
Axis 1: “Treg-ness” = FOXP3 + CTLA4 + IL10 + ...
Axis 4: “Exhaustion” = PDCD1 + CD244 + CD160 + ...
Axis 3: “G2/M Phase” = CDK1 + CCNA2 + CCB1 + ...
Axis 2: “Th1-ness” = TBX21 + CXCR3 + CCR5 + ...
...
Th1-ness
Treg-ness
Exhaustion
“Marker space”
Improved cell type identification
Clusters not driven by patient!
Before
After
tSNE of marker space
tsne1
tsne2
Improved cell type identification
Th1-nessTreg-nessTh17 / Th22-ness
Terminal differentiationAnti-inflammatoryType II Interferon Response
Now we can compare treatment groups to identify:
• Changes in cell type composition
• Changes gene expression within a cell type
Before
After
tSNE of marker space
tsne1
tsne2
Future directions
Cell type,
functional state
Image: Broad Inst

Mais conteúdo relacionado

Mais procurados

Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...
Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...
Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...
QIAGEN
 

Mais procurados (20)

Liquid biopsy
Liquid biopsyLiquid biopsy
Liquid biopsy
 
DNA Sequencing from Single Cell
DNA Sequencing from Single CellDNA Sequencing from Single Cell
DNA Sequencing from Single Cell
 
Applications of Single Cell Analysis
Applications of Single  Cell AnalysisApplications of Single  Cell Analysis
Applications of Single Cell Analysis
 
Introduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqIntroduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seq
 
Liquid Biopsy
Liquid BiopsyLiquid Biopsy
Liquid Biopsy
 
Liquid biopsy
Liquid biopsyLiquid biopsy
Liquid biopsy
 
Single cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applicationsSingle cell RNA sequencing; Methods and applications
Single cell RNA sequencing; Methods and applications
 
Cish
CishCish
Cish
 
Multi Omics Approach in Medicine
Multi Omics Approach in MedicineMulti Omics Approach in Medicine
Multi Omics Approach in Medicine
 
In silico structure prediction
In silico structure predictionIn silico structure prediction
In silico structure prediction
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Circulating tumor cells
Circulating tumor cellsCirculating tumor cells
Circulating tumor cells
 
Gene expression profiling
Gene expression profilingGene expression profiling
Gene expression profiling
 
The collection and processing of hematopoietic
The collection and processing of hematopoieticThe collection and processing of hematopoietic
The collection and processing of hematopoietic
 
Circulating tumor cells in crc
Circulating tumor cells in crcCirculating tumor cells in crc
Circulating tumor cells in crc
 
CTCs - Circulating Tumor Cells
CTCs - Circulating Tumor CellsCTCs - Circulating Tumor Cells
CTCs - Circulating Tumor Cells
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...
Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...
Liquid biopsy: Overcome Challenges of Circulating DNA with Automated and Stan...
 
ChipSeq Data Analysis
ChipSeq Data AnalysisChipSeq Data Analysis
ChipSeq Data Analysis
 
PD-L1 testing
PD-L1 testingPD-L1 testing
PD-L1 testing
 

Semelhante a Single-Cell Sequencing for Drug Discovery: Applications and Challenges

Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 

Semelhante a Single-Cell Sequencing for Drug Discovery: Applications and Challenges (20)

Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical research
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
PSB 2018 presentation
PSB 2018 presentationPSB 2018 presentation
PSB 2018 presentation
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Zen and the Art of Data Science Maintenance
Zen and the Art of Data Science MaintenanceZen and the Art of Data Science Maintenance
Zen and the Art of Data Science Maintenance
 
Mason abrf single_cell_2017
Mason abrf single_cell_2017Mason abrf single_cell_2017
Mason abrf single_cell_2017
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient Stratification
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
 
Prof. Mark Coles (Oxford University) - Data-driven systems medicine
Prof. Mark Coles (Oxford University) - Data-driven systems medicineProf. Mark Coles (Oxford University) - Data-driven systems medicine
Prof. Mark Coles (Oxford University) - Data-driven systems medicine
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judes
 
Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
NTU-2019
NTU-2019NTU-2019
NTU-2019
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
 
Translating next generation sequencing to practice
Translating next generation sequencing to practiceTranslating next generation sequencing to practice
Translating next generation sequencing to practice
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 

Mais de inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 

Mais de inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Single-Cell Sequencing for Drug Discovery: Applications and Challenges

  • 1. Single Cell Sequencing for Drug Discovery: Applications and Challenges Sarah Middleton Computational Biologist, GSK Image: Quanta Magazine
  • 2.
  • 3. Single Cell Sequencing for Drug Discovery: Applications and Challenges Sarah Middleton Computational Biologist, GSK Image: Quanta Magazine
  • 4. Target validation • Perturbation ‘omics analysis (e.g. CRISPR) • In vitro / in vivo expression analysis Target identification • Genetic disease association (GWAS) • Disease vs healthy gene expression • Pathway/network analysis Pre-clinical development • Biomarker prediction • Animal model data analysis • Patient stratification Post-market research • Drug repurposing • Combination therapy Clinical trials • Patient stratification • Medical image analysis Lead optimization & Candidate selection • Mechanism of action • Pharmacodynamics Compound screening • Tractability prediction • Assay development/analysis • Rational design Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature Computational biology in pharma R&D
  • 5. Target validation • Perturbation ‘omics analysis (e.g. CRISPR) • In vitro / in vivo expression analysis Target identification • Genetic disease association (GWAS) • Disease vs healthy gene expression • Pathway/network analysis Pre-clinical development • Biomarker prediction • Animal model data analysis • Patient stratification Post-market research • Drug repurposing • Combination therapy Clinical trials • Patient stratification • Medical image analysis Lead optimization & Candidate selection • Mechanism of action • Pharmacodynamics Compound screening • Tractability prediction • Assay development/analysis • Rational design Images adapted from: www.researchamerica.org; Owens, B. (2012) Nature Computational biology in pharma R&D ‘Omics data: • Genome (DNA-seq) • Transcriptome (RNA-seq; aka “gene expression”) • Proteome (mass spec) • Metabolome, interactome, ...
  • 6. Gene expression analysis (RNA-seq) Image adapted from: Grun et al. (2015) Nature “Single cell”
  • 7. Gene expression analysis (RNA-seq) Image adapted from: Grun et al. (2015) Nature Millions of cells “Bulk” “Single cell”
  • 8. Gene expression analysis (RNA-seq) Image adapted from: Grun et al. (2015) Nature Millions of cells “Bulk” “Single cell” Cell 1 Cell 2 Cell 3
  • 9. Image adapted from: Owens, B. (2012) Nature RNA RNA RNA
  • 10. Image adapted from: Owens, B. (2012) Nature Reverse Transcription: ~50% capture efficiency Several other steps: 80-90% efficiency each Overall: 10-20% RNA molecules (~100-300k/cell) RNA RNA RNA
  • 11. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology
  • 12. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology Cancer patients Responders Non-responders Immunotherapy (activates immune response to fight the cancer) Collect tumor single cells before and after treatment
  • 13. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology Cancer patients Responders Non-responders Immunotherapy (activates immune response to fight the cancer) Collect tumor single cells before and after treatment Pre-treatment cells Identify predictive markers of response Post-treatment cells Identify mechanisms of immunotherapy resistance
  • 14. Single cell application: mechanisms of therapy-resistant cancer Image: Valdes-More et al. (2018) Frontiers in Immunology CDK4/6 inhibitors as potential combination therapy to improve response Jerby-Arnon et al. (2018) Cell
  • 15. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell
  • 16. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell
  • 17. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell
  • 18. Single cell application: Disease-specific cell types in Alzheimer’s Keren-Shaul et al. (2017) Cell Regular microglia Disease-specific microglia Activated; digest aggregates, plaques Negative feedback (checkpoints)
  • 19. Scaling up single cell analysis Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com 1.3 million neurons from mouse brain
  • 20. Scaling up single cell analysis Goal: create a map of all cells in the human body Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com 1.3 million neurons from mouse brain Several trillion cells; hundreds of cell types
  • 21. Scaling up single cell analysis Goal: create a map of all cells in the human body Images: Angerer et al. (2017) Curr Op in Sys Biol; Broad Inst; darwinian-medicine.com 1.3 million neurons from mouse brain Human microbiome 100 trillion microbes/person Several trillion cells; hundreds of cell types
  • 22. Scaling up single cell analysis Wolf et al. (2018) Genome Biology HDF5 files for large ‘omics data
  • 23. n > p n = observations (cells, 100k-1mil+) p = features (genes, 20k)
  • 24. https://towardsdatascience.com/deep-learning-for-single-cell-biology-935d45064438; Lin et al. (2017) Nucleic Acids Research; Shaham et al. (2017) Bioinformatics Autoencoder (AE) for non-linear dimensionality reduction PCA AE Big data has advantages! Neural networks for cell type identification Residual neural networks for batch effect correction
  • 25. The missing data problem Genes Cells Truth Observed Undercounting & “dropout” Color = # molecules van Dijk et al. (2018) Cell Reverse Transcription: ~50% capture efficiency Several other steps: 80-90% efficiency each Overall: 10-20% RNA molecules (~100-300k/cell) Dropout
  • 26. The missing data problem 1. Model missing information 2. Imputation van Dijk et al. (2018) Cell; Arisdakessian et al. (2018) bioRxiv
  • 27. Missing data makes cell subtype identification difficult Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics Coarse-grained types separate well... tsne1 tsne2
  • 28. Missing data makes cell subtype identification difficult Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics Coarse-grained types separate well... tsne1 tsne2
  • 29. Missing data makes cell subtype identification difficult Lambrechts et al. (2018) Nat Medicine; Russ et al. (2013) Frontiers in Genetics Coarse-grained types separate well... ...But finer subdivisions can be murky tsne1 tsne2 tsne1 tsne2 T cells
  • 30. Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol
  • 31. Hirahara (2016) Int Immunol; Lloyd and Hessel (2010) Nat Rev Immunol T cell subtypes in Asthma
  • 32. Example: CD4+ T cells from lung Russ et al. (2013) Frontiers in Genetics Lung biopsy Blood CD4+ T cells CD8+ T cells CD45+ T cells Healthy Asthma T regulatory cells (“Tregs”) Immune-suppressive functions  Difference in Treg abundance between healthy and asthma?  Difference in Treg gene expression between healthy and asthma?
  • 33. Example: CD4+ T cells from lung Cells belonging to each donor tsne1 tsne2 Result: can’t confidently identify sub-types of CD4 T cells Two problems: 1. Marker only detected in a few cells (likely due to dropouts) 2. Clustering is driven by donor (likely due to batch effects) Treg marker expression tsne1 tsne2
  • 34. Simple approach to improve cell type identification 1. Aggregated info gives a clearer picture • Individual gene expression is noisy & prone to dropout • Combining signal from multiple genes can be more reliable 2. Biologically relevant genes are better for clustering • Variation less dominated by technical/batch effects Two concepts that help: Summed expr of 21 Treg-related genes Treg markers
  • 35. Simple approach to improve cell type identification A better feature space for cell type identification 1. Define a set of axes to measure similarity of each cell to various cell types or functional states 2. Place cells into the space according to their scores for each axis & identify clusters Axis 1: “Treg-ness” = FOXP3 + CTLA4 + IL10 + ... Axis 4: “Exhaustion” = PDCD1 + CD244 + CD160 + ... Axis 3: “G2/M Phase” = CDK1 + CCNA2 + CCB1 + ... Axis 2: “Th1-ness” = TBX21 + CXCR3 + CCR5 + ... ... Th1-ness Treg-ness Exhaustion “Marker space”
  • 36. Improved cell type identification Clusters not driven by patient! Before After tSNE of marker space tsne1 tsne2
  • 37. Improved cell type identification Th1-nessTreg-nessTh17 / Th22-ness Terminal differentiationAnti-inflammatoryType II Interferon Response Now we can compare treatment groups to identify: • Changes in cell type composition • Changes gene expression within a cell type Before After tSNE of marker space tsne1 tsne2
  • 38. Future directions Cell type, functional state Image: Broad Inst