SlideShare uma empresa Scribd logo
1 de 33
Baixar para ler offline
Unlocking the Power of FAIR Data Sharing with
ImmPort
Sanchita Bhattacharya
Science Program Lead and Outreach Coordinator, ImmPort
Bioinformatics Project Leader
Bakar Computational Health Sciences Institute
University of California, San Francisco
dkNET Webinar Series, April 12, 2024
ImmPort Team
UCSF
Atul Butte, PI
Sanchita Bhattacharya
Reuben Sarwal
Immune System Sciences
Steven H. Kleinstein
ICF
Srinivas Chepuri
Karen Ketchum
Matthew Strub
Olivier Toujas-Bernate
Alicia Williamson
Peraton
Morgan Crafts
Emma Afferton
Sanjiv Desai
John Campbell
Zhiping Gu
Kate Hypes
Jaya Kannan
Ruth Monteiro
Elizabeth Thomson
Zullinel Trilla-Flores
Vilma Thomas
Sammi Smith
Bryan Walters
Shujia Zhou
Funding Support
National Institute of Allergy and
Infectious Diseases (NIAID)
National Institutes of Health (NIH)
Health and Human Services (HHS)
Contract #: HHSN316201200036W
NIAID
Anupama Gururaj
Quan Chen
Dawei Lin
• ImmPort – An Overview
• Secondary Data Reuse – Case Studies
Outline
Yu et al., Current Opinion in Systems Biology, 2019
4
Molecular Portraits of Immune System
Clinical Trial Life Cycle: When to Share Data
5
2
.
C L I N I C A L
T R I A L M I L E S T
O N E
K E Y
:
M E TA D ATA INDIVIDU AL
PARTICIPANT D A TA
SUMMAR Y D A TA
At t r i a l
re g i st rat i o n
1 2 m o n t h s
a f te r st u d y
co m p l e t i o n
1 8 m o n t h s a f t e r
p ro d u c t a b a n d o n m e n t
O R 3 0 d a y s a f t e r
re g u l a t o r y a p p r o v al**
6 M o n t h s
a f te r
p u b l i c at i o n *
DATA
S H A R I N G
P L A N
R E G I ST R AT I O N
E L E ME N TS
F U L L DATA
P A CKA GE
P OST- R E G U L ATO RY
DATA P AC KAG E
1 8 m o n t h s
a f te r st u d y
co m p l e t i o n
S U M M A RY-
L E V E L
R E S U LTS
LAY
S U M M A R I E S
P A R T I C I PA N T
E N R O L L M E N T
N O 5
B
T R I A L D E S I G N
& R E G I ST R AT I O N
1
ST U DY CO M P L E T I O N
O R T E R MI N AT I O N 3
Y E S
5
A
R E G U L ATO R Y
A P P L I C AT I O N ?
2
P OST- P U B L I C AT I O N
DATA P AC KAG E
P U B L I C AT I O N
4
Sharing Clinical Trial Data: Maximizing Benefits, Minimizing Risk.
http://www.iom.edu/Reports/2015/
Opportunities and Challenges in Democratizing Clinical Research
Datasets
6 Bhattacharya et al., Front Immunology, 2021, PMID: 33936065
ImmPort data portal was developed to collect and share research
and clinical trials data from NIAID/DAIT funded researchers
7
ImmPort.org
ImmPort
Ecosystem
Immunophenotyping Assessment in a COVID-19 Cohort (IMPACC)
Serological Sciences Network (SeroNet)
Multisystem Inflammatory Syndrome in Children (MIS-C)
Impact of Initial Influenza Exposure on Immunity in Infants (U01)
Atopic Dermatitis Research Network (ADRN)
Population Genetics Analysis Program
Protective Immunity for Special Populations
HLA Region Genomics in Immune-mediated Diseases
Modeling Immunity for Biodefense
Reagent Development for Innate Immune Receptors
Adjuvant Development Program
Immunity in Neonates and Infants
Asthma and Allergic Diseases Cooperative Research Centers
HLA and KIR Region Genomics in Immune-Mediated Diseases
Cooperative Study Group for Autoimmune Disease Prevention
Immunobiology of Xenotransplantation
Centers for Medical Countermeasures against Radiation Consortium
Inner City Asthma Consortium
Systems Approach to Immunity and Inflammation
Innate Immune Receptors and Adjuvant Discovery Program
Maintenance of Macaque Specific Pathogen-Free Breeding Colonies
Non-human Primate Transplantation Tolerance Cooperative Study Group
Consortium for Food Allergy Research
Development of Sample Sparing Assays for Monitoring Immune Responses (U24)
Asthma and Allergic Diseases Clinical Research Consortium (AADCRC)
The Clinical Islet Transplantation (CIT) Consortium
Autoimmunity Centers of Excellence (ACE)
Clinical Trials in Organ Transplantation (CTOC)
Human Immunology Project Consortium (HIPC)
Collaborative Influenza Vaccine Innovation Centers (CIVICS)
Centers for Research in Emerging and Infectious Diseases (CREID)
Cooperative Centers on Human Immunology
Impact of Initial Influenza Exposure on Immunity in Infants (U01)
A Multidisciplinary Approach to Study Vaccine-elicited Immunity and Efficacy Against Malaria (MVIE)
ImmPort Shares Data from Major NIAID-funded
Programs and External Organizations
20 Years of
FAIR Data
Sharing
Data Submission Process Promotes FAIR Data
Major Steps in Data Submission for Data Submitters:
The Study
Registration
Wizard (SRW)
kick-starts the
data upload
process and
captures initial
metadata
associated
with the study
Submission templates incorporate controlled vocabulary terms from clinical and research ontologies.
Data Submission
Templates
capture
assoicated data
and metadate
based on study
design
10
Data Model
Adherence to FAIR principles increases the visibility of your data!
Findable
https://immport.org/shared/search
ImmPort Search – Cohort Discovery Tool (CDT) Additional Repositories and Search Engines
ImmPort Shared Data Browser (Cohort Discovery Tool)
● ImmPort currently shares over 900 studies encompassing a range of research areas, species & assay
types including 181 Clinical Trials data.
https://immport.org/shared/search
Accessible
• ImmPort study metadata (CDT Search) is
browsable without login
• Registration and acceptance of Data Use
Agreement is required to upload or download data
• Registration is free, simple, and immediate
https://docs.immport.org/apidocumentation/
ImmPort Registration & Login
ImmPort Application
Programming Interfaces (APIs)
• ImmPort offers several APIs with
detailed documentaiton for use
https://www.immport.org/auth/login
Interoperability with Other Resources
• To further interoperability,
ImmPort data is being mapped to
Fast Healthcare Interoperability
Resources (FHIR) format
• Users can explore ImmPort data in
FHIR format using the ImmPort
HAPI FHIR server
https://fhir.immport.org/
• ImmPort subject and sample metadata can be mapped to
GEO subject metadata, creating a larger dataset for studies
that have data in both repositories
Benefits of Open-Access Immunological Data
Reproducibility
Re-Analyze
Repurpose
16
Hulsen et al, 2019
Bhattacharya et al., ImmPort, toward repurposing of open access immunological assay data for translational and clinical research.Sci Data. 2018 Feb 27;5:180015.
PMID: 29485622
Nasrallah et al, 2015
Crowdsourcing: Influenza Vaccination Cohorts in ImmPort Database
17
Data Reuse
10kimmunomes.org
• Large, diverse, cleaned reference dataset
for human immunology
• Interactive data visualization
• Custom control cohorts and standardized
data download
18
Data available in the
10,000 Immunomes Project
Total Samples 42117
Total Distinct Subjects 10344
MEASUREMENT
Secreted Proteins
SUBJECTS
4835
ELISA 4035
Multiplex ELISA 1286
Virus Titer 3609
Virus Neutralization Titer 2265
HAI Titer 1344
Clinical Lab Tests 2639
Complete Blood Count 1684
Comprehensive Metabolic Panel 664
Fasting Lipid Profile 664
Questionnaire 1422
Cytometry 1415
Flow Cytometry (PBMC) 907
CyTOF (PBMC) 583
Flow Cytometry (Whole Blood) 164
HLA Type 1093
Gene Expression Array 476
Whole Blood 311
PBMC 165
CyTOF and
Flow Cytometry
- Automatically find positive and
negative populations with MetaCyto
- Assign Standardized
Cell Subset Names
- Segregate Sample Types
- Batch Correct
- Validate against
gold-standard hand-gated
populations
Gene
Expression
- RMA Background Correct
- Quantile Normalize
- Log2 Normalize
- Assign Probes to Entrez IDs
- Segregate Sample Types
- Combine data based on Entrez IDs
- Batch Correct with ComBat
- Assign HUGO Gene Names
Secreted
Proteins
- Standardize Units
- Standardize Protein Names
- Segregate Sample Types
- Correct for Dilution Factor
- Batch Correct
Others
(7 Assay Types)
- Standardize Units
- Standardize Names
- Segregate Sample Types
- Batch Correct
where Needed
85 Studies
10,344 Subjects
42,000+ Samples
Standardized
Data
Standardized pipeline for data
cleaning and harmonization
19
10KImmunomes.org
20
Docker image available
Example of AI-ready ImmPort Data: Re-analyis of 10K Immunomes
CyTOF Data Using GPT4
https://onlinelibrary.wiley.com/doi/10.1111/cea.14452
ChatGPT
Prompt:
ChatGPT
Response:
AI can analyze large scale cytometry datasets with ease,
even adjusting for confounding variables
• Age-associated differences in cell types
• Age- and gender-associated effects on cytokines
Additional ChatGPT Response:
ImmPort powered AI-Ready Datasets Coming Soon!!
23
A convolutional neural network
(CNN) for cytometry data
Dense
layers
Markers
Convolution
layers
Pooling
layer
Merge
layer
Output
Predictions
Cells
CyTOF Data
Non-Cytometry
Data
Y
(Model
output)
Original
Data
Modified
Data
Original Data Modi ed Data
Marker1 < 2.6
Low ΔY H ΔY
.
.2
.1 .6
Marker 2 < .
.
ΔY
Step 1: up-sample
each cell in the data
Step 2: Calculate the changes
in model output (ΔY)
Step 3: Identi cell
associated ith high ΔY
Explanations (LIME)
A robust and interpretable end-to-end deep learning model for
cytometry data
Hu at al., PNAS 2020
Goal: To diagnose the latent cytomegalovirus (CMV) in healthy
individuals
Visualizing Open-Access Living Donor Transplant Data
Chen J et al., JAMA Netw Open. 2019
24
ImmPort Data Reuse by the Scientific Community
https://www.nature.com/articles/s41586-021-03791-x
ImmPort Data Reuse
https://onlinelibrary.wiley.com/doi/10.1111/cea.14452
ImmuneSpace
HIPC’s ImmuneSpace extends ImmPort,
providing access to additional data (e.g.,
standardized gene expression matrices) and
web-based R tools for data accession,
analysis, and reporting.
Studies in the Immune Signatures Data
Resource are archived through the Shared
Data Portal on ImmPort and ImmuneSpace
repositories and may be updated over time.
https://immunespace.org
Education : Analysis Tutorial
Take Home Messages
• Open-access immunological studies are a valuable resource to evaluate new in silico hypotheses
testing, gain novel insights, and a productive starting point for informing the design of future
experiments
• Holistic approach to analyzing clinical research data
• 10,000 Immunomes Project- a framework for growing a diverse human immunology reference,
from ImmPort, a publicly available resource of subject-level immunology.
• Allows us to learn from the features and candidates we already know.
• Enables us to explore new factors to be discovered.
• Deep convolutional neural network model can accurately diagnose the latent cytomegalovirus
(CMV) in healthy individuals.
• Expanded uses of crowdsourcing in immunology will allow for more efficient large-scale data
collection and analysis. It will also involve, inspire, educate, and engage the community in a
variety of meaningful ways.
Embrace open-access datasets!
29
Ways to Stay Updated on ImmPort Activities
https://www.linkedin.com/company/immport/
ImmPort_Helpdesk@immport.org
https://docs.immport.org/home/newsletter/
Monthly Newsletter
ImmPort Office Hours
• ImmPort holds open office hours
sessions on the first Thursday of
each month from 2 PM – 3 PM ET
• Office Hours are a great
opportunity to discuss your
questions directly with the
ImmPort team and learn more
about ImmPort
• All user levels are welcome,
whether new to ImmPort or an
experienced user
https://docs.immport.org/home/events/
Visit the ImmPort Events page to add
ImmPort Office Hours to your calendar
https://www.focisnet.org/education/big-data-in-immunology/
Sanchita.Bhattacharya@ucsf.edu
@sanchitab
33
Bakar Institute of Computational Health Sciences, UCSF
Thanks!

Mais conteúdo relacionado

Semelhante a dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024

Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...
iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...
iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...
Health IT Conference – iHT2
 
The Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham TaylorThe Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham Taylor
Human Variome Project
 

Semelhante a dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024 (20)

How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Dalton
DaltonDalton
Dalton
 
Dalton presentation
Dalton presentationDalton presentation
Dalton presentation
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
 
Data systems web_integration_v0 1
Data systems web_integration_v0 1Data systems web_integration_v0 1
Data systems web_integration_v0 1
 
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
 
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
 
Provenance abstraction for implementing security: Learning Health System and ...
Provenance abstraction for implementing security: Learning Health System and ...Provenance abstraction for implementing security: Learning Health System and ...
Provenance abstraction for implementing security: Learning Health System and ...
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
2022-11-23 DTL Future of data-driven life sciences, Utrecht, Alain van Gool.pdf
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...
iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...
iHT² Health IT Summit Seattle 2013 - Josephine Briggs, MD, National Center fo...
 
Data Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionData Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tension
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffiths
 
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
 
Organ Specific Proteomics
Organ Specific ProteomicsOrgan Specific Proteomics
Organ Specific Proteomics
 
The Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham TaylorThe Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham Taylor
 

Mais de dkNET

dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
dkNET Webinar: Tabula Sapiens 03/22/2024
dkNET Webinar: Tabula Sapiens 03/22/2024dkNET Webinar: Tabula Sapiens 03/22/2024
dkNET Webinar: Tabula Sapiens 03/22/2024
dkNET
 
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
dkNET
 
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
dkNET
 
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
dkNET
 
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
dkNET
 
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET
 
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET
 
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET
 
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
dkNET
 
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
dkNET
 
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET
 
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET
 
dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...
dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...
dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...
dkNET
 
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
dkNET
 

Mais de dkNET (20)

dkNET Webinar: The 4DN Data Portal - Data, Resources and Tools to Help Elucid...
dkNET Webinar: The 4DN Data Portal - Data, Resources and Tools to Help Elucid...dkNET Webinar: The 4DN Data Portal - Data, Resources and Tools to Help Elucid...
dkNET Webinar: The 4DN Data Portal - Data, Resources and Tools to Help Elucid...
 
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
dkNET Webinar: Tabula Sapiens 03/22/2024
dkNET Webinar: Tabula Sapiens 03/22/2024dkNET Webinar: Tabula Sapiens 03/22/2024
dkNET Webinar: Tabula Sapiens 03/22/2024
 
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
 
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
 
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
 
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
 
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
 
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
 
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
 
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
 
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
 
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
 
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
 
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
 
dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...
dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...
dkNET Webinar "The Mission and Progress of the(sugar)science: Helping Scienti...
 
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
 

Último

Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycle
Cherry
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cherry
 
COMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demeritsCOMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demerits
Cherry
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
Cherry
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Cherry
 

Último (20)

Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycle
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Plasmid: types, structure and functions.
Plasmid: types, structure and functions.Plasmid: types, structure and functions.
Plasmid: types, structure and functions.
 
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.
 
COMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demeritsCOMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demerits
 
Efficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationEfficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence acceleration
 
Information science research with large language models: between science and ...
Information science research with large language models: between science and ...Information science research with large language models: between science and ...
Information science research with large language models: between science and ...
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneyX-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Method of Quantifying interactions and its types
Method of Quantifying interactions and its typesMethod of Quantifying interactions and its types
Method of Quantifying interactions and its types
 
Precision Silviculture and Silviculture practices of bamboo.pptx
Precision Silviculture and Silviculture practices of bamboo.pptxPrecision Silviculture and Silviculture practices of bamboo.pptx
Precision Silviculture and Silviculture practices of bamboo.pptx
 
Energy is the beat of life irrespective of the domains. ATP- the energy curre...
Energy is the beat of life irrespective of the domains. ATP- the energy curre...Energy is the beat of life irrespective of the domains. ATP- the energy curre...
Energy is the beat of life irrespective of the domains. ATP- the energy curre...
 
CONTRIBUTION OF PANCHANAN MAHESHWARI.pptx
CONTRIBUTION OF PANCHANAN MAHESHWARI.pptxCONTRIBUTION OF PANCHANAN MAHESHWARI.pptx
CONTRIBUTION OF PANCHANAN MAHESHWARI.pptx
 
Genome organization in virus,bacteria and eukaryotes.pptx
Genome organization in virus,bacteria and eukaryotes.pptxGenome organization in virus,bacteria and eukaryotes.pptx
Genome organization in virus,bacteria and eukaryotes.pptx
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methods
 

dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024

  • 1. Unlocking the Power of FAIR Data Sharing with ImmPort Sanchita Bhattacharya Science Program Lead and Outreach Coordinator, ImmPort Bioinformatics Project Leader Bakar Computational Health Sciences Institute University of California, San Francisco dkNET Webinar Series, April 12, 2024
  • 2. ImmPort Team UCSF Atul Butte, PI Sanchita Bhattacharya Reuben Sarwal Immune System Sciences Steven H. Kleinstein ICF Srinivas Chepuri Karen Ketchum Matthew Strub Olivier Toujas-Bernate Alicia Williamson Peraton Morgan Crafts Emma Afferton Sanjiv Desai John Campbell Zhiping Gu Kate Hypes Jaya Kannan Ruth Monteiro Elizabeth Thomson Zullinel Trilla-Flores Vilma Thomas Sammi Smith Bryan Walters Shujia Zhou Funding Support National Institute of Allergy and Infectious Diseases (NIAID) National Institutes of Health (NIH) Health and Human Services (HHS) Contract #: HHSN316201200036W NIAID Anupama Gururaj Quan Chen Dawei Lin
  • 3. • ImmPort – An Overview • Secondary Data Reuse – Case Studies Outline
  • 4. Yu et al., Current Opinion in Systems Biology, 2019 4 Molecular Portraits of Immune System
  • 5. Clinical Trial Life Cycle: When to Share Data 5 2 . C L I N I C A L T R I A L M I L E S T O N E K E Y : M E TA D ATA INDIVIDU AL PARTICIPANT D A TA SUMMAR Y D A TA At t r i a l re g i st rat i o n 1 2 m o n t h s a f te r st u d y co m p l e t i o n 1 8 m o n t h s a f t e r p ro d u c t a b a n d o n m e n t O R 3 0 d a y s a f t e r re g u l a t o r y a p p r o v al** 6 M o n t h s a f te r p u b l i c at i o n * DATA S H A R I N G P L A N R E G I ST R AT I O N E L E ME N TS F U L L DATA P A CKA GE P OST- R E G U L ATO RY DATA P AC KAG E 1 8 m o n t h s a f te r st u d y co m p l e t i o n S U M M A RY- L E V E L R E S U LTS LAY S U M M A R I E S P A R T I C I PA N T E N R O L L M E N T N O 5 B T R I A L D E S I G N & R E G I ST R AT I O N 1 ST U DY CO M P L E T I O N O R T E R MI N AT I O N 3 Y E S 5 A R E G U L ATO R Y A P P L I C AT I O N ? 2 P OST- P U B L I C AT I O N DATA P AC KAG E P U B L I C AT I O N 4 Sharing Clinical Trial Data: Maximizing Benefits, Minimizing Risk. http://www.iom.edu/Reports/2015/
  • 6. Opportunities and Challenges in Democratizing Clinical Research Datasets 6 Bhattacharya et al., Front Immunology, 2021, PMID: 33936065
  • 7. ImmPort data portal was developed to collect and share research and clinical trials data from NIAID/DAIT funded researchers 7 ImmPort.org ImmPort Ecosystem
  • 8. Immunophenotyping Assessment in a COVID-19 Cohort (IMPACC) Serological Sciences Network (SeroNet) Multisystem Inflammatory Syndrome in Children (MIS-C) Impact of Initial Influenza Exposure on Immunity in Infants (U01) Atopic Dermatitis Research Network (ADRN) Population Genetics Analysis Program Protective Immunity for Special Populations HLA Region Genomics in Immune-mediated Diseases Modeling Immunity for Biodefense Reagent Development for Innate Immune Receptors Adjuvant Development Program Immunity in Neonates and Infants Asthma and Allergic Diseases Cooperative Research Centers HLA and KIR Region Genomics in Immune-Mediated Diseases Cooperative Study Group for Autoimmune Disease Prevention Immunobiology of Xenotransplantation Centers for Medical Countermeasures against Radiation Consortium Inner City Asthma Consortium Systems Approach to Immunity and Inflammation Innate Immune Receptors and Adjuvant Discovery Program Maintenance of Macaque Specific Pathogen-Free Breeding Colonies Non-human Primate Transplantation Tolerance Cooperative Study Group Consortium for Food Allergy Research Development of Sample Sparing Assays for Monitoring Immune Responses (U24) Asthma and Allergic Diseases Clinical Research Consortium (AADCRC) The Clinical Islet Transplantation (CIT) Consortium Autoimmunity Centers of Excellence (ACE) Clinical Trials in Organ Transplantation (CTOC) Human Immunology Project Consortium (HIPC) Collaborative Influenza Vaccine Innovation Centers (CIVICS) Centers for Research in Emerging and Infectious Diseases (CREID) Cooperative Centers on Human Immunology Impact of Initial Influenza Exposure on Immunity in Infants (U01) A Multidisciplinary Approach to Study Vaccine-elicited Immunity and Efficacy Against Malaria (MVIE) ImmPort Shares Data from Major NIAID-funded Programs and External Organizations 20 Years of FAIR Data Sharing
  • 9. Data Submission Process Promotes FAIR Data Major Steps in Data Submission for Data Submitters: The Study Registration Wizard (SRW) kick-starts the data upload process and captures initial metadata associated with the study Submission templates incorporate controlled vocabulary terms from clinical and research ontologies. Data Submission Templates capture assoicated data and metadate based on study design
  • 11. Adherence to FAIR principles increases the visibility of your data! Findable https://immport.org/shared/search ImmPort Search – Cohort Discovery Tool (CDT) Additional Repositories and Search Engines
  • 12. ImmPort Shared Data Browser (Cohort Discovery Tool) ● ImmPort currently shares over 900 studies encompassing a range of research areas, species & assay types including 181 Clinical Trials data. https://immport.org/shared/search
  • 13.
  • 14. Accessible • ImmPort study metadata (CDT Search) is browsable without login • Registration and acceptance of Data Use Agreement is required to upload or download data • Registration is free, simple, and immediate https://docs.immport.org/apidocumentation/ ImmPort Registration & Login ImmPort Application Programming Interfaces (APIs) • ImmPort offers several APIs with detailed documentaiton for use https://www.immport.org/auth/login
  • 15. Interoperability with Other Resources • To further interoperability, ImmPort data is being mapped to Fast Healthcare Interoperability Resources (FHIR) format • Users can explore ImmPort data in FHIR format using the ImmPort HAPI FHIR server https://fhir.immport.org/ • ImmPort subject and sample metadata can be mapped to GEO subject metadata, creating a larger dataset for studies that have data in both repositories
  • 16. Benefits of Open-Access Immunological Data Reproducibility Re-Analyze Repurpose 16 Hulsen et al, 2019 Bhattacharya et al., ImmPort, toward repurposing of open access immunological assay data for translational and clinical research.Sci Data. 2018 Feb 27;5:180015. PMID: 29485622 Nasrallah et al, 2015
  • 17. Crowdsourcing: Influenza Vaccination Cohorts in ImmPort Database 17 Data Reuse
  • 18. 10kimmunomes.org • Large, diverse, cleaned reference dataset for human immunology • Interactive data visualization • Custom control cohorts and standardized data download 18
  • 19. Data available in the 10,000 Immunomes Project Total Samples 42117 Total Distinct Subjects 10344 MEASUREMENT Secreted Proteins SUBJECTS 4835 ELISA 4035 Multiplex ELISA 1286 Virus Titer 3609 Virus Neutralization Titer 2265 HAI Titer 1344 Clinical Lab Tests 2639 Complete Blood Count 1684 Comprehensive Metabolic Panel 664 Fasting Lipid Profile 664 Questionnaire 1422 Cytometry 1415 Flow Cytometry (PBMC) 907 CyTOF (PBMC) 583 Flow Cytometry (Whole Blood) 164 HLA Type 1093 Gene Expression Array 476 Whole Blood 311 PBMC 165 CyTOF and Flow Cytometry - Automatically find positive and negative populations with MetaCyto - Assign Standardized Cell Subset Names - Segregate Sample Types - Batch Correct - Validate against gold-standard hand-gated populations Gene Expression - RMA Background Correct - Quantile Normalize - Log2 Normalize - Assign Probes to Entrez IDs - Segregate Sample Types - Combine data based on Entrez IDs - Batch Correct with ComBat - Assign HUGO Gene Names Secreted Proteins - Standardize Units - Standardize Protein Names - Segregate Sample Types - Correct for Dilution Factor - Batch Correct Others (7 Assay Types) - Standardize Units - Standardize Names - Segregate Sample Types - Batch Correct where Needed 85 Studies 10,344 Subjects 42,000+ Samples Standardized Data Standardized pipeline for data cleaning and harmonization 19
  • 21. Example of AI-ready ImmPort Data: Re-analyis of 10K Immunomes CyTOF Data Using GPT4 https://onlinelibrary.wiley.com/doi/10.1111/cea.14452 ChatGPT Prompt: ChatGPT Response: AI can analyze large scale cytometry datasets with ease, even adjusting for confounding variables • Age-associated differences in cell types • Age- and gender-associated effects on cytokines Additional ChatGPT Response:
  • 22. ImmPort powered AI-Ready Datasets Coming Soon!!
  • 23. 23 A convolutional neural network (CNN) for cytometry data Dense layers Markers Convolution layers Pooling layer Merge layer Output Predictions Cells CyTOF Data Non-Cytometry Data Y (Model output) Original Data Modified Data Original Data Modi ed Data Marker1 < 2.6 Low ΔY H ΔY . .2 .1 .6 Marker 2 < . . ΔY Step 1: up-sample each cell in the data Step 2: Calculate the changes in model output (ΔY) Step 3: Identi cell associated ith high ΔY Explanations (LIME) A robust and interpretable end-to-end deep learning model for cytometry data Hu at al., PNAS 2020 Goal: To diagnose the latent cytomegalovirus (CMV) in healthy individuals
  • 24. Visualizing Open-Access Living Donor Transplant Data Chen J et al., JAMA Netw Open. 2019 24
  • 25. ImmPort Data Reuse by the Scientific Community https://www.nature.com/articles/s41586-021-03791-x
  • 27. ImmuneSpace HIPC’s ImmuneSpace extends ImmPort, providing access to additional data (e.g., standardized gene expression matrices) and web-based R tools for data accession, analysis, and reporting. Studies in the Immune Signatures Data Resource are archived through the Shared Data Portal on ImmPort and ImmuneSpace repositories and may be updated over time. https://immunespace.org
  • 29. Take Home Messages • Open-access immunological studies are a valuable resource to evaluate new in silico hypotheses testing, gain novel insights, and a productive starting point for informing the design of future experiments • Holistic approach to analyzing clinical research data • 10,000 Immunomes Project- a framework for growing a diverse human immunology reference, from ImmPort, a publicly available resource of subject-level immunology. • Allows us to learn from the features and candidates we already know. • Enables us to explore new factors to be discovered. • Deep convolutional neural network model can accurately diagnose the latent cytomegalovirus (CMV) in healthy individuals. • Expanded uses of crowdsourcing in immunology will allow for more efficient large-scale data collection and analysis. It will also involve, inspire, educate, and engage the community in a variety of meaningful ways. Embrace open-access datasets! 29
  • 30. Ways to Stay Updated on ImmPort Activities https://www.linkedin.com/company/immport/ ImmPort_Helpdesk@immport.org https://docs.immport.org/home/newsletter/ Monthly Newsletter
  • 31. ImmPort Office Hours • ImmPort holds open office hours sessions on the first Thursday of each month from 2 PM – 3 PM ET • Office Hours are a great opportunity to discuss your questions directly with the ImmPort team and learn more about ImmPort • All user levels are welcome, whether new to ImmPort or an experienced user https://docs.immport.org/home/events/ Visit the ImmPort Events page to add ImmPort Office Hours to your calendar
  • 33. Sanchita.Bhattacharya@ucsf.edu @sanchitab 33 Bakar Institute of Computational Health Sciences, UCSF Thanks!