SlideShare uma empresa Scribd logo
1 de 24
Digital biomarkers for preventive
personalised healthcare
Paolo Missier
Newcastle University School of Computing
The Team
Prof. Paolo Missier PI, Newcastle University (PI)
Prof. Michael Catt, Newcastle University and Closed Loop Medicine, Cambridge (CO-I)
Dr. Jaume Bacardit, Newcastle University (CO-I)
Key contributors:
Dr. Ossama Alshabrawy (PhD student, now Lecturer at Northumbria University)
Ben Lam, PhD student
Dr. Jacek Cala, Sr. Research Associate
In collaboration with the IMI DIRECT Consortium
https://www.imi.europa.eu/projects-results/project-factsheets/direct
Diabetes research on patient stratification
Data-Driven, Personalised, Predictive, Preventive, Participatory Medicine (D2P4)
Part I:
The role of physical activity monitoring to support Type II Diabetes studies
Can we learn useful representations for a person’s daily activities from accelerometry?
Part II:
Generating synthetic physical activity data
How do we simulate plausible physical activity patterns and why?
Data-Driven, Personalised, Predictive, Preventive, Participatory Medicine (D2P4)
Part I:
The role of physical activity monitoring to support Type II Diabetes studies
Can we learn useful representations for a person’s daily activities from accelerometry?
Main contributors:
Dr. Ossama Alshabrawy (PhD student, now Lecturer at Northumbria University)
Benjamin Lam, PhD student
Activity traces archive from the UK Biobank
Filter:
Accelerometry study?
103,712
Split criteria:
Type 2 Diabetes?
At baseline: 2,755
Through EHR analysis: 1,321
Total: 4,076
Non-Diabetes
99,636
Filter:
EHR data available?
19,852
502, 664
All UK Biobank participants:
Filter:
QC on activity traces
3,103
Positives:
T2D vs Norm-0
Physical Impairment analysis
Severe impairment
1,666
No impairment
8,463
T2D vs Norm-2
Is there enough signal in the traces to
segregate T2D from Norm?
Extracting High Level Activity Features (HLAF)
feature extraction 60 features / day  aggregated to week
(*)
(*) https://github.com/activityMonitoring/biobankAccelerometerAnalysis
Selected results: Clustering
Selected results: classification
Negatives: HLAF SDL HLAF+SDL
Norm-0 Norm-2 Norm-0 Norm-2 Norm-0 Norm-2
RF .80 .68 .83 .78 .86 .77
LR .79 .70 .83 .78 .86 .78
XGB .78 .66 .80 .74 .85 .75
Lam B, Catt M, Cassidy S, Bacardit J, Darke P, Butterfield S, Alshabrawy O, Trenell M, Missier P, Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–
Based Cross-sectional Study of the UK Biobank Accelerometer Cohort -- JMIR Diabetes, Vol 6 no1. 19/3/2021:23364
SDL: Socio-Demographic and Lifestyle variables
Lessons learnt
• Signal is weak and noisy when used in the contex of a complex metabolic disease
• “Controls” may actually be physically impaired and this is hard to determine
• UK Biobank had no QC protocol, ”a random week in life” provides poor indicators
Are we mapping raw traces to the best possible feature space?
Learning embedded representation spaces
DIRECT
DB
• ~3,000 individuals total
• Follow-ups at 18 36, 48 months
Representation
learning
Embedded
feature space
LSTM Autoencoder
Covariates,
Outcomes (eg Insulin sensitivity)
Classification
Clustering
Cluster
interpretation
Autoencoder Architecture
LSTM Autoencoder
Final reconstruction loss: 0.46
(early termination, 9/150 epochs to prevent overfitting)
Clustering in the high level and embedded spaces
Embedded features High-level features
K-means
Hierarchical
Affinity Propagation
Spectral clustering
Embedded features High-level features
Embedded features High-level features Embedded features High-level features
Clusters quality
Silhouette Calinski-Harabrasz Davies-Bouldin
Affinity Propagation 0.634 2220.021 0.895
Spectral 0.677 2600.836 0.839
DBSCAN 0.274 73.642 1.808
Hierarchical 0.466 2292.27 0.879
K-means 0.482 2617.19 0.839
Silhouette: Bounded between 0 and 1 (Closer to 1, the better)
Calinski-Harabrasz: Unbounded (The higher the score, the better)
Davies-Bouldin (Not well suited to density methods): Bounded between 0 and 1 (Closer to 0, the better)
Many of the other cluster validity indices require knowledge of the ground truth labels, so this is not suitable for this study
Cluster interpretation: clinical and activity variables
Logistic regression AdaBoost classifier
Random forest classifier XGBoost
2 clusters  binary classification: are the clinical variables good predictors for the clusters?
percent time
light-tasks daily
percent time
sedentary daily
avg num hrs
asleep daily
avg daily
MET level
0.009 0.3 0.005 0.01
Significant p-values from t-tests
Distribution of physical variables
Data-Driven, Personalised, Predictive, Preventive, Participatory Medicine (D2P4)
Part II:
Generating synthetic physical activity data
How do we simulate plausible physical activity patterns and why?
Main contributor: Dr. Jacek Cala, Sr. Research Associate
Motivation
From the ESPRC Healthcare Technologies Grand Challenges (*)
“[Design] An intelligent 'companion' that is fully aware of an individual's healthcare history and
experience, empowering them to self-manage their health and care by providing directly relevant
feedback, information and advice.”
(*) https://epsrc.ukri.org/research/ourportfolio/themes/healthcaretechnologies/strategy/grandchallenges/
Scoping this down…
How do we design an AI agent that
- Knows our (wellness, fitness, health) goals
- Understands our current state through physical activity monitoring
- Can suggest personalised interventions to achieve our goals
Idea: Reinforcement learning
Longitudinal and profile-specific data scarcity
The Good: Annotated sensor data are widely available and useful to train an AI agent
The Bad: Difficult to find / create protocols where:
• Participants are followed for any length of time
 no longitudinal dimension (months, years)
• Responses to interventions can be observed
• Activity traces are available for specific conditions, pathologies, patient groups...
A little puppetry
Approach:
1. Use 24x7 traces to:
• Learn to generate new synthetic traces for a
catalogue A1… An of activities
• Model unfolding daily activity patterns
2. Simulate:
Generate syntraces and combine them into
controlled plausible daily patterns
Limited to basic activity types
- Sedentary, Light tasks,
Moderate, Vigorous
- Sleep
Goal:
to simulate a variety of physical activity patterns that unfold in time
- Realistic
- Useful in practice to boost existing training sets
Learn: Generating synthetic activity traces
(*) https://github.com/activityMonitoring/biobankAccelerometerAnalysis
Training:
- Traces: UK Biobank / 24x7 / 27 individuals
- HAR: Oxford accelerometry analysis tool (*)
- Traces broken down by (predicted) activity type
- A separate model trained for each activity type
- Notes: sleep excluded, traces trimmed to limit
training times
raw data
(subject-1) preprocess
126-dimensional
feature vectors
classified
activity trace
classify
split vectors
by activity
walking feature
vectors
sleep feature
vectors
moderate
feature vectors
...
train low-level
sleep model
train low-level
moderate model
train low-level
walking model
...
walking
model
moderate
model
sleep
model
...
raw data
(subject-1) preprocess
126-dimensional
feature vectors
classified
activity trace
classify
split vectors
by activity
walking feature
vectors
sleep feature
vectors
moderate
feature vectors
raw data
(subject-1) preprocess
126-dimensional
feature vectors
classified
activity trace
classify
split vectors
by activity
walking feature
vectors
sleep feature
vectors
moderate
feature vectors
...
Approach: Generative Neural Networks
BasicGAN from Synthetic Data Vault:
https://sdv.dev/
Preliminary results
Validation: Oxford activity classifier used as discriminator
186 synthetic traces
• Walking activity easiest to simulate: 120 correctly classified
• Moderate activity hardest (196) – only 4 correctly classified
Problem:
some of the correctly classified traces look unrealistic
Model and simulate: whole-day activity profiles
Goal: to realistically
combine bouts of single activities into ”virtual days”
Approach: parametric multi-state modelling
• transition probability si  sj increases as more time
spent in si
Objective: Use real 24x7 sequences to learn:
- Realistic lengths of each activity bouts
- Activity transitions, eg walk  sit
Selected traces of 24-hour synthetic activity profiles generated by
the semi-Markov generalised gamma model
(a), (b) show plausible traces; (c), (d) less realistic
Summary and open research
Part I: The role of physical activity monitoring to support Type II Diabetes studies
- Single sensor, free-living, poor QC  weak and noisy signal
- Good clustering of patients but signal inadequate for specific outcomes eg insuline sensitivity
- Signal either stable over time or too noisy to track disease progression
Next: multi-sensor monitoring
Part II: Generating synthetic physical activity data
- Plausible activity patterns
Next: use syndata for training using reinforcement learning?
Leveraged resources and Future plans
New collaboration:
- Physical activity monitoring to support a study on “long covid”-induced frailty.
Consortium of 5 hospitals (Italy + Israel), about 300 patients. Funded by Gilead
Potential collaborations:
- Closed Loop Medicine, Digital Healthcare, Cambridge (through Prof. Catt)
- Fully-funded CDT PhD studentship aligned with the project since its inception (Ben Lam)
- New PhD student started October 2021 (Naif Alzahrani)
Key outputs
Publications:
• Lam B, Catt M, Cassidy S, Bacardit J, Darke P, Butterfield S, Alshabrawy O, Trenell M, Missier P, Using Wearable
Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank
Accelerometer Cohort -- JMIR Diabetes, Vol 6 no1. 19/3/2021:23364
• Ferrari D, Milic J, Tonelli R, Ghinelli F, Meschiari M, et al. (2020) Machine learning in predicting respiratory failure in
patients with COVID-19 pneumonia—Challenges, strengths, and opportunities in a global health emergency.
PLOS ONE 15(11): e0239172. https://doi.org/10.1371/journal.pone.0239172
Invited Presentations:
- Data Science for (Health) Science: tales from a challenging front line, a talk given to the The School of Information
Sciences, Center for Informatics Research in Science and Scholarship, University of Illinois Urbana-Champaign, USA
(March 2021)
- Digital markers from physical activity traces to support research into type 2 diabetes, Talk given to the IMI DIRECT
consortium (April 2020)
- Prediction & prevention of age-related diseases through Machine Learning, Talk given to Newcastle BRC/NIHR group
(Jan 2020)
- Exploring the role of digital and genetic biomarkers to learn personalized predictive models of metabolic diseases, Talk
given at the Turing Health Programme workshop, Manchester March 2019

Mais conteúdo relacionado

Mais procurados

Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...
Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...
Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...Kevin Mader
 
IRJET- Classifying Chest Pathology Images using Deep Learning Techniques
IRJET- Classifying Chest Pathology Images using Deep Learning TechniquesIRJET- Classifying Chest Pathology Images using Deep Learning Techniques
IRJET- Classifying Chest Pathology Images using Deep Learning TechniquesIRJET Journal
 
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...Kevin Mader
 
A survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsA survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsJoseph Paul Cohen PhD
 
Recent advances of AI for medical imaging : Engineering perspectives
Recent advances of AI for medical imaging : Engineering perspectivesRecent advances of AI for medical imaging : Engineering perspectives
Recent advances of AI for medical imaging : Engineering perspectivesNamkug Kim
 
Comparison of Machine Learning Techniques for Identification of Disease
Comparison of Machine Learning Techniques for Identification of DiseaseComparison of Machine Learning Techniques for Identification of Disease
Comparison of Machine Learning Techniques for Identification of DiseaseIJCSIS Research Publications
 
Brainsci 10-00118
Brainsci 10-00118Brainsci 10-00118
Brainsci 10-00118imen jdey
 
IRJET- Plant Leaf Disease Detection using Image Processing
IRJET- Plant Leaf Disease Detection using Image ProcessingIRJET- Plant Leaf Disease Detection using Image Processing
IRJET- Plant Leaf Disease Detection using Image ProcessingIRJET Journal
 
A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...
A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...
A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...Mohammad Shakirul islam
 
IRJET- Comparative Analysis of Video Processing Object Detection
IRJET- Comparative Analysis of Video Processing Object DetectionIRJET- Comparative Analysis of Video Processing Object Detection
IRJET- Comparative Analysis of Video Processing Object DetectionIRJET Journal
 
DENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSIS
DENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSISDENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSIS
DENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSIScsandit
 
2017 07 03_meetup_d
2017 07 03_meetup_d2017 07 03_meetup_d
2017 07 03_meetup_dDana Brophy
 

Mais procurados (20)

Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...
Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...
Introduction to Machine Learning and Texture Analysis for Lesion Characteriza...
 
IRJET- Classifying Chest Pathology Images using Deep Learning Techniques
IRJET- Classifying Chest Pathology Images using Deep Learning TechniquesIRJET- Classifying Chest Pathology Images using Deep Learning Techniques
IRJET- Classifying Chest Pathology Images using Deep Learning Techniques
 
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...
 
Main
MainMain
Main
 
A survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsA survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applications
 
Recent advances of AI for medical imaging : Engineering perspectives
Recent advances of AI for medical imaging : Engineering perspectivesRecent advances of AI for medical imaging : Engineering perspectives
Recent advances of AI for medical imaging : Engineering perspectives
 
Comparison of Machine Learning Techniques for Identification of Disease
Comparison of Machine Learning Techniques for Identification of DiseaseComparison of Machine Learning Techniques for Identification of Disease
Comparison of Machine Learning Techniques for Identification of Disease
 
Brainsci 10-00118
Brainsci 10-00118Brainsci 10-00118
Brainsci 10-00118
 
IRJET- Plant Leaf Disease Detection using Image Processing
IRJET- Plant Leaf Disease Detection using Image ProcessingIRJET- Plant Leaf Disease Detection using Image Processing
IRJET- Plant Leaf Disease Detection using Image Processing
 
A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...
A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...
A Novel Approach for Tomato Diseases Classification Based on Deep Convolution...
 
Reserach data
Reserach dataReserach data
Reserach data
 
IRJET- Comparative Analysis of Video Processing Object Detection
IRJET- Comparative Analysis of Video Processing Object DetectionIRJET- Comparative Analysis of Video Processing Object Detection
IRJET- Comparative Analysis of Video Processing Object Detection
 
Qt7355g8v8
Qt7355g8v8Qt7355g8v8
Qt7355g8v8
 
An Introduction to Biology with Computers
An Introduction to Biology with ComputersAn Introduction to Biology with Computers
An Introduction to Biology with Computers
 
Fulltext02
Fulltext02Fulltext02
Fulltext02
 
DENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSIS
DENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSISDENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSIS
DENGUE DETECTION AND PREDICTION SYSTEM USING DATA MINING WITH FREQUENCY ANALYSIS
 
323462348
323462348323462348
323462348
 
research publication
research publicationresearch publication
research publication
 
Paper presentation report
Paper presentation reportPaper presentation report
Paper presentation report
 
2017 07 03_meetup_d
2017 07 03_meetup_d2017 07 03_meetup_d
2017 07 03_meetup_d
 

Semelhante a Digital biomarkers for preventive personalised healthcare

The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...
The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...
The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...Maciej Behnke
 
Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...
Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...
Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...NhatHai Phan
 
Activity recognition in health field
Activity recognition in health fieldActivity recognition in health field
Activity recognition in health fieldYogesh KM
 
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...Paolo Missier
 
IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...
IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...
IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...Ingunn Grip Fjær
 
Proposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data AnalyticsProposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data Analyticsvivatechijri
 
Citation Kristoffersson, A.; Lindén,
Citation Kristoffersson, A.; Lindén,Citation Kristoffersson, A.; Lindén,
Citation Kristoffersson, A.; Lindén,VinaOconner450
 
Visual Analytics for Healthcare - Panel at AMIA 2012 in Chicago
Visual Analytics for Healthcare - Panel at AMIA 2012 in ChicagoVisual Analytics for Healthcare - Panel at AMIA 2012 in Chicago
Visual Analytics for Healthcare - Panel at AMIA 2012 in ChicagoAdam Perer
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchFranciscoJAzuajeG
 
Personalized medicine
Personalized medicinePersonalized medicine
Personalized medicinecancerdrg
 
Bjoern Eskofier: Keynote at DSAI & TISHW 2016 Conference
Bjoern Eskofier: Keynote at DSAI & TISHW 2016 ConferenceBjoern Eskofier: Keynote at DSAI & TISHW 2016 Conference
Bjoern Eskofier: Keynote at DSAI & TISHW 2016 ConferenceBjoern Eskofier
 
Sport kit
Sport kitSport kit
Sport kitalchis
 
Sensor Based Ambient Assisted Living
Sensor Based Ambient Assisted LivingSensor Based Ambient Assisted Living
Sensor Based Ambient Assisted LivingYiannis Kompatsiaris
 
Univ of Miami CTSI: Citizen science seminar; Oct 2014
Univ of Miami CTSI: Citizen science seminar; Oct 2014Univ of Miami CTSI: Citizen science seminar; Oct 2014
Univ of Miami CTSI: Citizen science seminar; Oct 2014Richard Bookman
 
Biochip Informatics Technology For Electronic & Communication Engineering
Biochip Informatics Technology For Electronic & Communication EngineeringBiochip Informatics Technology For Electronic & Communication Engineering
Biochip Informatics Technology For Electronic & Communication EngineeringNazakatHussain15
 
What Makes Transfer learning Work for Medical Images
What Makes Transfer learning Work for Medical Images What Makes Transfer learning Work for Medical Images
What Makes Transfer learning Work for Medical Images MithunjhaAnandakumar
 
Physical activity prediction using fitness data: Challenges and issues
Physical activity prediction using fitness data: Challenges and issuesPhysical activity prediction using fitness data: Challenges and issues
Physical activity prediction using fitness data: Challenges and issuesjournalBEEI
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsNigel Collier
 

Semelhante a Digital biomarkers for preventive personalised healthcare (20)

The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...
The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...
The Cold Start Problem and Per-Group Personalization in Real-Life Emotion Rec...
 
Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...
Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...
Ontology-based Deep Learning for Human Behavior Prediction in Health Social N...
 
Activity recognition in health field
Activity recognition in health fieldActivity recognition in health field
Activity recognition in health field
 
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
 
IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...
IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...
IoT, Big Data Healthcare Summit Western Canada - Wearables from Lab to Bedsid...
 
Proposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data AnalyticsProposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data Analytics
 
Citation Kristoffersson, A.; Lindén,
Citation Kristoffersson, A.; Lindén,Citation Kristoffersson, A.; Lindén,
Citation Kristoffersson, A.; Lindén,
 
Visual Analytics for Healthcare - Panel at AMIA 2012 in Chicago
Visual Analytics for Healthcare - Panel at AMIA 2012 in ChicagoVisual Analytics for Healthcare - Panel at AMIA 2012 in Chicago
Visual Analytics for Healthcare - Panel at AMIA 2012 in Chicago
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical research
 
Personalized medicine
Personalized medicinePersonalized medicine
Personalized medicine
 
Bjoern Eskofier: Keynote at DSAI & TISHW 2016 Conference
Bjoern Eskofier: Keynote at DSAI & TISHW 2016 ConferenceBjoern Eskofier: Keynote at DSAI & TISHW 2016 Conference
Bjoern Eskofier: Keynote at DSAI & TISHW 2016 Conference
 
Sport kit
Sport kitSport kit
Sport kit
 
Sensor Based Ambient Assisted Living
Sensor Based Ambient Assisted LivingSensor Based Ambient Assisted Living
Sensor Based Ambient Assisted Living
 
Univ of Miami CTSI: Citizen science seminar; Oct 2014
Univ of Miami CTSI: Citizen science seminar; Oct 2014Univ of Miami CTSI: Citizen science seminar; Oct 2014
Univ of Miami CTSI: Citizen science seminar; Oct 2014
 
Summit2013 ho-jin choi - summit2013
Summit2013   ho-jin choi - summit2013Summit2013   ho-jin choi - summit2013
Summit2013 ho-jin choi - summit2013
 
Biochip Informatics Technology For Electronic & Communication Engineering
Biochip Informatics Technology For Electronic & Communication EngineeringBiochip Informatics Technology For Electronic & Communication Engineering
Biochip Informatics Technology For Electronic & Communication Engineering
 
Final_Presentation.pptx
Final_Presentation.pptxFinal_Presentation.pptx
Final_Presentation.pptx
 
What Makes Transfer learning Work for Medical Images
What Makes Transfer learning Work for Medical Images What Makes Transfer learning Work for Medical Images
What Makes Transfer learning Work for Medical Images
 
Physical activity prediction using fitness data: Challenges and issues
Physical activity prediction using fitness data: Challenges and issuesPhysical activity prediction using fitness data: Challenges and issues
Physical activity prediction using fitness data: Challenges and issues
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease Informatics
 

Mais de Paolo Missier

Towards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance recordsTowards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance recordsPaolo Missier
 
Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...Paolo Missier
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...Paolo Missier
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...Paolo Missier
 
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)Paolo Missier
 
A Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overviewA Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overviewPaolo Missier
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Paolo Missier
 
Tracking trajectories of multiple long-term conditions using dynamic patient...
Tracking trajectories of  multiple long-term conditions using dynamic patient...Tracking trajectories of  multiple long-term conditions using dynamic patient...
Tracking trajectories of multiple long-term conditions using dynamic patient...Paolo Missier
 
Data Provenance for Data Science
Data Provenance for Data ScienceData Provenance for Data Science
Data Provenance for Data SciencePaolo Missier
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Paolo Missier
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...Paolo Missier
 
ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...Paolo Missier
 
ReComp, the complete story: an invited talk at Cardiff University
ReComp, the complete story:  an invited talk at Cardiff UniversityReComp, the complete story:  an invited talk at Cardiff University
ReComp, the complete story: an invited talk at Cardiff UniversityPaolo Missier
 
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Paolo Missier
 
Decentralized, Trust-less Marketplace for Brokered IoT Data Trading using Blo...
Decentralized, Trust-less Marketplacefor Brokered IoT Data Tradingusing Blo...Decentralized, Trust-less Marketplacefor Brokered IoT Data Tradingusing Blo...
Decentralized, Trust-less Marketplace for Brokered IoT Data Trading using Blo...Paolo Missier
 
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Paolo Missier
 
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...Paolo Missier
 
Provenance Annotation and Analysis to Support Process Re-Computation
Provenance Annotation and Analysis to Support Process Re-ComputationProvenance Annotation and Analysis to Support Process Re-Computation
Provenance Annotation and Analysis to Support Process Re-ComputationPaolo Missier
 
Design and evaluation of a genomics variant analysis pipeline using GATK Spar...
Design and evaluation of a genomics variant analysis pipeline using GATK Spar...Design and evaluation of a genomics variant analysis pipeline using GATK Spar...
Design and evaluation of a genomics variant analysis pipeline using GATK Spar...Paolo Missier
 
Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...Paolo Missier
 

Mais de Paolo Missier (20)

Towards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance recordsTowards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance records
 
Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...
 
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
 
A Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overviewA Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overview
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
 
Tracking trajectories of multiple long-term conditions using dynamic patient...
Tracking trajectories of  multiple long-term conditions using dynamic patient...Tracking trajectories of  multiple long-term conditions using dynamic patient...
Tracking trajectories of multiple long-term conditions using dynamic patient...
 
Data Provenance for Data Science
Data Provenance for Data ScienceData Provenance for Data Science
Data Provenance for Data Science
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...
 
ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...
 
ReComp, the complete story: an invited talk at Cardiff University
ReComp, the complete story:  an invited talk at Cardiff UniversityReComp, the complete story:  an invited talk at Cardiff University
ReComp, the complete story: an invited talk at Cardiff University
 
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
 
Decentralized, Trust-less Marketplace for Brokered IoT Data Trading using Blo...
Decentralized, Trust-less Marketplacefor Brokered IoT Data Tradingusing Blo...Decentralized, Trust-less Marketplacefor Brokered IoT Data Tradingusing Blo...
Decentralized, Trust-less Marketplace for Brokered IoT Data Trading using Blo...
 
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
 
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter U...
 
Provenance Annotation and Analysis to Support Process Re-Computation
Provenance Annotation and Analysis to Support Process Re-ComputationProvenance Annotation and Analysis to Support Process Re-Computation
Provenance Annotation and Analysis to Support Process Re-Computation
 
Design and evaluation of a genomics variant analysis pipeline using GATK Spar...
Design and evaluation of a genomics variant analysis pipeline using GATK Spar...Design and evaluation of a genomics variant analysis pipeline using GATK Spar...
Design and evaluation of a genomics variant analysis pipeline using GATK Spar...
 
Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...
 

Último

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Último (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Digital biomarkers for preventive personalised healthcare

  • 1. Digital biomarkers for preventive personalised healthcare Paolo Missier Newcastle University School of Computing
  • 2. The Team Prof. Paolo Missier PI, Newcastle University (PI) Prof. Michael Catt, Newcastle University and Closed Loop Medicine, Cambridge (CO-I) Dr. Jaume Bacardit, Newcastle University (CO-I) Key contributors: Dr. Ossama Alshabrawy (PhD student, now Lecturer at Northumbria University) Ben Lam, PhD student Dr. Jacek Cala, Sr. Research Associate In collaboration with the IMI DIRECT Consortium https://www.imi.europa.eu/projects-results/project-factsheets/direct Diabetes research on patient stratification
  • 3. Data-Driven, Personalised, Predictive, Preventive, Participatory Medicine (D2P4) Part I: The role of physical activity monitoring to support Type II Diabetes studies Can we learn useful representations for a person’s daily activities from accelerometry? Part II: Generating synthetic physical activity data How do we simulate plausible physical activity patterns and why?
  • 4. Data-Driven, Personalised, Predictive, Preventive, Participatory Medicine (D2P4) Part I: The role of physical activity monitoring to support Type II Diabetes studies Can we learn useful representations for a person’s daily activities from accelerometry? Main contributors: Dr. Ossama Alshabrawy (PhD student, now Lecturer at Northumbria University) Benjamin Lam, PhD student
  • 5. Activity traces archive from the UK Biobank Filter: Accelerometry study? 103,712 Split criteria: Type 2 Diabetes? At baseline: 2,755 Through EHR analysis: 1,321 Total: 4,076 Non-Diabetes 99,636 Filter: EHR data available? 19,852 502, 664 All UK Biobank participants: Filter: QC on activity traces 3,103 Positives: T2D vs Norm-0 Physical Impairment analysis Severe impairment 1,666 No impairment 8,463 T2D vs Norm-2 Is there enough signal in the traces to segregate T2D from Norm?
  • 6. Extracting High Level Activity Features (HLAF) feature extraction 60 features / day  aggregated to week (*) (*) https://github.com/activityMonitoring/biobankAccelerometerAnalysis
  • 8. Selected results: classification Negatives: HLAF SDL HLAF+SDL Norm-0 Norm-2 Norm-0 Norm-2 Norm-0 Norm-2 RF .80 .68 .83 .78 .86 .77 LR .79 .70 .83 .78 .86 .78 XGB .78 .66 .80 .74 .85 .75 Lam B, Catt M, Cassidy S, Bacardit J, Darke P, Butterfield S, Alshabrawy O, Trenell M, Missier P, Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning– Based Cross-sectional Study of the UK Biobank Accelerometer Cohort -- JMIR Diabetes, Vol 6 no1. 19/3/2021:23364 SDL: Socio-Demographic and Lifestyle variables
  • 9. Lessons learnt • Signal is weak and noisy when used in the contex of a complex metabolic disease • “Controls” may actually be physically impaired and this is hard to determine • UK Biobank had no QC protocol, ”a random week in life” provides poor indicators Are we mapping raw traces to the best possible feature space?
  • 10. Learning embedded representation spaces DIRECT DB • ~3,000 individuals total • Follow-ups at 18 36, 48 months Representation learning Embedded feature space LSTM Autoencoder Covariates, Outcomes (eg Insulin sensitivity) Classification Clustering Cluster interpretation
  • 11. Autoencoder Architecture LSTM Autoencoder Final reconstruction loss: 0.46 (early termination, 9/150 epochs to prevent overfitting)
  • 12. Clustering in the high level and embedded spaces Embedded features High-level features K-means Hierarchical Affinity Propagation Spectral clustering Embedded features High-level features Embedded features High-level features Embedded features High-level features
  • 13. Clusters quality Silhouette Calinski-Harabrasz Davies-Bouldin Affinity Propagation 0.634 2220.021 0.895 Spectral 0.677 2600.836 0.839 DBSCAN 0.274 73.642 1.808 Hierarchical 0.466 2292.27 0.879 K-means 0.482 2617.19 0.839 Silhouette: Bounded between 0 and 1 (Closer to 1, the better) Calinski-Harabrasz: Unbounded (The higher the score, the better) Davies-Bouldin (Not well suited to density methods): Bounded between 0 and 1 (Closer to 0, the better) Many of the other cluster validity indices require knowledge of the ground truth labels, so this is not suitable for this study
  • 14. Cluster interpretation: clinical and activity variables Logistic regression AdaBoost classifier Random forest classifier XGBoost 2 clusters  binary classification: are the clinical variables good predictors for the clusters? percent time light-tasks daily percent time sedentary daily avg num hrs asleep daily avg daily MET level 0.009 0.3 0.005 0.01 Significant p-values from t-tests Distribution of physical variables
  • 15. Data-Driven, Personalised, Predictive, Preventive, Participatory Medicine (D2P4) Part II: Generating synthetic physical activity data How do we simulate plausible physical activity patterns and why? Main contributor: Dr. Jacek Cala, Sr. Research Associate
  • 16. Motivation From the ESPRC Healthcare Technologies Grand Challenges (*) “[Design] An intelligent 'companion' that is fully aware of an individual's healthcare history and experience, empowering them to self-manage their health and care by providing directly relevant feedback, information and advice.” (*) https://epsrc.ukri.org/research/ourportfolio/themes/healthcaretechnologies/strategy/grandchallenges/ Scoping this down… How do we design an AI agent that - Knows our (wellness, fitness, health) goals - Understands our current state through physical activity monitoring - Can suggest personalised interventions to achieve our goals Idea: Reinforcement learning
  • 17. Longitudinal and profile-specific data scarcity The Good: Annotated sensor data are widely available and useful to train an AI agent The Bad: Difficult to find / create protocols where: • Participants are followed for any length of time  no longitudinal dimension (months, years) • Responses to interventions can be observed • Activity traces are available for specific conditions, pathologies, patient groups...
  • 18. A little puppetry Approach: 1. Use 24x7 traces to: • Learn to generate new synthetic traces for a catalogue A1… An of activities • Model unfolding daily activity patterns 2. Simulate: Generate syntraces and combine them into controlled plausible daily patterns Limited to basic activity types - Sedentary, Light tasks, Moderate, Vigorous - Sleep Goal: to simulate a variety of physical activity patterns that unfold in time - Realistic - Useful in practice to boost existing training sets
  • 19. Learn: Generating synthetic activity traces (*) https://github.com/activityMonitoring/biobankAccelerometerAnalysis Training: - Traces: UK Biobank / 24x7 / 27 individuals - HAR: Oxford accelerometry analysis tool (*) - Traces broken down by (predicted) activity type - A separate model trained for each activity type - Notes: sleep excluded, traces trimmed to limit training times raw data (subject-1) preprocess 126-dimensional feature vectors classified activity trace classify split vectors by activity walking feature vectors sleep feature vectors moderate feature vectors ... train low-level sleep model train low-level moderate model train low-level walking model ... walking model moderate model sleep model ... raw data (subject-1) preprocess 126-dimensional feature vectors classified activity trace classify split vectors by activity walking feature vectors sleep feature vectors moderate feature vectors raw data (subject-1) preprocess 126-dimensional feature vectors classified activity trace classify split vectors by activity walking feature vectors sleep feature vectors moderate feature vectors ... Approach: Generative Neural Networks BasicGAN from Synthetic Data Vault: https://sdv.dev/
  • 20. Preliminary results Validation: Oxford activity classifier used as discriminator 186 synthetic traces • Walking activity easiest to simulate: 120 correctly classified • Moderate activity hardest (196) – only 4 correctly classified Problem: some of the correctly classified traces look unrealistic
  • 21. Model and simulate: whole-day activity profiles Goal: to realistically combine bouts of single activities into ”virtual days” Approach: parametric multi-state modelling • transition probability si  sj increases as more time spent in si Objective: Use real 24x7 sequences to learn: - Realistic lengths of each activity bouts - Activity transitions, eg walk  sit Selected traces of 24-hour synthetic activity profiles generated by the semi-Markov generalised gamma model (a), (b) show plausible traces; (c), (d) less realistic
  • 22. Summary and open research Part I: The role of physical activity monitoring to support Type II Diabetes studies - Single sensor, free-living, poor QC  weak and noisy signal - Good clustering of patients but signal inadequate for specific outcomes eg insuline sensitivity - Signal either stable over time or too noisy to track disease progression Next: multi-sensor monitoring Part II: Generating synthetic physical activity data - Plausible activity patterns Next: use syndata for training using reinforcement learning?
  • 23. Leveraged resources and Future plans New collaboration: - Physical activity monitoring to support a study on “long covid”-induced frailty. Consortium of 5 hospitals (Italy + Israel), about 300 patients. Funded by Gilead Potential collaborations: - Closed Loop Medicine, Digital Healthcare, Cambridge (through Prof. Catt) - Fully-funded CDT PhD studentship aligned with the project since its inception (Ben Lam) - New PhD student started October 2021 (Naif Alzahrani)
  • 24. Key outputs Publications: • Lam B, Catt M, Cassidy S, Bacardit J, Darke P, Butterfield S, Alshabrawy O, Trenell M, Missier P, Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort -- JMIR Diabetes, Vol 6 no1. 19/3/2021:23364 • Ferrari D, Milic J, Tonelli R, Ghinelli F, Meschiari M, et al. (2020) Machine learning in predicting respiratory failure in patients with COVID-19 pneumonia—Challenges, strengths, and opportunities in a global health emergency. PLOS ONE 15(11): e0239172. https://doi.org/10.1371/journal.pone.0239172 Invited Presentations: - Data Science for (Health) Science: tales from a challenging front line, a talk given to the The School of Information Sciences, Center for Informatics Research in Science and Scholarship, University of Illinois Urbana-Champaign, USA (March 2021) - Digital markers from physical activity traces to support research into type 2 diabetes, Talk given to the IMI DIRECT consortium (April 2020) - Prediction & prevention of age-related diseases through Machine Learning, Talk given to Newcastle BRC/NIHR group (Jan 2020) - Exploring the role of digital and genetic biomarkers to learn personalized predictive models of metabolic diseases, Talk given at the Turing Health Programme workshop, Manchester March 2019

Notas do Editor

  1. Aims: 1. To understand the role {potential, limitations} of physical activity monitoring to support the detection of complex metabolic diseases (Type II Diabetes) 2. To investigate the potential of synthetic activity traces to address the scarcity of longitudinal activity datasets for research Clustering within the space Learning classifiers for specific clinical outcomes
  2. Aims: 1. To understand the role {potential, limitations} of physical activity monitoring to support the detection of complex metabolic diseases (Type II Diabetes) 2. To investigate the potential of synthetic activity traces to address the scarcity of longitudinal activity datasets for research Clustering within the space Learning classifiers for specific clinical outcomes
  3. Spectral clustering is a technique with roots in graph theory, where the approach is used to identify communities of nodes in a graph based on the edges connecting them. The method is flexible and allows us to cluster non graph data as well. Spectral clustering uses information from the eigenvalues (spectrum) of the Laplacian built from a graph representation of the data set. 
  4. Aims: 1. To understand the role {potential, limitations} of physical activity monitoring to support the detection of complex metabolic diseases (Type II Diabetes) 2. To investigate the potential of synthetic activity traces to address the scarcity of longitudinal activity datasets for research Clustering within the space Learning classifiers for specific clinical outcomes