SlideShare uma empresa Scribd logo
1 de 39
Baixar para ler offline
ORGANIZING THE
INTERNET OF THINGS
ACTIONABLE INSIGHT THROUGH ONTOLOGIES
Boris Adryan
badryan@gmail.com
• Computational biologist
• Research group leader
• Advisor at
• 2015 Fellow of the
Who is
@BorisAdryan
• Why a biologist is interested in
large, unstructured data
• What wrong is with the IoT in its
current state
• How biologists deal with similar
problems
• Which academic concepts would
be useful in the IoT
WHAT TO EXPECT IN THE NEXT HOUR…
(including questions!)
• Why a biologist is interested
in large, unstructured data
• What wrong is with the IoT in its
current state
• How biologists deal with similar
problems
• Which academic concepts would
be useful in the IoT
WHAT TO EXPECT IN THE NEXT 10 MINUTES
DNA = storage of a blueprint
RNA = ‘active copy’ of DNA
protein = the building blocks
of cells and tissues
LIFE AS WE KNOW IT
transcription
translation
Gregor Johann Mendel,
exhibited in the Library at the NIMR
‣ Reading DNA information
‣ Determining “the sequence
of a gene” was a PhD in the
early 1980s
‣ Data processing was mainly
transcribing the observation
into a research paper
BIOLOGY THEN AND NOW
SEQUENCE INFORMATION
Sanger sequencing
ca. 1980
http://www.eplantscience.com
189,739,230,107 bases base pairs on 15th April 2015
(from 159,813,411,760 bases pairs in April 2015)
‣ We can sequence a human
genome in half a day
‣ Sequence databases grow
faster than storage capacity
‣ Data processing is the key
step in scientific
understanding
BIOLOGY THEN AND NOW
SEQUENCE INFORMATION
1990: automation
kilobases a day
2007: next-gen seq
megabases a day
2015: 1000s of
instruments world-wide
BIOLOGY THEN AND NOW
GENE ACTIVITY INFORMATION
‣ When are genes needed?
‣ Classical molecular biology
workflow, taking days…
‣ Data is semi-quantitative;
testing one gene at the time
Northern blot, ca. 1995
‣ High-throughput gene expression profiling
since mid-1990s
‣ Quantitative information for every gene in an
organism
‣ Key challenge is the graphical representation
and interpretation of the data
screenshot from
FlyBase, today
2
6 ATP
‣ Signal transduction and
metabolic pathways
‣ Characterisation of proteins
and substrates that mediate
chemical reactions
‣ Nobel prize material
BIOLOGY THEN AND NOW
BIOCHEMISTRY
‣ We know about 250k metabolites
‣ 100k protein structures
‣ on the order of 10k different
chemical reactions
BIOLOGY THEN AND NOW
BIOCHEMISTRY
“The Robot Scientist”
“small molecules”
(Organic & Biomolecular Chemistry Blog)
protein
(via the Protein Databank, www.pdb.org)
‣Everything is connected
‣ Big, noisy, often
unstructured data
‣ We are learning how biological
entities depend on each other
DNA > RNA > proteins
• Why a biologist is interested in
large, unstructured data
• What wrong is with the IoT
in its current state
• How biologists deal with similar
problems
• Which academic concepts would
be useful in the IoT
WHAT TO EXPECT IN THE NEXT 5 MINUTES
‣ Everything is connected
‣ Big, noisy, often
unstructured data
www.thingslearn.com
Analytics, context integration, machine learning
and predictive modelling for the IoT.
0 clean shirt left
+
washing machine estimates 97% of
your last pack of powder used
+
it’s Wednesday, 23:55
+
the last four Thursdays had a
morning business meeting
+
the car is parked 20 m from a shop
+
last retail activity: 8 sec ago
Send immediate text reminder
to pick up washing powder +
send tweet from @BorisHouse
“need identified” +
“notification appropriate”
Actionable insight.
From everything.
NO ANALYTICAL FLEXIBILITY IN M2M/IOT
Matt Hatton, Machina Research
The BLN IoT ‘14
Internet replaces wire
It’s all about the
context
M2M
consumer
IoT
defined I-P-O
like it’s 1975
context
context
context
context
context
context
context
Is this hot?
LIFE SCIENCE STRATEGIES
DON’T WORK IN THE IOT
- There are no commonly accepted
- ‘catalogue’ of things,
- ‘ontology’ of things,
- ‘data format’ of things,
- ‘meta data’ for things.
- Most businesses are driven by revenue, not
long-term strategic vision
- Service providers have no need to publish
- Data can be highly personal (cheap excuse)
unless they’re
Trojan Room
coffee pot -
ca. 1993
Oct. 1995
“The Internet of Things”
Kevin Ashton, ca. 1999
20 YEARS OF NON-CONVERGENT EVOLUTION
FIRST DATA POTENTIAL RECOGNISED TODAY’S REALITY
“ignorant coexistence”
➡ Commonly accepted platforms
and formats for data exchange
➡ Meta-data deposition is a must
➡ Infrastructure provides entry
point for computational
knowledge inference
“designed to ask questions”
• Why a biologist is interested in
large, unstructured data
• What wrong is with the IoT in its
current state
• How biologists deal with
similar problems
• Which academic concepts would
be useful in the IoT
WHAT TO EXPECT IN THE NEXT 10 MINUTES
Oct. 1995
TOWARDS MIAMI STANDARD AND
DATA REPOSITORIES
cf. IoT
Nov. 1993
MInimal Annotation for
MIcroarray Info
META DATA, SHARING AND
DATA REPOSITORIES
founded in Nov. 1999
But this is a complex and ambitious project, and is one of the biggest challenges that
bioinformatics has yet faced. Major difficulties stem from the detail required to describe the
conditions of an experiment, and the relative and imprecise nature of measurements of
expression levels.The potentially huge volume of data only adds to these difficulties.
Nature
Feb. 2000
“
“
Nov. 2000
Oct. 2002
Wide adoption as
requirement for
publication in
scientific journals
META DATA, SHARING AND
DATA REPOSITORIES
cf. IoT 2014
since 2003
http://en.wikipedia.org/wiki/Silo
THE LIFE SCIENCES FIXED THEIR
KNOWLEDGE REPRESENTATION PROBLEM
FORMALISING KNOWLEDGE
FORMALISING KNOWLEDGE
WITH GENE ONTOLOGY
CURRENT GOVERNMENT
INVESTMENTS INTO GENE
ONTOLOGY
NIH alone spent $44,616,906 on the
ontology structure since 2001
(I don’t have data for UK/EU spendings)
~100 full-time salaries for experts with
domain-specific knowledge
~40,000 terms
story
measurements
+ meta data
open, public repositories
human
curators
ontology
terms
community
PUBLISH OR PERISH
ok?
journal
informal exchange - no credit!
funders
assessment
The majority of this
infrastructure is paid for by
governments and charities
industry!
OUR PROBLEM IS KNOWLEDGE
DATA != INSIGHT
WITHOUT ORGANISING IT
• Why a biologist is interested in
large, unstructured data
• What wrong is with the IoT in its
current state
• How biologists deal with similar
problems
• Which academic concepts
would be useful in the IoT
WHAT TO EXPECT IN THE NEXT 10 MINUTES
measurements
+ meta data
storage &
provenance
human
curators
ontology
terms
user
PUBLISH OR YOU’RE NOT DOING IOT
ok?
Maybe the majority of this
infrastructure should be
paid for by governments?
company
cloud
device
registration
“ “
privileges
dataadded
value
WHAT IS AN ONTOLOGY?
used to establish conceptual
connection between entities
knowledge inference
finger
ontology structure
- body part
- limb
- arm
- hand
- thumb
- fingerontology rules
‣controlled vocabulary
‣clearly defined relationships
is a
is a
connects to
part of
with ontological reasoning, a computer can
infer that “finger is a body part”, although we
haven’t explicitly defined it that way
ARE PEOPLE NOT ALREADY USING
ONTOLOGIES IN THE IOT?
Semantic Sensor Network Ontology
“thermostat”
The idea is not new! Cf. extension of the semantic web
with the Semantic Sensor Network.
‣catalogs
‣conventions
http://www.w3.org/2005/Incubator/ssn/ssnx/ssn
ONTOLOGIES HAVE TO BE
PRAGMATIC COMPROMISES
Gene Ontology annotation
15 years of research
47 publications
100+ authors
50+ PhDs
15 direct annotations
~150 inferred annotations
THE THREE BRANCHES OF
Adapted from Anurag et al., Mol. BioSyst., 2012,8, 346-352
Localization:Where is an entity acting?
Function:What does the entity do?
Process:When is the entity needed?
inferences on “is a”
“part of”
“regulates”
“has part”
from geneontology.org from Ashburner et al., Nat Genet. 2000, 25(1):25-9.
GO AND CONTEXT
THE BRANCHES OF GO AND THE IOT
Localization: inside, (my?) home, living room
Function:
measures temperature
regulates temperature
interacts with user directly
interacts with user via app
Process:
regulation of temperature
measurement of ambient temperature
‘is proxy / is avatar’ for
presence
fire
ice age
A LAST WORD ON PRAGMATISM
“perfect” ontology
The SSN Ontology allows for
inference entirely on the basis
of its structure and annotation.
In reality, many parameters are
difficult to establish and the
effort to annotate things
outweighs the utility.
“crude” ontology
A simplified structure allows for
quick annotation even by non-
specialists.
The lack of details can lead to
clashes in the ontology =>
more smartness has to go into
software; more coding effort.
1 billlion
different things
1 milllion
use cases
0 clean shirt left
+
washing machine estimates 97% of
your last pack of powder used
+
it’s Wednesday, 23:55
+
the last four Thursdays had a
morning business meeting
+
the car is parked 20 m from a shop
+
last retail activity: 8 sec ago
Send immediate text reminder
to pick up washing powder +
send tweet from @BorisHouse
“need identified” +
“notification appropriate”
Actionable insight.
From everything.
“not home”
“buying”
credit card: “highly personal device” ~ alive and awake
3% left and
not pressed
“indicator of esteem”
Today’s biology is a
quantitative, data-
rich science.
Infrastructure for ‘big
data’ was driven by
academics.
Data is only useful if
it can be turned
into knowledge.
Understanding of data
requires ‘data about
the data’.
Meta-data should be
in a universally
understood format.
Ontologies provide
context.
Gene Ontology
(GO) is a de facto
standard.
Human curation is
key to GO.
Public funders and
industry contribute
significantly to GO.
Should governments
be involved in IoT?
GO is not a ‘one fits
all’, but has a few
useful concepts.
What does the thing
do? Thing function.
For what can the
thing be an avatar?
Thing process.
Where is the thing?
Thing localization.
@BorisAdryan

Mais conteúdo relacionado

Mais procurados

Fold For Covid
Fold For CovidFold For Covid
Fold For CovidBalena
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityAndry Alamsyah
 
AI In Cybersecurity – Challenges and Solutions
AI In Cybersecurity – Challenges and SolutionsAI In Cybersecurity – Challenges and Solutions
AI In Cybersecurity – Challenges and SolutionsZoneFox
 
Big data - What is It?
Big data - What is It?Big data - What is It?
Big data - What is It?Nicole Aidney
 
Data Science: Not Just For Big Data
Data Science: Not Just For Big DataData Science: Not Just For Big Data
Data Science: Not Just For Big DataRevolution Analytics
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
SapientNitro Strata_presentation_upload
SapientNitro Strata_presentation_uploadSapientNitro Strata_presentation_upload
SapientNitro Strata_presentation_uploadOReillyStrata
 
AI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are DangerousAI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are DangerousRaffael Marty
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science EducationJames Hendler
 
The Internet of Things, Productivity, and Employment
The Internet of Things, Productivity, and Employment The Internet of Things, Productivity, and Employment
The Internet of Things, Productivity, and Employment Alex Krause
 
THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915
THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915
THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915Economic Strategy Institute
 
TopConf Linz, 02/02/2016
TopConf Linz, 02/02/2016TopConf Linz, 02/02/2016
TopConf Linz, 02/02/2016Boris Adryan
 
W-JAX Keynote - Big Data and Corporate Evolution
W-JAX Keynote - Big Data and Corporate EvolutionW-JAX Keynote - Big Data and Corporate Evolution
W-JAX Keynote - Big Data and Corporate Evolutionjstogdill
 
Big data, big opportunities
Big data, big opportunitiesBig data, big opportunities
Big data, big opportunitiesChouaieb NEMRI
 
National seminar on emergence of internet of things (io t) trends and challe...
National seminar on emergence of internet of things (io t)  trends and challe...National seminar on emergence of internet of things (io t)  trends and challe...
National seminar on emergence of internet of things (io t) trends and challe...Ajay Ohri
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science James Hendler
 
AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?Srinath Perera
 

Mais procurados (20)

Fold For Covid
Fold For CovidFold For Covid
Fold For Covid
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research Activity
 
Artificial intelligence - Digital Readiness.
Artificial intelligence - Digital Readiness.Artificial intelligence - Digital Readiness.
Artificial intelligence - Digital Readiness.
 
AI In Cybersecurity – Challenges and Solutions
AI In Cybersecurity – Challenges and SolutionsAI In Cybersecurity – Challenges and Solutions
AI In Cybersecurity – Challenges and Solutions
 
Big data - What is It?
Big data - What is It?Big data - What is It?
Big data - What is It?
 
Data Science: Not Just For Big Data
Data Science: Not Just For Big DataData Science: Not Just For Big Data
Data Science: Not Just For Big Data
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
SapientNitro Strata_presentation_upload
SapientNitro Strata_presentation_uploadSapientNitro Strata_presentation_upload
SapientNitro Strata_presentation_upload
 
AI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are DangerousAI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are Dangerous
 
Thingmonk 2015
Thingmonk 2015Thingmonk 2015
Thingmonk 2015
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science Education
 
The Internet of Things, Productivity, and Employment
The Internet of Things, Productivity, and Employment The Internet of Things, Productivity, and Employment
The Internet of Things, Productivity, and Employment
 
Cyber security and AI
Cyber security and AICyber security and AI
Cyber security and AI
 
THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915
THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915
THE INTERNET OF THINGS, PRODUCTIVITY AND EMPLOYMENT Boston 0915
 
TopConf Linz, 02/02/2016
TopConf Linz, 02/02/2016TopConf Linz, 02/02/2016
TopConf Linz, 02/02/2016
 
W-JAX Keynote - Big Data and Corporate Evolution
W-JAX Keynote - Big Data and Corporate EvolutionW-JAX Keynote - Big Data and Corporate Evolution
W-JAX Keynote - Big Data and Corporate Evolution
 
Big data, big opportunities
Big data, big opportunitiesBig data, big opportunities
Big data, big opportunities
 
National seminar on emergence of internet of things (io t) trends and challe...
National seminar on emergence of internet of things (io t)  trends and challe...National seminar on emergence of internet of things (io t)  trends and challe...
National seminar on emergence of internet of things (io t) trends and challe...
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?AI in the Real World: Challenges, and Risks and how to handle them?
AI in the Real World: Challenges, and Risks and how to handle them?
 

Destaque

Industry of Things World - Berlin 19-09-16
Industry of Things World - Berlin 19-09-16Industry of Things World - Berlin 19-09-16
Industry of Things World - Berlin 19-09-16Boris Adryan
 
EclipseCon France 2015 - Science Track
EclipseCon France 2015 - Science TrackEclipseCon France 2015 - Science Track
EclipseCon France 2015 - Science TrackBoris Adryan
 
Eclipse IoT - Day 0 of thingmonk 2016
Eclipse IoT - Day 0 of  thingmonk 2016Eclipse IoT - Day 0 of  thingmonk 2016
Eclipse IoT - Day 0 of thingmonk 2016Boris Adryan
 
Just because you can doesn't mean that you should - thingmonk 2016
Just because you can doesn't mean that you should - thingmonk 2016Just because you can doesn't mean that you should - thingmonk 2016
Just because you can doesn't mean that you should - thingmonk 2016Boris Adryan
 
Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16
Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16
Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16Boris Adryan
 
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Boris Adryan
 
Eclipse IoT - ecosystem
Eclipse IoT - ecosystemEclipse IoT - ecosystem
Eclipse IoT - ecosystemBoris Adryan
 

Destaque (7)

Industry of Things World - Berlin 19-09-16
Industry of Things World - Berlin 19-09-16Industry of Things World - Berlin 19-09-16
Industry of Things World - Berlin 19-09-16
 
EclipseCon France 2015 - Science Track
EclipseCon France 2015 - Science TrackEclipseCon France 2015 - Science Track
EclipseCon France 2015 - Science Track
 
Eclipse IoT - Day 0 of thingmonk 2016
Eclipse IoT - Day 0 of  thingmonk 2016Eclipse IoT - Day 0 of  thingmonk 2016
Eclipse IoT - Day 0 of thingmonk 2016
 
Just because you can doesn't mean that you should - thingmonk 2016
Just because you can doesn't mean that you should - thingmonk 2016Just because you can doesn't mean that you should - thingmonk 2016
Just because you can doesn't mean that you should - thingmonk 2016
 
Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16
Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16
Plattformen für das Internet der Dinge, solutions.hamburg, 05.09.16
 
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
 
Eclipse IoT - ecosystem
Eclipse IoT - ecosystemEclipse IoT - ecosystem
Eclipse IoT - ecosystem
 

Semelhante a Organizing the Internet of Things with Ontologies for Actionable Insight

High-Performance Networking Use Cases in Life Sciences
High-Performance Networking Use Cases in Life SciencesHigh-Performance Networking Use Cases in Life Sciences
High-Performance Networking Use Cases in Life SciencesAri Berman
 
How to win the 4th Industrial Revolution
How to win the 4th Industrial RevolutionHow to win the 4th Industrial Revolution
How to win the 4th Industrial RevolutionArlen Meyers, MD, MBA
 
2019 June 27 - Big data and data science
2019 June 27 - Big data and data science2019 June 27 - Big data and data science
2019 June 27 - Big data and data scienceFabio Stella
 
Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...Duncan Hull
 
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical ScienceAri Berman
 
Informatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeInformatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeLiz Lyon
 
BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?Tuan Yang
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiProfessor Lili Saghafi
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services IndustryBarry Smith
 
[Webinar] The Internet of Things and the Coming Data Deluge
[Webinar] The Internet of Things and the Coming Data Deluge[Webinar] The Internet of Things and the Coming Data Deluge
[Webinar] The Internet of Things and the Coming Data DelugeInsightInnovation
 
Essay Writing Online. . Step-By-Step Guide to Essay Writing - ESL Buzz
Essay Writing Online. . Step-By-Step Guide to Essay Writing - ESL BuzzEssay Writing Online. . Step-By-Step Guide to Essay Writing - ESL Buzz
Essay Writing Online. . Step-By-Step Guide to Essay Writing - ESL BuzzKari Wilson
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to BioinformaticsLeighton Pritchard
 
Blogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsBlogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsJeremy Frey
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960mare34
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsDuncan Hull
 
Data has a gravity and is attracting decisions
Data has a gravity and is attracting decisionsData has a gravity and is attracting decisions
Data has a gravity and is attracting decisionsPietro Leo
 
Diag internet of things
Diag internet of thingsDiag internet of things
Diag internet of thingsPeter Dreyer
 
DMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data MiningDMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data MiningPier Luca Lanzi
 

Semelhante a Organizing the Internet of Things with Ontologies for Actionable Insight (20)

Boris IoT slides
Boris IoT slides Boris IoT slides
Boris IoT slides
 
High-Performance Networking Use Cases in Life Sciences
High-Performance Networking Use Cases in Life SciencesHigh-Performance Networking Use Cases in Life Sciences
High-Performance Networking Use Cases in Life Sciences
 
How to win the 4th Industrial Revolution
How to win the 4th Industrial RevolutionHow to win the 4th Industrial Revolution
How to win the 4th Industrial Revolution
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
2019 June 27 - Big data and data science
2019 June 27 - Big data and data science2019 June 27 - Big data and data science
2019 June 27 - Big data and data science
 
Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...
 
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
 
Informatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeInformatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data Decade
 
BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services Industry
 
[Webinar] The Internet of Things and the Coming Data Deluge
[Webinar] The Internet of Things and the Coming Data Deluge[Webinar] The Internet of Things and the Coming Data Deluge
[Webinar] The Internet of Things and the Coming Data Deluge
 
Essay Writing Online. . Step-By-Step Guide to Essay Writing - ESL Buzz
Essay Writing Online. . Step-By-Step Guide to Essay Writing - ESL BuzzEssay Writing Online. . Step-By-Step Guide to Essay Writing - ESL Buzz
Essay Writing Online. . Step-By-Step Guide to Essay Writing - ESL Buzz
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Blogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsBlogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart Labs
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of Bioinformatics
 
Data has a gravity and is attracting decisions
Data has a gravity and is attracting decisionsData has a gravity and is attracting decisions
Data has a gravity and is attracting decisions
 
Diag internet of things
Diag internet of thingsDiag internet of things
Diag internet of things
 
DMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data MiningDMTM 2015 - 02 Data Mining
DMTM 2015 - 02 Data Mining
 

Mais de Boris Adryan

Computational decision making
Computational decision makingComputational decision making
Computational decision makingBoris Adryan
 
Development and Deployment: The Human Factor
Development and Deployment: The Human FactorDevelopment and Deployment: The Human Factor
Development and Deployment: The Human FactorBoris Adryan
 
Zühlke Meetup - Mai 2017
Zühlke Meetup - Mai 2017Zühlke Meetup - Mai 2017
Zühlke Meetup - Mai 2017Boris Adryan
 
Node-RED and Minecraft - CamJam September 2015
Node-RED and Minecraft - CamJam September 2015Node-RED and Minecraft - CamJam September 2015
Node-RED and Minecraft - CamJam September 2015Boris Adryan
 
Node-RED workshop at IoT Toulouse
Node-RED workshop at IoT ToulouseNode-RED workshop at IoT Toulouse
Node-RED workshop at IoT ToulouseBoris Adryan
 
An introduction to workflow-based programming with Node-RED
An introduction to workflow-based programming with Node-REDAn introduction to workflow-based programming with Node-RED
An introduction to workflow-based programming with Node-REDBoris Adryan
 
Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14
Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14
Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14Boris Adryan
 
Node-RED and getting started on the Internet of Things
Node-RED and getting started on the Internet of ThingsNode-RED and getting started on the Internet of Things
Node-RED and getting started on the Internet of ThingsBoris Adryan
 
Node-RED Interoperability Test
Node-RED Interoperability TestNode-RED Interoperability Test
Node-RED Interoperability TestBoris Adryan
 

Mais de Boris Adryan (9)

Computational decision making
Computational decision makingComputational decision making
Computational decision making
 
Development and Deployment: The Human Factor
Development and Deployment: The Human FactorDevelopment and Deployment: The Human Factor
Development and Deployment: The Human Factor
 
Zühlke Meetup - Mai 2017
Zühlke Meetup - Mai 2017Zühlke Meetup - Mai 2017
Zühlke Meetup - Mai 2017
 
Node-RED and Minecraft - CamJam September 2015
Node-RED and Minecraft - CamJam September 2015Node-RED and Minecraft - CamJam September 2015
Node-RED and Minecraft - CamJam September 2015
 
Node-RED workshop at IoT Toulouse
Node-RED workshop at IoT ToulouseNode-RED workshop at IoT Toulouse
Node-RED workshop at IoT Toulouse
 
An introduction to workflow-based programming with Node-RED
An introduction to workflow-based programming with Node-REDAn introduction to workflow-based programming with Node-RED
An introduction to workflow-based programming with Node-RED
 
Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14
Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14
Wiring the Internet of Things with Node-RED, @IoTConf talk, September '14
 
Node-RED and getting started on the Internet of Things
Node-RED and getting started on the Internet of ThingsNode-RED and getting started on the Internet of Things
Node-RED and getting started on the Internet of Things
 
Node-RED Interoperability Test
Node-RED Interoperability TestNode-RED Interoperability Test
Node-RED Interoperability Test
 

Último

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 

Último (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 

Organizing the Internet of Things with Ontologies for Actionable Insight

  • 1. ORGANIZING THE INTERNET OF THINGS ACTIONABLE INSIGHT THROUGH ONTOLOGIES Boris Adryan badryan@gmail.com
  • 2. • Computational biologist • Research group leader • Advisor at • 2015 Fellow of the Who is @BorisAdryan
  • 3. • Why a biologist is interested in large, unstructured data • What wrong is with the IoT in its current state • How biologists deal with similar problems • Which academic concepts would be useful in the IoT WHAT TO EXPECT IN THE NEXT HOUR… (including questions!)
  • 4. • Why a biologist is interested in large, unstructured data • What wrong is with the IoT in its current state • How biologists deal with similar problems • Which academic concepts would be useful in the IoT WHAT TO EXPECT IN THE NEXT 10 MINUTES
  • 5. DNA = storage of a blueprint RNA = ‘active copy’ of DNA protein = the building blocks of cells and tissues LIFE AS WE KNOW IT transcription translation Gregor Johann Mendel, exhibited in the Library at the NIMR
  • 6. ‣ Reading DNA information ‣ Determining “the sequence of a gene” was a PhD in the early 1980s ‣ Data processing was mainly transcribing the observation into a research paper BIOLOGY THEN AND NOW SEQUENCE INFORMATION Sanger sequencing ca. 1980 http://www.eplantscience.com
  • 7. 189,739,230,107 bases base pairs on 15th April 2015 (from 159,813,411,760 bases pairs in April 2015) ‣ We can sequence a human genome in half a day ‣ Sequence databases grow faster than storage capacity ‣ Data processing is the key step in scientific understanding BIOLOGY THEN AND NOW SEQUENCE INFORMATION 1990: automation kilobases a day 2007: next-gen seq megabases a day 2015: 1000s of instruments world-wide
  • 8. BIOLOGY THEN AND NOW GENE ACTIVITY INFORMATION ‣ When are genes needed? ‣ Classical molecular biology workflow, taking days… ‣ Data is semi-quantitative; testing one gene at the time Northern blot, ca. 1995 ‣ High-throughput gene expression profiling since mid-1990s ‣ Quantitative information for every gene in an organism ‣ Key challenge is the graphical representation and interpretation of the data screenshot from FlyBase, today
  • 9. 2 6 ATP ‣ Signal transduction and metabolic pathways ‣ Characterisation of proteins and substrates that mediate chemical reactions ‣ Nobel prize material BIOLOGY THEN AND NOW BIOCHEMISTRY
  • 10. ‣ We know about 250k metabolites ‣ 100k protein structures ‣ on the order of 10k different chemical reactions BIOLOGY THEN AND NOW BIOCHEMISTRY “The Robot Scientist” “small molecules” (Organic & Biomolecular Chemistry Blog) protein (via the Protein Databank, www.pdb.org)
  • 11. ‣Everything is connected ‣ Big, noisy, often unstructured data ‣ We are learning how biological entities depend on each other DNA > RNA > proteins
  • 12. • Why a biologist is interested in large, unstructured data • What wrong is with the IoT in its current state • How biologists deal with similar problems • Which academic concepts would be useful in the IoT WHAT TO EXPECT IN THE NEXT 5 MINUTES
  • 13. ‣ Everything is connected ‣ Big, noisy, often unstructured data www.thingslearn.com Analytics, context integration, machine learning and predictive modelling for the IoT.
  • 14. 0 clean shirt left + washing machine estimates 97% of your last pack of powder used + it’s Wednesday, 23:55 + the last four Thursdays had a morning business meeting + the car is parked 20 m from a shop + last retail activity: 8 sec ago Send immediate text reminder to pick up washing powder + send tweet from @BorisHouse “need identified” + “notification appropriate” Actionable insight. From everything.
  • 15. NO ANALYTICAL FLEXIBILITY IN M2M/IOT Matt Hatton, Machina Research The BLN IoT ‘14 Internet replaces wire It’s all about the context M2M consumer IoT defined I-P-O like it’s 1975 context context context context context context context Is this hot?
  • 16. LIFE SCIENCE STRATEGIES DON’T WORK IN THE IOT - There are no commonly accepted - ‘catalogue’ of things, - ‘ontology’ of things, - ‘data format’ of things, - ‘meta data’ for things. - Most businesses are driven by revenue, not long-term strategic vision - Service providers have no need to publish - Data can be highly personal (cheap excuse) unless they’re
  • 17. Trojan Room coffee pot - ca. 1993 Oct. 1995 “The Internet of Things” Kevin Ashton, ca. 1999 20 YEARS OF NON-CONVERGENT EVOLUTION FIRST DATA POTENTIAL RECOGNISED TODAY’S REALITY “ignorant coexistence” ➡ Commonly accepted platforms and formats for data exchange ➡ Meta-data deposition is a must ➡ Infrastructure provides entry point for computational knowledge inference “designed to ask questions”
  • 18. • Why a biologist is interested in large, unstructured data • What wrong is with the IoT in its current state • How biologists deal with similar problems • Which academic concepts would be useful in the IoT WHAT TO EXPECT IN THE NEXT 10 MINUTES
  • 19. Oct. 1995 TOWARDS MIAMI STANDARD AND DATA REPOSITORIES cf. IoT Nov. 1993 MInimal Annotation for MIcroarray Info
  • 20. META DATA, SHARING AND DATA REPOSITORIES founded in Nov. 1999 But this is a complex and ambitious project, and is one of the biggest challenges that bioinformatics has yet faced. Major difficulties stem from the detail required to describe the conditions of an experiment, and the relative and imprecise nature of measurements of expression levels.The potentially huge volume of data only adds to these difficulties. Nature Feb. 2000 “ “ Nov. 2000 Oct. 2002 Wide adoption as requirement for publication in scientific journals
  • 21. META DATA, SHARING AND DATA REPOSITORIES cf. IoT 2014 since 2003 http://en.wikipedia.org/wiki/Silo
  • 22. THE LIFE SCIENCES FIXED THEIR KNOWLEDGE REPRESENTATION PROBLEM
  • 25. CURRENT GOVERNMENT INVESTMENTS INTO GENE ONTOLOGY NIH alone spent $44,616,906 on the ontology structure since 2001 (I don’t have data for UK/EU spendings) ~100 full-time salaries for experts with domain-specific knowledge ~40,000 terms
  • 26. story measurements + meta data open, public repositories human curators ontology terms community PUBLISH OR PERISH ok? journal informal exchange - no credit! funders assessment The majority of this infrastructure is paid for by governments and charities industry!
  • 27.
  • 28. OUR PROBLEM IS KNOWLEDGE DATA != INSIGHT WITHOUT ORGANISING IT
  • 29. • Why a biologist is interested in large, unstructured data • What wrong is with the IoT in its current state • How biologists deal with similar problems • Which academic concepts would be useful in the IoT WHAT TO EXPECT IN THE NEXT 10 MINUTES
  • 30. measurements + meta data storage & provenance human curators ontology terms user PUBLISH OR YOU’RE NOT DOING IOT ok? Maybe the majority of this infrastructure should be paid for by governments? company cloud device registration “ “ privileges dataadded value
  • 31. WHAT IS AN ONTOLOGY? used to establish conceptual connection between entities knowledge inference finger ontology structure - body part - limb - arm - hand - thumb - fingerontology rules ‣controlled vocabulary ‣clearly defined relationships is a is a connects to part of with ontological reasoning, a computer can infer that “finger is a body part”, although we haven’t explicitly defined it that way
  • 32. ARE PEOPLE NOT ALREADY USING ONTOLOGIES IN THE IOT? Semantic Sensor Network Ontology “thermostat” The idea is not new! Cf. extension of the semantic web with the Semantic Sensor Network. ‣catalogs ‣conventions http://www.w3.org/2005/Incubator/ssn/ssnx/ssn
  • 33. ONTOLOGIES HAVE TO BE PRAGMATIC COMPROMISES Gene Ontology annotation 15 years of research 47 publications 100+ authors 50+ PhDs 15 direct annotations ~150 inferred annotations
  • 34. THE THREE BRANCHES OF Adapted from Anurag et al., Mol. BioSyst., 2012,8, 346-352 Localization:Where is an entity acting? Function:What does the entity do? Process:When is the entity needed?
  • 35. inferences on “is a” “part of” “regulates” “has part” from geneontology.org from Ashburner et al., Nat Genet. 2000, 25(1):25-9. GO AND CONTEXT
  • 36. THE BRANCHES OF GO AND THE IOT Localization: inside, (my?) home, living room Function: measures temperature regulates temperature interacts with user directly interacts with user via app Process: regulation of temperature measurement of ambient temperature ‘is proxy / is avatar’ for presence fire ice age
  • 37. A LAST WORD ON PRAGMATISM “perfect” ontology The SSN Ontology allows for inference entirely on the basis of its structure and annotation. In reality, many parameters are difficult to establish and the effort to annotate things outweighs the utility. “crude” ontology A simplified structure allows for quick annotation even by non- specialists. The lack of details can lead to clashes in the ontology => more smartness has to go into software; more coding effort. 1 billlion different things 1 milllion use cases
  • 38. 0 clean shirt left + washing machine estimates 97% of your last pack of powder used + it’s Wednesday, 23:55 + the last four Thursdays had a morning business meeting + the car is parked 20 m from a shop + last retail activity: 8 sec ago Send immediate text reminder to pick up washing powder + send tweet from @BorisHouse “need identified” + “notification appropriate” Actionable insight. From everything. “not home” “buying” credit card: “highly personal device” ~ alive and awake 3% left and not pressed “indicator of esteem”
  • 39. Today’s biology is a quantitative, data- rich science. Infrastructure for ‘big data’ was driven by academics. Data is only useful if it can be turned into knowledge. Understanding of data requires ‘data about the data’. Meta-data should be in a universally understood format. Ontologies provide context. Gene Ontology (GO) is a de facto standard. Human curation is key to GO. Public funders and industry contribute significantly to GO. Should governments be involved in IoT? GO is not a ‘one fits all’, but has a few useful concepts. What does the thing do? Thing function. For what can the thing be an avatar? Thing process. Where is the thing? Thing localization. @BorisAdryan