SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
MedChemica
BigData
‘What is that ALL about?’
Al Dossetter
al.dossetter@medchemica.com
MedChemica Limited
Macclesfield Sci Bar
25th April 2016
Big Data – ‘What is that all about?’
•  Introduction to Big Data
•  Examples from History
•  Big Data and science
•  MedChemica – advancing drug design
through actionable knowledge
About Us Passionate about generating better decisions from data
Dr Andrew G. Leach
Technical Director
Liverpool John Moores
12 years experience
Applied computational
and medicinal chemistry
Dr Ed Griffen
Technical Director
21 years experience
Medicinal chemistry and
large scale statistical
analysis methods
Dr Al Dossetter
Managing Director
17 years Medicinal chemistry and
extensive cloud computing
experience
Dr Ali Griffen
Business Analyst
PhD Fungal Vascular wilt disease
21 years experience Team leader
bioscientist and biological data
curation
Dr Shane Montague
Lead Data Scientist
PhD Computer Science
13 years experience Data
science and information
security
Dr Jia Wu
Consultant Data Scientist
PhD Machine Learning
12 years experience in data
mining and machine learning.
Projects in finance, energy and
criminology.
Best Definition of Big Data
•  Any analysis of a data set that is too large to
do by hand
–  Requires computational techniques
–  Requires statistical techniques
•  Yields
–  Knowledge
-  Knowledge that can be counter intuitive
  It got ‘Big’ because:
-  the internet made a lot of data available very
quickly (often for free)
  It got interesting because:
-  Knowledge yields real benefits to the bottom line
-  Reduce costs or Increased sales
  You the consumer benefit….
-  Cheaper goods, available on-line
-  Flights on time, trains on time, deliveries on time
Big Data
“The Revolution that will
change the world we live in”
•  Principles of Big Data
–  Use ALL of the Data
•  however noisy
–  Analyse in an unbiased way
–  “DO WHAT” it tells you
•  Do Not Worry About “WHY”
–  KEEP everything
•  ‘you never know what question you
want to ask’
The	
  4	
  Vs	
  
•  Picture	
  from	
  Google	
  or	
  someone	
  
•  What	
  does	
  it	
  mean?	
  
•  Mostly	
  it	
  is	
  about	
  using	
  lots	
  of	
  computers	
  
Most issues are sorted out by more CPUs, more drive
space, and better stats
Its actually been around quite a while…
•  It was genius to break the codes
•  Further genius of collating the data and reducing it so
that analysts can use in a timely manner (volume /
velocity / veracity)
•  ….saved many many lives on both sides
….and banking, finance and trading
What do Nappies and Beer have in common?
•  Analysis of shopping habits found these two things were bought together
•  Put them close together in the store and sell more
+
=
UPS delivery service
•  Fitted sensors to all delivery
trucks and gathered data
•  Analysed data to detect
early engine issues BEFORE
breakdown
•  Therefore FIX early and
keep the van on the road
•  The customer benefits
because:
•  Deliveries on-time
•  Even larger dataset – high
degree of predicition on
deliver times
Jet Engines – reliable service
•  Sensors on jet engines – monitored in flight
•  Similar to UPS
•  Therefore FIX early and keep the planes in the air
•  The customer benefits because:
•  Flights on time and reliable
Google translate
The Unreasonable Effectiveness of Data
“Because of a huge shared cognitive and cultural
context, linguistic expression can be highly ambiguous
and still often be understood correctly.”
	
  
	
  
	
  
	
  
	
  
•  h@ps://en.wikipedia.org/wiki/File:Google_Translate_Icon.png	
  
•  h@ps://en.wikipedia.org/wiki/Google_Translate	
  
•  h@ps://www.youtube.com/watch?v=yvDCzhbjYWs	
  
•  University	
  of	
  BriQsh	
  Columbia	
  DisQnguished	
  Lecture	
  Series	
  -­‐	
  Sept	
  23rd	
  2011	
  
Groups or pairs of words associated together on
websites around the internet
Statistical analyse of frequency of pairing
Therefore this word (or group) probably translates into
this word
What about science?
We need to be accurate (don’t we?)
•  Large Hadron Collider shows how we can gather a lot
of data very accurately
•  Large amount needs to reduce the errors – very very
big data
The Life Science industry has woken up to Big Data
•  Human Genome
•  Biological systems
•  Kinome
•  Metabolomics
•  Proteomics
•  3D structural information (CDC /
Protein Data Bank)
•  Literature and Patents (GVK Bio,
ChEMBL, Pubmed, PubChem)
•  Reaction infomatics – what works,
what doesn’t
•  Document management
•  Regulatory submissions
Huge Opportunity in this area
	
  
What about life sciences?
•  Hard and harder to discover drugs.
•  They have to work
•  They have to be safe
•  People want them cheaply
•  A description of the drug research and
development process
Company	

 Ticker	

 Number of drugs
approved	

R&D Spending
Per Drug ($Mil)	

Total R&D
Spending
1997-2011 ($Mil)	

AstraZeneca	

 AZN	

 5	

 11,790.93	

 58,955	

GlaxoSmithKline	

 GSK	

 10	

 8,170.81	

 81,708	

Sanofi	

 SNY	

 8	

 7,909.26	

 63,274	

Pfizer Inc.	

 PFE	

 14	

 7,727.03	

 108,178	

Roche Holding AG	

 RHHBY	

 11	

 7,803.77	

 85,841	

Johnson & Johnson	

 JNJ	

 15	

 5,885.65	

 88,285	

Eli Lilly & Co.	

 LLY	

 11	

 4,577.04	

 50,347	

Abbott Laboratories	

 ABT	

 8	

 4,496.21	

 35,970	

Merck & Co Inc	

 MRK	

 16	

 4,209.99	

 67,360	

Bristol-Myers
Squibb Co.	

BMY	

 11	

 4,152.26	

 45,675	

Novartis AG	

 NVS	

 21	

 3,983.13	

 83,646	

Amgen Inc.	

 AMGN	

 9	

 3,692.14	

 33,229	

Sources: InnoThink Center For Research In Biomedical Innovation;
Thomson Reuters Fundamentals via FactSet Research Systems
The Truly Staggering Cost Of Inventing New Drugs
Matthew Herper - Forbes	

Drug failures later in development are mainly due to EFFICACY and SAFETY
Actual spending – all LO projects are biggest spend
Paul, S. M. et al How to improve R&D productivity: the pharmaceutical
industry’s grand challenge, Nat. Rev. Drug Discovery 2010, 9, 203
Snap-Shot of a medium sized
companies R&D spend in one
year - $1.7 billion
For a period large pharma set targets at each stage of the process – an
attrition model - unsuccessful and very wasteful
Better chemistry
Reduce the number
of projects
Chemistry influence success and speed
Methods that really work, new formulations
What Causes Attrition in Development?
PK
7%
Lack of
efficacy in
man
46%
Adverse
effects in man
17%
Animal toxicity
16%
Commercial
reasons
7%
Miscellaneous
7%
Many compounds fail in development through inadequate
pharmacokinetics / bioavailability and unacceptable
toxicological profiles in addition to lack of efficacy in man
liver
kidneys
bladder
Dissolve
Cross
Membranes
Metabolism
Avoid
Excretion
Oral Dosing of Drugs
BBB (Blood Brain Barrier)
Target
(maybe in the brain)
Survive pH range 1.5-8
Absorption
Distribution
Metabolism
Excretion
Toxicity
Roche
Data
rule
finder
Roche
Database
Genentech
Data
rule
finder
Genentech
Data
AZ
Data
rule
finder
AZ
Database
Grand Rule
Database
Grand Rule database
Better medicinal chemistry by sharing knowledge not data & structures
MedChemica
Grand Rule
Database
Grand Rule
Database
Grand Rule
Database
AZ	
  
ExploitaQon	
  
Roche	
  
ExploitaQon	
  
Genentech	
  
ExploitaQon	
  
Pharma 4
Data
rule
finder
Pharma 4
Data
Grand Rule
Database
Pharma	
  4	
  
ExploitaQon	
  
Grand Rule
Database
Pharma 5
Data
rule
finder
Pharma 5
Data
Grand Rule
Database
Pharma	
  5	
  
ExploitaQon	
  
Grand Rule
Database
>500	
  million	
  pairs	
  from	
  
companies	
  
+	
  12	
  million	
  from	
  public	
  data	
  
…so what are
you going to
make next…?
Who	
  is	
  GOOD	
  at	
  Big	
  Data?	
  
The	
  people	
  making	
  the	
  money!	
  
Chemical	
  transform	
  
to	
  improve	
  metabolism	
  
Chemists who wanted to fix metabolism also made these…
R	
  =	
  
SaltTraX© -­‐	
  contact@medchemica.com	
  	
  	
  support@elixirsobware.co.uk	
  	
  
What	
  about	
  clinical	
  safety?	
  
SAFE	
  DRUGS	
  
‘Potency’	
  
Do	
  not	
  sacrifice	
  
The	
  be@er	
  it	
  is	
  	
  
the	
  lower	
  the	
  dose	
  
Improved	
  tes=ng	
  
	
  in-­‐vivo	
  
with	
  fewer	
  animals	
  
Clinical	
  linkage	
  
to	
  protein	
  target	
  
Can	
  test	
  In-­‐Vivo	
  
AnQ	
  SAR	
  
e.g.	
  hERG,	
  Nav1.5,	
  5-­‐HT2a…	
  
	
  
Analysis	
  of	
  In-­‐Vivo	
  data	
  
Pfizer	
  –	
  rat	
  data	
  
<0.2mg/Kg	
  
Dose	
  
Metabolism	
  &	
  
Pharmacokine=cs	
  
Be@er	
  design	
  so	
  	
  
dose	
  is	
  lower	
  	
  
Grand Rule
Database
Hughes	
  et	
  al,	
  Bioorg	
  Med	
  Chem	
  
Le>.	
  2008,	
  18(17),	
  4872	
  
Collaborators	
  and	
  Users	
  
The	
  ‘Internet	
  of	
  Things	
  (IoT)’	
  
A higher diversity of devices connected to the internet with flow of
data to and from
For example Smart Watches
Life style device – marketed on selling fitness / wellness
Like UPS vans and RR jet engines can we detect the illness pre-
symptomatically?
Big Data – ‘What is that all about?’
•  Introduction to Big Data
–  Big enough to need a computer / advanced stats
•  Examples from History
–  Bletchley park, UPS, Beer and Nappies….
•  Big Data and science
–  Hadron collider….
•  MedChemica – Advancing drug design
through actionable knowledge
–  Allows sharing of knowledge to accelerate and
reduce costs of finding new, safe medicines

Mais conteúdo relacionado

Mais procurados

DataPharmaNovember2016
DataPharmaNovember2016DataPharmaNovember2016
DataPharmaNovember2016Pfizer
 
SMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conferenceSMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conferenceDale Butler
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Hellmuth Broda
 
AI for Precision Medicine (Pragmatic preclinical data science)
AI for Precision Medicine (Pragmatic preclinical data science)AI for Precision Medicine (Pragmatic preclinical data science)
AI for Precision Medicine (Pragmatic preclinical data science)Paul Agapow
 
AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryDavid Leahy
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discoveryRAVINDRABABUKOPPERA
 
Advanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event GuideAdvanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event GuidePfizer
 
Pistoia alliance debates analytics 15-09-2015 16.00
Pistoia alliance debates   analytics 15-09-2015 16.00Pistoia alliance debates   analytics 15-09-2015 16.00
Pistoia alliance debates analytics 15-09-2015 16.00Pistoia Alliance
 
Data mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industryData mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industrylurdhu agnes
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck finalPistoia Alliance
 
Role of AI in Drug Discovery and Development
Role of AI in  Drug Discovery and DevelopmentRole of AI in  Drug Discovery and Development
Role of AI in Drug Discovery and DevelopmentDr. Manu Kumar Shetty
 
So, My FitBit is Clinical Trial Grade Right?
So, My FitBit is Clinical Trial Grade Right?So, My FitBit is Clinical Trial Grade Right?
So, My FitBit is Clinical Trial Grade Right?PAREXEL International
 
7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D Outcomes7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D OutcomesTamrMarketing
 
Data Science in Medicine and Health
Data Science in Medicine and HealthData Science in Medicine and Health
Data Science in Medicine and HealthSteve Tsang
 
Placebo and Standard of Care Data Sharing Initiative - PSoC Data Sharing
Placebo and Standard of Care Data Sharing Initiative - PSoC Data SharingPlacebo and Standard of Care Data Sharing Initiative - PSoC Data Sharing
Placebo and Standard of Care Data Sharing Initiative - PSoC Data SharingTransCelerate
 
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY "AN OVERVIEW OF AWARENESS"
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY  "AN OVERVIEW OF AWARENESS"ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY  "AN OVERVIEW OF AWARENESS"
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY "AN OVERVIEW OF AWARENESS"FinianCN
 
Data science in health care
Data science in health careData science in health care
Data science in health careChetan Khanzode
 

Mais procurados (20)

DataPharmaNovember2016
DataPharmaNovember2016DataPharmaNovember2016
DataPharmaNovember2016
 
SMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conferenceSMi Group's AI in Drug Discovery 2020 conference
SMi Group's AI in Drug Discovery 2020 conference
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
 
AI for Precision Medicine (Pragmatic preclinical data science)
AI for Precision Medicine (Pragmatic preclinical data science)AI for Precision Medicine (Pragmatic preclinical data science)
AI for Precision Medicine (Pragmatic preclinical data science)
 
AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug Discovery
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discovery
 
Advanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event GuideAdvanced Analytics for Clinical Data Full Event Guide
Advanced Analytics for Clinical Data Full Event Guide
 
Pistoia alliance debates analytics 15-09-2015 16.00
Pistoia alliance debates   analytics 15-09-2015 16.00Pistoia alliance debates   analytics 15-09-2015 16.00
Pistoia alliance debates analytics 15-09-2015 16.00
 
Data mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industryData mining (DM) in the pharmaceutical industry
Data mining (DM) in the pharmaceutical industry
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
Role of AI in Drug Discovery and Development
Role of AI in  Drug Discovery and DevelopmentRole of AI in  Drug Discovery and Development
Role of AI in Drug Discovery and Development
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
So, My FitBit is Clinical Trial Grade Right?
So, My FitBit is Clinical Trial Grade Right?So, My FitBit is Clinical Trial Grade Right?
So, My FitBit is Clinical Trial Grade Right?
 
7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D Outcomes7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D Outcomes
 
Data Science in Medicine and Health
Data Science in Medicine and HealthData Science in Medicine and Health
Data Science in Medicine and Health
 
Placebo and Standard of Care Data Sharing Initiative - PSoC Data Sharing
Placebo and Standard of Care Data Sharing Initiative - PSoC Data SharingPlacebo and Standard of Care Data Sharing Initiative - PSoC Data Sharing
Placebo and Standard of Care Data Sharing Initiative - PSoC Data Sharing
 
Paras new CV`..
Paras new CV`..Paras new CV`..
Paras new CV`..
 
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY "AN OVERVIEW OF AWARENESS"
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY  "AN OVERVIEW OF AWARENESS"ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY  "AN OVERVIEW OF AWARENESS"
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY "AN OVERVIEW OF AWARENESS"
 
Data science in health care
Data science in health careData science in health care
Data science in health care
 

Destaque

Alejandra amaya hernández
Alejandra amaya hernándezAlejandra amaya hernández
Alejandra amaya hernándezVentas Vargas
 
Іміджеві складові сучасного позашкільного закладу
Іміджеві складові сучасного позашкільного закладуІміджеві складові сучасного позашкільного закладу
Іміджеві складові сучасного позашкільного закладуYaroslava Bondar
 
Україна в інформатиці
Україна в інформатиціУкраїна в інформатиці
Україна в інформатиціYaroslava Bondar
 
Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...
Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...
Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...VsimPPT
 
припрема за час број 9
припрема за час број 9припрема за час број 9
припрема за час број 9Ivana Milic
 
листић број 9
листић број 9листић број 9
листић број 9Ivana Milic
 
сабирање и одузимање 6 и 7 .docx
сабирање и одузимање 6 и 7 .docxсабирање и одузимање 6 и 7 .docx
сабирање и одузимање 6 и 7 .docxIvana Milic
 
повежи бројеве до 9
повежи бројеве до 9повежи бројеве до 9
повежи бројеве до 9Ivana Milic
 
Edisi khusus majalah Hai (1979) UFO
Edisi khusus majalah Hai (1979) UFOEdisi khusus majalah Hai (1979) UFO
Edisi khusus majalah Hai (1979) UFONur Agustinus
 
Dokumen Lacerta I & II
Dokumen Lacerta I & IIDokumen Lacerta I & II
Dokumen Lacerta I & IINur Agustinus
 
PMB 202 Entrepreneurial Mindset
PMB 202 Entrepreneurial MindsetPMB 202 Entrepreneurial Mindset
PMB 202 Entrepreneurial MindsetNur Agustinus
 

Destaque (20)

5-Cm16 15-16
5-Cm16 15-165-Cm16 15-16
5-Cm16 15-16
 
Alejandra amaya hernández
Alejandra amaya hernándezAlejandra amaya hernández
Alejandra amaya hernández
 
5-Cm17 15-16
5-Cm17 15-165-Cm17 15-16
5-Cm17 15-16
 
cv jo (1)
cv jo (1)cv jo (1)
cv jo (1)
 
Deliverable_5.1.2
Deliverable_5.1.2Deliverable_5.1.2
Deliverable_5.1.2
 
5-Cm18 15-16
5-Cm18 15-165-Cm18 15-16
5-Cm18 15-16
 
діагн к.р. 5 кл.
діагн к.р. 5 кл.діагн к.р. 5 кл.
діагн к.р. 5 кл.
 
Іміджеві складові сучасного позашкільного закладу
Іміджеві складові сучасного позашкільного закладуІміджеві складові сучасного позашкільного закладу
Іміджеві складові сучасного позашкільного закладу
 
математика 5 кл
математика 5 клматематика 5 кл
математика 5 кл
 
Proyecto canaima.
Proyecto canaima. Proyecto canaima.
Proyecto canaima.
 
FAZEEL HYDER'S CV
FAZEEL HYDER'S CVFAZEEL HYDER'S CV
FAZEEL HYDER'S CV
 
Україна в інформатиці
Україна в інформатиціУкраїна в інформатиці
Україна в інформатиці
 
Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...
Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...
Урок 4 для 10 класу - Створення нумерованих і маркірованих списків. Налаштува...
 
припрема за час број 9
припрема за час број 9припрема за час број 9
припрема за час број 9
 
листић број 9
листић број 9листић број 9
листић број 9
 
сабирање и одузимање 6 и 7 .docx
сабирање и одузимање 6 и 7 .docxсабирање и одузимање 6 и 7 .docx
сабирање и одузимање 6 и 7 .docx
 
повежи бројеве до 9
повежи бројеве до 9повежи бројеве до 9
повежи бројеве до 9
 
Edisi khusus majalah Hai (1979) UFO
Edisi khusus majalah Hai (1979) UFOEdisi khusus majalah Hai (1979) UFO
Edisi khusus majalah Hai (1979) UFO
 
Dokumen Lacerta I & II
Dokumen Lacerta I & IIDokumen Lacerta I & II
Dokumen Lacerta I & II
 
PMB 202 Entrepreneurial Mindset
PMB 202 Entrepreneurial MindsetPMB 202 Entrepreneurial Mindset
PMB 202 Entrepreneurial Mindset
 

Semelhante a MedChemica BigData 'What is that ALL about

2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forumChris Dwan
 
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to GraphsNeo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to GraphsNeo4j
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Barry Smith
 
High Performance Computing and the Opportunity with Cognitive Technology
 High Performance Computing and the Opportunity with Cognitive Technology High Performance Computing and the Opportunity with Cognitive Technology
High Performance Computing and the Opportunity with Cognitive TechnologyIBM Watson
 
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analyticsBill Jacobs
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Philip Bourne
 
Data & Technology in Clinical Trials
Data & Technology in Clinical TrialsData & Technology in Clinical Trials
Data & Technology in Clinical TrialsNassim Azzi, MBA
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcarePerficient, Inc.
 
Big data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleBig data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleHadas Jacoby
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Bigfinite
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...Christopher Hart
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?Paul Agapow
 
Big Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesBig Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesPremNarayanan6
 
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013Amazon Web Services
 
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...Tom Connor
 
Supporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data SolutionSupporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data SolutionSaama
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Improving health care outcomes with responsible data science
Improving health care outcomes with responsible data scienceImproving health care outcomes with responsible data science
Improving health care outcomes with responsible data scienceWessel Kraaij
 

Semelhante a MedChemica BigData 'What is that ALL about (20)

2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forum
 
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to GraphsNeo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
 
High Performance Computing and the Opportunity with Cognitive Technology
 High Performance Computing and the Opportunity with Cognitive Technology High Performance Computing and the Opportunity with Cognitive Technology
High Performance Computing and the Opportunity with Cognitive Technology
 
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
Big data analystics
Big data analysticsBig data analystics
Big data analystics
 
Data & Technology in Clinical Trials
Data & Technology in Clinical TrialsData & Technology in Clinical Trials
Data & Technology in Clinical Trials
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in Healthcare
 
Big data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleBig data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simple
 
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?
 
Big Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical DevicesBig Data in Healthcare and Medical Devices
Big Data in Healthcare and Medical Devices
 
National Workshop to Advance Use of Electronic Data
National Workshop to Advance Use of Electronic DataNational Workshop to Advance Use of Electronic Data
National Workshop to Advance Use of Electronic Data
 
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
 
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
 
Supporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data SolutionSupporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data Solution
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Improving health care outcomes with responsible data science
Improving health care outcomes with responsible data scienceImproving health care outcomes with responsible data science
Improving health care outcomes with responsible data science
 

Último

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 

Último (20)

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

MedChemica BigData 'What is that ALL about

  • 1. MedChemica BigData ‘What is that ALL about?’ Al Dossetter al.dossetter@medchemica.com MedChemica Limited Macclesfield Sci Bar 25th April 2016
  • 2. Big Data – ‘What is that all about?’ •  Introduction to Big Data •  Examples from History •  Big Data and science •  MedChemica – advancing drug design through actionable knowledge
  • 3. About Us Passionate about generating better decisions from data Dr Andrew G. Leach Technical Director Liverpool John Moores 12 years experience Applied computational and medicinal chemistry Dr Ed Griffen Technical Director 21 years experience Medicinal chemistry and large scale statistical analysis methods Dr Al Dossetter Managing Director 17 years Medicinal chemistry and extensive cloud computing experience Dr Ali Griffen Business Analyst PhD Fungal Vascular wilt disease 21 years experience Team leader bioscientist and biological data curation Dr Shane Montague Lead Data Scientist PhD Computer Science 13 years experience Data science and information security Dr Jia Wu Consultant Data Scientist PhD Machine Learning 12 years experience in data mining and machine learning. Projects in finance, energy and criminology.
  • 4. Best Definition of Big Data •  Any analysis of a data set that is too large to do by hand –  Requires computational techniques –  Requires statistical techniques •  Yields –  Knowledge -  Knowledge that can be counter intuitive   It got ‘Big’ because: -  the internet made a lot of data available very quickly (often for free)   It got interesting because: -  Knowledge yields real benefits to the bottom line -  Reduce costs or Increased sales   You the consumer benefit…. -  Cheaper goods, available on-line -  Flights on time, trains on time, deliveries on time
  • 5. Big Data “The Revolution that will change the world we live in” •  Principles of Big Data –  Use ALL of the Data •  however noisy –  Analyse in an unbiased way –  “DO WHAT” it tells you •  Do Not Worry About “WHY” –  KEEP everything •  ‘you never know what question you want to ask’
  • 6. The  4  Vs   •  Picture  from  Google  or  someone   •  What  does  it  mean?   •  Mostly  it  is  about  using  lots  of  computers   Most issues are sorted out by more CPUs, more drive space, and better stats
  • 7. Its actually been around quite a while… •  It was genius to break the codes •  Further genius of collating the data and reducing it so that analysts can use in a timely manner (volume / velocity / veracity) •  ….saved many many lives on both sides
  • 9. What do Nappies and Beer have in common? •  Analysis of shopping habits found these two things were bought together •  Put them close together in the store and sell more + =
  • 10. UPS delivery service •  Fitted sensors to all delivery trucks and gathered data •  Analysed data to detect early engine issues BEFORE breakdown •  Therefore FIX early and keep the van on the road •  The customer benefits because: •  Deliveries on-time •  Even larger dataset – high degree of predicition on deliver times
  • 11. Jet Engines – reliable service •  Sensors on jet engines – monitored in flight •  Similar to UPS •  Therefore FIX early and keep the planes in the air •  The customer benefits because: •  Flights on time and reliable
  • 12. Google translate The Unreasonable Effectiveness of Data “Because of a huge shared cognitive and cultural context, linguistic expression can be highly ambiguous and still often be understood correctly.”           •  h@ps://en.wikipedia.org/wiki/File:Google_Translate_Icon.png   •  h@ps://en.wikipedia.org/wiki/Google_Translate   •  h@ps://www.youtube.com/watch?v=yvDCzhbjYWs   •  University  of  BriQsh  Columbia  DisQnguished  Lecture  Series  -­‐  Sept  23rd  2011   Groups or pairs of words associated together on websites around the internet Statistical analyse of frequency of pairing Therefore this word (or group) probably translates into this word
  • 13. What about science? We need to be accurate (don’t we?) •  Large Hadron Collider shows how we can gather a lot of data very accurately •  Large amount needs to reduce the errors – very very big data
  • 14. The Life Science industry has woken up to Big Data •  Human Genome •  Biological systems •  Kinome •  Metabolomics •  Proteomics •  3D structural information (CDC / Protein Data Bank) •  Literature and Patents (GVK Bio, ChEMBL, Pubmed, PubChem) •  Reaction infomatics – what works, what doesn’t •  Document management •  Regulatory submissions Huge Opportunity in this area  
  • 15. What about life sciences? •  Hard and harder to discover drugs. •  They have to work •  They have to be safe •  People want them cheaply •  A description of the drug research and development process
  • 16. Company Ticker Number of drugs approved R&D Spending Per Drug ($Mil) Total R&D Spending 1997-2011 ($Mil) AstraZeneca AZN 5 11,790.93 58,955 GlaxoSmithKline GSK 10 8,170.81 81,708 Sanofi SNY 8 7,909.26 63,274 Pfizer Inc. PFE 14 7,727.03 108,178 Roche Holding AG RHHBY 11 7,803.77 85,841 Johnson & Johnson JNJ 15 5,885.65 88,285 Eli Lilly & Co. LLY 11 4,577.04 50,347 Abbott Laboratories ABT 8 4,496.21 35,970 Merck & Co Inc MRK 16 4,209.99 67,360 Bristol-Myers Squibb Co. BMY 11 4,152.26 45,675 Novartis AG NVS 21 3,983.13 83,646 Amgen Inc. AMGN 9 3,692.14 33,229 Sources: InnoThink Center For Research In Biomedical Innovation; Thomson Reuters Fundamentals via FactSet Research Systems The Truly Staggering Cost Of Inventing New Drugs Matthew Herper - Forbes Drug failures later in development are mainly due to EFFICACY and SAFETY
  • 17.
  • 18. Actual spending – all LO projects are biggest spend Paul, S. M. et al How to improve R&D productivity: the pharmaceutical industry’s grand challenge, Nat. Rev. Drug Discovery 2010, 9, 203 Snap-Shot of a medium sized companies R&D spend in one year - $1.7 billion For a period large pharma set targets at each stage of the process – an attrition model - unsuccessful and very wasteful Better chemistry Reduce the number of projects Chemistry influence success and speed Methods that really work, new formulations
  • 19. What Causes Attrition in Development? PK 7% Lack of efficacy in man 46% Adverse effects in man 17% Animal toxicity 16% Commercial reasons 7% Miscellaneous 7% Many compounds fail in development through inadequate pharmacokinetics / bioavailability and unacceptable toxicological profiles in addition to lack of efficacy in man
  • 20. liver kidneys bladder Dissolve Cross Membranes Metabolism Avoid Excretion Oral Dosing of Drugs BBB (Blood Brain Barrier) Target (maybe in the brain) Survive pH range 1.5-8 Absorption Distribution Metabolism Excretion Toxicity
  • 21. Roche Data rule finder Roche Database Genentech Data rule finder Genentech Data AZ Data rule finder AZ Database Grand Rule Database Grand Rule database Better medicinal chemistry by sharing knowledge not data & structures MedChemica Grand Rule Database Grand Rule Database Grand Rule Database AZ   ExploitaQon   Roche   ExploitaQon   Genentech   ExploitaQon   Pharma 4 Data rule finder Pharma 4 Data Grand Rule Database Pharma  4   ExploitaQon   Grand Rule Database Pharma 5 Data rule finder Pharma 5 Data Grand Rule Database Pharma  5   ExploitaQon   Grand Rule Database >500  million  pairs  from   companies   +  12  million  from  public  data  
  • 22. …so what are you going to make next…?
  • 23. Who  is  GOOD  at  Big  Data?   The  people  making  the  money!   Chemical  transform   to  improve  metabolism   Chemists who wanted to fix metabolism also made these… R  =   SaltTraX© -­‐  contact@medchemica.com      support@elixirsobware.co.uk    
  • 24. What  about  clinical  safety?   SAFE  DRUGS   ‘Potency’   Do  not  sacrifice   The  be@er  it  is     the  lower  the  dose   Improved  tes=ng    in-­‐vivo   with  fewer  animals   Clinical  linkage   to  protein  target   Can  test  In-­‐Vivo   AnQ  SAR   e.g.  hERG,  Nav1.5,  5-­‐HT2a…     Analysis  of  In-­‐Vivo  data   Pfizer  –  rat  data   <0.2mg/Kg   Dose   Metabolism  &   Pharmacokine=cs   Be@er  design  so     dose  is  lower     Grand Rule Database Hughes  et  al,  Bioorg  Med  Chem   Le>.  2008,  18(17),  4872  
  • 26. The  ‘Internet  of  Things  (IoT)’   A higher diversity of devices connected to the internet with flow of data to and from For example Smart Watches Life style device – marketed on selling fitness / wellness Like UPS vans and RR jet engines can we detect the illness pre- symptomatically?
  • 27. Big Data – ‘What is that all about?’ •  Introduction to Big Data –  Big enough to need a computer / advanced stats •  Examples from History –  Bletchley park, UPS, Beer and Nappies…. •  Big Data and science –  Hadron collider…. •  MedChemica – Advancing drug design through actionable knowledge –  Allows sharing of knowledge to accelerate and reduce costs of finding new, safe medicines