SlideShare a Scribd company logo
1 of 11
Download to read offline
TITLE OF PRESENTATION |
Presented By
Date
Timothy Hoctor, VP Professional Services
October 13, 2015
Introduction to Elsevier Professional Services &
Strategic Vision
TITLE OF PRESENTATION |
• Increase R&D productivity ‐ Support
research by linking R&D data across
development spectrum (discovery,
preclinical, clinical and patient
outcome)
• Increase return on information –
Enhanced search and visualization,
from “query‐to‐action”
• Define potential data standards
• Reduce cost of IT support by
implementing cloud technology and
extensive APIs that allows
customization with internal and
external sources
2
Our Professional Services team leverages greater Elsevier capabilities to
provide customized and optimized data management & analysis solutions
• Customer CBI’s
(customer-provided)
• Lack of standards (for data and
metadata) and discipline (e.g. data
curation)
• Internal infrastructure is not designed
for the management, curation and
analysis of modern experimental data
• Difficulty to integrate CUSTOMER
data with public domain data in a
systematic way
• An acute shortage of people with
informatics and domain area (biology,
data mining)
• Integration is difficult even if we find
the data
TITLE OF PRESENTATION |
Strategic objective to become leading collaborator in R&D data
management
3
Data &
Process
Mapping
• Mapping of
External
Information
Chain against
business process
Gap Analysis
Data
Management
&
Stewardship
Setup
Data
Governance
and
Continuous
Data Quality
Improvement
Data
Normalization
and Long Term
Data Strategy
Where
we are
focusing
• Consulting
service on key
decision info gap
• Data linking
service and API
pilot to improve
decision making
• Data Management
Strategy
• Data warehouse
structure
• Taxonomy
integration and
implementation
• Data harmonization
• Collaborate with
business, technical
& project stewards
to design, develop,
standardize data
structure
• Instituting Master
Data management
• Be your external
data steward for
life science
community
• Data lifecycle
analytics and
management
service
End to end service with demonstrated R&D data harmonization, taxonomy
development, and life science information management expertise
Elsevier Life
Science
Professional
Services
TITLE OF PRESENTATION | 4
Professional Services takes siloed R&D data processes…
Data
Capture
Data
Storage
Data
Analysis
Data
Intelligence
…and transforms them into an integrated data management solution that
increases productivity and accelerates discovery.
Data
Capture
Data Organization
& Normalization
Integrated Data
Management
• Collective silo
• Aligned format
• Unlimited use potential
Data Analysis &
Intelligence
ELNs
Medicinal
Chemistry
Clinical
Trials
TITLE OF PRESENTATION |
Our Key Capabilities
• Access to life science data and
content with proven data curation,
taxonomy expertise, and semantic
backbone across R&D and post
market launch
• Demonstrated R&D data integration
expertise
• Existing life science portfolio of
solutions across development
spectrum
• Global footprint in life science sector
5
Data Science and Translational R&D
• Integrated R&D data management with a
clear data stewardship strategy
• Broader utilization of available data
• Harmonization of internal, public and 3rd
party data to generate new scientific
insights and better business decisions
• Infrastructure to support handling of big data
and collaboration platform
TITLE OF PRESENTATION |
Case Study: Chemistry Data Management
Integration into Elsevier Chemistry platform: Roche,
Novartis
• Key Value Drivers:
• Increase Discoverability
• Efficiency: Decrease cost and maintenance
• High value in seeing failed reactions
• Don’t repeat them → proven savings!
• High value in cross-fertilization of Process Chem/Med
Chem
• Take advantage of designed Process Chem
experiments
• Improved patent filings time/content
• Make better use of resources → better chemistry
• Roche analyzed the time saved per scientist,
multiplied by wages and number of scientists
• ROI: Payback for the project cost was less than 1 year
6
From Presentation at ACS 245th National Meeting by
Michael Kapler, Roche Pharma Research & Early Development
Integration with Customer Data Ecosystem: Merck
• Key Value Drivers:
• Efficiency: Decrease time and minimize
interface support
• Provide tailored workflows and broader use
cases
• Reduce discovery time and eliminate manual
curation
• Derive answers “from days to seconds”
• Eliminate lag in data currency
• Provide integrated content ‘dashboards’ to
suit multiple use cases alongside vertical
applications
• Support agile data framework
• Leverage experience to build on-demand
analysis tools (pre-defined query set against
normalized data)
From Presentation at BIOIT 2015 by
Huijun Wang, Merck R&D Chemistry
TITLE OF PRESENTATION |
A bit more detail: Merck
7
TITLE OF PRESENTATION |
Merck: Creating a data dashboard
8
TITLE OF PRESENTATION |
MD Anderson
What did we do:
Provided a structured hosted biological data environment and
supporting services to normalize and analyze and compare
experimental data with data from across corpus of published
full text.
How we worked together:
• Beginning with extensive scoping and dedicated project
co-resourcing, through full documented use cases and
workflows and comparative analysis of semantic engine.
Environment(s) were tailored to specific needs and
populated with specifically defined with custom data
cartridges and taxonomies.
What was the result:
• MD Anderson system allowed for comparative analysis
against corpus of data from all major pathway analysis
tools deriving data from published literature, and then
further analysis against lab-generated internal data. Key
Benefit: Novel potential therapy and multiple unexplored
targets identified:
9
Case Studies: Integrated Biology Data Management
DARPA Big Data Mechanism for Cancer Research
What did we do:
Provided a structured hosted biological data environment to
establish gold standard taxonomy and pattern recognition in
biological target validation for oncology
How we worked together:
• Worked in collaboration with academic partner (Carnegie
Mellon) to scope build and deploy enterprise platform for
comparative analytics and custom extensible oncology
data cartridge. Using iterative review for all project
participants to identify and incorporate best-in-class entity
and pattern recognition and gold standard data taxonomy
What was the result:
• DARPA system has been extended to more than 30 US
research groups for comparative analysis.
• Where explicit patterns have been identified outside the
system, these have been written into the system to
improve recall and causal reasoning.
• Taxonomy has been expanded to reflect new learnings
and improve relationship identification.
• DARPA has determined Elsevier’s system to be gold
standard for cancer pathway analytics.
From Presentation at BIOIT 2014 by
Phil Lorenzi, UT MD Anderson Cancer Center
TITLE OF PRESENTATION |
Executive Summary:
Elsevier performed research using Elsevier tools and data, public data source and
open source tools, to provide CUSTOMER with answers to specific research
questions, and presented back the research methodology. The following slides
specify questions posed by CUSTOMER to Elsevier for our collaboration
• Elsevier used our body of relationships mined from our large database of full text
using linguistic analysis to find subject-verb-object relationships in the text of the
articles
• The entire corpus of documents was mined to create a large database of
relationships that can be searched using simple or advanced search languages
• For all searches our extensive taxonomies and synonyms were used to normalize
terms and verbs for identifying relationships regardless of the exact words the
author used.
• Our team created custom taxonomies for protein purification methods as part of the
project
Project: Using Elsevier tools, data, and capabilities to address
research problems:
TITLE OF PRESENTATION |
• Method
• Searched for stated relationships between any biological element and Sjögren’s
syndrome.
• Used taxonomy of classifications to group the relationships into categories
• Findings
• Created ‘mind map’ of relationships of Sjögren’s syndrome to diseases, small-molecule
treatments, receptors, transcription factors, complexes, proteins
• Found leading researchers and institutions studying this disease
• Created collaboration maps showing collaborative studies, including pharma-academic,
an pharma-biotech collaborations.
11
What is known about Sjögren’s syndrome? (e.g. Cell types,
pathways, highest confidence associated genes, etc…)
Question:

More Related Content

What's hot

BIOLINK 2008: Linking database submissions to primary citations with PubMe...
BIOLINK 2008:    Linking database submissions to primary citations with PubMe...BIOLINK 2008:    Linking database submissions to primary citations with PubMe...
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
Heather Piwowar
 
Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collection
Sherry Lake
 
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
Dr. Haxel Consult
 
A FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics
A FAIR Data Sharing Framework for Large-Scale Human Cancer ProteogenomicsA FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics
A FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics
Brett Tully
 
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
ASIS&T
 

What's hot (20)

Digital transformation of translational medicine
Digital transformation of translational medicineDigital transformation of translational medicine
Digital transformation of translational medicine
 
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
BIOLINK 2008:    Linking database submissions to primary citations with PubMe...BIOLINK 2008:    Linking database submissions to primary citations with PubMe...
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
 
Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collection
 
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
 
Fair by design
Fair by designFair by design
Fair by design
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
Open Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality DataOpen Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality Data
 
Expert Panel on Data Challenges in Translational Research
Expert Panel on Data Challenges in Translational ResearchExpert Panel on Data Challenges in Translational Research
Expert Panel on Data Challenges in Translational Research
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...
 
effective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemeffective data sharing for a learning healthcare system
effective data sharing for a learning healthcare system
 
SciBite
SciBiteSciBite
SciBite
 
A FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics
A FAIR Data Sharing Framework for Large-Scale Human Cancer ProteogenomicsA FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics
A FAIR Data Sharing Framework for Large-Scale Human Cancer Proteogenomics
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
Lightning Talk, Coates: Clinical Data Management strategies: How can they imp...
 
Teaching data management in a lab environment (IASSIST 2014)
Teaching data management in a lab environment (IASSIST 2014)Teaching data management in a lab environment (IASSIST 2014)
Teaching data management in a lab environment (IASSIST 2014)
 
Thakur Interim Research Products
Thakur Interim Research ProductsThakur Interim Research Products
Thakur Interim Research Products
 
Inglis Preprints in Biology and Medicine
Inglis Preprints in Biology and MedicineInglis Preprints in Biology and Medicine
Inglis Preprints in Biology and Medicine
 
Enhancing Our Capacity for Large Health Dataset Analysis
Enhancing Our Capacity for Large Health Dataset AnalysisEnhancing Our Capacity for Large Health Dataset Analysis
Enhancing Our Capacity for Large Health Dataset Analysis
 

Similar to ELSS use cases and strategy

Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
William Gunn
 

Similar to ELSS use cases and strategy (20)

Toward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data EcosystemToward a FAIR Biomedical Data Ecosystem
Toward a FAIR Biomedical Data Ecosystem
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
Designing and launching the Clinical Reference Library
Designing and launching the Clinical Reference LibraryDesigning and launching the Clinical Reference Library
Designing and launching the Clinical Reference Library
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Assessing and Reporting Research Impact – A Role for the Library - Kristi L....
Assessing and Reporting Research Impact – A Role for the Library  - Kristi L....Assessing and Reporting Research Impact – A Role for the Library  - Kristi L....
Assessing and Reporting Research Impact – A Role for the Library - Kristi L....
 
Assessing and Reporting Research Impact – A Role for the Library - Kristi L....
Assessing and Reporting Research Impact – A Role for the Library  - Kristi L....Assessing and Reporting Research Impact – A Role for the Library  - Kristi L....
Assessing and Reporting Research Impact – A Role for the Library - Kristi L....
 
Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing
 
ROER4D Open Data Initiative
ROER4D Open Data InitiativeROER4D Open Data Initiative
ROER4D Open Data Initiative
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest Group
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevort
 
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
The Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer DatasetThe Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer Dataset
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 

More from Anton Yuryev

Pathway analysis for personalized oncology
Pathway analysis for personalized oncologyPathway analysis for personalized oncology
Pathway analysis for personalized oncology
Anton Yuryev
 

More from Anton Yuryev (20)

Knowledge graph applications for cosmetics industry
Knowledge graph applications for cosmetics industryKnowledge graph applications for cosmetics industry
Knowledge graph applications for cosmetics industry
 
Five drug development strategies to combat SARS-CoV2
Five drug development strategies to combat SARS-CoV2Five drug development strategies to combat SARS-CoV2
Five drug development strategies to combat SARS-CoV2
 
Drugs predicted to bind #COVID19 proteins by computational docking
Drugs predicted to bind #COVID19 proteins by computational dockingDrugs predicted to bind #COVID19 proteins by computational docking
Drugs predicted to bind #COVID19 proteins by computational docking
 
Genetic variations linked to Acute Respiratory Distress syndrome
Genetic variations linked to Acute Respiratory Distress syndromeGenetic variations linked to Acute Respiratory Distress syndrome
Genetic variations linked to Acute Respiratory Distress syndrome
 
AAK1 GAK inhibitors for anti-COVID19 therapy
AAK1 GAK inhibitors for anti-COVID19 therapyAAK1 GAK inhibitors for anti-COVID19 therapy
AAK1 GAK inhibitors for anti-COVID19 therapy
 
Drug re-positioning for tuberculosis infection in HIV/ADS patients
Drug re-positioning for tuberculosis infection in HIV/ADS patients Drug re-positioning for tuberculosis infection in HIV/ADS patients
Drug re-positioning for tuberculosis infection in HIV/ADS patients
 
Patient microbiome analysis using Elsevier text mining
Patient microbiome analysis using Elsevier text miningPatient microbiome analysis using Elsevier text mining
Patient microbiome analysis using Elsevier text mining
 
Gallbladder cancer patient analysis
Gallbladder cancer patient analysisGallbladder cancer patient analysis
Gallbladder cancer patient analysis
 
OMICs data analysis using Pathway Studio
OMICs data analysis using Pathway StudioOMICs data analysis using Pathway Studio
OMICs data analysis using Pathway Studio
 
Analysis of gene expression microarray data of patients with Spinal Muscular ...
Analysis of gene expression microarray data of patients with Spinal Muscular ...Analysis of gene expression microarray data of patients with Spinal Muscular ...
Analysis of gene expression microarray data of patients with Spinal Muscular ...
 
Humira repositioning report
Humira repositioning reportHumira repositioning report
Humira repositioning report
 
Pathway Studio v.12 Release Notes
Pathway Studio v.12 Release NotesPathway Studio v.12 Release Notes
Pathway Studio v.12 Release Notes
 
Analysis of Functional Magnetic Resonance Imaging (fMRI) data from human brai...
Analysis of Functional Magnetic Resonance Imaging (fMRI) data from human brai...Analysis of Functional Magnetic Resonance Imaging (fMRI) data from human brai...
Analysis of Functional Magnetic Resonance Imaging (fMRI) data from human brai...
 
Drug repositioning for hepatocellular carcinoma
Drug repositioning for hepatocellular carcinomaDrug repositioning for hepatocellular carcinoma
Drug repositioning for hepatocellular carcinoma
 
Elsevier helps malaria research with comprehensive Plasmodium biology database
Elsevier helps malaria research with comprehensive Plasmodium biology databaseElsevier helps malaria research with comprehensive Plasmodium biology database
Elsevier helps malaria research with comprehensive Plasmodium biology database
 
Personalized Medicine: Matching cancer drugs with mechanism (AAPS webinar)
Personalized Medicine: Matching cancer drugs with mechanism (AAPS webinar)Personalized Medicine: Matching cancer drugs with mechanism (AAPS webinar)
Personalized Medicine: Matching cancer drugs with mechanism (AAPS webinar)
 
Presentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-AntonioPresentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-Antonio
 
Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...
Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...
Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...
 
Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...
Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...
Profiling how Immune inhibitors Secreted by Melanoma affect NK & other immune...
 
Pathway analysis for personalized oncology
Pathway analysis for personalized oncologyPathway analysis for personalized oncology
Pathway analysis for personalized oncology
 

ELSS use cases and strategy

  • 1. TITLE OF PRESENTATION | Presented By Date Timothy Hoctor, VP Professional Services October 13, 2015 Introduction to Elsevier Professional Services & Strategic Vision
  • 2. TITLE OF PRESENTATION | • Increase R&D productivity ‐ Support research by linking R&D data across development spectrum (discovery, preclinical, clinical and patient outcome) • Increase return on information – Enhanced search and visualization, from “query‐to‐action” • Define potential data standards • Reduce cost of IT support by implementing cloud technology and extensive APIs that allows customization with internal and external sources 2 Our Professional Services team leverages greater Elsevier capabilities to provide customized and optimized data management & analysis solutions • Customer CBI’s (customer-provided) • Lack of standards (for data and metadata) and discipline (e.g. data curation) • Internal infrastructure is not designed for the management, curation and analysis of modern experimental data • Difficulty to integrate CUSTOMER data with public domain data in a systematic way • An acute shortage of people with informatics and domain area (biology, data mining) • Integration is difficult even if we find the data
  • 3. TITLE OF PRESENTATION | Strategic objective to become leading collaborator in R&D data management 3 Data & Process Mapping • Mapping of External Information Chain against business process Gap Analysis Data Management & Stewardship Setup Data Governance and Continuous Data Quality Improvement Data Normalization and Long Term Data Strategy Where we are focusing • Consulting service on key decision info gap • Data linking service and API pilot to improve decision making • Data Management Strategy • Data warehouse structure • Taxonomy integration and implementation • Data harmonization • Collaborate with business, technical & project stewards to design, develop, standardize data structure • Instituting Master Data management • Be your external data steward for life science community • Data lifecycle analytics and management service End to end service with demonstrated R&D data harmonization, taxonomy development, and life science information management expertise Elsevier Life Science Professional Services
  • 4. TITLE OF PRESENTATION | 4 Professional Services takes siloed R&D data processes… Data Capture Data Storage Data Analysis Data Intelligence …and transforms them into an integrated data management solution that increases productivity and accelerates discovery. Data Capture Data Organization & Normalization Integrated Data Management • Collective silo • Aligned format • Unlimited use potential Data Analysis & Intelligence ELNs Medicinal Chemistry Clinical Trials
  • 5. TITLE OF PRESENTATION | Our Key Capabilities • Access to life science data and content with proven data curation, taxonomy expertise, and semantic backbone across R&D and post market launch • Demonstrated R&D data integration expertise • Existing life science portfolio of solutions across development spectrum • Global footprint in life science sector 5 Data Science and Translational R&D • Integrated R&D data management with a clear data stewardship strategy • Broader utilization of available data • Harmonization of internal, public and 3rd party data to generate new scientific insights and better business decisions • Infrastructure to support handling of big data and collaboration platform
  • 6. TITLE OF PRESENTATION | Case Study: Chemistry Data Management Integration into Elsevier Chemistry platform: Roche, Novartis • Key Value Drivers: • Increase Discoverability • Efficiency: Decrease cost and maintenance • High value in seeing failed reactions • Don’t repeat them → proven savings! • High value in cross-fertilization of Process Chem/Med Chem • Take advantage of designed Process Chem experiments • Improved patent filings time/content • Make better use of resources → better chemistry • Roche analyzed the time saved per scientist, multiplied by wages and number of scientists • ROI: Payback for the project cost was less than 1 year 6 From Presentation at ACS 245th National Meeting by Michael Kapler, Roche Pharma Research & Early Development Integration with Customer Data Ecosystem: Merck • Key Value Drivers: • Efficiency: Decrease time and minimize interface support • Provide tailored workflows and broader use cases • Reduce discovery time and eliminate manual curation • Derive answers “from days to seconds” • Eliminate lag in data currency • Provide integrated content ‘dashboards’ to suit multiple use cases alongside vertical applications • Support agile data framework • Leverage experience to build on-demand analysis tools (pre-defined query set against normalized data) From Presentation at BIOIT 2015 by Huijun Wang, Merck R&D Chemistry
  • 7. TITLE OF PRESENTATION | A bit more detail: Merck 7
  • 8. TITLE OF PRESENTATION | Merck: Creating a data dashboard 8
  • 9. TITLE OF PRESENTATION | MD Anderson What did we do: Provided a structured hosted biological data environment and supporting services to normalize and analyze and compare experimental data with data from across corpus of published full text. How we worked together: • Beginning with extensive scoping and dedicated project co-resourcing, through full documented use cases and workflows and comparative analysis of semantic engine. Environment(s) were tailored to specific needs and populated with specifically defined with custom data cartridges and taxonomies. What was the result: • MD Anderson system allowed for comparative analysis against corpus of data from all major pathway analysis tools deriving data from published literature, and then further analysis against lab-generated internal data. Key Benefit: Novel potential therapy and multiple unexplored targets identified: 9 Case Studies: Integrated Biology Data Management DARPA Big Data Mechanism for Cancer Research What did we do: Provided a structured hosted biological data environment to establish gold standard taxonomy and pattern recognition in biological target validation for oncology How we worked together: • Worked in collaboration with academic partner (Carnegie Mellon) to scope build and deploy enterprise platform for comparative analytics and custom extensible oncology data cartridge. Using iterative review for all project participants to identify and incorporate best-in-class entity and pattern recognition and gold standard data taxonomy What was the result: • DARPA system has been extended to more than 30 US research groups for comparative analysis. • Where explicit patterns have been identified outside the system, these have been written into the system to improve recall and causal reasoning. • Taxonomy has been expanded to reflect new learnings and improve relationship identification. • DARPA has determined Elsevier’s system to be gold standard for cancer pathway analytics. From Presentation at BIOIT 2014 by Phil Lorenzi, UT MD Anderson Cancer Center
  • 10. TITLE OF PRESENTATION | Executive Summary: Elsevier performed research using Elsevier tools and data, public data source and open source tools, to provide CUSTOMER with answers to specific research questions, and presented back the research methodology. The following slides specify questions posed by CUSTOMER to Elsevier for our collaboration • Elsevier used our body of relationships mined from our large database of full text using linguistic analysis to find subject-verb-object relationships in the text of the articles • The entire corpus of documents was mined to create a large database of relationships that can be searched using simple or advanced search languages • For all searches our extensive taxonomies and synonyms were used to normalize terms and verbs for identifying relationships regardless of the exact words the author used. • Our team created custom taxonomies for protein purification methods as part of the project Project: Using Elsevier tools, data, and capabilities to address research problems:
  • 11. TITLE OF PRESENTATION | • Method • Searched for stated relationships between any biological element and Sjögren’s syndrome. • Used taxonomy of classifications to group the relationships into categories • Findings • Created ‘mind map’ of relationships of Sjögren’s syndrome to diseases, small-molecule treatments, receptors, transcription factors, complexes, proteins • Found leading researchers and institutions studying this disease • Created collaboration maps showing collaborative studies, including pharma-academic, an pharma-biotech collaborations. 11 What is known about Sjögren’s syndrome? (e.g. Cell types, pathways, highest confidence associated genes, etc…) Question: