J mc callumbig datapsp2013

BIG DATA:
HELPING SCHOLARLY PUBLISHERS
CUT THROUGH THE HYPE
Janice McCallum
Health Content Advisors
Association of American Publishers
2013 PSP Annual Conference, Washington, DC
February 6-8, 2013

Focus for this talk
Is Big Data Overhyped?
Let’s start with some definitions
Relevance to scholarly publishers
What should you be doing to profit from--and avoid being
disrupted by--Big Data upstarts?

Other Headline Grabbers from Retail
and Financial Services
Analyzing customer behavior to predict purchasing or payment
patterns:
• Target recognizing likely pregnancy of shopper
• Credit card companies knowing when you’re likely to be late with
payments
• Gathering real-time behavior data (EyeSee mannequins)

There is More to Big Data
than Watson…Is It All
Hype?
Comprehensive List of Big Data
Statistics
“Comprehensive” should be in quotes.
Sources and magnitude are constantly
changing.

What’s the Big Deal with Big Data?
Growth in computing power/reduction in cost: Moore’s Law.
• Why Big Data is So Big
New Sources of Data
• Mobile devices and sensor data hugely expand the amount of
data generated.
• Social media: Facebook and Twitter
•
But keep in mind:
What Executives Don’t Understand About Big Data
• Schrage: Question shouldn’t be how do we get more data, it
should be “what marriage of data and algorithms” achieve our
desired outcome.
Value of Big Data lies in unlocking patterns and insights that
would not have been possible without the combination of
computing power, tools, and data.
Alternative definition for Big Data:
Advanced Analytics for Complex Problem Solving

• Volume Large datasets: too big for standard
enterprise database applications.
• Variety Combining structured and “unstructured”
data sources. Think “union” of different sources, not big vs. small
or structured vs. unstructured.
• Velocity Big data systems can integrate and process
near real-time data from mobile devices and sensors.
• Veracity Data management best practices still matter.
It’s not about trading off size vs. quality; it’s about combining
best of both worlds.
Big Data is an umbrella term that can encompass infinite
use cases. The ability to incorporate large diverse data
sources into an analytic model is paramount.
The 4 Vs of Big Data
‟Controlling
Data Volume,
Velocity, and
Variety….to
improve internal
and external
collaboration.”
Doug Laney, 2001

RELEVANCE OF BIG DATA TO
SCHOLARLY PUBLISHERS
With Examples from Medical Publishing

Expectations of Researchers
Have Changed
Scientific and Medical Researchers Need:
Easier faster access to data sets;
Ability to trace data provenance;
Central repositories or better discovery
options for data sets;
Business models for accessing, sharing,
and adding value to the base of
knowledge.
Raw Data,
Now!
Tim Berners-Lee,
2009
[Biomedicine
is] going to
have to
become more
dynamic, more
computational.
Stephen
Wolfram, 2006

Expectations of Clinicians Have Changed
“…the problem is no longer getting access to
data, whether it's a genome sequence or
whether it's a glucose sensor, but how do you
process that data in an efficient way…”
--Eric Topol, 2012

Partial Set of Data Sources in Medical Research
Clinical
Research
Patient
registries/
Outcomes
Data
Rx data
Sensor
data/Exercise
tracking
OTC & food
purchases
Disease
registries
Genomic
data
Almost all medical research currently occurs on data types displayed above the fold.

Big Data Uses in Healthcare
Are you prepared to play in this fast-
growing fast-changing segment?

Getting Started in Big Data
 First: recognize that you are all data publishers. If
the content is digital, it’s data.
 Create standard formats for data sets that are
submitted with articles.
 Plan for collaborating with Big Data analytics
companies.
 Develop expertise in new more complex models of
medical evidence.

Accelerated Pace of Data Flows
Evidence
Base
Software,
models
Analysis, insights
Data sets, registries,
directories
Curated news, textual content,
summaries
Big Data isn’t about
structured vs. unstructured
data. It’s about building upon
the existing base of
knowledge with the ability to
constantly update the
evidence base with new data
that either reinforce or
replace currently accepted
knowledge.
A strong foundation remains
essential and requires multi-
directional data flows.

What Role Will Your Organization Play in Big
Data Era? Some possibilities:
Evidence
Base
Software,
models
Analysis, insights
Data sets, registries,
directories
Curated news, textual content,
summaries
← Disseminate latest evidence-based guidelines
← Provide software platform that
incorporates latest algorithms and
integrates data
← Employ analysts, data scientists,
researchers to conduct studies and report
results
← Provide master data management
services; become clearinghouse for
data exchange
← Publish curated scientific
research results

THANK YOU!
Janice McCallum
Managing Director
Health Content Advisors
Janice@HealthContentAdvisors.com
@janicemccallum

J mc callumbig datapsp2013

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a J mc callumbig datapsp2013

Semelhante a J mc callumbig datapsp2013 (20)

Último

Último (20)

J mc callumbig datapsp2013

Notas do Editor