Big Data at Geisinger Health System: Big Wins in a Short Time

DataWorks Summit
23 de Jun de 2017
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
1 de 37

Mais conteúdo relacionado

Mais procurados

The Knowledge Graph ExplosionThe Knowledge Graph Explosion
The Knowledge Graph ExplosionNeo4j
Easily Identify Sources of Supply Chain GridlockEasily Identify Sources of Supply Chain Gridlock
Easily Identify Sources of Supply Chain GridlockNeo4j
Creating an Effective MDM Strategy for SalesforceCreating an Effective MDM Strategy for Salesforce
Creating an Effective MDM Strategy for SalesforcePerficient, Inc.
Microsoft: A Waking Giant In Healthcare Analytics and Big DataMicrosoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataHealth Catalyst
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
The hospital of the futureThe hospital of the future
The hospital of the futureDeloitte United States

Mais procurados(20)

Similar a Big Data at Geisinger Health System: Big Wins in a Short Time

How to Architect Smarter Systems for HealthcareHow to Architect Smarter Systems for Healthcare
How to Architect Smarter Systems for HealthcareReal-Time Innovations (RTI)
Hadoop Enabled HealthcareHadoop Enabled Healthcare
Hadoop Enabled HealthcareDataWorks Summit
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchDataWorks Summit/Hadoop Summit
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchDataWorks Summit/Hadoop Summit

Similar a Big Data at Geisinger Health System: Big Wins in a Short Time(20)

Mais de DataWorks Summit

Data Science Crash CourseData Science Crash Course
Data Science Crash CourseDataWorks Summit
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit

Mais de DataWorks Summit(20)

Último

Unleashing the Power of Modern Carpooling Apps, Inspired by BlaBlaCarUnleashing the Power of Modern Carpooling Apps, Inspired by BlaBlaCar
Unleashing the Power of Modern Carpooling Apps, Inspired by BlaBlaCarArchie Cadell
Metadata & Discovery Group Conference 2023 - Day 2Metadata & Discovery Group Conference 2023 - Day 2
Metadata & Discovery Group Conference 2023 - Day 2CILIP MDG
Product Listing Presentation_Cathy.pptxProduct Listing Presentation_Cathy.pptx
Product Listing Presentation_Cathy.pptxCatarinaTorrenuevaMa
Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...
Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...The Digital Insurer
Announcing InfluxDB ClusteredAnnouncing InfluxDB Clustered
Announcing InfluxDB ClusteredInfluxData
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Maxim Salnikov

Último(20)

Big Data at Geisinger Health System: Big Wins in a Short Time

Notas do Editor

  1. Brief introduction about Geisinger
  2. EHR in mid-90s. By 2006, leadership wanted EDW. CDIS (clin dec intel syst) live in 2008. Big win early. Few Healthcare orgs had this integration platform at this time. Internally, depts. (research) no longer had to request extracts from Epic for analytics. One platform of data (clin, fin, claims) for analytics, to transform the delivery of care. It has gone through a number of iterations, and currently supports much of the analytics running our day-to-day operations. Over 2100 users. 2012, switched to TD (higher performance). 2016, UDA. Integrate all key analytics platforms (Hadoop, Cerner, Epic EDW)
  3. Next phase of our analytics platform: Hadoop (Big Data)
  4. Late binding of Hadoop allows for the data to simply be loaded without detailed analysis and preparation up-front.
  5. Our multi-zoned Hadoop system allows for many views of the data, including temporal, modeled, etc. Hadoop is not confined to structured data in discreet fields, as is the case with traditional analytic platforms.
  6. LDAP and AD Integration using Ranger/Knox Encryption at rest SSL endpoint encryption active for all network connections Kerberos Authentication: To thwart impersonation threats Appropriate access and roles as required. These roles will continue to be defined by the Data Manger or his designate All PHI data will be masked in the Development environment
  7. Less costly hardware for storing increasing data (structured and unstructured) 5 million to purchase new Terradata hardware Prevent “one-off” data systems (e.g. IoT data capture, ICU real-time data capture, Cybersecurity)
  8. Lung nodules are commonly identified in free text within radiology reports and can easily be lost to follow up with potential for delayed cancer diagnosis. A treasure trove of useful, relevant, and unstructured clinical information in the form of text blobs and semi-templated data is locked inside EHRs. We used Solr, a module part of the Apache Hadoop ecosystem, to expose the data and let users perform rapid search. The ability to sort through over 184M clinical notes across 20-years worth of in/outpatient records Serves a framework to run CTAKES and other Natural Language Processing programs to find signal in the text noise, and make the data actionable.
  9. UMLS: Unified Medical Language System Negations Nearly 30 % of identified lung nodule notes are negative results. NLP engine constructs grammar tree and associates negation words with the identified lung nodule text Calculate Lung RADS scores based on nodule size and description Future tasks Measure accuracy of predicted Lund RADS scores and improve performace