SlideShare a Scribd company logo
1 of 23
Download to read offline
HFE & BCR-ABL
In Search of Links
© 2014, TopicQuests Foundation
Licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Jack Park
BigData Science Meetup
Freemont, CA: 17 May, 2014
Shyam Sarkar, Organizer
Target Benefits
• SolrSherlock will support:
– Hypothesis formation
– Research/Experiment planning
– Deep Question Answering
• Personal medical issues
• …
“Therefore psychologically we must keep all the theories in our heads, and
every theoretical physicist who is any good knows six or seven different
theoretical representations for exactly the same physics.”
―Richard Feynman
“Why, sometimes I've believed as many as six impossible things before
breakfast.”
―The Queen: Through The Looking Glass
What We Have Read: HFE
• Human hemochromatosis protein also known
as the HFE protein is a protein which in
humans is encoded by the HFE gene. The HFE
gene is located on short arm of chromosome 6
at location 6p22.2*
– Some mutations which are associated with
Hereditary Hemochromatosis (a genetic
disease)**:
• C282Y
• H63D
*http://en.wikipedia.org/wiki/HFE_%28gene%29
**http://www.genome.gov/10001214
What We Have Read: BCR-ABL aka:
Philadelphia Chromosome
• Philadelphia chromosome or Philadelphia
translocation is a specific chromosomal
abnormality that is associated with chronic
myelogenous leukemia (CML). It is the result
of a reciprocal translocation between
chromosome 9 and 22, and is specifically
designated t(9;22)(q34;q11)*
*http://en.wikipedia.org/wiki/Philadelphia_chromosome
Are HFE and BCR-ABL Linked?
• One document instance which suggests they
are linked:
– “We found that HFE C282Y might be associated
with a protective role against CMPD. Because
chronic iron deficiency or latent anemia may
trigger disease susceptibility for CMPD, HFE C282Y
positivity may be a genetic factor influencing this
effect.”*
• Note: this response is simply evidence of a link, a
signal; it leaves open many questions
CMPD: Chronic Myeloproliferative Disease
* http://www.ncbi.nlm.nih.gov/pubmed/19258483
Where do we go from here?
• We have read about some actors
• We seek evidence for relationships between
those actors
• We have one small piece of evidence
• We turn to Literature-based Discovery (LBD)
– Read and process many papers
– Assemble an evidence field
– Determine answers and confidence levels
Sensemaking In Biological Research
http://www.biomedcentral.com/content/pdf/1471-2105-15-117.pdf Figure 1
© 2014 Mirel and Görg; licensee BioMed Central Ltd (cc by)
Literature-based Discovery
• Swanson’s ABC Model
• Two Varieties of LBD
– Closed Discovery
– Open Discovery
SolrSherlock Block Level
• Models
– Process Models
– Conceptual Graphs
– OpenBEL
• Identity
– Topic Map
• Topics
• Relations
• Associations
– Bayes
– DeepLearning
– HyperMembrane
• Interface
Interface
Associations
Identity
Models
Data
SolrSherlock’s HyperMembrane
• SolrSherlock Big Picture
– Documents to harvest
– Sentences to parse
• WordGrams from the sentences
– Lenses to interpret the sentences
» NTuples from the WordGrams
– Lenses to interpret whole documents
• HyperMembrane as a fabric woven from the
Ntuples
– Organizes statements read from literature into a kind
of associative fabric, linked into a topic map
HyperMembrane Inspiration
http://xanadu.com/zigzag/ZZdnld/zzRefDef/
https://www.flickr.com/photos/portier/2927798222/sizes/s/
HyperMembrane Internal Structure
Graph
Agent
Structure
Agent
Sentence
Agent
Document
Agent
Query
AgentInformation
Fabric
Sentence Parse
• Salient WordGrams in that sentence:
– C282Y
– might be associated with a
– protective role against
• Transforms to: protect against
– CMPD
We found that HFE C282Y might be associated with a protective role against CMPD
+-----------------MVp-----------------------------------+
| +---------Js------------+ |
+---Cet------+ | | +-------Ds---------+ |
+-Sp-+--TH--+ +--G-+--Ss--+--Ix---+---Pv-----+---MVp--+ | +----A---+ +--Js--+
| | | | | | | | | | | | | |
we found.p that.c HFE C282Y might.v be.v associated.v with a protective.a role.n against CMPD
Parse produced by a Java
implementation of Link
Grammar Parser
WordGram instances
created while processing
the sentence
WordGram Example
• Sentence:
– CO2 causes climate change
• WordGrams
– Terminals
• CO2
• causes
• climate
• Change
– Pairs
• CO2 causes
• causes climate
• climate change
– Triples
• CO2 causes climate
• causes climate change
– Quads
• CO2 causes climate change
• Parsed Result—representation of the sentence:
– CO2 (terminal, noun)
– cause (terminal, verb, transformed causescause)
– climate change (pair, noun phrase)
• Resulting NTuple
– {CO2, cause, climate change}
• Where the names are replaced with topic locators from the topic map
These WordGram
instances represent the
sentence; they are wired
into the fabric.
This Ntuple participates
in high-level structure
formation and in
question answering
WordGram instances
created while processing
the sentence
WordGram instances
created while processing
the sentence
WordGram instances
created while processing
the sentence
Lenses
• Simple Interpreters
– Based on Canonical Predicates
– Build structures from parsed sentences and
WordGrams
– Examples from biology
• Cause
• Bind
• Augment
• Prevent
• Increase
• Decrease
• Believe
Multiple Lenses
• Consider this sentence:
– We believe that A causes B
– Two Lenses in play
• Believe
• Cause
– Result is a nested NTuple
• {We, believe, {A, cause, B}}
Canonical Predicate
• Results from transformations on predicates
– E.g.
• A causes B, A can cause B, A will cause B  A cause B
• A is caused by B  B cause A
Actors: Named Entities
• For any given named entity, there will be one and
only one WordGram
– Issue of Ambiguity
• Same name string can serve different topics in the topic map
– Topic map maintains identity for disambiguation
• Thus, a single WordGram might be associated with more
than one individual actor
• This means:
– Fibers (threads) flowing through the fabric must be
maintained in bundles according to their context
(topic)
Lens Selection and Action
• The Lens:
– ProtectAgainst
• Selected by the WordGram for “protect against”
– Is a transformation of the WordGram for “protective role
against”
• Lens Action:
– Create an NTuple
• {C282Y, protect against, CMPD}
• We will call that NTuple an Assertion
We found that HFE C282Y might be associated with a protective role against CMPD
Weaving an Information Fabric
• Background:
– One and only one
WordGram for each
Actor (named entity)
– One and only one
WordGram for each
canonical Predicate
– One and only one
NTuple for each
Assertion
• WordGrams which form
an NTuple are strung
together as beads on a
string in the fabric.
– Thus, it is the detection
of NTuple structures
(Assertions) which form
the HyperMembrane’s
fabric.
Note: it is next to impossible to diagram the fabric, but it
will likely look like a very tangled knotted structure. https://www.flickr.com/photos/fermicat/27
3539481/in/set-72157601620157588/
Fabric Example
• Two NTuples
– {Jack Park, AuthoredBook, The Wind Power Book}
– {Jack Park, AuthoredBook, Ohio State University
Football Vault}
JP101 JP102
Book101
AuthoredBook
Wind Power Book
OSU Football…
Book102
Jack Park
Topic Map organizes fiber bundles
Looking Forward
• Lenses, today, are hardwired
– Opportunity for adaptive learning of new lenses
• Fabric, today, is simple
– Opportunity to use cardinalities, frequency counts
in the fabric for:
• Probabilistic modeling
• Topological studies
• Opportunity for a Domain-Specific Language
(DSL) to emerge
Completed Representation
antioxidants
kill
free radicals
Contraindicates
macrophages use
free radicals to
kill bacteria
Bacterial Infection Antioxidants
Because
Appropriate For
Compromised Host
Let us co-create Cognitive Agents for Discovery
jackpark@topicquests.org
Thanks to Mei Lin Fung, David Alexander Price, and Patrick Durusau for
valuable comments
SolrSherlock at:
http://debategraph.org/SolrSherlock and https://github.com/SolrSherlock

More Related Content

What's hot

Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهsadegh salehi
 
Deep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text ClassificationDeep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text ClassificationHPCC Systems
 
Open hpi semweb-06-part2
Open hpi semweb-06-part2Open hpi semweb-06-part2
Open hpi semweb-06-part2Nadine Ludwig
 
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Andre Freitas
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and ChallengesJens Lehmann
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologySteven Miller
 
Towards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesTowards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesAnita de Waard
 

What's hot (9)

Tell It Like It Seems: Challenges Identifying Requirements of a Learning He...
Tell It Like It Seems: Challenges Identifying Requirements of a Learning He...Tell It Like It Seems: Challenges Identifying Requirements of a Learning He...
Tell It Like It Seems: Challenges Identifying Requirements of a Learning He...
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغه
 
Deep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text ClassificationDeep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text Classification
 
Open hpi semweb-06-part2
Open hpi semweb-06-part2Open hpi semweb-06-part2
Open hpi semweb-06-part2
 
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
Question Answering over Linked Data: Challenges, Approaches & Trends (Tutoria...
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and Terminology
 
Towards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesTowards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data Services
 

Similar to SolrSherlock: Linkfinding among Biomolecules with Literature-based Discovery

Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri
 
HyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingHyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingJack Park
 
2015 09 emc lsug
2015 09 emc lsug2015 09 emc lsug
2015 09 emc lsugChris Dwan
 
DATA641 Lecture 3 - Word meaning.pptx
DATA641 Lecture 3 - Word meaning.pptxDATA641 Lecture 3 - Word meaning.pptx
DATA641 Lecture 3 - Word meaning.pptxDrPraveenPawar
 
Temple University Digital Scholarship Center: Model of the Month Club: Septem...
Temple University Digital Scholarship Center: Model of the Month Club: Septem...Temple University Digital Scholarship Center: Model of the Month Club: Septem...
Temple University Digital Scholarship Center: Model of the Month Club: Septem...Liz Rodrigues
 
Choices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein OntologiesChoices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein Ontologiesbenosteen
 
Open Babel project overview
Open Babel project overviewOpen Babel project overview
Open Babel project overviewbaoilleach
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulationstbruce
 
S.O.L.I.D. Principles for Software Architects
S.O.L.I.D. Principles for Software ArchitectsS.O.L.I.D. Principles for Software Architects
S.O.L.I.D. Principles for Software ArchitectsRicardo Wilkins
 
Storytelling for research software engineers
Storytelling for research software engineersStorytelling for research software engineers
Storytelling for research software engineersAlbanLevy
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.orgNorman Morrison
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudOntotext
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Spark Summit
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Oscar Corcho
 

Similar to SolrSherlock: Linkfinding among Biomolecules with Literature-based Discovery (20)

Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
Word 2 vector
Word 2 vectorWord 2 vector
Word 2 vector
 
Word2vector
Word2vectorWord2vector
Word2vector
 
All good things
All good thingsAll good things
All good things
 
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
 
Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
 
NLP & DBpedia
 NLP & DBpedia NLP & DBpedia
NLP & DBpedia
 
HyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingHyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive Computing
 
2015 09 emc lsug
2015 09 emc lsug2015 09 emc lsug
2015 09 emc lsug
 
DATA641 Lecture 3 - Word meaning.pptx
DATA641 Lecture 3 - Word meaning.pptxDATA641 Lecture 3 - Word meaning.pptx
DATA641 Lecture 3 - Word meaning.pptx
 
Temple University Digital Scholarship Center: Model of the Month Club: Septem...
Temple University Digital Scholarship Center: Model of the Month Club: Septem...Temple University Digital Scholarship Center: Model of the Month Club: Septem...
Temple University Digital Scholarship Center: Model of the Month Club: Septem...
 
Choices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein OntologiesChoices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein Ontologies
 
Open Babel project overview
Open Babel project overviewOpen Babel project overview
Open Babel project overview
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulations
 
S.O.L.I.D. Principles for Software Architects
S.O.L.I.D. Principles for Software ArchitectsS.O.L.I.D. Principles for Software Architects
S.O.L.I.D. Principles for Software Architects
 
Storytelling for research software engineers
Storytelling for research software engineersStorytelling for research software engineers
Storytelling for research software engineers
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?
 

More from Jack Park

Topic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsTopic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsJack Park
 
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Jack Park
 
Game-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesGame-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesJack Park
 
OpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationOpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationJack Park
 
Finding Meaning Across Boundaries
Finding Meaning Across BoundariesFinding Meaning Across Boundaries
Finding Meaning Across BoundariesJack Park
 
On Augmenting Patients
On Augmenting PatientsOn Augmenting Patients
On Augmenting PatientsJack Park
 
Open Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsOpen Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsJack Park
 
Knowledge Garden Overview
Knowledge Garden OverviewKnowledge Garden Overview
Knowledge Garden OverviewJack Park
 
Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Jack Park
 
Questing in the age of Complex Systems
Questing in the age of Complex SystemsQuesting in the age of Complex Systems
Questing in the age of Complex SystemsJack Park
 
FutureOfText2015
FutureOfText2015FutureOfText2015
FutureOfText2015Jack Park
 
Federating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, StudentsFederating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, StudentsJack Park
 
Towards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingTowards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingJack Park
 
Towards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryTowards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryJack Park
 
Feedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensFeedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensJack Park
 
A Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsA Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsJack Park
 
Game-based Learning
Game-based LearningGame-based Learning
Game-based LearningJack Park
 

More from Jack Park (20)

Tm keynote
Tm keynoteTm keynote
Tm keynote
 
Topic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsTopic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation Topics
 
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
 
Game-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesGame-based Learning Landscape and Opportunities
Game-based Learning Landscape and Opportunities
 
OpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationOpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence Augmentation
 
Finding Meaning Across Boundaries
Finding Meaning Across BoundariesFinding Meaning Across Boundaries
Finding Meaning Across Boundaries
 
On Augmenting Patients
On Augmenting PatientsOn Augmenting Patients
On Augmenting Patients
 
Lbd tm-2
Lbd tm-2Lbd tm-2
Lbd tm-2
 
Open Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsOpen Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and Prospects
 
Knowledge Garden Overview
Knowledge Garden OverviewKnowledge Garden Overview
Knowledge Garden Overview
 
Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...
 
Questing in the age of Complex Systems
Questing in the age of Complex SystemsQuesting in the age of Complex Systems
Questing in the age of Complex Systems
 
Why?
Why?Why?
Why?
 
FutureOfText2015
FutureOfText2015FutureOfText2015
FutureOfText2015
 
Federating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, StudentsFederating Cultures: Human Knowledge, Teachers, Students
Federating Cultures: Human Knowledge, Teachers, Students
 
Towards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingTowards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive Computing
 
Towards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryTowards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData Discovery
 
Feedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensFeedback Loops and Knowledge Gardens
Feedback Loops and Knowledge Gardens
 
A Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsA Future for Education: Some Core Thoughts
A Future for Education: Some Core Thoughts
 
Game-based Learning
Game-based LearningGame-based Learning
Game-based Learning
 

Recently uploaded

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excelysmaelreyes
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 

Recently uploaded (20)

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excel
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 

SolrSherlock: Linkfinding among Biomolecules with Literature-based Discovery

  • 1. HFE & BCR-ABL In Search of Links © 2014, TopicQuests Foundation Licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Jack Park BigData Science Meetup Freemont, CA: 17 May, 2014 Shyam Sarkar, Organizer
  • 2. Target Benefits • SolrSherlock will support: – Hypothesis formation – Research/Experiment planning – Deep Question Answering • Personal medical issues • … “Therefore psychologically we must keep all the theories in our heads, and every theoretical physicist who is any good knows six or seven different theoretical representations for exactly the same physics.” ―Richard Feynman “Why, sometimes I've believed as many as six impossible things before breakfast.” ―The Queen: Through The Looking Glass
  • 3. What We Have Read: HFE • Human hemochromatosis protein also known as the HFE protein is a protein which in humans is encoded by the HFE gene. The HFE gene is located on short arm of chromosome 6 at location 6p22.2* – Some mutations which are associated with Hereditary Hemochromatosis (a genetic disease)**: • C282Y • H63D *http://en.wikipedia.org/wiki/HFE_%28gene%29 **http://www.genome.gov/10001214
  • 4. What We Have Read: BCR-ABL aka: Philadelphia Chromosome • Philadelphia chromosome or Philadelphia translocation is a specific chromosomal abnormality that is associated with chronic myelogenous leukemia (CML). It is the result of a reciprocal translocation between chromosome 9 and 22, and is specifically designated t(9;22)(q34;q11)* *http://en.wikipedia.org/wiki/Philadelphia_chromosome
  • 5. Are HFE and BCR-ABL Linked? • One document instance which suggests they are linked: – “We found that HFE C282Y might be associated with a protective role against CMPD. Because chronic iron deficiency or latent anemia may trigger disease susceptibility for CMPD, HFE C282Y positivity may be a genetic factor influencing this effect.”* • Note: this response is simply evidence of a link, a signal; it leaves open many questions CMPD: Chronic Myeloproliferative Disease * http://www.ncbi.nlm.nih.gov/pubmed/19258483
  • 6. Where do we go from here? • We have read about some actors • We seek evidence for relationships between those actors • We have one small piece of evidence • We turn to Literature-based Discovery (LBD) – Read and process many papers – Assemble an evidence field – Determine answers and confidence levels
  • 7. Sensemaking In Biological Research http://www.biomedcentral.com/content/pdf/1471-2105-15-117.pdf Figure 1 © 2014 Mirel and Görg; licensee BioMed Central Ltd (cc by)
  • 8. Literature-based Discovery • Swanson’s ABC Model • Two Varieties of LBD – Closed Discovery – Open Discovery
  • 9. SolrSherlock Block Level • Models – Process Models – Conceptual Graphs – OpenBEL • Identity – Topic Map • Topics • Relations • Associations – Bayes – DeepLearning – HyperMembrane • Interface Interface Associations Identity Models Data
  • 10. SolrSherlock’s HyperMembrane • SolrSherlock Big Picture – Documents to harvest – Sentences to parse • WordGrams from the sentences – Lenses to interpret the sentences » NTuples from the WordGrams – Lenses to interpret whole documents • HyperMembrane as a fabric woven from the Ntuples – Organizes statements read from literature into a kind of associative fabric, linked into a topic map
  • 13. Sentence Parse • Salient WordGrams in that sentence: – C282Y – might be associated with a – protective role against • Transforms to: protect against – CMPD We found that HFE C282Y might be associated with a protective role against CMPD +-----------------MVp-----------------------------------+ | +---------Js------------+ | +---Cet------+ | | +-------Ds---------+ | +-Sp-+--TH--+ +--G-+--Ss--+--Ix---+---Pv-----+---MVp--+ | +----A---+ +--Js--+ | | | | | | | | | | | | | | we found.p that.c HFE C282Y might.v be.v associated.v with a protective.a role.n against CMPD Parse produced by a Java implementation of Link Grammar Parser
  • 14. WordGram instances created while processing the sentence WordGram Example • Sentence: – CO2 causes climate change • WordGrams – Terminals • CO2 • causes • climate • Change – Pairs • CO2 causes • causes climate • climate change – Triples • CO2 causes climate • causes climate change – Quads • CO2 causes climate change • Parsed Result—representation of the sentence: – CO2 (terminal, noun) – cause (terminal, verb, transformed causescause) – climate change (pair, noun phrase) • Resulting NTuple – {CO2, cause, climate change} • Where the names are replaced with topic locators from the topic map These WordGram instances represent the sentence; they are wired into the fabric. This Ntuple participates in high-level structure formation and in question answering WordGram instances created while processing the sentence WordGram instances created while processing the sentence WordGram instances created while processing the sentence
  • 15. Lenses • Simple Interpreters – Based on Canonical Predicates – Build structures from parsed sentences and WordGrams – Examples from biology • Cause • Bind • Augment • Prevent • Increase • Decrease • Believe
  • 16. Multiple Lenses • Consider this sentence: – We believe that A causes B – Two Lenses in play • Believe • Cause – Result is a nested NTuple • {We, believe, {A, cause, B}}
  • 17. Canonical Predicate • Results from transformations on predicates – E.g. • A causes B, A can cause B, A will cause B  A cause B • A is caused by B  B cause A
  • 18. Actors: Named Entities • For any given named entity, there will be one and only one WordGram – Issue of Ambiguity • Same name string can serve different topics in the topic map – Topic map maintains identity for disambiguation • Thus, a single WordGram might be associated with more than one individual actor • This means: – Fibers (threads) flowing through the fabric must be maintained in bundles according to their context (topic)
  • 19. Lens Selection and Action • The Lens: – ProtectAgainst • Selected by the WordGram for “protect against” – Is a transformation of the WordGram for “protective role against” • Lens Action: – Create an NTuple • {C282Y, protect against, CMPD} • We will call that NTuple an Assertion We found that HFE C282Y might be associated with a protective role against CMPD
  • 20. Weaving an Information Fabric • Background: – One and only one WordGram for each Actor (named entity) – One and only one WordGram for each canonical Predicate – One and only one NTuple for each Assertion • WordGrams which form an NTuple are strung together as beads on a string in the fabric. – Thus, it is the detection of NTuple structures (Assertions) which form the HyperMembrane’s fabric. Note: it is next to impossible to diagram the fabric, but it will likely look like a very tangled knotted structure. https://www.flickr.com/photos/fermicat/27 3539481/in/set-72157601620157588/
  • 21. Fabric Example • Two NTuples – {Jack Park, AuthoredBook, The Wind Power Book} – {Jack Park, AuthoredBook, Ohio State University Football Vault} JP101 JP102 Book101 AuthoredBook Wind Power Book OSU Football… Book102 Jack Park Topic Map organizes fiber bundles
  • 22. Looking Forward • Lenses, today, are hardwired – Opportunity for adaptive learning of new lenses • Fabric, today, is simple – Opportunity to use cardinalities, frequency counts in the fabric for: • Probabilistic modeling • Topological studies • Opportunity for a Domain-Specific Language (DSL) to emerge
  • 23. Completed Representation antioxidants kill free radicals Contraindicates macrophages use free radicals to kill bacteria Bacterial Infection Antioxidants Because Appropriate For Compromised Host Let us co-create Cognitive Agents for Discovery jackpark@topicquests.org Thanks to Mei Lin Fung, David Alexander Price, and Patrick Durusau for valuable comments SolrSherlock at: http://debategraph.org/SolrSherlock and https://github.com/SolrSherlock