SlideShare uma empresa Scribd logo
1 de 39
Baixar para ler offline
HyperMembrane Structures for
Open Source Cognitive Computing
Japanese Agency for Science and Technology
Tokyo, Japan
3 March, 2015
Jack Park
© 2015 TopicQuests Foundation CC by SA 4.0
The Present Situation
Upon this gifted age, in its dark hour,
Rains from the sky a meteoric shower
Of facts . . . they lie unquestioned,
uncombined.
Wisdom enough to leech us of our ill
Is daily spun; but there exists no loom
To weave it into fabric
Edna St. Vincent Millay, 1939
2
Topics To Cover
• Discovery, learning, problem solving
• Topic Maps
• OpenSherlock
• HyperMembranes
• Open Source
• Key reasons for building open source cognitive
systems
3
Cognitive Computing: My View
• Cognitive Computing is:
– Far less about what a computer knows
– Far more about how computers can
augment human cognitive capabilities
– Based on the J.C.R Licklider and
Douglas Engelbart augmentation work
J.C.R. Licklider
Douglas Engelbart
4Imgs: Wikipedia
A Domain-specific Problem Statement
• An Example:
– Do these two sentences say the same thing?
• CO2 is a causal factor in climate change.
• Climate change is caused by carbon dioxide.
• Problem Statement
– Software agents need elegant methods for
reading, representing, organizing, and modeling
information resources to support discovery and
answering questions.
5
A Framing Thought
• From [1]
– The understanding of global brain organization
and its large-scale integration remains a challenge
for modern neurosciences.
• To
– The understanding of global conversations about
topics that matter and their large-scale federation
remain a challenge for modern information
technology.
[1] Petri G, Expert P, Turkheimer F, Carhart-Harris R, Nutt D, Hellyer PJ, Vaccarino F. (2014)
Homological scaffolds of brain functional networks. J. R. Soc. Interface 11: 20140873.
6
Our Goals
• Improve Human-Tool Capabilities
• Augment existing analytic methods
– Increase opportunities for discovery
– Improve already sophisticated methods
• Build Looms
– Read documents
– Map and model topics read
– Weave information fabrics
Douglas Engelbart
7
Discovery
• Is it really possible for people to see
everything?
– Part of discovery is connecting dots not
yet connected.
– “Cognitive Agents” can help increase
chances of serendipity.
“Discovery consists of
seeing what everybody
has seen and thinking
what nobody has
thought.”
–Albert Szent-Györgyi
8
Related Work
• Commercial
– IBM Watson
– Wolfram Alpha
– Viv
– Saffron 10
– Clueda
– Siri
– Google Now
– Cortana
– …
• Open Source
– OAQA
– DeepDive
– OpenCog
– OpenNARS
– Watsonsim
– YodaQA
– AKSW OpenQA
– AKSW QA
– AquaLog
– OpenSherlock
– OpenIRIS (CALO)
– …
• Research
– Project Aristo
– Project Halo
– FREyA
– CASIA
– NLP-Reduce
– EIS Sina
– WDAqua ITM
– Intui2
– …
9
Biologically Inspired Design
• Humans are blessed with:
– Memory to keep concepts organized and
connected
– Internal mechanisms which map sensor data into
memory for processing and storage
– The abilities of complex, adaptive, anticipatory
systems
10
Memory: Introducing Topic Maps
• A Topic Map is like a library without all the books*
– A Topic Map is indexical
• Like a card catalog
– Each topic has its own representation
• Improving on a card catalog, a topic can be identified many
different ways
• Captures metadata and optionally content
– A Topic Map is relational
• Like a good road map
– Topics are connected by associations (relations)
– Topics point to their occurrences in the territory
– A Topic Map is organized
• Multiple records on the same topic are co-located (stored as one
topic) in the map
*a map is not its territory
11
TopicMap Structure
•Topics as Actors
•Topics as Relations
•Topics as Types
•Topics as Biographies
12
Processing Mechanisms
• Typically, software processes take the form of
variants of NLP (natural language processing)
– Parsers
– Cluster analysis
– Entity recognition
– Relation detection
– Role recognition
– Probabilistic methods
13
A Key Question in My Research
• Can a Topic Map learn (construct itself) by “reading” literature?
– Relevant issues:
• Bootstrapping
• Machine reading
– NLP
– Linguistics
– Statistics
– Analogy & Metaphor
– …
• Knowledge representation
• Model building
– Anticipation
• Weaving information fabrics
• Literature-based discovery
• Deep Question Answering
14
A Simple Example
• Read this sentence:
– Gene expression is caused by insoluble hormones
binding to a plasma membrane hormone receptor
• Topic Map recognizes:
– Gene expression  GeneExpression
– insoluble hormones  InsolubleHormone
– plasma membrane hormone receptor 
PlasmaMembraneReceptor
• Software agents transform:
– is caused by  Cause
– binding to  Binds
• Final semantic structure:
• { {InsolubleHormone, Binds, PlasmaMembraneReceptor},
Cause, GeneExpression }
15
Introducing OpenSherlock
• OpenSherlock is:
– A Topic Map for information resource identity and organization
– A HyperMembrane information fabric structure
– A society of agents system which can
• Read documents
• Process information resources
– Maintain the topic map
– Maintain the HyperMembrane
– Build and maintain models
– Perform discovery tasks
– Answer questions
– Agents are coordinated by:
• A blackboard system
• A dynamic task-based agenda
• Event propagation and handling
16
Observations 1
• A Topic Map is central to the key question, and
therefore to a thesis entailed by this research
– It serves as a kind of memory for social processes
– It provides a robust platform for subject identity
– It can also serve as a repository for domain-
specific vocabularies (ontologies, taxonomies,
naming conventions,…)
17
Observations 2
• A Topic Map is necessary but not sufficient to support
discovery, learning, or problem solving
– It really only provides a powerful indexical structure related to
the key artifacts in any universe of discourse:
• Actors
• Their relations
• Their states
• Rules, laws, theories,…
• To model those key artifacts, other representation
strategies are required
– Conceptual Graphs
– Qualitative Process Theory
– Belief Networks
– …
18
A Research Question
• What processes are available which, if
performed while harvesting (reading)
documents, can reduce the amount of
processing required later during question
answering?
– The question entails
• Synthesis of ontology
• Co-reference resolution
• Re-representation during question lifting
• …
19
A Working Hypothesis
• Process
– Build and maintain a content-addressable memory
of questions, claims, arguments, and evidence
fields.
• We call that a HyperMembrane
– Note:
• Every text object passed into the system is processed by
the same algorithms
– Sentences harvested from text
– Questions and responses posed by humans
20
Key Concept: HyperMembrane
• HyperMembrane is a key concept in the
working hypothesis that OpenSherlock seeks
to explore and demonstrate
– A growing graph as a collection of woven and
intersecting fabrics
• constructed from normalized tuples (n-tuples) which
are designed to reduce the amount of NLP required to
read documents
• such that intersections of fabrics occur where named
entities in the graph of n-tuples are the same
– Inspired by Ted Nelson’s ZigZag Architecture
21
Machine Reading in OpenSherlock
• Goals:
– Grow the topic map
• Topic Map then serves to support fabrication of higher-order
knowledge structures
– Conceptual Graphs
– Belief Networks
– QP Theory Models
– HyperMembrane
– …
• Process Loop:
– For a given document
• For every paragraph in that document
– For every sentence in each paragraph
» Read the sentence
22
Sentence Reading
• First Step:
– Process sentence into word grams*
• Second Step:
– Where possible
• Transform word grams into n-tuples**
• n-tuples form the HyperMembrane
* A container of words, from 1 to 8 words per container
** A container of symbols based on words in word grams
23
Process Sentence into WordGrams
• Approach
– Break sentence into word grams*
• WordGram objects are shared across sentences
– Count of sentence identifiers associated with each object
serves as basis for probabilistic models
– Either
• TopicMap recognizes terms
– Or
• Sentence is parsed by Link-Grammar Parser**
• TopicMap learns from parse results
*http://en.wikipedia.org/wiki/W-shingling **http://www.link.cs.cmu.edu/link/
24
Transform WordGrams to N-Tuples
• Normalized tuple (N-Tuple)
– A structure where the subject, predicate, and object are normalized
• Nouns and verbs transformed
– CO2, Carbon Dioxide, …  CO2
– causes, is caused by, …  cause
• Two sentence example
– CO2 is a cause of climate change.
– Climate change is caused by carbon dioxide.
– Result:
» { CO2, cause, climate change }
– Normalization processes include general and domain specific lenses
• Rule-based interpreters which detect structures
– Taxonomy
– Causality
– Biomedical
– Geophysical
– …
• Process models
– Built and maintained while reading
– Predict while reading – Anticipatory Reading
25
About N-Tuples
• An N-Tuple is a structured record of
– Topics in the topic map
– Those topics are harvested from text
• An N-Tuple takes the form:
– { Subject, Predicate, Object }
– Where
• Subject and/or Object can be one of:
– A topic from the topic map
– Another N-Tuple
• An N-Tuple is identified by the identities of the terms it contains
– When thinking in terms of terms (words) read from documents, the identities
(numeric representations) of those terms form the identity of the N-Tuple
object.
• N-Tuples are content addressable
• Disambiguation of subjects is a topic mapping process
– Learning means continuous refinement of subject identity
– Ambiguities can also be solved through human intervention
26
N-Tuples as HyperMembrane
Tuples
{A, Bind, B}
{{A, Bind, B}, Cause, X}
{X, Bind, D}
{{X, Bind, D}, Cause, Y}
27
A B
Bind
X
Y
D
Cause
Bind
Cause
Current State of OpenSherlock
ElasticSearch
Titan
or
Blazegraph
Ontology
Importer
Ontologies
PubMed
Reader
PubMed
Abstracts
HyperMembrane
Engine
TellAsk
UMLS
Importer
UMLS
28
Observations 3
• HyperMembrane is a reminding system
– HyperMembrane is a record of federated human
conversation
• Harvested from books, papers, and recorded
conversation
• Includes statistical properties of recorded utterances
– HyperMembrane records:
• That which is common
• That which is novel
– Possibly wrong
– Possibly game changing
29
TellAsk Interface
Conversation Tree
User can click a
node to select as
parent for any
user response
Response Type Selectors.
Selection required before
response.
User types here
Linear
conversation flow
Entry Forms Selector List
Map starts a new conversation
with entered topic
30
The Open Source Stack
• Persistence
– ElasticSearch
– Considering Titan
– Considering Blazegraph (Bigdata™ RDF Store)
• Libraries
– Many from Apache Foundation and others
– LinkGrammarParser (Java version)
– XML PullParser
– Simple JSON Parser
• Tools
– Eclipse
31
Summary
32
Current State of Development
• Aim to answer simple questions about
casuality
– Current focus on biomedical domain
– Current focus on two lenses
• Taxonomy
• Casuality
– No Conceptual Graphs
– No Process Models
– No Probabilistic Models
33
Future Work
• Aim to complete an anticipatory system
– Process models for anticipation
– Conceptual graphs
– Probabilistic models
– More lenses
• Pluggable lenses
• Adaptive lenses
– More domains
34
Why Do This?
• Augment human capabilities in problem
solving
• Participate in Open Science
35
Augmenting Social Sensemaking
1
2
3
Creating Ideas
Refining Connections
Connecting Ideas
Cancer patient
36
Participate in Open Science
37
Key Context for Open Science
• A planet-wide, collaborative quest for Global
Thrivability*.
– Issues include
• Sociological events
– Health, epidemics, wars,…
• Geophysical events
– Climate change, earthquakes, volcanoes, …
• Astrophysical events
– Asteroids, our Sun. …
* Let’s call the quest: EarthMoonshot
38
Completed Representation
antioxidants
kill
free radicals
Contraindicates
macrophages use
free radicals to
kill bacteria
Bacterial Infection Antioxidants
Because
Appropriate For
Compromised Host
Let us co-create Cognitive Agents for Discovery
jackpark@topicquests.org
OpenSherlock documents at: http://debategraph.org/OpenSherlock
Code emerging at: https://github.com/opensherlock/
Slides online at http://slideshare.net/jackpark/
Acknowledgments:
Bob Gleichauf
David Alexander Price
Arun Majumdar
Robert S. Stephenson
Mark Szpakowski
Martin Radley
Sherry Jones
Alexander Wenzowski
Ted Kahn
Patrick Durusau
39

Mais conteúdo relacionado

Mais procurados

Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupJames Hendler
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebJames Hendler
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataJames Hendler
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide WebJames Hendler
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)James Hendler
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's PerspectiveJames Hendler
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for RealJames Hendler
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)James Hendler
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic WebJames Hendler
 
KR in the age of Deep Learning
KR in the age of Deep LearningKR in the age of Deep Learning
KR in the age of Deep LearningJames Hendler
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...James Hendler
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...James Hendler
 
Digital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIDigital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIJames Hendler
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain
 
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update The Semantic Web: 2010 Update
The Semantic Web: 2010 Update James Hendler
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)James Hendler
 
5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)Scott Lowe
 

Mais procurados (20)

Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic Markup
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the Web
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's Perspective
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for Real
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
 
KR in the age of Deep Learning
KR in the age of Deep LearningKR in the age of Deep Learning
KR in the age of Deep Learning
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
 
Towards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service InnovationTowards An Improvement Community Platform for Service Innovation
Towards An Improvement Community Platform for Service Innovation
 
Digital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIDigital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AI
 
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State UniversityPrateek Jain dissertation defense, Kno.e.sis, Wright State University
Prateek Jain dissertation defense, Kno.e.sis, Wright State University
 
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update The Semantic Web: 2010 Update
The Semantic Web: 2010 Update
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)5 Thoughts on Staying Sharp and Relevant (Chicago)
5 Thoughts on Staying Sharp and Relevant (Chicago)
 

Semelhante a HyperMembrane Structures for Open Source Cognitive Computing

Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Deep Kayal
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...Angelo Salatino
 
Recommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoRecommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoAshok Venkatesan
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxahmadbhattim005
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxUneezaRajpoot
 
Introduction to Object Oriented Programming
Introduction to Object Oriented ProgrammingIntroduction to Object Oriented Programming
Introduction to Object Oriented ProgrammingMoutaz Haddara
 
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkThe Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkKrishna Sankar
 
Searching of Web and Electronic Resources
Searching of Web and Electronic Resources Searching of Web and Electronic Resources
Searching of Web and Electronic Resources Bramesha B
 
Artificial Intelligent introduction or history
Artificial Intelligent introduction or historyArtificial Intelligent introduction or history
Artificial Intelligent introduction or historyArslan Sattar
 
Knowledge base system appl. p 1,2-ver1
Knowledge base system appl.  p 1,2-ver1Knowledge base system appl.  p 1,2-ver1
Knowledge base system appl. p 1,2-ver1Taymoor Nazmy
 
The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...Anubhav Jain
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎Libcorpio
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagementSTIinnsbruck
 

Semelhante a HyperMembrane Structures for Open Source Cognitive Computing (20)

Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Topic Pages. From articles to answers.
Topic Pages. From articles to answers.
 
Ngsp
NgspNgsp
Ngsp
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly...
 
Recommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and DatoRecommending Semantic Nearest Neighbors Using Storm and Dato
Recommending Semantic Nearest Neighbors Using Storm and Dato
 
DatoConference2015
DatoConference2015DatoConference2015
DatoConference2015
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptx
 
PS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptxPS3103 Cognitive Psy Lecture 1.pptx
PS3103 Cognitive Psy Lecture 1.pptx
 
lec01.ppt
lec01.pptlec01.ppt
lec01.ppt
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
Introduction to Object Oriented Programming
Introduction to Object Oriented ProgrammingIntroduction to Object Oriented Programming
Introduction to Object Oriented Programming
 
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache SparkThe Hitchhiker's Guide to Machine Learning with Python & Apache Spark
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
 
Searching of Web and Electronic Resources
Searching of Web and Electronic Resources Searching of Web and Electronic Resources
Searching of Web and Electronic Resources
 
Lec-1.pdf
Lec-1.pdfLec-1.pdf
Lec-1.pdf
 
Artificial Intelligent introduction or history
Artificial Intelligent introduction or historyArtificial Intelligent introduction or history
Artificial Intelligent introduction or history
 
Knowledge base system appl. p 1,2-ver1
Knowledge base system appl.  p 1,2-ver1Knowledge base system appl.  p 1,2-ver1
Knowledge base system appl. p 1,2-ver1
 
The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...The Materials Project: Experiences from running a million computational scien...
The Materials Project: Experiences from running a million computational scien...
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
 
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
 

Mais de Jack Park

Topic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsTopic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsJack Park
 
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Jack Park
 
Game-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesGame-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesJack Park
 
OpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationOpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationJack Park
 
Finding Meaning Across Boundaries
Finding Meaning Across BoundariesFinding Meaning Across Boundaries
Finding Meaning Across BoundariesJack Park
 
On Augmenting Patients
On Augmenting PatientsOn Augmenting Patients
On Augmenting PatientsJack Park
 
Open Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsOpen Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsJack Park
 
Knowledge Garden Overview
Knowledge Garden OverviewKnowledge Garden Overview
Knowledge Garden OverviewJack Park
 
Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Jack Park
 
Questing in the age of Complex Systems
Questing in the age of Complex SystemsQuesting in the age of Complex Systems
Questing in the age of Complex SystemsJack Park
 
FutureOfText2015
FutureOfText2015FutureOfText2015
FutureOfText2015Jack Park
 
Towards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingTowards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingJack Park
 
Towards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryTowards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryJack Park
 
Feedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensFeedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensJack Park
 
A Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsA Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsJack Park
 
Game-based Learning
Game-based LearningGame-based Learning
Game-based LearningJack Park
 
Learning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling OpportunitiesLearning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling OpportunitiesJack Park
 
Condos in a Politics Knowledge Garden
Condos in a Politics Knowledge GardenCondos in a Politics Knowledge Garden
Condos in a Politics Knowledge GardenJack Park
 

Mais de Jack Park (20)

Tm keynote
Tm keynoteTm keynote
Tm keynote
 
Topic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation TopicsTopic Maps: Romancing Conversation Topics
Topic Maps: Romancing Conversation Topics
 
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
Augmented Claim Craft Ecosystem: HyperKnowledge- OpenSherlock Overview
 
Game-based Learning Landscape and Opportunities
Game-based Learning Landscape and OpportunitiesGame-based Learning Landscape and Opportunities
Game-based Learning Landscape and Opportunities
 
OpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence AugmentationOpenSherlock - Hybrid Computing - Intelligence Augmentation
OpenSherlock - Hybrid Computing - Intelligence Augmentation
 
Finding Meaning Across Boundaries
Finding Meaning Across BoundariesFinding Meaning Across Boundaries
Finding Meaning Across Boundaries
 
On Augmenting Patients
On Augmenting PatientsOn Augmenting Patients
On Augmenting Patients
 
Open Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and ProspectsOpen Source Cognitive Computing: Progress and Prospects
Open Source Cognitive Computing: Progress and Prospects
 
Knowledge Garden Overview
Knowledge Garden OverviewKnowledge Garden Overview
Knowledge Garden Overview
 
Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...Literature-based discovery: it's all about connecting dots in widely disparat...
Literature-based discovery: it's all about connecting dots in widely disparat...
 
Questing in the age of Complex Systems
Questing in the age of Complex SystemsQuesting in the age of Complex Systems
Questing in the age of Complex Systems
 
Why?
Why?Why?
Why?
 
FutureOfText2015
FutureOfText2015FutureOfText2015
FutureOfText2015
 
Towards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive ComputingTowards an EarthMoonshot with Cognitive Computing
Towards an EarthMoonshot with Cognitive Computing
 
Towards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData DiscoveryTowards Cognitive Agents for BigData Discovery
Towards Cognitive Agents for BigData Discovery
 
Feedback Loops and Knowledge Gardens
Feedback Loops and Knowledge GardensFeedback Loops and Knowledge Gardens
Feedback Loops and Knowledge Gardens
 
A Future for Education: Some Core Thoughts
A Future for Education: Some Core ThoughtsA Future for Education: Some Core Thoughts
A Future for Education: Some Core Thoughts
 
Game-based Learning
Game-based LearningGame-based Learning
Game-based Learning
 
Learning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling OpportunitiesLearning Gardens: Storytelling Opportunities
Learning Gardens: Storytelling Opportunities
 
Condos in a Politics Knowledge Garden
Condos in a Politics Knowledge GardenCondos in a Politics Knowledge Garden
Condos in a Politics Knowledge Garden
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Último (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

HyperMembrane Structures for Open Source Cognitive Computing

  • 1. HyperMembrane Structures for Open Source Cognitive Computing Japanese Agency for Science and Technology Tokyo, Japan 3 March, 2015 Jack Park © 2015 TopicQuests Foundation CC by SA 4.0
  • 2. The Present Situation Upon this gifted age, in its dark hour, Rains from the sky a meteoric shower Of facts . . . they lie unquestioned, uncombined. Wisdom enough to leech us of our ill Is daily spun; but there exists no loom To weave it into fabric Edna St. Vincent Millay, 1939 2
  • 3. Topics To Cover • Discovery, learning, problem solving • Topic Maps • OpenSherlock • HyperMembranes • Open Source • Key reasons for building open source cognitive systems 3
  • 4. Cognitive Computing: My View • Cognitive Computing is: – Far less about what a computer knows – Far more about how computers can augment human cognitive capabilities – Based on the J.C.R Licklider and Douglas Engelbart augmentation work J.C.R. Licklider Douglas Engelbart 4Imgs: Wikipedia
  • 5. A Domain-specific Problem Statement • An Example: – Do these two sentences say the same thing? • CO2 is a causal factor in climate change. • Climate change is caused by carbon dioxide. • Problem Statement – Software agents need elegant methods for reading, representing, organizing, and modeling information resources to support discovery and answering questions. 5
  • 6. A Framing Thought • From [1] – The understanding of global brain organization and its large-scale integration remains a challenge for modern neurosciences. • To – The understanding of global conversations about topics that matter and their large-scale federation remain a challenge for modern information technology. [1] Petri G, Expert P, Turkheimer F, Carhart-Harris R, Nutt D, Hellyer PJ, Vaccarino F. (2014) Homological scaffolds of brain functional networks. J. R. Soc. Interface 11: 20140873. 6
  • 7. Our Goals • Improve Human-Tool Capabilities • Augment existing analytic methods – Increase opportunities for discovery – Improve already sophisticated methods • Build Looms – Read documents – Map and model topics read – Weave information fabrics Douglas Engelbart 7
  • 8. Discovery • Is it really possible for people to see everything? – Part of discovery is connecting dots not yet connected. – “Cognitive Agents” can help increase chances of serendipity. “Discovery consists of seeing what everybody has seen and thinking what nobody has thought.” –Albert Szent-Györgyi 8
  • 9. Related Work • Commercial – IBM Watson – Wolfram Alpha – Viv – Saffron 10 – Clueda – Siri – Google Now – Cortana – … • Open Source – OAQA – DeepDive – OpenCog – OpenNARS – Watsonsim – YodaQA – AKSW OpenQA – AKSW QA – AquaLog – OpenSherlock – OpenIRIS (CALO) – … • Research – Project Aristo – Project Halo – FREyA – CASIA – NLP-Reduce – EIS Sina – WDAqua ITM – Intui2 – … 9
  • 10. Biologically Inspired Design • Humans are blessed with: – Memory to keep concepts organized and connected – Internal mechanisms which map sensor data into memory for processing and storage – The abilities of complex, adaptive, anticipatory systems 10
  • 11. Memory: Introducing Topic Maps • A Topic Map is like a library without all the books* – A Topic Map is indexical • Like a card catalog – Each topic has its own representation • Improving on a card catalog, a topic can be identified many different ways • Captures metadata and optionally content – A Topic Map is relational • Like a good road map – Topics are connected by associations (relations) – Topics point to their occurrences in the territory – A Topic Map is organized • Multiple records on the same topic are co-located (stored as one topic) in the map *a map is not its territory 11
  • 12. TopicMap Structure •Topics as Actors •Topics as Relations •Topics as Types •Topics as Biographies 12
  • 13. Processing Mechanisms • Typically, software processes take the form of variants of NLP (natural language processing) – Parsers – Cluster analysis – Entity recognition – Relation detection – Role recognition – Probabilistic methods 13
  • 14. A Key Question in My Research • Can a Topic Map learn (construct itself) by “reading” literature? – Relevant issues: • Bootstrapping • Machine reading – NLP – Linguistics – Statistics – Analogy & Metaphor – … • Knowledge representation • Model building – Anticipation • Weaving information fabrics • Literature-based discovery • Deep Question Answering 14
  • 15. A Simple Example • Read this sentence: – Gene expression is caused by insoluble hormones binding to a plasma membrane hormone receptor • Topic Map recognizes: – Gene expression  GeneExpression – insoluble hormones  InsolubleHormone – plasma membrane hormone receptor  PlasmaMembraneReceptor • Software agents transform: – is caused by  Cause – binding to  Binds • Final semantic structure: • { {InsolubleHormone, Binds, PlasmaMembraneReceptor}, Cause, GeneExpression } 15
  • 16. Introducing OpenSherlock • OpenSherlock is: – A Topic Map for information resource identity and organization – A HyperMembrane information fabric structure – A society of agents system which can • Read documents • Process information resources – Maintain the topic map – Maintain the HyperMembrane – Build and maintain models – Perform discovery tasks – Answer questions – Agents are coordinated by: • A blackboard system • A dynamic task-based agenda • Event propagation and handling 16
  • 17. Observations 1 • A Topic Map is central to the key question, and therefore to a thesis entailed by this research – It serves as a kind of memory for social processes – It provides a robust platform for subject identity – It can also serve as a repository for domain- specific vocabularies (ontologies, taxonomies, naming conventions,…) 17
  • 18. Observations 2 • A Topic Map is necessary but not sufficient to support discovery, learning, or problem solving – It really only provides a powerful indexical structure related to the key artifacts in any universe of discourse: • Actors • Their relations • Their states • Rules, laws, theories,… • To model those key artifacts, other representation strategies are required – Conceptual Graphs – Qualitative Process Theory – Belief Networks – … 18
  • 19. A Research Question • What processes are available which, if performed while harvesting (reading) documents, can reduce the amount of processing required later during question answering? – The question entails • Synthesis of ontology • Co-reference resolution • Re-representation during question lifting • … 19
  • 20. A Working Hypothesis • Process – Build and maintain a content-addressable memory of questions, claims, arguments, and evidence fields. • We call that a HyperMembrane – Note: • Every text object passed into the system is processed by the same algorithms – Sentences harvested from text – Questions and responses posed by humans 20
  • 21. Key Concept: HyperMembrane • HyperMembrane is a key concept in the working hypothesis that OpenSherlock seeks to explore and demonstrate – A growing graph as a collection of woven and intersecting fabrics • constructed from normalized tuples (n-tuples) which are designed to reduce the amount of NLP required to read documents • such that intersections of fabrics occur where named entities in the graph of n-tuples are the same – Inspired by Ted Nelson’s ZigZag Architecture 21
  • 22. Machine Reading in OpenSherlock • Goals: – Grow the topic map • Topic Map then serves to support fabrication of higher-order knowledge structures – Conceptual Graphs – Belief Networks – QP Theory Models – HyperMembrane – … • Process Loop: – For a given document • For every paragraph in that document – For every sentence in each paragraph » Read the sentence 22
  • 23. Sentence Reading • First Step: – Process sentence into word grams* • Second Step: – Where possible • Transform word grams into n-tuples** • n-tuples form the HyperMembrane * A container of words, from 1 to 8 words per container ** A container of symbols based on words in word grams 23
  • 24. Process Sentence into WordGrams • Approach – Break sentence into word grams* • WordGram objects are shared across sentences – Count of sentence identifiers associated with each object serves as basis for probabilistic models – Either • TopicMap recognizes terms – Or • Sentence is parsed by Link-Grammar Parser** • TopicMap learns from parse results *http://en.wikipedia.org/wiki/W-shingling **http://www.link.cs.cmu.edu/link/ 24
  • 25. Transform WordGrams to N-Tuples • Normalized tuple (N-Tuple) – A structure where the subject, predicate, and object are normalized • Nouns and verbs transformed – CO2, Carbon Dioxide, …  CO2 – causes, is caused by, …  cause • Two sentence example – CO2 is a cause of climate change. – Climate change is caused by carbon dioxide. – Result: » { CO2, cause, climate change } – Normalization processes include general and domain specific lenses • Rule-based interpreters which detect structures – Taxonomy – Causality – Biomedical – Geophysical – … • Process models – Built and maintained while reading – Predict while reading – Anticipatory Reading 25
  • 26. About N-Tuples • An N-Tuple is a structured record of – Topics in the topic map – Those topics are harvested from text • An N-Tuple takes the form: – { Subject, Predicate, Object } – Where • Subject and/or Object can be one of: – A topic from the topic map – Another N-Tuple • An N-Tuple is identified by the identities of the terms it contains – When thinking in terms of terms (words) read from documents, the identities (numeric representations) of those terms form the identity of the N-Tuple object. • N-Tuples are content addressable • Disambiguation of subjects is a topic mapping process – Learning means continuous refinement of subject identity – Ambiguities can also be solved through human intervention 26
  • 27. N-Tuples as HyperMembrane Tuples {A, Bind, B} {{A, Bind, B}, Cause, X} {X, Bind, D} {{X, Bind, D}, Cause, Y} 27 A B Bind X Y D Cause Bind Cause
  • 28. Current State of OpenSherlock ElasticSearch Titan or Blazegraph Ontology Importer Ontologies PubMed Reader PubMed Abstracts HyperMembrane Engine TellAsk UMLS Importer UMLS 28
  • 29. Observations 3 • HyperMembrane is a reminding system – HyperMembrane is a record of federated human conversation • Harvested from books, papers, and recorded conversation • Includes statistical properties of recorded utterances – HyperMembrane records: • That which is common • That which is novel – Possibly wrong – Possibly game changing 29
  • 30. TellAsk Interface Conversation Tree User can click a node to select as parent for any user response Response Type Selectors. Selection required before response. User types here Linear conversation flow Entry Forms Selector List Map starts a new conversation with entered topic 30
  • 31. The Open Source Stack • Persistence – ElasticSearch – Considering Titan – Considering Blazegraph (Bigdata™ RDF Store) • Libraries – Many from Apache Foundation and others – LinkGrammarParser (Java version) – XML PullParser – Simple JSON Parser • Tools – Eclipse 31
  • 33. Current State of Development • Aim to answer simple questions about casuality – Current focus on biomedical domain – Current focus on two lenses • Taxonomy • Casuality – No Conceptual Graphs – No Process Models – No Probabilistic Models 33
  • 34. Future Work • Aim to complete an anticipatory system – Process models for anticipation – Conceptual graphs – Probabilistic models – More lenses • Pluggable lenses • Adaptive lenses – More domains 34
  • 35. Why Do This? • Augment human capabilities in problem solving • Participate in Open Science 35
  • 36. Augmenting Social Sensemaking 1 2 3 Creating Ideas Refining Connections Connecting Ideas Cancer patient 36
  • 37. Participate in Open Science 37
  • 38. Key Context for Open Science • A planet-wide, collaborative quest for Global Thrivability*. – Issues include • Sociological events – Health, epidemics, wars,… • Geophysical events – Climate change, earthquakes, volcanoes, … • Astrophysical events – Asteroids, our Sun. … * Let’s call the quest: EarthMoonshot 38
  • 39. Completed Representation antioxidants kill free radicals Contraindicates macrophages use free radicals to kill bacteria Bacterial Infection Antioxidants Because Appropriate For Compromised Host Let us co-create Cognitive Agents for Discovery jackpark@topicquests.org OpenSherlock documents at: http://debategraph.org/OpenSherlock Code emerging at: https://github.com/opensherlock/ Slides online at http://slideshare.net/jackpark/ Acknowledgments: Bob Gleichauf David Alexander Price Arun Majumdar Robert S. Stephenson Mark Szpakowski Martin Radley Sherry Jones Alexander Wenzowski Ted Kahn Patrick Durusau 39