SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Efficiency is the New Precision
Semantic Supercomputing in the Zettabyte age
Francisco De Sousa Webber

Co-Founder & CEO

f.webber@cortical.io
2
Big Bang: Data Explosion
Transactional Data
Human Files
Social Interactions
M
a
c
h
i
n
e
G
e
n
e
r
a
t
e
d
D
a
t
a
(
I
o
T
)
Terabyte
Petabyte
Exabyte
Zettabyte
Mainframe/Mini Era PC/Client Era Internet Era Virtualisation Era
2025
Data
Volume
3
Current Status
ML & AI is the Answer … Is it?
Productivity Decreases
(Text) Content Increases
Current Von Neumann Computer Platform
Performance Stagnates
4
Technical Perspective
•Von Neumann Computing

•Statistical ML

•Semantic Folding

•Semantic Computing
5
Von Neumann Computing
Computing Unit
Processor
Memory
Adress Logic
Arithmetic Unit
Control Unit
I / O
Data
Address
World
6
Von Neumann Computing Limitations
Computing Unit
Processor
Memory
Adress Logic
Arithmetic Unit
Control Unit
I / O World
Address
Bottleneck
1000+
Instructions
Sequencial
Access
Every instruction
passes over Bus
7
Statistical Machine Learning
Machine
Learning

Use Case Data
Annotated
data
Training Data
Training
Use Case
Model
Training
Engine
Inference
Engine
Data In-Flow
Data

Out-Flow
Test Data
8
Statistical Machine Learning Limitations
Machine
Learning

Use Case Data
Annotated
data
Training Data
Training
Use Case
Model
Training
Engine
Inference
Engine
Data In-Flow
Data

Out-Flow
Test Data
Insufficient
Data
Slow
Training
Model
Imprecision
Inference
Latency
Manual
Effort
9
Statistic AI & ML Problem: Efficiency
Need to improve the Principle not just increase the Computational Power
Findability Issue

Von Neumann Gap

Exponential Power Consumption 

Million Model Multiverse
Both are cars, for people, using “water gas” - the difference between them is efficiency ….
Initial Principle - Steam Latest Principle - Hydrogen Fuel Cell
10
Statistical Modelling: Findability Issue
The Google field of view
The actual Internet
The Google “virtual view” of the internet
The “blind spot”
user
internet is growing
visible internet is
growing slowly
invisible internet
is growing fast
The number of new pages
grows faster than the
number of keywords
pointing at them.
11
Statistical Modelling: Von Neumann Gap
1980 1990 2000 2010 2020
Processing Speed
Data Amount
GAP
Current computing paradigm is
insufficient for the growing data load:
• Increased Error Rates 

• Increased Power Consumption

• Increased Processing Delays
12
Statistical Modelling: Exponential Energy Need
0%
4%
8%
2018
2030
Current Global Energy Consumption
of Computing Devices

equals that of Global Air Transport
In 2030 Global Energy
Consumption of Computing
Devices will reach that of Global
Automobile Transportation
13
Statistical Modelling: Million Model Multiverse
Individually labeled data for
supervised learning
Local

Use-Case
Local
Statistical
Model
Local
Training
Data
Local
Gold
Standard
Individually trained model
Individually collected and
prepared training data
No network effects
14
Statistical Modelling: Technology Impact
• Findability Issue: Fake News

• Von Neumann Gap: Climate Change

• Stuck in the Average: Innovation Gap

• Phased ML-User Profiles: Populism
When its hard to find
information its also hard to find
the truth
Statistics Averages: Innovation
is not made by Majorities
Green Computing is beyond
the Von Neumann Gap
Statistical ML Models
facilitate Opinion Meddling
15
The Solution: Semantic Folding
Based on recent findings in Neuroscience

Implemented as Unsupervised Machine Learning approach

Replaces complex statistical modelling with Analogical Computation
16
Semantic Folding: Analogical Computation
“signed contract” Overlap 36% ”done deal”
“star trek” Overlap 1%
Similar

Meanings
Different

Meanings
Context:

Bank,
Account,
Holder,
Payment, 

Tax, 

In-house,
Manager
”done deal”
17
Semantic Fingerprinting
Training of the Semantic Space
Reference Material
Semantic Word Fingerprint Dictionary
Converting Text into Semantic Fingerprints
Use Case Data Semantic Text Fingerprint
Comparing Semantic Fingerprints
18
Level 1: Word Fingerprints
organ
Fingerprint Generation
“organ”
Context 3:

church,

altar,
baroque,
architecture,
renaissance
Context 1:

liver,

heart,

muscle, 

endothelia,
body,
anatomy
Context 2b:

piano,

guitar,
trombone,
flute,
trumpet,
quartet,
music
Contexts 2a:

composer,
baroque,
music,

score,
Johann
Sebastian
Bach
19
Level 2: Text Fingerprints
organs and pianos are musical instruments
organs and pianos are musical
instruments
Aggregation + Sparsification
1 2
3
4
20
Many Languages - One Semantic Fingerprint
Concepts & their Representations are Stable Across Languages
philosophy
EN
philosophie
FR
filosofía
ES
философия
RU
‫فلسفة‬
AR
哲學
ZH
21
Example document Most similar documents
Ordered along the users
information need
query result set ranking
Similarity Engine
document index
NLU Primitive 1: Semantic Search
22
Email 12
Semantic filter FP
Positive class
Negative class
Semantic 

space 

trained for
Compliance
SH1_email

AND
SH2_email

AND

SH3_email
Email 189 Email 2443
Email 12 Email 189 Email 2443
NLU Primitive 2: Semantic Classification
23
Socrates 470/469 – 399 BC was a classical Greek (Athenian)
philosopher credited as one of the founders of Western
philosophy. He is an enigmatic figure known chiefly through the
accounts of classical writers, especially the writings of his students
Plato and Xenophon and the plays of his contemporary
Aristophanes. Plato's dialogues are among the most
comprehensive accounts of Socrates to survive from antiquity,
though it is unclear the degree to which Socrates himself is hidden
behind his best disciple, Plato.
Aggregation
[

“plato",

“socrates",

“philosopher",

“aristophanes",

“antiquity",

“writings",

“xenophon",

“dialogues",

“disciple",

“philosophy"

]
Text Fingerprint
Maximize for Similarity
NLU Primitive 3: Keyword Extraction Extract Keywords
Word Fingerprints
24
There are a number of remedies for
snoring, but few are proven clinically
effective. Popular treatments include:
Mechanical devices. Many splints,
braces, and other devices are
available which reposition the nose,
jaw, and/or mouth in order to clear the
airways.
Nasal strips that attach like an
adhesive bandage to the bridge of the
nose are available at most drugstores,
and can help stop snoring in some
individualss. Continuous positive
airway pressure. Several surgical
procedures are available for treat? ing
chronic snoring.
Snoring usually worsens when an
individual sleeps on his or her back, so
sleeping on ones side may alleviate
the problem. Those who have difficulty
staying in a side sleeping position may
find sleeping with pillows behind them
helps them maintain the position
longer.
Retina Engine SVM
Random Forrest
DL Network
Algorithm 1
Algorithm n
• Classification

• Clustering

• Prediction

• Generating

• Computing

• Analyzing
Semantic Folding based Machine Learning
25
Semantic Engine Semantic Search Semantic Annotation
Document

Classification/Clustering
Keyword Extraction
Context Term Generation
Information Discovery
Expert Finding
Text Analytics
Risk Analysis
Business Intelligence
Lease/Credit

Agreements
Cortical.io Engines
26
Hardware Acceleration for Semantic Folding
Match one Query Fingerprint against
an unlimited number of Document
Fingerprints
Match one Filter Fingerprint against a
stream of incoming Fingerprints
Enterprise Search

Discovery Search

Web Search

Social Media Profile Search
Desktop: Each Board searches up to 1 Billion Fingerprints per second
Enterprise: Each Server searches up to 10 Billion Fingerprints per second
Web Scale: Each Rack searches up to 100 Billion Fingerprints per second
Real-time Document Classification

Email Filtering - Routing

DeepPacket Inspection

Social Media Topic Detection
27
Semantic Super Computing Platform
Retina Engine
Converter
Module
Similar Term 

Module
Context
Module
Compare
Module
Retina Database
Retina Search
Document Index
Fingerprint
Matcher
Search 

Re-ranker
Retina Filter
Filter Bank
Fingerprint
Matcher
Filter

Re-ranker
[
Xilinx
Alveo
Host System
Storage
CPU-cores Memory
X86
Server
Application Server
Retina System
Administration App
Email-Filter App Semantic-Search App Next App
Integration Layer
Identity Access
Management
SMTP
Connector
RDB
Connector
CMS
Connector
DMS
Connector
File Service

Connector
Email-Filter API Semantic-Search API Next Application API
Web Service

Connector
BPM
Connector Management API

&

Monitoring API
28
Comparing the Leading NLU Approaches
"The Enron Email
Corpus Archived
2011-03-08 at the
Wayback Machine"
Retrieved March 5, 2011.
Retina Engine (CPU) Retina Engine (FPGA)
Pure Keyword Baseline (CPU)
FastText (CPU)
Doc2Vec (CPU)
Word2Vec (CPU)
BERT (GPU)
BERT (CPU)
70%
75%
80%
85%
90%
0% 0% 1% 10% 100% 1000%
Precision
Speed ---> faster (logarithmic)
Classification Enron email dataset: Farmer Set
PyTorch

(bert-base-uncased)
PyTorch

(bert-base-uncased)

AWS g3.8xlarge EC2
Scikit-learn 

TfidfVectorizer
official 

pre-trained model

Facebook
Gensim 

implementation
Pre-trained 

Google Model
1 x Xilinx

Alveo 250
+
29
• Banking:

• E-mail & Chat Compliance Monitoring

• Credit Risk Analysis

• CRM:

• Customer Intent Analysis

• Legal:

• Contract Intelligence

• Regulatory Process Optimization

• Financial Services:

• Investment Signal Extraction from News
Streams

• Life Sciences:

• Information Discovery

• Media:

• Viewer Stream Analytics
• Automotive:

• Handbook Search

• Car Supplier Management

• Consolidation of Car Terminology

• Technical Support:

• Support Intelligence

• Social Media:

• Organic Topic Mining

• Commerce:

• Catalogue Management & Automation

• Human Resources:

• Job Description - Resume Matching
Demonstrated Semantic Folding Use Cases
30
Simplicity
One Algorithm, One Operator, One Data Format
Compositionality
Words, Sentences, Paragraphs, Documents
Analogy
Normalized Representation, Bitwise Similarity
Modelability
Unsupervised Semantic Model Generation
Efficiency
Small Amounts of Reference Data
Scaleability
One Semantic Model Many Use-Cases
Replicability
Same Use-Case in New Domain
Inspectability
Refinement, Debugging, Verification
Robustness
“Graceful Failing”
NLU by Semantic Folding
info@cortical.io
Global Data Sphere (Zettabyte)
Transactional Data
Machine Generated Data
Human Generated Data
Social Media Data
ML-Data
Text-ML-Data
Semantic Folding Potential Market
L
O
G
D
a
t
a
S
e
n
s
o
r
D
a
t
a
T
e
x
t
D
a
t
a
T
e
x
t
D
a
t
a
Market Potential - Semantic Folding
info@cortical.io

Mais conteúdo relacionado

Mais procurados

IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...Dr. Haxel Consult
 
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...Dr. Haxel Consult
 
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...Dr. Haxel Consult
 
AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...
AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...
AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...Dr. Haxel Consult
 
II-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data ExplorationII-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data ExplorationDr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Edureka!
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceANOOP V S
 
Data Science Application in Business Portfolio & Risk Management
Data Science Application in Business Portfolio & Risk ManagementData Science Application in Business Portfolio & Risk Management
Data Science Application in Business Portfolio & Risk ManagementData Science Thailand
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxShanmugasundaram M
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data ScienceSpotle.ai
 
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...Dr. Haxel Consult
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace Mohamadreza Mohtat
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Gabriel Moreira
 
A Pragmatic AI Maturity Model
A Pragmatic AI Maturity ModelA Pragmatic AI Maturity Model
A Pragmatic AI Maturity ModelDATAVERSITY
 
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...Dr. Haxel Consult
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataSemantic Web Company
 
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...Dr. Haxel Consult
 

Mais procurados (20)

IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
 
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
 
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
 
AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...
AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...
AI-SDV 2020: AI-augmented Question Answering and Semantic Search for Life Sci...
 
II-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data ExplorationII-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data Exploration
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-DV 2017: Averbis
II-DV 2017: AverbisII-DV 2017: Averbis
II-DV 2017: Averbis
 
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data Science Application in Business Portfolio & Risk Management
Data Science Application in Business Portfolio & Risk ManagementData Science Application in Business Portfolio & Risk Management
Data Science Application in Business Portfolio & Risk Management
 
Data Science in Action
Data Science in ActionData Science in Action
Data Science in Action
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015
 
A Pragmatic AI Maturity Model
A Pragmatic AI Maturity ModelA Pragmatic AI Maturity Model
A Pragmatic AI Maturity Model
 
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked Data
 
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
 

Semelhante a AI-SDV 2021: Francisco Webber - Efficiency is the New Precision

Data Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinctionData Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinctionChristoforos Anagnostopoulos
 
A sentient network - How High-velocity Data and Machine Learning will Shape t...
A sentient network - How High-velocity Data and Machine Learning will Shape t...A sentient network - How High-velocity Data and Machine Learning will Shape t...
A sentient network - How High-velocity Data and Machine Learning will Shape t...Wenjing Chu
 
Financial Network Analytics @ NetONets
Financial Network Analytics @ NetONetsFinancial Network Analytics @ NetONets
Financial Network Analytics @ NetONetsKimmo Soramaki
 
dell_ml_rm.ppt
dell_ml_rm.pptdell_ml_rm.ppt
dell_ml_rm.pptbutest
 
dell_ml_rm.ppt
dell_ml_rm.pptdell_ml_rm.ppt
dell_ml_rm.pptbutest
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioCHAKER ALLAOUI
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksOscar Corcho
 
Future platform for internet of things
Future platform for internet of thingsFuture platform for internet of things
Future platform for internet of thingsColdbeans Software
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...Daniel Katz
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)Nicolas Kourtellis
 
Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and OpportunitiesKenny Huang Ph.D.
 
Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015Den Reymer
 
"Why Fake News Is Relevant" - Introduction to the Userfeeds Protocol
"Why Fake News Is Relevant" - Introduction to the Userfeeds Protocol"Why Fake News Is Relevant" - Introduction to the Userfeeds Protocol
"Why Fake News Is Relevant" - Introduction to the Userfeeds ProtocolUserfeeds.io
 
Information Retrieval
Information Retrieval Information Retrieval
Information Retrieval ShujaatZaheer3
 
The Nature of Information
The Nature of InformationThe Nature of Information
The Nature of InformationAdrian Paschke
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
Synergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software EngineeringSynergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software EngineeringTao Xie
 

Semelhante a AI-SDV 2021: Francisco Webber - Efficiency is the New Precision (20)

Data Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinctionData Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinction
 
Reproducible Science and Deep Software Variability
Reproducible Science and Deep Software VariabilityReproducible Science and Deep Software Variability
Reproducible Science and Deep Software Variability
 
A sentient network - How High-velocity Data and Machine Learning will Shape t...
A sentient network - How High-velocity Data and Machine Learning will Shape t...A sentient network - How High-velocity Data and Machine Learning will Shape t...
A sentient network - How High-velocity Data and Machine Learning will Shape t...
 
Financial Network Analytics @ NetONets
Financial Network Analytics @ NetONetsFinancial Network Analytics @ NetONets
Financial Network Analytics @ NetONets
 
dell_ml_rm.ppt
dell_ml_rm.pptdell_ml_rm.ppt
dell_ml_rm.ppt
 
dell_ml_rm.ppt
dell_ml_rm.pptdell_ml_rm.ppt
dell_ml_rm.ppt
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process Scenario
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor Networks
 
Future platform for internet of things
Future platform for internet of thingsFuture platform for internet of things
Future platform for internet of things
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
 
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
SAMOA: A Platform for Mining Big Data Streams (Apache BigData Europe 2015)
 
Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and Opportunities
 
Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015
 
"Why Fake News Is Relevant" - Introduction to the Userfeeds Protocol
"Why Fake News Is Relevant" - Introduction to the Userfeeds Protocol"Why Fake News Is Relevant" - Introduction to the Userfeeds Protocol
"Why Fake News Is Relevant" - Introduction to the Userfeeds Protocol
 
Information Retrieval
Information Retrieval Information Retrieval
Information Retrieval
 
A Empresa na Era da Informação Extrema
A Empresa na Era da Informação ExtremaA Empresa na Era da Informação Extrema
A Empresa na Era da Informação Extrema
 
The Nature of Information
The Nature of InformationThe Nature of Information
The Nature of Information
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Synergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software EngineeringSynergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software Engineering
 

Mais de Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
 

Mais de Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

AI-SDV 2021: Francisco Webber - Efficiency is the New Precision

  • 1. Efficiency is the New Precision Semantic Supercomputing in the Zettabyte age Francisco De Sousa Webber Co-Founder & CEO f.webber@cortical.io
  • 2. 2 Big Bang: Data Explosion Transactional Data Human Files Social Interactions M a c h i n e G e n e r a t e d D a t a ( I o T ) Terabyte Petabyte Exabyte Zettabyte Mainframe/Mini Era PC/Client Era Internet Era Virtualisation Era 2025 Data Volume
  • 3. 3 Current Status ML & AI is the Answer … Is it? Productivity Decreases (Text) Content Increases Current Von Neumann Computer Platform Performance Stagnates
  • 4. 4 Technical Perspective •Von Neumann Computing •Statistical ML •Semantic Folding •Semantic Computing
  • 5. 5 Von Neumann Computing Computing Unit Processor Memory Adress Logic Arithmetic Unit Control Unit I / O Data Address World
  • 6. 6 Von Neumann Computing Limitations Computing Unit Processor Memory Adress Logic Arithmetic Unit Control Unit I / O World Address Bottleneck 1000+ Instructions Sequencial Access Every instruction passes over Bus
  • 7. 7 Statistical Machine Learning Machine Learning
 Use Case Data Annotated data Training Data Training Use Case Model Training Engine Inference Engine Data In-Flow Data
 Out-Flow Test Data
  • 8. 8 Statistical Machine Learning Limitations Machine Learning
 Use Case Data Annotated data Training Data Training Use Case Model Training Engine Inference Engine Data In-Flow Data
 Out-Flow Test Data Insufficient Data Slow Training Model Imprecision Inference Latency Manual Effort
  • 9. 9 Statistic AI & ML Problem: Efficiency Need to improve the Principle not just increase the Computational Power Findability Issue Von Neumann Gap Exponential Power Consumption Million Model Multiverse Both are cars, for people, using “water gas” - the difference between them is efficiency …. Initial Principle - Steam Latest Principle - Hydrogen Fuel Cell
  • 10. 10 Statistical Modelling: Findability Issue The Google field of view The actual Internet The Google “virtual view” of the internet The “blind spot” user internet is growing visible internet is growing slowly invisible internet is growing fast The number of new pages grows faster than the number of keywords pointing at them.
  • 11. 11 Statistical Modelling: Von Neumann Gap 1980 1990 2000 2010 2020 Processing Speed Data Amount GAP Current computing paradigm is insufficient for the growing data load: • Increased Error Rates • Increased Power Consumption • Increased Processing Delays
  • 12. 12 Statistical Modelling: Exponential Energy Need 0% 4% 8% 2018 2030 Current Global Energy Consumption of Computing Devices
 equals that of Global Air Transport In 2030 Global Energy Consumption of Computing Devices will reach that of Global Automobile Transportation
  • 13. 13 Statistical Modelling: Million Model Multiverse Individually labeled data for supervised learning Local Use-Case Local Statistical Model Local Training Data Local Gold Standard Individually trained model Individually collected and prepared training data No network effects
  • 14. 14 Statistical Modelling: Technology Impact • Findability Issue: Fake News • Von Neumann Gap: Climate Change • Stuck in the Average: Innovation Gap • Phased ML-User Profiles: Populism When its hard to find information its also hard to find the truth Statistics Averages: Innovation is not made by Majorities Green Computing is beyond the Von Neumann Gap Statistical ML Models facilitate Opinion Meddling
  • 15. 15 The Solution: Semantic Folding Based on recent findings in Neuroscience
 Implemented as Unsupervised Machine Learning approach Replaces complex statistical modelling with Analogical Computation
  • 16. 16 Semantic Folding: Analogical Computation “signed contract” Overlap 36% ”done deal” “star trek” Overlap 1% Similar Meanings Different Meanings Context: Bank, Account, Holder, Payment, 
 Tax, 
 In-house, Manager ”done deal”
  • 17. 17 Semantic Fingerprinting Training of the Semantic Space Reference Material Semantic Word Fingerprint Dictionary Converting Text into Semantic Fingerprints Use Case Data Semantic Text Fingerprint Comparing Semantic Fingerprints
  • 18. 18 Level 1: Word Fingerprints organ Fingerprint Generation “organ” Context 3: church,
 altar, baroque, architecture, renaissance Context 1: liver,
 heart,
 muscle, 
 endothelia, body, anatomy Context 2b: piano,
 guitar, trombone, flute, trumpet, quartet, music Contexts 2a:
 composer, baroque, music,
 score, Johann Sebastian Bach
  • 19. 19 Level 2: Text Fingerprints organs and pianos are musical instruments organs and pianos are musical instruments Aggregation + Sparsification 1 2 3 4
  • 20. 20 Many Languages - One Semantic Fingerprint Concepts & their Representations are Stable Across Languages philosophy EN philosophie FR filosofía ES философия RU ‫فلسفة‬ AR 哲學 ZH
  • 21. 21 Example document Most similar documents Ordered along the users information need query result set ranking Similarity Engine document index NLU Primitive 1: Semantic Search
  • 22. 22 Email 12 Semantic filter FP Positive class Negative class Semantic space trained for Compliance SH1_email AND SH2_email AND SH3_email Email 189 Email 2443 Email 12 Email 189 Email 2443 NLU Primitive 2: Semantic Classification
  • 23. 23 Socrates 470/469 – 399 BC was a classical Greek (Athenian) philosopher credited as one of the founders of Western philosophy. He is an enigmatic figure known chiefly through the accounts of classical writers, especially the writings of his students Plato and Xenophon and the plays of his contemporary Aristophanes. Plato's dialogues are among the most comprehensive accounts of Socrates to survive from antiquity, though it is unclear the degree to which Socrates himself is hidden behind his best disciple, Plato. Aggregation [ “plato", “socrates", “philosopher", “aristophanes", “antiquity", “writings", “xenophon", “dialogues", “disciple", “philosophy" ] Text Fingerprint Maximize for Similarity NLU Primitive 3: Keyword Extraction Extract Keywords Word Fingerprints
  • 24. 24 There are a number of remedies for snoring, but few are proven clinically effective. Popular treatments include: Mechanical devices. Many splints, braces, and other devices are available which reposition the nose, jaw, and/or mouth in order to clear the airways. Nasal strips that attach like an adhesive bandage to the bridge of the nose are available at most drugstores, and can help stop snoring in some individualss. Continuous positive airway pressure. Several surgical procedures are available for treat? ing chronic snoring. Snoring usually worsens when an individual sleeps on his or her back, so sleeping on ones side may alleviate the problem. Those who have difficulty staying in a side sleeping position may find sleeping with pillows behind them helps them maintain the position longer. Retina Engine SVM Random Forrest DL Network Algorithm 1 Algorithm n • Classification • Clustering • Prediction • Generating • Computing • Analyzing Semantic Folding based Machine Learning
  • 25. 25 Semantic Engine Semantic Search Semantic Annotation Document
 Classification/Clustering Keyword Extraction Context Term Generation Information Discovery Expert Finding Text Analytics Risk Analysis Business Intelligence Lease/Credit
 Agreements Cortical.io Engines
  • 26. 26 Hardware Acceleration for Semantic Folding Match one Query Fingerprint against an unlimited number of Document Fingerprints Match one Filter Fingerprint against a stream of incoming Fingerprints Enterprise Search Discovery Search Web Search Social Media Profile Search Desktop: Each Board searches up to 1 Billion Fingerprints per second Enterprise: Each Server searches up to 10 Billion Fingerprints per second Web Scale: Each Rack searches up to 100 Billion Fingerprints per second Real-time Document Classification Email Filtering - Routing DeepPacket Inspection Social Media Topic Detection
  • 27. 27 Semantic Super Computing Platform Retina Engine Converter Module Similar Term Module Context Module Compare Module Retina Database Retina Search Document Index Fingerprint Matcher Search Re-ranker Retina Filter Filter Bank Fingerprint Matcher Filter Re-ranker [ Xilinx Alveo Host System Storage CPU-cores Memory X86 Server Application Server Retina System Administration App Email-Filter App Semantic-Search App Next App Integration Layer Identity Access Management SMTP Connector RDB Connector CMS Connector DMS Connector File Service Connector Email-Filter API Semantic-Search API Next Application API Web Service Connector BPM Connector Management API & Monitoring API
  • 28. 28 Comparing the Leading NLU Approaches "The Enron Email Corpus Archived 2011-03-08 at the Wayback Machine" Retrieved March 5, 2011. Retina Engine (CPU) Retina Engine (FPGA) Pure Keyword Baseline (CPU) FastText (CPU) Doc2Vec (CPU) Word2Vec (CPU) BERT (GPU) BERT (CPU) 70% 75% 80% 85% 90% 0% 0% 1% 10% 100% 1000% Precision Speed ---> faster (logarithmic) Classification Enron email dataset: Farmer Set PyTorch
 (bert-base-uncased) PyTorch
 (bert-base-uncased)
 AWS g3.8xlarge EC2 Scikit-learn 
 TfidfVectorizer official 
 pre-trained model
 Facebook Gensim 
 implementation Pre-trained 
 Google Model 1 x Xilinx
 Alveo 250 +
  • 29. 29 • Banking: • E-mail & Chat Compliance Monitoring • Credit Risk Analysis • CRM: • Customer Intent Analysis • Legal: • Contract Intelligence • Regulatory Process Optimization • Financial Services: • Investment Signal Extraction from News Streams • Life Sciences: • Information Discovery • Media: • Viewer Stream Analytics • Automotive: • Handbook Search • Car Supplier Management • Consolidation of Car Terminology • Technical Support: • Support Intelligence • Social Media: • Organic Topic Mining • Commerce: • Catalogue Management & Automation • Human Resources: • Job Description - Resume Matching Demonstrated Semantic Folding Use Cases
  • 30. 30 Simplicity One Algorithm, One Operator, One Data Format Compositionality Words, Sentences, Paragraphs, Documents Analogy Normalized Representation, Bitwise Similarity Modelability Unsupervised Semantic Model Generation Efficiency Small Amounts of Reference Data Scaleability One Semantic Model Many Use-Cases Replicability Same Use-Case in New Domain Inspectability Refinement, Debugging, Verification Robustness “Graceful Failing” NLU by Semantic Folding
  • 31. info@cortical.io Global Data Sphere (Zettabyte) Transactional Data Machine Generated Data Human Generated Data Social Media Data ML-Data Text-ML-Data Semantic Folding Potential Market L O G D a t a S e n s o r D a t a T e x t D a t a T e x t D a t a Market Potential - Semantic Folding