SlideShare uma empresa Scribd logo
1 de 21
Baixar para ler offline
Dr. Charles Li
Analytics Solution Center
Charles_Li@us.ibm.com

Biometrics, Identity and Big Data Analytics

© 2013 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Topics
Biometrics, Identity & ID Management
Views on Biometrics Technology and System
Big Data Analytics and Challenges
Identity Establishment from All Sources
Identity and Biometrics in the Cloud
Identity and Biometrics Analytics in Motion
Summary

2

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Biometrics, Identity and ID Management
Entitlement(s)
Actions

Identity

Reputation
(History)

Trust
(Rules)

Identity
Establishment

Status
(Environment)

Identity Management

Players

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Views on biometrics
technology and system

What is missing?

4

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Big Data Concept
Extract insight from a high volume, variety and velocity of data in a
timely and cost-effective manner

Data in many forms –
Variety: structured, unstructured, text
and multimedia
Data in Motion – Analysis of
Velocity: streaming data to enable
decisions within fractions of a
second
Volume: Data at Scale - from
terabytes to zettabytes
5

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Analytics Concept
What is
happening

How many,
how often,
where?

What
exactly is
the
problem?

Structured
Data &
Unstructured
Content

6

Made
consumable
and
accessible to
everyone

What
actions are
needed?

Biometrics
Quality
Monitoring

What could
happen?
Simulation

What if
these
trends
continue?
Forecasting

How can we
achieve the best
outcome?
Optimisation

What will
happen
next if?
Predictive
Modelling

How can we
achieve the best
outcome and
address variability?
Stochastic
Optimisation

Descriptive Predictive
Analytics
Analytics

Prescriptive
Analytics

Biometrics
Reports

Extracting
insight,
concepts and
relationships

Content
Analytics

Deep insights
to improve
visualization
and
marketing
interactions

Visual
Analytics

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Biometrics Data at Scale – Static & Single Instance
ID Cards/Border Crossings/Benefits/Multiple
Instances
7,000,000,000x(10 Print 0.5-1MB + Face 200KB +
IRIS KB)

DHS IDENT over 150 million
identities;
125,000 transactions daily

7 Exabytes

~100-300 Terabytes
1 GigaBytes = 1000MB
1 TeraBytes = 1000GB

FBI NGI ~ over100 Million
Fingerprints & More PetaBytes
1 coming plus
Faces/Iris

= 1000TB

1 ExaByes
~100-200 Terabytes = 1000PB
1 ZettaBytes = 1000EB
1 YottaBytes = 1000ZB
US DoS has in the range of
100 million faces & Others

~ at least 10-50 Terabytes
EU VIS Biometrics Matching System (BMS) at
70 million individuals and 100K daily enrollment

Prolific Usage of Mobile Phones
6 Billion Mobile Phones
6 Exabytes of behavior data
1 Billion Arrivals 2012 world wide
United States – 100-200 million
international arrivals 2012

1 Exabytes traveling data
Unique Identification Authority of India (UIDAI)
plans to enroll 1.2 billion citizens.(UID
Program) ( enroll million /day; half billion by

3-4 Exabytes Biometrics &

2014)
Biographic Data

~100 Terabyte
many instances, history, transaction, logs… data in reality

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Big Data Sources

System Transaction, Log and Transition Data – Several Times More!

8

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Other Big data examples
By 2016, annual Internet traffic
will reach 1.3 Zettabytes

Google processes

> 24 Petabytes
of data in a single day

Facebook processes

Twitter processes

500+ Terabytes of data daily

12 Terabytes of data daily

150 Exabytes global size of

AT&T transfers about
30 Petabytes of data through
its network daily

“Big Data” in Healthcare, growing
between 1.2 and 2.4 EX / year

We don’t have the most challenging problem!
Hadron Collider at CERN
generates 40 Terabytes
of usable data / day

For every session, NY Stock
Exchange captures 1 Terabyte
of trade information

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Biometric Performance at Giga Scale*
“Brutal Force” De-Duplication
• Cumulative de-duplication / Total number of checks= N(N-1)/2 –
“Combination Problem”
• De-duplicate 100 million population enrollment results
4,999,999,950,000,000 checking!!!
• 15 years to complete with 10 million matches per second

Biometric Accuracy Challenge
• FMR at 1 Identification false match per million;
• 500 False Matches with 1 million enrollment population (de-duplicate)
• 5 million false matches with 100 million enrollment population

Prohibitive!
We have some unique challenges!

* Courtesy to Bojan Cukic
© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Face the Challenges
Identity Establishment with All Data Sources
- Leverage Entity Resolution Technologies
- Leverage ‘Context Accumulation’

Biometrics Services in the Cloud
- Leverage Big Data Infrastructure, Platforms
- Leverage Software Services

Biometrics and Identity Analytics in Motion
- Monitor quality
- Monitor performance

11

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Establishment Identity with All Sources
Biometrics(physical and behavioral)

• Reduce search space and
computing resources
• Compliment to low quality images
• Cost and benefits tradeoff
• Systematic research necessary
• Successful programs

Biographic information
Behavior data (Social media usage)
Travel data (API, PNR)
Credit Card/Banking Information

Entity /Identity
Resolution
With all
Sources

Web or Mobile App usage behavior
• Emails
• Multimedia

Spatial and temporal information

12

Entity / Identity Resolution - a
complex process involving the
application of sophisticated
algorithms across multiple
heterogeneous data sources to
resolve multiple records into a
single fused view of an individual

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Biometrics Services in the Cloud - Leverage Big Data
Infrastructure, Platform and Software Services

Cloud Solutions
Software and Business Process as a Service

Business Process
BPaaS

Business Analytics
and Optimization

Social Business

Smarter Commerce

Smarter Cities

Enrolment Service
Process
Data

Process
Data

1:1 Identification Service

Process
Data

….

Software
SaaS

Standard Interface

Cloud Services
Infrastructure and Platform as a Service

Application Services

Platform
PaaS
Application
Lifecycle

Application
Resources

Application
Environments

Enterprise
Fingerprint

Face

Iris

Biometric Data

Infrastructure
aaS

Infrastructure
Management
Availability and
Platform
and Administration Performance

Application
Management

Integration

Enterprise+

Security and
Compliance

Usage and
Accounting

Deployment
Note: Cloud & Big Data not the same

Private, Public and Hybrid Models
© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Exemplary Progress
A Prototype - Leveraging the cloud for Big Data Biometrics
• E. Kohlwey et al. “Leveraging the Cloud for Big Data Biometrics,
2011
• A prototype system for generalized searching of cloud-scale
biometric data as well as an application of this system to the task of
matching collection of synthetic human iris images
• Implemented with Hadoop (Map/Reduce framework)
Successful deployment of Identification algorithms for India
UID program
• Non-traditional matching vendor technologies
Biometrics as a Service
• Business process as a service
• Software as a service

14

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Challenges
Focus on Parallelism and Scalability
• Excellent research and testing areas
• Bring algorithms into operational environment
Explore defining biometrics as a service program –
new way of thinking about acquisition
• Business process as a service
• Software as a service
Encourage partnership among Big Data & Analytics
developers, traditional biometrics solution
providers
• Big Data and Analytics players
15

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Big Data Appliance Examples
IBM Nettezza
Oracle EXADATA
Terradata
EMC2 Greenplum
SAP HANA
Schooner Appliance MySQL

Example - (CBP) 40TB data (per appliance, a few hundreds
cores) hosted by a little more than a dozen appliances support
30 – 40 % of DHS’s operations
16

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Biometrics and Identity Analytics in Motion
ROC curve calibration along the security vs convenience
• Allow systems to dynamically change operation criteria based on live situation
• This is a real challenge due to the needed ground truth…

Quality Feedback to the Collection
• Avoid collecting ‘bad’ data to degrade the system

Operating Metrics Monitoring
• Rates on enrollment, rejection and etc.
• Geo-location and temporal information
Fuse all data sources based on real time feedback
• Dynamically allocating fusion algorithms and configurations

Provide controlled parallelism
• System and algorithms levels

17

© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

One Approach - Streams Technology in Working
Continuous ingestion
Continuous analysis
Filter / Sample

Infrastructure provides services for
Scheduling analytics across hardware hosts,
Establishing streaming connectivity

Annotate

Transform

Correlate
Classify

Near Real Time on Big Data Platform
Achieve scale:
By partitioning applications into software components
By distributing across stream-connected hardware hosts
© 2013 IBM
1
Corporation

Where appropriate:
Elements can be fused together
for lower communication latency
© 2009 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

Summary
Re-focus on Identity
• Biometrics as an enabling technology

Re-thinking on
• Open architecture
• Vendor agnostic solution via biometrics middleware

Big Impact by Big Data and Cloud Technologies
• Biometrics as a Service to Leverage Cloud Computing
Big Data Real Time Platform
• Near real time analytics requirements

19

© 2009 IBM Corporation
20

© 2013 IBM Corporation
Leveraging Information for Smarter Organizational Outcomes

A New Look - Identity and Biometrics Analytics
Real-time feeds

Business
Workflow Resolution

Real
Time
Biometrics
Capture Data
Biographic
Data

Stream in
Parallel

Including
many
Models
Entity /Identity
Resolution

Pipeline
Identification
Services
Predictive Models

Unstructured data

Social Media
Info on Web
Behavioral data

High
Volume

Content
Analytics
Big Data
Platform

Travel Data
Banking Data
Spatial Data
Temporal Data

21

Big Data
Solution

Massively
Parallel
Processing

Visualization Analytics

Report – Descriptive
Analytics

© 2009 IBM Corporation

Mais conteúdo relacionado

Mais procurados

Big data a possible game changer for e-governance
Big data   a possible game changer for e-governanceBig data   a possible game changer for e-governance
Big data a possible game changer for e-governanceSomenath Nag
 
Big Data Techcon 2014
Big Data Techcon 2014Big Data Techcon 2014
Big Data Techcon 2014Samir Lad
 
Ls subramanian internet of things
Ls subramanian internet of thingsLs subramanian internet of things
Ls subramanian internet of thingspromediakw
 
Smart Grid Analytics: All That Remains to be Ready is You
Smart Grid Analytics: All That Remains to be Ready is YouSmart Grid Analytics: All That Remains to be Ready is You
Smart Grid Analytics: All That Remains to be Ready is YouLauren Watters
 
IBM Watson IoT - New Possibilities in a Connected World
IBM Watson IoT - New Possibilities in a Connected WorldIBM Watson IoT - New Possibilities in a Connected World
IBM Watson IoT - New Possibilities in a Connected WorldCasey Lucas
 
Internet of Things (IoT)
Internet of Things (IoT)Internet of Things (IoT)
Internet of Things (IoT)Prakhyath Rai
 
Zinnov Zones for IoT Services 2017
Zinnov Zones for IoT Services 2017Zinnov Zones for IoT Services 2017
Zinnov Zones for IoT Services 2017Zinnov
 
Technology Trends and Big Data in 2013-2014
Technology Trends and Big Data in 2013-2014Technology Trends and Big Data in 2013-2014
Technology Trends and Big Data in 2013-2014KMS Technology
 
Ctrls-Company Presentation
Ctrls-Company PresentationCtrls-Company Presentation
Ctrls-Company PresentationCTRLS
 
Disaster Recovery Trends In India - Future Outlook
Disaster Recovery Trends In India - Future OutlookDisaster Recovery Trends In India - Future Outlook
Disaster Recovery Trends In India - Future OutlookCTRLS
 
Data Management The Next Level
 Data Management The Next Level Data Management The Next Level
Data Management The Next LevelCTRLS
 
A Framework for Cloud Computing Adoption in South African Government
A Framework for Cloud Computing Adoption in South African GovernmentA Framework for Cloud Computing Adoption in South African Government
A Framework for Cloud Computing Adoption in South African GovernmentGovCloud Network
 
Lijun-Ravi
Lijun-RaviLijun-Ravi
Lijun-RaviEnergyIP
 
Business value Drivers for IoT Solutions
Business value Drivers for IoT SolutionsBusiness value Drivers for IoT Solutions
Business value Drivers for IoT SolutionsIBM_Info_Management
 
Cloud Computing : Situation in Thailand
Cloud Computing : Situation in ThailandCloud Computing : Situation in Thailand
Cloud Computing : Situation in ThailandSoftware Park Thailand
 

Mais procurados (20)

Big data a possible game changer for e-governance
Big data   a possible game changer for e-governanceBig data   a possible game changer for e-governance
Big data a possible game changer for e-governance
 
Big Data Techcon 2014
Big Data Techcon 2014Big Data Techcon 2014
Big Data Techcon 2014
 
Ls subramanian internet of things
Ls subramanian internet of thingsLs subramanian internet of things
Ls subramanian internet of things
 
Smart Grid Analytics: All That Remains to be Ready is You
Smart Grid Analytics: All That Remains to be Ready is YouSmart Grid Analytics: All That Remains to be Ready is You
Smart Grid Analytics: All That Remains to be Ready is You
 
IBM Watson IoT - New Possibilities in a Connected World
IBM Watson IoT - New Possibilities in a Connected WorldIBM Watson IoT - New Possibilities in a Connected World
IBM Watson IoT - New Possibilities in a Connected World
 
The Full Spectrum of IoT Electronics
The Full Spectrum of IoT ElectronicsThe Full Spectrum of IoT Electronics
The Full Spectrum of IoT Electronics
 
Internet of Things (IoT)
Internet of Things (IoT)Internet of Things (IoT)
Internet of Things (IoT)
 
Zinnov Zones for IoT Services 2017
Zinnov Zones for IoT Services 2017Zinnov Zones for IoT Services 2017
Zinnov Zones for IoT Services 2017
 
Technology Trends and Big Data in 2013-2014
Technology Trends and Big Data in 2013-2014Technology Trends and Big Data in 2013-2014
Technology Trends and Big Data in 2013-2014
 
Ctrls-Company Presentation
Ctrls-Company PresentationCtrls-Company Presentation
Ctrls-Company Presentation
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Data dynamics in IoT Era
Data dynamics in IoT EraData dynamics in IoT Era
Data dynamics in IoT Era
 
Disaster Recovery Trends In India - Future Outlook
Disaster Recovery Trends In India - Future OutlookDisaster Recovery Trends In India - Future Outlook
Disaster Recovery Trends In India - Future Outlook
 
Data Management The Next Level
 Data Management The Next Level Data Management The Next Level
Data Management The Next Level
 
220401IMI2.pptx
220401IMI2.pptx220401IMI2.pptx
220401IMI2.pptx
 
A Framework for Cloud Computing Adoption in South African Government
A Framework for Cloud Computing Adoption in South African GovernmentA Framework for Cloud Computing Adoption in South African Government
A Framework for Cloud Computing Adoption in South African Government
 
Ibm iot overview
Ibm   iot overviewIbm   iot overview
Ibm iot overview
 
Lijun-Ravi
Lijun-RaviLijun-Ravi
Lijun-Ravi
 
Business value Drivers for IoT Solutions
Business value Drivers for IoT SolutionsBusiness value Drivers for IoT Solutions
Business value Drivers for IoT Solutions
 
Cloud Computing : Situation in Thailand
Cloud Computing : Situation in ThailandCloud Computing : Situation in Thailand
Cloud Computing : Situation in Thailand
 

Destaque

Assignment week7
Assignment week7Assignment week7
Assignment week7s1200022
 
Manley Solutions- In-Building Systems
Manley Solutions- In-Building SystemsManley Solutions- In-Building Systems
Manley Solutions- In-Building Systemsstumanley
 
Introduction to cambridge semantics trade surveillance 2015
Introduction to cambridge semantics trade surveillance 2015Introduction to cambridge semantics trade surveillance 2015
Introduction to cambridge semantics trade surveillance 2015John Rueter
 
Modul 1 overview ft tx
Modul 1 overview ft txModul 1 overview ft tx
Modul 1 overview ft txSherly Toresia
 
OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...
OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...
OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...Shahid Shah
 

Destaque (6)

Assignment week7
Assignment week7Assignment week7
Assignment week7
 
Manley Solutions- In-Building Systems
Manley Solutions- In-Building SystemsManley Solutions- In-Building Systems
Manley Solutions- In-Building Systems
 
BioIT 2015 Data Lake Talk
BioIT 2015 Data Lake TalkBioIT 2015 Data Lake Talk
BioIT 2015 Data Lake Talk
 
Introduction to cambridge semantics trade surveillance 2015
Introduction to cambridge semantics trade surveillance 2015Introduction to cambridge semantics trade surveillance 2015
Introduction to cambridge semantics trade surveillance 2015
 
Modul 1 overview ft tx
Modul 1 overview ft txModul 1 overview ft tx
Modul 1 overview ft tx
 
OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...
OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...
OSEHRA Summit 2012 Lunch Keynote: Current health IT systems integrate poorly ...
 

Semelhante a Li charles biometrics analytics & big data 122013a for release

Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
IBM Technology Day 2013 BigData Salle Rome
IBM Technology Day 2013 BigData Salle RomeIBM Technology Day 2013 BigData Salle Rome
IBM Technology Day 2013 BigData Salle RomeIBM Switzerland
 
Future of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnFuture of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnIBM Danmark
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platformIBM Sverige
 
IBM an Era of new computing
IBM an Era of new computingIBM an Era of new computing
IBM an Era of new computingShane McCaul
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Jeffrey T. Pollock
 
Big Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter JönssonBig Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter JönssonIBM Danmark
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overviewnickychu
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overviewoptier
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overviewoptier
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2Joe_F
 
Key note big data analytics ecosystem strategy
Key note   big data analytics ecosystem strategyKey note   big data analytics ecosystem strategy
Key note big data analytics ecosystem strategyIBM Sverige
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends? Karan Sachdeva
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big DataIBM Analytics
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?Aerospike, Inc.
 

Semelhante a Li charles biometrics analytics & big data 122013a for release (20)

Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
IBM Technology Day 2013 BigData Salle Rome
IBM Technology Day 2013 BigData Salle RomeIBM Technology Day 2013 BigData Salle Rome
IBM Technology Day 2013 BigData Salle Rome
 
Future of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnFuture of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren Ravn
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platform
 
IBM an Era of new computing
IBM an Era of new computingIBM an Era of new computing
IBM an Era of new computing
 
09 research
09 research09 research
09 research
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!
 
Big Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter JönssonBig Data & Analytics, Peter Jönsson
Big Data & Analytics, Peter Jönsson
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overview
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2
 
Key note big data analytics ecosystem strategy
Key note   big data analytics ecosystem strategyKey note   big data analytics ecosystem strategy
Key note big data analytics ecosystem strategy
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends?
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big Data
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?
 
Ijdbms
IjdbmsIjdbms
Ijdbms
 
Ijdbms
IjdbmsIjdbms
Ijdbms
 

Último

Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 

Último (20)

Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 

Li charles biometrics analytics & big data 122013a for release

  • 1. Dr. Charles Li Analytics Solution Center Charles_Li@us.ibm.com Biometrics, Identity and Big Data Analytics © 2013 IBM Corporation
  • 2. Leveraging Information for Smarter Organizational Outcomes Topics Biometrics, Identity & ID Management Views on Biometrics Technology and System Big Data Analytics and Challenges Identity Establishment from All Sources Identity and Biometrics in the Cloud Identity and Biometrics Analytics in Motion Summary 2 © 2009 IBM Corporation
  • 3. Leveraging Information for Smarter Organizational Outcomes Biometrics, Identity and ID Management Entitlement(s) Actions Identity Reputation (History) Trust (Rules) Identity Establishment Status (Environment) Identity Management Players © 2009 IBM Corporation
  • 4. Leveraging Information for Smarter Organizational Outcomes Views on biometrics technology and system What is missing? 4 © 2009 IBM Corporation
  • 5. Leveraging Information for Smarter Organizational Outcomes Big Data Concept Extract insight from a high volume, variety and velocity of data in a timely and cost-effective manner Data in many forms – Variety: structured, unstructured, text and multimedia Data in Motion – Analysis of Velocity: streaming data to enable decisions within fractions of a second Volume: Data at Scale - from terabytes to zettabytes 5 © 2009 IBM Corporation
  • 6. Leveraging Information for Smarter Organizational Outcomes Analytics Concept What is happening How many, how often, where? What exactly is the problem? Structured Data & Unstructured Content 6 Made consumable and accessible to everyone What actions are needed? Biometrics Quality Monitoring What could happen? Simulation What if these trends continue? Forecasting How can we achieve the best outcome? Optimisation What will happen next if? Predictive Modelling How can we achieve the best outcome and address variability? Stochastic Optimisation Descriptive Predictive Analytics Analytics Prescriptive Analytics Biometrics Reports Extracting insight, concepts and relationships Content Analytics Deep insights to improve visualization and marketing interactions Visual Analytics © 2009 IBM Corporation
  • 7. Leveraging Information for Smarter Organizational Outcomes Biometrics Data at Scale – Static & Single Instance ID Cards/Border Crossings/Benefits/Multiple Instances 7,000,000,000x(10 Print 0.5-1MB + Face 200KB + IRIS KB) DHS IDENT over 150 million identities; 125,000 transactions daily 7 Exabytes ~100-300 Terabytes 1 GigaBytes = 1000MB 1 TeraBytes = 1000GB FBI NGI ~ over100 Million Fingerprints & More PetaBytes 1 coming plus Faces/Iris = 1000TB 1 ExaByes ~100-200 Terabytes = 1000PB 1 ZettaBytes = 1000EB 1 YottaBytes = 1000ZB US DoS has in the range of 100 million faces & Others ~ at least 10-50 Terabytes EU VIS Biometrics Matching System (BMS) at 70 million individuals and 100K daily enrollment Prolific Usage of Mobile Phones 6 Billion Mobile Phones 6 Exabytes of behavior data 1 Billion Arrivals 2012 world wide United States – 100-200 million international arrivals 2012 1 Exabytes traveling data Unique Identification Authority of India (UIDAI) plans to enroll 1.2 billion citizens.(UID Program) ( enroll million /day; half billion by 3-4 Exabytes Biometrics & 2014) Biographic Data ~100 Terabyte many instances, history, transaction, logs… data in reality © 2009 IBM Corporation
  • 8. Leveraging Information for Smarter Organizational Outcomes Big Data Sources System Transaction, Log and Transition Data – Several Times More! 8 © 2009 IBM Corporation
  • 9. Leveraging Information for Smarter Organizational Outcomes Other Big data examples By 2016, annual Internet traffic will reach 1.3 Zettabytes Google processes > 24 Petabytes of data in a single day Facebook processes Twitter processes 500+ Terabytes of data daily 12 Terabytes of data daily 150 Exabytes global size of AT&T transfers about 30 Petabytes of data through its network daily “Big Data” in Healthcare, growing between 1.2 and 2.4 EX / year We don’t have the most challenging problem! Hadron Collider at CERN generates 40 Terabytes of usable data / day For every session, NY Stock Exchange captures 1 Terabyte of trade information © 2009 IBM Corporation
  • 10. Leveraging Information for Smarter Organizational Outcomes Biometric Performance at Giga Scale* “Brutal Force” De-Duplication • Cumulative de-duplication / Total number of checks= N(N-1)/2 – “Combination Problem” • De-duplicate 100 million population enrollment results 4,999,999,950,000,000 checking!!! • 15 years to complete with 10 million matches per second Biometric Accuracy Challenge • FMR at 1 Identification false match per million; • 500 False Matches with 1 million enrollment population (de-duplicate) • 5 million false matches with 100 million enrollment population Prohibitive! We have some unique challenges! * Courtesy to Bojan Cukic © 2009 IBM Corporation
  • 11. Leveraging Information for Smarter Organizational Outcomes Face the Challenges Identity Establishment with All Data Sources - Leverage Entity Resolution Technologies - Leverage ‘Context Accumulation’ Biometrics Services in the Cloud - Leverage Big Data Infrastructure, Platforms - Leverage Software Services Biometrics and Identity Analytics in Motion - Monitor quality - Monitor performance 11 © 2009 IBM Corporation
  • 12. Leveraging Information for Smarter Organizational Outcomes Establishment Identity with All Sources Biometrics(physical and behavioral) • Reduce search space and computing resources • Compliment to low quality images • Cost and benefits tradeoff • Systematic research necessary • Successful programs Biographic information Behavior data (Social media usage) Travel data (API, PNR) Credit Card/Banking Information Entity /Identity Resolution With all Sources Web or Mobile App usage behavior • Emails • Multimedia Spatial and temporal information 12 Entity / Identity Resolution - a complex process involving the application of sophisticated algorithms across multiple heterogeneous data sources to resolve multiple records into a single fused view of an individual © 2009 IBM Corporation
  • 13. Leveraging Information for Smarter Organizational Outcomes Biometrics Services in the Cloud - Leverage Big Data Infrastructure, Platform and Software Services Cloud Solutions Software and Business Process as a Service Business Process BPaaS Business Analytics and Optimization Social Business Smarter Commerce Smarter Cities Enrolment Service Process Data Process Data 1:1 Identification Service Process Data …. Software SaaS Standard Interface Cloud Services Infrastructure and Platform as a Service Application Services Platform PaaS Application Lifecycle Application Resources Application Environments Enterprise Fingerprint Face Iris Biometric Data Infrastructure aaS Infrastructure Management Availability and Platform and Administration Performance Application Management Integration Enterprise+ Security and Compliance Usage and Accounting Deployment Note: Cloud & Big Data not the same Private, Public and Hybrid Models © 2009 IBM Corporation
  • 14. Leveraging Information for Smarter Organizational Outcomes Exemplary Progress A Prototype - Leveraging the cloud for Big Data Biometrics • E. Kohlwey et al. “Leveraging the Cloud for Big Data Biometrics, 2011 • A prototype system for generalized searching of cloud-scale biometric data as well as an application of this system to the task of matching collection of synthetic human iris images • Implemented with Hadoop (Map/Reduce framework) Successful deployment of Identification algorithms for India UID program • Non-traditional matching vendor technologies Biometrics as a Service • Business process as a service • Software as a service 14 © 2009 IBM Corporation
  • 15. Leveraging Information for Smarter Organizational Outcomes Challenges Focus on Parallelism and Scalability • Excellent research and testing areas • Bring algorithms into operational environment Explore defining biometrics as a service program – new way of thinking about acquisition • Business process as a service • Software as a service Encourage partnership among Big Data & Analytics developers, traditional biometrics solution providers • Big Data and Analytics players 15 © 2009 IBM Corporation
  • 16. Leveraging Information for Smarter Organizational Outcomes Big Data Appliance Examples IBM Nettezza Oracle EXADATA Terradata EMC2 Greenplum SAP HANA Schooner Appliance MySQL Example - (CBP) 40TB data (per appliance, a few hundreds cores) hosted by a little more than a dozen appliances support 30 – 40 % of DHS’s operations 16 © 2009 IBM Corporation
  • 17. Leveraging Information for Smarter Organizational Outcomes Biometrics and Identity Analytics in Motion ROC curve calibration along the security vs convenience • Allow systems to dynamically change operation criteria based on live situation • This is a real challenge due to the needed ground truth… Quality Feedback to the Collection • Avoid collecting ‘bad’ data to degrade the system Operating Metrics Monitoring • Rates on enrollment, rejection and etc. • Geo-location and temporal information Fuse all data sources based on real time feedback • Dynamically allocating fusion algorithms and configurations Provide controlled parallelism • System and algorithms levels 17 © 2009 IBM Corporation
  • 18. Leveraging Information for Smarter Organizational Outcomes One Approach - Streams Technology in Working Continuous ingestion Continuous analysis Filter / Sample Infrastructure provides services for Scheduling analytics across hardware hosts, Establishing streaming connectivity Annotate Transform Correlate Classify Near Real Time on Big Data Platform Achieve scale: By partitioning applications into software components By distributing across stream-connected hardware hosts © 2013 IBM 1 Corporation Where appropriate: Elements can be fused together for lower communication latency © 2009 IBM Corporation
  • 19. Leveraging Information for Smarter Organizational Outcomes Summary Re-focus on Identity • Biometrics as an enabling technology Re-thinking on • Open architecture • Vendor agnostic solution via biometrics middleware Big Impact by Big Data and Cloud Technologies • Biometrics as a Service to Leverage Cloud Computing Big Data Real Time Platform • Near real time analytics requirements 19 © 2009 IBM Corporation
  • 20. 20 © 2013 IBM Corporation
  • 21. Leveraging Information for Smarter Organizational Outcomes A New Look - Identity and Biometrics Analytics Real-time feeds Business Workflow Resolution Real Time Biometrics Capture Data Biographic Data Stream in Parallel Including many Models Entity /Identity Resolution Pipeline Identification Services Predictive Models Unstructured data Social Media Info on Web Behavioral data High Volume Content Analytics Big Data Platform Travel Data Banking Data Spatial Data Temporal Data 21 Big Data Solution Massively Parallel Processing Visualization Analytics Report – Descriptive Analytics © 2009 IBM Corporation