SlideShare uma empresa Scribd logo
1 de 44
Baixar para ler offline
Document Recognition
       a technology overview

Presented by:
Chris Riley of Artsyl Technologies, Inc.
But First
 Your new AIIM Board!

 Exciting new events
    Golf
    Networking
    More Education Sessions
What we will cover:
 Why Chris?

 What Are the Document Recognition Technologies

 Who Makes Them

 Buyer Beware

 The future

 Q&A

 Free Stuff!
Why Chris?
 Who is Artsyl?

 What qualifies Chris to talk to me?

   When a developer turns to sales
What we will cover:
 Why Chris?

 What Are the Document Recognition Technologies

 Who Makes Them

 Buyer Beware

 The future

 Q&A

 Free Stuff!
Who knows what OCR is?
The Technologies
 OCR – Optical Character Recognition
 ICR – Intelligent Character Recognition
 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: OCR
 OCR – Optical Character Recognition
                                                               Ship To:
 ICR – Intelligent Character Recognition
 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: ICR
 OCR – Optical Character Recognition
                                                               Ilya
 ICR – Intelligent Character Recognition
 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: OMR
 OCR – Optical Character Recognition
 ICR – Intelligent Character Recognition                       Card Account

 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Barcode
 OCR – Optical Character Recognition
 ICR – Intelligent Character Recognition                       1889094476620

 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Handwriting
 OCR – Optical Character Recognition
                                                               * Critical *
 ICR – Intelligent Character Recognition
 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Acronym Heaven
 OCR – Optical Character Recognition
 ICR – Intelligent Character Recognition
 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: CAR/LAR
 OCR – Optical Character Recognition
 ICR – Intelligent Character Recognition
 OMR – Optical Mark Recognition
                                                   2 hundred dollars & no cents
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Assisted Capture
 OCR – Optical Character Recognition
 ICR – Intelligent Character Recognition
 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Fixed Form Processing

 OCR – Optical Character Recognition                           Name: Ilya
 ICR – Intelligent Character Recognition                       Date: 12/21/2982
 OMR – Optical Mark Recognition
 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Fixed Form Processing




                         Name: Ilya
                         Date: 12/21/2982
80% of business end-user documents
        are semi-structured
The Technologies: Semi-Structured Forms
                                                               Invoice No: 99044
 OCR – Optical Character Recognition                           Date: 06/09/04
 ICR – Intelligent Character Recognition                       Invoice No: 24567
 OMR – Optical Mark Recognition                                Date: 06/09/04

 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Semi-Structured Forms




                             Invoice No: 99044
                             Date: 06/09/04

                             Invoice No: 24567
                             Date: 06/09/04 (06/09/2004)
The Technologies: Semi-Structured Forms
                                                               Consignee
 OCR – Optical Character Recognition                           Consignor
 ICR – Intelligent Character Recognition                       Date
 OMR – Optical Mark Recognition                                Term

 Barcode
 Handwriting
 All the other ones made up for marketing purposes



 CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition
 Assisted Capture
 Fixed Form Process
 Semi-Structured Forms Processing
 Unstructured Document Processing
The Technologies: Common Processes

 Full page conversion
 Classification
 Index level extraction

 Redaction
 Routing
 Auto Filing
 Re-Purposing
 Image Rotation
The Technologies: Full page conversion

 Image file to electronic data file
 ALL text on the page
 Includes:
   Image Pre-processing
   Document Analysis/Zoning
   Extraction
   Export ( Commonly PDF, DOC )
The Technologies: Classification

 Software tells you the document type
 Scan batches of mixed documents


                                      ng         ce
                                               oi
                                   di
                                  a          v
                                           In
                                fL                             k
                              lo                             ec
                          l
                       Bi                                  Ch

                                                      PO
The Technologies: Index Level Extraction

 Just certain required fields extracted
 Normalization of data
 Export usually to a database

                   Invoice Number
                     Invoice Date
                    Total Amt Due
                         Term
The Technologies: How Accurate

 Better question is how do you determine
 accuracy

 Document Type Accuracy
 Field/Zone Location Accuracy
 Data Type Accuracy
 Character Accuracy
The Technologies: Common usage scenarios

 Document Conversion

 Document Archival / Retrieval

 Invoice Processing

 Insurance Processing( medical, mortgage )

 Waybill processing

 Survey processing
What we will cover:
 Why Chris?

 What Are the Document Recognition Technologies

 Who Makes Them

 Buyer Beware

 The future

 Q&A

 Free Stuff!
There Really are only 3 core
       technology providers

It takes 50 man-years to develop OCR
    using current computing abilities
Who Makes Them: Core Engines
 ABBYY
 Nuance ( formally ScanSoft )
 ReadI.R.I.S

 Océ
 CharacTell
 ParaScript
 A2iA

 Handful of Open Source
 Handful of Other Vendors
 Two handfuls of OLD engines
Who Makes Them: Who Licenses Them
EVERYONE ELSE!
AnaComp
Anydoc
BancTec
BrainWare
Captaris
Captivation
Cardiff
CVision
DataCap
DigiTech
eCopy
EMC Documentum
Kofax
LaserFiche
LeadTools
Microsoft
NSi AutoStore
OnBase
Perceptive Imaging
ReadSoft
SER
Top Image Systems
Tower
Westbrook
Xerox


Hundreds More
What we will cover:
 Why Chris?

 What Are the Document Recognition Technologies

 Who Makes Them

 Buyer Beware

 The future

 Q&A

 Free Stuff!
30% of organizations that purchase,
    purchase the wrong thing

  Over 50 % of organizations that
  purchase never use it properly
Buyer Beware
 If OCR is the reason for buying a solution know
 what Engine it is!

 Talk about the WHOLE solution not the pieces

 Get past marketing gimmicks

 Trust, Love, Be Certain of your reseller / vendor
Buyer Beware: Know your engine

 What version?
 Will they upgrade?
Buyer Beware: Talk about Whole Solution

 Scanner / Input
 Capture
 Storage

 Have Requirements List Before
Buyer Beware: Get past Gimmicks

 NOTHING! Is 100%

 All canned demos work perfect

 Always see test on your documents

 Version numbers are really arbitrary
Buyer Beware: Trust your vendor / reseller

 Support after sale ( test them )

 Where to get professional services

 Do they understand the solution and not
 just the pieces?
What we will cover:
 Why Chris?

 What Are the Document Recognition Technologies

 Who Makes Them

 Buyer Beware

 The future

 Q&A

 Free Stuff!
The Future
 Full-page OCR will be a commodity

 Advance Document Processing will become main-
 stream but less required


 Think about what to do now that you will be gathering
 data rapidly

 There will be a new approach to OCR
What we will cover:
 Why Chris?

 What Are the Document Recognition Technologies

 Who Makes Them

 Buyer Beware

 The future

 Q&A

 Free Stuff!
Questions and Answers
 Before you ask
What we will cover:
 Why Chris?

 What Are the Document Recognition Technologies

 Who Makes Them

 Buyer Beware

 The future

 Q&A

 Free Stuff!
Free Stuff
 Copy of ABBYY FineReader Pro 9.0
 Copy of Nuance OmniPage 16
 Copy of ReadI.R.I.S Pro 11

 4 Hour Consulting Session with ME!

Mais conteúdo relacionado

Mais procurados

Creation of automatic identification and data capture infrastructure via data...
Creation of automatic identification and data capture infrastructure via data...Creation of automatic identification and data capture infrastructure via data...
Creation of automatic identification and data capture infrastructure via data...
IAEME Publication
 
Barcode & smart cards
Barcode & smart cardsBarcode & smart cards
Barcode & smart cards
Muhammad Ali
 
Advanced smart credential cum unique identification and recognition system
Advanced smart credential cum unique identification and recognition systemAdvanced smart credential cum unique identification and recognition system
Advanced smart credential cum unique identification and recognition system
IAEME Publication
 
Company Profile - Fourth Prime Solutions Pvt. Ltd.
Company Profile - Fourth Prime Solutions Pvt. Ltd.Company Profile - Fourth Prime Solutions Pvt. Ltd.
Company Profile - Fourth Prime Solutions Pvt. Ltd.
Vineet M Srivastav
 

Mais procurados (18)

Barcode technology
Barcode technologyBarcode technology
Barcode technology
 
Creation of automatic identification and data capture infrastructure via data...
Creation of automatic identification and data capture infrastructure via data...Creation of automatic identification and data capture infrastructure via data...
Creation of automatic identification and data capture infrastructure via data...
 
Bar code VS RFID
Bar code VS RFIDBar code VS RFID
Bar code VS RFID
 
Barcode & qr code
Barcode & qr codeBarcode & qr code
Barcode & qr code
 
No Barcodes? No Problem!
No Barcodes? No Problem!No Barcodes? No Problem!
No Barcodes? No Problem!
 
Barcoding & RFID
Barcoding & RFIDBarcoding & RFID
Barcoding & RFID
 
Barcodes (WHW) What ? How ? Why ?
Barcodes (WHW) What ? How ? Why ?Barcodes (WHW) What ? How ? Why ?
Barcodes (WHW) What ? How ? Why ?
 
Barcode & smart cards
Barcode & smart cardsBarcode & smart cards
Barcode & smart cards
 
BAR CODE AND R.F.I.D.
BAR CODE AND R.F.I.D.BAR CODE AND R.F.I.D.
BAR CODE AND R.F.I.D.
 
PCI Version Three and Thee
PCI Version Three and TheePCI Version Three and Thee
PCI Version Three and Thee
 
Regulatory aspect of barcode technology
Regulatory aspect of barcode technologyRegulatory aspect of barcode technology
Regulatory aspect of barcode technology
 
Advanced smart credential cum unique identification and recognition system
Advanced smart credential cum unique identification and recognition systemAdvanced smart credential cum unique identification and recognition system
Advanced smart credential cum unique identification and recognition system
 
Company Profile - Fourth Prime Solutions Pvt. Ltd.
Company Profile - Fourth Prime Solutions Pvt. Ltd.Company Profile - Fourth Prime Solutions Pvt. Ltd.
Company Profile - Fourth Prime Solutions Pvt. Ltd.
 
Barcodes
BarcodesBarcodes
Barcodes
 
Barcode technology
Barcode  technologyBarcode  technology
Barcode technology
 
Barcode history and Future
Barcode history and Future Barcode history and Future
Barcode history and Future
 
Bar code-technology in tire industry
Bar code-technology in tire industryBar code-technology in tire industry
Bar code-technology in tire industry
 
Barcode presentation 2013
Barcode presentation 2013Barcode presentation 2013
Barcode presentation 2013
 

Semelhante a December 2007 Document Recognition Technology Overview Presentation

Automation for RDC and Mobile
Automation for RDC and MobileAutomation for RDC and Mobile
Automation for RDC and Mobile
Vivastream
 
vrushabh sahare barcoding presentation
vrushabh sahare   barcoding presentationvrushabh sahare   barcoding presentation
vrushabh sahare barcoding presentation
Akash Maurya
 
Launch crecorder obd2
Launch crecorder obd2Launch crecorder obd2
Launch crecorder obd2
autoobdtools
 

Semelhante a December 2007 Document Recognition Technology Overview Presentation (20)

Automation for RDC and Mobile
Automation for RDC and MobileAutomation for RDC and Mobile
Automation for RDC and Mobile
 
devices and methods for automatic data capture
devices and methods for automatic data capturedevices and methods for automatic data capture
devices and methods for automatic data capture
 
A Barcode-Based Prototype Authentication System Using Python Programming and ...
A Barcode-Based Prototype Authentication System Using Python Programming and ...A Barcode-Based Prototype Authentication System Using Python Programming and ...
A Barcode-Based Prototype Authentication System Using Python Programming and ...
 
Accura XV
Accura XVAccura XV
Accura XV
 
Mis06
Mis06Mis06
Mis06
 
Automatic Data Capture.pptx
Automatic Data Capture.pptxAutomatic Data Capture.pptx
Automatic Data Capture.pptx
 
Impact of Technology on Profession: Human Vs. AI + Bot
Impact of Technology on Profession: Human Vs. AI + BotImpact of Technology on Profession: Human Vs. AI + Bot
Impact of Technology on Profession: Human Vs. AI + Bot
 
QR Codes seminar
QR Codes seminarQR Codes seminar
QR Codes seminar
 
Silchar paper final
Silchar paper finalSilchar paper final
Silchar paper final
 
Enterprise Digital Writing
Enterprise Digital WritingEnterprise Digital Writing
Enterprise Digital Writing
 
vrushabh sahare barcoding presentation
vrushabh sahare   barcoding presentationvrushabh sahare   barcoding presentation
vrushabh sahare barcoding presentation
 
Trends in Automation
Trends in AutomationTrends in Automation
Trends in Automation
 
Getting a Barcode | Obtaining A Barcode
Getting a Barcode | Obtaining A BarcodeGetting a Barcode | Obtaining A Barcode
Getting a Barcode | Obtaining A Barcode
 
EMV 201 EMF June 2016
EMV 201 EMF June 2016EMV 201 EMF June 2016
EMV 201 EMF June 2016
 
Avi solution2
Avi solution2Avi solution2
Avi solution2
 
Barcoding 101: What You Need to Know
Barcoding 101: What You Need to KnowBarcoding 101: What You Need to Know
Barcoding 101: What You Need to Know
 
IRJET- Fast Detection Method of Quick Response Code by Comparing Run-Length C...
IRJET- Fast Detection Method of Quick Response Code by Comparing Run-Length C...IRJET- Fast Detection Method of Quick Response Code by Comparing Run-Length C...
IRJET- Fast Detection Method of Quick Response Code by Comparing Run-Length C...
 
A brief history of Optical Character Recognition (OCR)
A brief history of Optical Character Recognition (OCR)A brief history of Optical Character Recognition (OCR)
A brief history of Optical Character Recognition (OCR)
 
Infopulse AI, Data Science & RPA Managed Services
Infopulse AI, Data Science & RPA Managed ServicesInfopulse AI, Data Science & RPA Managed Services
Infopulse AI, Data Science & RPA Managed Services
 
Launch crecorder obd2
Launch crecorder obd2Launch crecorder obd2
Launch crecorder obd2
 

Mais de John Wang

How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...
How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...
How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...
John Wang
 
The Role of Content Management in Electronic Health Records (EMR)
The Role of Content Management in Electronic Health Records (EMR)The Role of Content Management in Electronic Health Records (EMR)
The Role of Content Management in Electronic Health Records (EMR)
John Wang
 
February 2009 Working the IT/RIM Relationship Presentation by Helen Streck
February 2009 Working the IT/RIM Relationship Presentation by Helen StreckFebruary 2009 Working the IT/RIM Relationship Presentation by Helen Streck
February 2009 Working the IT/RIM Relationship Presentation by Helen Streck
John Wang
 
August 2008 Content Management ROI Presentation by Brian Dirking
August 2008 Content Management ROI Presentation by Brian DirkingAugust 2008 Content Management ROI Presentation by Brian Dirking
August 2008 Content Management ROI Presentation by Brian Dirking
John Wang
 
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy HubbardOctober 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
John Wang
 
January 2006 Document Scanning Considerations Presentation
January 2006 Document Scanning Considerations PresentationJanuary 2006 Document Scanning Considerations Presentation
January 2006 Document Scanning Considerations Presentation
John Wang
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies Presentation
John Wang
 
April 2005 Headlines Newsletter
April 2005 Headlines Newsletter April 2005 Headlines Newsletter
April 2005 Headlines Newsletter
John Wang
 

Mais de John Wang (9)

How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...
How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...
How to Reduce Cost and Risk by Bringing E-Discovery In-House to Get Relevant ...
 
The Role of Content Management in Electronic Health Records (EMR)
The Role of Content Management in Electronic Health Records (EMR)The Role of Content Management in Electronic Health Records (EMR)
The Role of Content Management in Electronic Health Records (EMR)
 
February 2010 8 Things You Cant Afford To Ignore About eDiscovery
February 2010 8 Things You Cant Afford To Ignore About eDiscoveryFebruary 2010 8 Things You Cant Afford To Ignore About eDiscovery
February 2010 8 Things You Cant Afford To Ignore About eDiscovery
 
February 2009 Working the IT/RIM Relationship Presentation by Helen Streck
February 2009 Working the IT/RIM Relationship Presentation by Helen StreckFebruary 2009 Working the IT/RIM Relationship Presentation by Helen Streck
February 2009 Working the IT/RIM Relationship Presentation by Helen Streck
 
August 2008 Content Management ROI Presentation by Brian Dirking
August 2008 Content Management ROI Presentation by Brian DirkingAugust 2008 Content Management ROI Presentation by Brian Dirking
August 2008 Content Management ROI Presentation by Brian Dirking
 
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy HubbardOctober 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
 
January 2006 Document Scanning Considerations Presentation
January 2006 Document Scanning Considerations PresentationJanuary 2006 Document Scanning Considerations Presentation
January 2006 Document Scanning Considerations Presentation
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies Presentation
 
April 2005 Headlines Newsletter
April 2005 Headlines Newsletter April 2005 Headlines Newsletter
April 2005 Headlines Newsletter
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

December 2007 Document Recognition Technology Overview Presentation

  • 1. Document Recognition a technology overview Presented by: Chris Riley of Artsyl Technologies, Inc.
  • 2. But First Your new AIIM Board! Exciting new events Golf Networking More Education Sessions
  • 3. What we will cover: Why Chris? What Are the Document Recognition Technologies Who Makes Them Buyer Beware The future Q&A Free Stuff!
  • 4. Why Chris? Who is Artsyl? What qualifies Chris to talk to me? When a developer turns to sales
  • 5. What we will cover: Why Chris? What Are the Document Recognition Technologies Who Makes Them Buyer Beware The future Q&A Free Stuff!
  • 6. Who knows what OCR is?
  • 7. The Technologies OCR – Optical Character Recognition ICR – Intelligent Character Recognition OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 8. The Technologies: OCR OCR – Optical Character Recognition Ship To: ICR – Intelligent Character Recognition OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 9. The Technologies: ICR OCR – Optical Character Recognition Ilya ICR – Intelligent Character Recognition OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 10. The Technologies: OMR OCR – Optical Character Recognition ICR – Intelligent Character Recognition Card Account OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 11. The Technologies: Barcode OCR – Optical Character Recognition ICR – Intelligent Character Recognition 1889094476620 OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 12. The Technologies: Handwriting OCR – Optical Character Recognition * Critical * ICR – Intelligent Character Recognition OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 13. The Technologies: Acronym Heaven OCR – Optical Character Recognition ICR – Intelligent Character Recognition OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 14. The Technologies: CAR/LAR OCR – Optical Character Recognition ICR – Intelligent Character Recognition OMR – Optical Mark Recognition 2 hundred dollars & no cents Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 15. The Technologies: Assisted Capture OCR – Optical Character Recognition ICR – Intelligent Character Recognition OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 16. The Technologies: Fixed Form Processing OCR – Optical Character Recognition Name: Ilya ICR – Intelligent Character Recognition Date: 12/21/2982 OMR – Optical Mark Recognition Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 17. The Technologies: Fixed Form Processing Name: Ilya Date: 12/21/2982
  • 18. 80% of business end-user documents are semi-structured
  • 19. The Technologies: Semi-Structured Forms Invoice No: 99044 OCR – Optical Character Recognition Date: 06/09/04 ICR – Intelligent Character Recognition Invoice No: 24567 OMR – Optical Mark Recognition Date: 06/09/04 Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 20. The Technologies: Semi-Structured Forms Invoice No: 99044 Date: 06/09/04 Invoice No: 24567 Date: 06/09/04 (06/09/2004)
  • 21. The Technologies: Semi-Structured Forms Consignee OCR – Optical Character Recognition Consignor ICR – Intelligent Character Recognition Date OMR – Optical Mark Recognition Term Barcode Handwriting All the other ones made up for marketing purposes CAR/LAR ( Check21 ) – Courtesy and Legal Amount Recognition Assisted Capture Fixed Form Process Semi-Structured Forms Processing Unstructured Document Processing
  • 22. The Technologies: Common Processes Full page conversion Classification Index level extraction Redaction Routing Auto Filing Re-Purposing Image Rotation
  • 23. The Technologies: Full page conversion Image file to electronic data file ALL text on the page Includes: Image Pre-processing Document Analysis/Zoning Extraction Export ( Commonly PDF, DOC )
  • 24. The Technologies: Classification Software tells you the document type Scan batches of mixed documents ng ce oi di a v In fL k lo ec l Bi Ch PO
  • 25. The Technologies: Index Level Extraction Just certain required fields extracted Normalization of data Export usually to a database Invoice Number Invoice Date Total Amt Due Term
  • 26. The Technologies: How Accurate Better question is how do you determine accuracy Document Type Accuracy Field/Zone Location Accuracy Data Type Accuracy Character Accuracy
  • 27. The Technologies: Common usage scenarios Document Conversion Document Archival / Retrieval Invoice Processing Insurance Processing( medical, mortgage ) Waybill processing Survey processing
  • 28. What we will cover: Why Chris? What Are the Document Recognition Technologies Who Makes Them Buyer Beware The future Q&A Free Stuff!
  • 29. There Really are only 3 core technology providers It takes 50 man-years to develop OCR using current computing abilities
  • 30. Who Makes Them: Core Engines ABBYY Nuance ( formally ScanSoft ) ReadI.R.I.S Océ CharacTell ParaScript A2iA Handful of Open Source Handful of Other Vendors Two handfuls of OLD engines
  • 31. Who Makes Them: Who Licenses Them EVERYONE ELSE! AnaComp Anydoc BancTec BrainWare Captaris Captivation Cardiff CVision DataCap DigiTech eCopy EMC Documentum Kofax LaserFiche LeadTools Microsoft NSi AutoStore OnBase Perceptive Imaging ReadSoft SER Top Image Systems Tower Westbrook Xerox Hundreds More
  • 32. What we will cover: Why Chris? What Are the Document Recognition Technologies Who Makes Them Buyer Beware The future Q&A Free Stuff!
  • 33. 30% of organizations that purchase, purchase the wrong thing Over 50 % of organizations that purchase never use it properly
  • 34. Buyer Beware If OCR is the reason for buying a solution know what Engine it is! Talk about the WHOLE solution not the pieces Get past marketing gimmicks Trust, Love, Be Certain of your reseller / vendor
  • 35. Buyer Beware: Know your engine What version? Will they upgrade?
  • 36. Buyer Beware: Talk about Whole Solution Scanner / Input Capture Storage Have Requirements List Before
  • 37. Buyer Beware: Get past Gimmicks NOTHING! Is 100% All canned demos work perfect Always see test on your documents Version numbers are really arbitrary
  • 38. Buyer Beware: Trust your vendor / reseller Support after sale ( test them ) Where to get professional services Do they understand the solution and not just the pieces?
  • 39. What we will cover: Why Chris? What Are the Document Recognition Technologies Who Makes Them Buyer Beware The future Q&A Free Stuff!
  • 40. The Future Full-page OCR will be a commodity Advance Document Processing will become main- stream but less required Think about what to do now that you will be gathering data rapidly There will be a new approach to OCR
  • 41. What we will cover: Why Chris? What Are the Document Recognition Technologies Who Makes Them Buyer Beware The future Q&A Free Stuff!
  • 42. Questions and Answers Before you ask
  • 43. What we will cover: Why Chris? What Are the Document Recognition Technologies Who Makes Them Buyer Beware The future Q&A Free Stuff!
  • 44. Free Stuff Copy of ABBYY FineReader Pro 9.0 Copy of Nuance OmniPage 16 Copy of ReadI.R.I.S Pro 11 4 Hour Consulting Session with ME!