SlideShare uma empresa Scribd logo
1 de 46
Washington DC, November 2011
George Roth, Adonis Damian
www.recognos.com
 A document management system (DMS) is a computer system (or
  set of computer programs) used to track and store electronic
  documents and/or images of paper documents. It is usually also
  capable of keeping track of the different versions created by different
  users (history tracking). The term has some overlap with the concepts
  of content management systems. It is often viewed as a component
  of enterprise content management (ECM) systems and related to
  digital asset management, document imaging, workflow systems and
  records management systems.
 Make the formatted equivalent with non-formatted !




November 2011
CLASSICAL           NEW
   Metadata           Compliance
   Integration        Accessibility
   Capture            Interactivity
   Indexing           Augmentation
   Storage            Translation
   Retrieval          Linking – Relationships
   Distribution       Sentiment Analysis
   Security           New Search (Semantic Tagging, Deep
   Workflow            Search, NL Questions)
   Collaboration
   Versioning
   Search
   Publishing
   …




November 2011
   Volume
   Labor extensive
   The “research project” – 40% – 60% data
    gathering
   Metadata independent of content
   Shallow Search
   Hard to understand by non-experts


November 2011
   NLP Natural Language Processing –
    understand the meaning of documents
    (statistic, machine learning, hybrid, graph
    based)
   Semantic Search – tagging
   Data Integration
   Sentiment Analysis
   Linked Open Data – Linked Data
   Inference - Reasoning

November 2011
   Inside – Controlled Environment - TRUST
   Inside – Security issues
   Same techniques as outside the enterprise
   Integrates non-formatted with formatted
    data
   Easy to measure the effects - ROI
   Add on to the existing KM models
   Emerging area – Semantic technologies
    started on the www
November 2011
New features will become commodity in 2-3 years

   Compliance
   Data Extraction, Comparison, Change
    Analysis
   Interactivity
   Augmentation
   Translation
   Linking – Relationships
   Sentiment Analysis
   New Search (Semantic Tagging, Deep Search,
    NL Questions)
November 2011
   Microsoft: Powerset (Bing), Fast Search, Jinni
   Google: Freebase, Needlebase
   Apple: SIRI
   Etc…




November 2011
 Embedded Compliance Rules




November 2011
 Example there is a rule: – email –
Rule 0134C: “Not allowed to mention a percentage as a
  profit promise investing with the firm”
 In an email:
“ Dear John, Our company has an amazing method to
  invest, so that you will make at least 10% profit in 3
  months !!!! “
 The email was stopped – sent to Compliance with the
  message: “Violation of the Rule 0134C”



November 2011
   MFIP data extraction
   Link to the original document




November 2011
 Data Extraction, Comparison,
    Change Analysis



November 2011
November 2011
November 2011
   Create Alarm when Trading Policy Changes
   Create Alarm when Commissions Change
    (fields)
   Create Alarms when member of the Board
    Changes




November 2011
 Interactivity




November 2011
November 2011
 Augmentation




November 2011
November 2011
 Automated Translation




November 2011
   Google Translate
     Great for simple translation – emails, non
        technical documents

   Language Weaver
     Specialized translation through machine learning
     Train the system per domains



November 2011
 Sentiment Analysis




November 2011
   Media Sentry
   Open Amplify, Expert Systems, Lymbix
   NLP and machine learning




November 2011
November 2011
 Search




November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
 Complex App Samples




November 2011
November 2011
WWW

                 Google        Meltwaters                                            Forums /
                                               Twitter           Facebook                                Websites
                 Alerts          Alerts                                               Blogs




                           Exchange
                              Server



                                                         External Data Pull


                          Exchange                 Twitter          Facebook              80legs                 Diffbot
                            Adapter               Adapter             Adapter            Adapter                Adapter




                Internal Message Storage

                                        File
                                      Server


                                                                      Natural Language Processing


                                                                                                     Uploaded
                                                                                ESSEX               Taxonomy




                Web User Interface
                                                                                Data Storage


                                                                                   MS SQL Server




November 2011
   Amdocs AIDA (AMDOCS Intelligent Decision Automation)




November 2011
November 2011
Display Linked Data   Ask a question –   Entity Lookup
                       semantic search

November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
November 2011
   Interactive - Exists
   Search – Semantic Search, Q&A
   Semantic Tagging – Summarization
   LOD with domains
   Linked : People, Companies, Locations,
    Specific Terms
   Example a travel book


November 2011
The following technologies were used:
- iQser – GIN
- Clark & Parsia – Spanner, StarDog
- Expert System – NLP
- GATE
- Smart Logic – Enterprise Query Platform – Fast Search – Microsoft
  Sharepoint 11
- Revelytix
- Cognition
- Franz Systems
- DiffBot
- Ontotext




November 2011
George Roth
President and CEO Recognos Inc.
San Francisco
www.recognos.com
groth@recognos.com
Drew Warren
CEO Recognos Financial
New York
dwarren@recognosfinancial.com
www.recognosfinancial.com



November 2011

Mais conteúdo relacionado

Semelhante a Semantic Technology in Document Management

Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management Shakir Majeed Khan
 
SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011Shakir Majeed Khan
 
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagSPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagKnowledge Management Associates, LLC
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebNuxeo
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverChris McNulty
 
SharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David StrockSharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David StrockJeff Shuey
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101Chris McNulty
 
SharePoint & ERM
SharePoint & ERMSharePoint & ERM
SharePoint & ERMNick Inglis
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Artificial Intelligence Institute at UofSC
 
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel OlesonSharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel OlesonJoel Oleson
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebAmit Sheth
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebAmit Sheth
 
SharePoint 2010- Changing business needs
SharePoint 2010- Changing business needsSharePoint 2010- Changing business needs
SharePoint 2010- Changing business needsShakir Majeed Khan
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationKim Negaard
 
Productie Sharepoint Presentatie
Productie Sharepoint PresentatieProductie Sharepoint Presentatie
Productie Sharepoint PresentatieJan van der Kolk
 
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupDriving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupEPC Group
 

Semelhante a Semantic Technology in Document Management (20)

Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
 
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
 
Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management Stug-paf kiet 28 january live and on location-Enterprise Content Management
Stug-paf kiet 28 january live and on location-Enterprise Content Management
 
SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011SharePoint 2010 and Changing Business Needs-MAJU 2011
SharePoint 2010 and Changing Business Needs-MAJU 2011
 
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagSPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday DenverContent is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
Content is King - ECM in SharePoint 2010 - SharePoint Saturday Denver
 
SharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David StrockSharePoint Saturday DC by ImageTech Systems - David Strock
SharePoint Saturday DC by ImageTech Systems - David Strock
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
 
SharePoint & ERM
SharePoint & ERMSharePoint & ERM
SharePoint & ERM
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel OlesonSharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
SharePoint Server 2007 Overview - TechMentor 2007 with Joel Oleson
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic WebContent Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
 
Sp tech con-admin101
Sp tech con-admin101Sp tech con-admin101
Sp tech con-admin101
 
SharePoint 2010- Changing business needs
SharePoint 2010- Changing business needsSharePoint 2010- Changing business needs
SharePoint 2010- Changing business needs
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
 
Asap session 1
Asap session 1Asap session 1
Asap session 1
 
Productie Sharepoint Presentatie
Productie Sharepoint PresentatieProductie Sharepoint Presentatie
Productie Sharepoint Presentatie
 
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC GroupDriving End User Adoption in SharePoint 2013 & 2010 - EPC Group
Driving End User Adoption in SharePoint 2013 & 2010 - EPC Group
 

Último

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 

Último (20)

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 

Semantic Technology in Document Management

  • 1. Washington DC, November 2011 George Roth, Adonis Damian www.recognos.com
  • 2.  A document management system (DMS) is a computer system (or set of computer programs) used to track and store electronic documents and/or images of paper documents. It is usually also capable of keeping track of the different versions created by different users (history tracking). The term has some overlap with the concepts of content management systems. It is often viewed as a component of enterprise content management (ECM) systems and related to digital asset management, document imaging, workflow systems and records management systems.  Make the formatted equivalent with non-formatted ! November 2011
  • 3. CLASSICAL NEW  Metadata  Compliance  Integration  Accessibility  Capture  Interactivity  Indexing  Augmentation  Storage  Translation  Retrieval  Linking – Relationships  Distribution  Sentiment Analysis  Security  New Search (Semantic Tagging, Deep  Workflow Search, NL Questions)  Collaboration  Versioning  Search  Publishing  … November 2011
  • 4. Volume  Labor extensive  The “research project” – 40% – 60% data gathering  Metadata independent of content  Shallow Search  Hard to understand by non-experts November 2011
  • 5. NLP Natural Language Processing – understand the meaning of documents (statistic, machine learning, hybrid, graph based)  Semantic Search – tagging  Data Integration  Sentiment Analysis  Linked Open Data – Linked Data  Inference - Reasoning November 2011
  • 6. Inside – Controlled Environment - TRUST  Inside – Security issues  Same techniques as outside the enterprise  Integrates non-formatted with formatted data  Easy to measure the effects - ROI  Add on to the existing KM models  Emerging area – Semantic technologies started on the www November 2011
  • 7. New features will become commodity in 2-3 years  Compliance  Data Extraction, Comparison, Change Analysis  Interactivity  Augmentation  Translation  Linking – Relationships  Sentiment Analysis  New Search (Semantic Tagging, Deep Search, NL Questions) November 2011
  • 8. Microsoft: Powerset (Bing), Fast Search, Jinni  Google: Freebase, Needlebase  Apple: SIRI  Etc… November 2011
  • 9.  Embedded Compliance Rules November 2011
  • 10.  Example there is a rule: – email – Rule 0134C: “Not allowed to mention a percentage as a profit promise investing with the firm”  In an email: “ Dear John, Our company has an amazing method to invest, so that you will make at least 10% profit in 3 months !!!! “ The email was stopped – sent to Compliance with the message: “Violation of the Rule 0134C” November 2011
  • 11. MFIP data extraction  Link to the original document November 2011
  • 12.  Data Extraction, Comparison, Change Analysis November 2011
  • 15. Create Alarm when Trading Policy Changes  Create Alarm when Commissions Change (fields)  Create Alarms when member of the Board Changes November 2011
  • 21. Google Translate  Great for simple translation – emails, non technical documents  Language Weaver  Specialized translation through machine learning  Train the system per domains November 2011
  • 23. Media Sentry  Open Amplify, Expert Systems, Lymbix  NLP and machine learning November 2011
  • 32.  Complex App Samples November 2011
  • 34. WWW Google Meltwaters Forums / Twitter Facebook Websites Alerts Alerts Blogs Exchange Server External Data Pull Exchange Twitter Facebook 80legs Diffbot Adapter Adapter Adapter Adapter Adapter Internal Message Storage File Server Natural Language Processing Uploaded ESSEX Taxonomy Web User Interface Data Storage MS SQL Server November 2011
  • 35. Amdocs AIDA (AMDOCS Intelligent Decision Automation) November 2011
  • 37. Display Linked Data Ask a question – Entity Lookup semantic search November 2011
  • 44. Interactive - Exists  Search – Semantic Search, Q&A  Semantic Tagging – Summarization  LOD with domains  Linked : People, Companies, Locations, Specific Terms  Example a travel book November 2011
  • 45. The following technologies were used: - iQser – GIN - Clark & Parsia – Spanner, StarDog - Expert System – NLP - GATE - Smart Logic – Enterprise Query Platform – Fast Search – Microsoft Sharepoint 11 - Revelytix - Cognition - Franz Systems - DiffBot - Ontotext November 2011
  • 46. George Roth President and CEO Recognos Inc. San Francisco www.recognos.com groth@recognos.com Drew Warren CEO Recognos Financial New York dwarren@recognosfinancial.com www.recognosfinancial.com November 2011