SlideShare uma empresa Scribd logo
1 de 11
Employ the Cloud for
Efficient Content
Analytics
Enabling Optimal Decision Making
with the Cloud and Content Analytics
Samir A. Batla
Principal Product Manager, EMC IIG
samir.batla@emc.com

© Copyright 2011 EMC Corporation. All rights reserved.

1
Information Growth – Three Dimensional
VOLUME

DISPERSION

CONTENT
ANALYTICS

RICHNESS

INFORMATION

© Copyright 2011 EMC Corporation. All rights reserved.

2
Big Data Size: The Volume Of Content
Continues To Explode
The Digital Universe 2010 - 2020
90%
91% Video
Unstructured1
It’s everywhere
(2014)2

Data Volume
Growing 44x
2010:
1.2
Zettabytes

Source: IDC Digital2011 EMCStudy, sponsoredrights reserved.2010
© Copyright Universe Corporation. All by EMC, May

2020: 35.2
Zettabytes

3
Types of Content Analysis
Driven by Need

“I need to find
“I need to discover
documents about
new knowledge to
some concept”
manage my
taxonomies”
Business Analyst: “I’m composing a
workflow to protect documents
containing sensitive information. I need
to find content that contains employee id
patterns in order to apply rights
management policies”

© Copyright 2011 EMC Corporation. All rights reserved.

4
Types of Content Analysis
To list a few…
• Categorization – Taxonomy Driven
– Indicates what the content is about in a certain context

• Named Entity Extraction – NLP-based
– Finds what’s mentioned in the content

• Pattern Detection – Rules-based e.g. Regex
– Finds patterns in the content

• Sentiment Analysis, Topic/Theme analysis, etc.

© Copyright 2011 EMC Corporation. All rights reserved.

5
The Journey To Your Cloud
Private Cloud is a logical first step
Enterprise IT

Private Cloud

Complex
Trusted
Expensive
Controlled
Inflexible
Reliable
Siloed
Secure

Public Cloud
Simple
Low Cost
Flexible
Dynamic

Infrastructure

“70% Will Spend More On Private Cloud through
2012” GARTNER DATA CENTER CONFERENCE 2009

© Copyright 2011 EMC Corporation. All rights reserved.

6
The Hybrid Cloud
Best of Private and Public Clouds
Hybrid Cloud

Information

Private Cloud

Public Cloud

Hybrid Cloud use will triple within the next three
years. Sand Hill Group 2010

© Copyright 2011 EMC Corporation. All rights reserved.

7
Latency: The Cloud’s Achilles Heel
Shipping Costs

© Copyright 2011 EMC Corporation. All rights reserved.

8
Applications In The Cloud
ECM

CUSTOMER
COMMUNICATION

CONTENT
ANALYTICS
GOVERNANCE

DATA
ANALYTICS

© Copyright 2011 EMC Corporation. All rights reserved.

CAPTURE/INGEST

CONTENT
DELIVERY

9
Private Cloud for Efficient Content
Analytics
Ideal First Step

Trusted
Controlled
Reliable
Secure

© Copyright 2011 EMC Corporation. All rights reserved.

Simple
Low Cost
Flexible
Dynamic

10
THANK YOU

© Copyright 2011 EMC Corporation. All rights reserved.

11

Mais conteúdo relacionado

Mais procurados

Creating and Managing a Private or Hybrid Cloud: A Strategy Session
Creating and Managing a Private or Hybrid Cloud: A Strategy SessionCreating and Managing a Private or Hybrid Cloud: A Strategy Session
Creating and Managing a Private or Hybrid Cloud: A Strategy SessionRightScale
 
Cloud Computing and Distance Education
Cloud Computing and Distance EducationCloud Computing and Distance Education
Cloud Computing and Distance EducationRoryMcGreal
 
Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...
Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...
Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...Národní technická knihovna (NTK)
 
Why edge computing is critical to hybrid IT and cloud success
Why edge computing is critical to hybrid IT and cloud successWhy edge computing is critical to hybrid IT and cloud success
Why edge computing is critical to hybrid IT and cloud successClearSky Data
 
Impact of cloud computing in education, e governance
Impact of cloud computing in education, e governanceImpact of cloud computing in education, e governance
Impact of cloud computing in education, e governanceAsim Kumar Pathak
 
Iot and cloud computing
Iot and cloud computingIot and cloud computing
Iot and cloud computingeteshagarwal1
 
International Journal on Cloud Computing: Services and Architecture (IJCCSA)
 International Journal on Cloud Computing: Services and Architecture (IJCCSA) International Journal on Cloud Computing: Services and Architecture (IJCCSA)
International Journal on Cloud Computing: Services and Architecture (IJCCSA)ijccsa
 
Fog Computing and Cloud Computing
Fog Computing and Cloud ComputingFog Computing and Cloud Computing
Fog Computing and Cloud ComputingAhmed Banafa
 
Why is hybrid cloud still so hard? 4 keys to unlock the future of IT
Why is hybrid cloud still so hard? 4 keys to unlock the future of ITWhy is hybrid cloud still so hard? 4 keys to unlock the future of IT
Why is hybrid cloud still so hard? 4 keys to unlock the future of ITClearSky Data
 
Toward a global data infrastructure
Toward a global data infrastructureToward a global data infrastructure
Toward a global data infrastructureieeechennai
 
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...DESMOND YUEN
 
Introduction to roof computing by Nishant Krishna
Introduction to roof computing by Nishant KrishnaIntroduction to roof computing by Nishant Krishna
Introduction to roof computing by Nishant KrishnaCodeOps Technologies LLP
 
cloud storage
cloud storagecloud storage
cloud storagedriley9
 
Presentation on Cloud Storage
Presentation on Cloud StoragePresentation on Cloud Storage
Presentation on Cloud StorageRachitSinghal17
 
Importance of cloud computing in education sector!
Importance of cloud computing in education sector!Importance of cloud computing in education sector!
Importance of cloud computing in education sector!Sushil Deshmukh
 
Challenges of Cloud Computing
Challenges of Cloud ComputingChallenges of Cloud Computing
Challenges of Cloud Computinglavanyamohan45
 

Mais procurados (20)

Creating and Managing a Private or Hybrid Cloud: A Strategy Session
Creating and Managing a Private or Hybrid Cloud: A Strategy SessionCreating and Managing a Private or Hybrid Cloud: A Strategy Session
Creating and Managing a Private or Hybrid Cloud: A Strategy Session
 
Cloud Computing and Distance Education
Cloud Computing and Distance EducationCloud Computing and Distance Education
Cloud Computing and Distance Education
 
Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...
Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...
Cloud Computing in your library’s future? How to ensure a sunny outcome. (Car...
 
Why edge computing is critical to hybrid IT and cloud success
Why edge computing is critical to hybrid IT and cloud successWhy edge computing is critical to hybrid IT and cloud success
Why edge computing is critical to hybrid IT and cloud success
 
Impact of cloud computing in education, e governance
Impact of cloud computing in education, e governanceImpact of cloud computing in education, e governance
Impact of cloud computing in education, e governance
 
Iot and cloud computing
Iot and cloud computingIot and cloud computing
Iot and cloud computing
 
International Journal on Cloud Computing: Services and Architecture (IJCCSA)
 International Journal on Cloud Computing: Services and Architecture (IJCCSA) International Journal on Cloud Computing: Services and Architecture (IJCCSA)
International Journal on Cloud Computing: Services and Architecture (IJCCSA)
 
Fog computing
Fog computingFog computing
Fog computing
 
Cloud Computing Introduction
Cloud Computing IntroductionCloud Computing Introduction
Cloud Computing Introduction
 
Fog Computing and Cloud Computing
Fog Computing and Cloud ComputingFog Computing and Cloud Computing
Fog Computing and Cloud Computing
 
Why is hybrid cloud still so hard? 4 keys to unlock the future of IT
Why is hybrid cloud still so hard? 4 keys to unlock the future of ITWhy is hybrid cloud still so hard? 4 keys to unlock the future of IT
Why is hybrid cloud still so hard? 4 keys to unlock the future of IT
 
Toward a global data infrastructure
Toward a global data infrastructureToward a global data infrastructure
Toward a global data infrastructure
 
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...
 
Introduction to roof computing by Nishant Krishna
Introduction to roof computing by Nishant KrishnaIntroduction to roof computing by Nishant Krishna
Introduction to roof computing by Nishant Krishna
 
Cloud computing ppts
Cloud computing pptsCloud computing ppts
Cloud computing ppts
 
Cloud computing ppts
Cloud computing pptsCloud computing ppts
Cloud computing ppts
 
cloud storage
cloud storagecloud storage
cloud storage
 
Presentation on Cloud Storage
Presentation on Cloud StoragePresentation on Cloud Storage
Presentation on Cloud Storage
 
Importance of cloud computing in education sector!
Importance of cloud computing in education sector!Importance of cloud computing in education sector!
Importance of cloud computing in education sector!
 
Challenges of Cloud Computing
Challenges of Cloud ComputingChallenges of Cloud Computing
Challenges of Cloud Computing
 

Destaque

Gerard Valenduc: "Preventing the risk of digital exclusion among youth"
Gerard Valenduc: "Preventing the risk of digital exclusion among youth"Gerard Valenduc: "Preventing the risk of digital exclusion among youth"
Gerard Valenduc: "Preventing the risk of digital exclusion among youth"TELECENTRE EUROPE
 
Youth Depression and Critical Thinking About Youth Depression
Youth Depression and Critical Thinking About Youth DepressionYouth Depression and Critical Thinking About Youth Depression
Youth Depression and Critical Thinking About Youth Depressionsignoroni
 
Història en àrab
Història en àrabHistòria en àrab
Història en àrabMAICA CIMA
 
John Pullicino Certificates
John Pullicino CertificatesJohn Pullicino Certificates
John Pullicino Certificatesjohnpullicino
 
Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"
Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"
Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"TELECENTRE EUROPE
 
Vocabulari complet
Vocabulari completVocabulari complet
Vocabulari completMAICA CIMA
 
Aparell Locomotor[3]. Exercicis
Aparell Locomotor[3]. ExercicisAparell Locomotor[3]. Exercicis
Aparell Locomotor[3]. ExercicisMAICA CIMA
 

Destaque (14)

Gerard Valenduc: "Preventing the risk of digital exclusion among youth"
Gerard Valenduc: "Preventing the risk of digital exclusion among youth"Gerard Valenduc: "Preventing the risk of digital exclusion among youth"
Gerard Valenduc: "Preventing the risk of digital exclusion among youth"
 
Youth Depression and Critical Thinking About Youth Depression
Youth Depression and Critical Thinking About Youth DepressionYouth Depression and Critical Thinking About Youth Depression
Youth Depression and Critical Thinking About Youth Depression
 
Història en àrab
Història en àrabHistòria en àrab
Història en àrab
 
John Pullicino Certificates
John Pullicino CertificatesJohn Pullicino Certificates
John Pullicino Certificates
 
Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"
Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"
Gianluca Misuraca: "Assessing the socio-economic impact of Telecentres"
 
Fibras textiles
Fibras textilesFibras textiles
Fibras textiles
 
ABC
ABCABC
ABC
 
La Cèl·lula
La Cèl·lulaLa Cèl·lula
La Cèl·lula
 
Vocabulari complet
Vocabulari completVocabulari complet
Vocabulari complet
 
Roma
RomaRoma
Roma
 
Aparell Locomotor[3]. Exercicis
Aparell Locomotor[3]. ExercicisAparell Locomotor[3]. Exercicis
Aparell Locomotor[3]. Exercicis
 
Meteorológia 1
Meteorológia 1Meteorológia 1
Meteorológia 1
 
презентация2
презентация2презентация2
презентация2
 
Meteorológia 2
Meteorológia 2Meteorológia 2
Meteorológia 2
 

Semelhante a Employ the Cloud for Efficient Content Analytics - 10 november 2011

The Three Stages of Cloud Adoption - RightScale Compute 2013
The Three Stages of Cloud Adoption - RightScale Compute 2013The Three Stages of Cloud Adoption - RightScale Compute 2013
The Three Stages of Cloud Adoption - RightScale Compute 2013RightScale
 
Big data, data science & fast data
Big data, data science & fast dataBig data, data science & fast data
Big data, data science & fast dataKunal Joshi
 
Wicsa2011 cloud tutorial
Wicsa2011 cloud tutorialWicsa2011 cloud tutorial
Wicsa2011 cloud tutorialAnna Liu
 
Hortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your dataHortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your dataScott Clinton
 
KMWorld - The Future of Enterprise Content Management (ECM)
KMWorld - The Future of Enterprise Content Management (ECM)KMWorld - The Future of Enterprise Content Management (ECM)
KMWorld - The Future of Enterprise Content Management (ECM)Nuxeo
 
Greg Brown - Intel Big Data & Cloud Summit 2013
Greg Brown - Intel Big Data & Cloud Summit 2013Greg Brown - Intel Big Data & Cloud Summit 2013
Greg Brown - Intel Big Data & Cloud Summit 2013IntelAPAC
 
Cloud Technology to Facilitate Growth
Cloud Technology to Facilitate GrowthCloud Technology to Facilitate Growth
Cloud Technology to Facilitate GrowthIconnyx
 
CCSK, cloud security framework, Indonesia
CCSK, cloud security framework, IndonesiaCCSK, cloud security framework, Indonesia
CCSK, cloud security framework, IndonesiaWise Pacific Venture
 
Tempo - Mobile access with Governance
Tempo - Mobile access with GovernanceTempo - Mobile access with Governance
Tempo - Mobile access with GovernanceGabe Faraone
 
glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)
glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)
glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)Glenn Ambler
 
Moving enterprise IT to the cloud
Moving enterprise IT to the cloudMoving enterprise IT to the cloud
Moving enterprise IT to the cloudJan Wiersma
 
Presentation big data
Presentation   big dataPresentation   big data
Presentation big dataxKinAnx
 
De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1
De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1
De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1EMC Nederland
 
Cloudcamp- The World Wide Cloud
Cloudcamp- The World Wide CloudCloudcamp- The World Wide Cloud
Cloudcamp- The World Wide CloudReuven Cohen
 
Fast & Big Data - the journey from innovative ideas to the creation of real v...
Fast & Big Data - the journey from innovative ideas to the creation of real v...Fast & Big Data - the journey from innovative ideas to the creation of real v...
Fast & Big Data - the journey from innovative ideas to the creation of real v...Daniel Zini
 
OpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud Data
OpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud DataOpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud Data
OpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud DataEric D. Schabell
 
Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...
Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...
Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...Nadine Burnett
 
Securing Your Data for Your Journey to the Cloud
Securing Your Data for Your Journey to the CloudSecuring Your Data for Your Journey to the Cloud
Securing Your Data for Your Journey to the CloudLiwei Ren任力偉
 
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...RightScale
 

Semelhante a Employ the Cloud for Efficient Content Analytics - 10 november 2011 (20)

The Three Stages of Cloud Adoption - RightScale Compute 2013
The Three Stages of Cloud Adoption - RightScale Compute 2013The Three Stages of Cloud Adoption - RightScale Compute 2013
The Three Stages of Cloud Adoption - RightScale Compute 2013
 
Big data, data science & fast data
Big data, data science & fast dataBig data, data science & fast data
Big data, data science & fast data
 
Wicsa2011 cloud tutorial
Wicsa2011 cloud tutorialWicsa2011 cloud tutorial
Wicsa2011 cloud tutorial
 
Hortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your dataHortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your data
 
KMWorld - The Future of Enterprise Content Management (ECM)
KMWorld - The Future of Enterprise Content Management (ECM)KMWorld - The Future of Enterprise Content Management (ECM)
KMWorld - The Future of Enterprise Content Management (ECM)
 
Greg Brown - Intel Big Data & Cloud Summit 2013
Greg Brown - Intel Big Data & Cloud Summit 2013Greg Brown - Intel Big Data & Cloud Summit 2013
Greg Brown - Intel Big Data & Cloud Summit 2013
 
Cloud Technology to Facilitate Growth
Cloud Technology to Facilitate GrowthCloud Technology to Facilitate Growth
Cloud Technology to Facilitate Growth
 
CCSK, cloud security framework, Indonesia
CCSK, cloud security framework, IndonesiaCCSK, cloud security framework, Indonesia
CCSK, cloud security framework, Indonesia
 
Tempo - Mobile access with Governance
Tempo - Mobile access with GovernanceTempo - Mobile access with Governance
Tempo - Mobile access with Governance
 
glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)
glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)
glenn_amblercloud_security_ncc_event_22-may-2012_v1 (9)
 
Moving enterprise IT to the cloud
Moving enterprise IT to the cloudMoving enterprise IT to the cloud
Moving enterprise IT to the cloud
 
Brand niemann03292011
Brand niemann03292011Brand niemann03292011
Brand niemann03292011
 
Presentation big data
Presentation   big dataPresentation   big data
Presentation big data
 
De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1
De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1
De wondere wereld van cloud en sddc 26 nov 2013 ht v1.1
 
Cloudcamp- The World Wide Cloud
Cloudcamp- The World Wide CloudCloudcamp- The World Wide Cloud
Cloudcamp- The World Wide Cloud
 
Fast & Big Data - the journey from innovative ideas to the creation of real v...
Fast & Big Data - the journey from innovative ideas to the creation of real v...Fast & Big Data - the journey from innovative ideas to the creation of real v...
Fast & Big Data - the journey from innovative ideas to the creation of real v...
 
OpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud Data
OpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud DataOpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud Data
OpenShift Commons Dublin 2022 - 3 Pitfalls Everyone Should Avoid with Cloud Data
 
Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...
Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...
Key Success Factors for MOSS 2007 as ECM at Telecom - V07 - Rayner, Miles & B...
 
Securing Your Data for Your Journey to the Cloud
Securing Your Data for Your Journey to the CloudSecuring Your Data for Your Journey to the Cloud
Securing Your Data for Your Journey to the Cloud
 
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...
 

Último

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Último (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Employ the Cloud for Efficient Content Analytics - 10 november 2011

  • 1. Employ the Cloud for Efficient Content Analytics Enabling Optimal Decision Making with the Cloud and Content Analytics Samir A. Batla Principal Product Manager, EMC IIG samir.batla@emc.com © Copyright 2011 EMC Corporation. All rights reserved. 1
  • 2. Information Growth – Three Dimensional VOLUME DISPERSION CONTENT ANALYTICS RICHNESS INFORMATION © Copyright 2011 EMC Corporation. All rights reserved. 2
  • 3. Big Data Size: The Volume Of Content Continues To Explode The Digital Universe 2010 - 2020 90% 91% Video Unstructured1 It’s everywhere (2014)2 Data Volume Growing 44x 2010: 1.2 Zettabytes Source: IDC Digital2011 EMCStudy, sponsoredrights reserved.2010 © Copyright Universe Corporation. All by EMC, May 2020: 35.2 Zettabytes 3
  • 4. Types of Content Analysis Driven by Need “I need to find “I need to discover documents about new knowledge to some concept” manage my taxonomies” Business Analyst: “I’m composing a workflow to protect documents containing sensitive information. I need to find content that contains employee id patterns in order to apply rights management policies” © Copyright 2011 EMC Corporation. All rights reserved. 4
  • 5. Types of Content Analysis To list a few… • Categorization – Taxonomy Driven – Indicates what the content is about in a certain context • Named Entity Extraction – NLP-based – Finds what’s mentioned in the content • Pattern Detection – Rules-based e.g. Regex – Finds patterns in the content • Sentiment Analysis, Topic/Theme analysis, etc. © Copyright 2011 EMC Corporation. All rights reserved. 5
  • 6. The Journey To Your Cloud Private Cloud is a logical first step Enterprise IT Private Cloud Complex Trusted Expensive Controlled Inflexible Reliable Siloed Secure Public Cloud Simple Low Cost Flexible Dynamic Infrastructure “70% Will Spend More On Private Cloud through 2012” GARTNER DATA CENTER CONFERENCE 2009 © Copyright 2011 EMC Corporation. All rights reserved. 6
  • 7. The Hybrid Cloud Best of Private and Public Clouds Hybrid Cloud Information Private Cloud Public Cloud Hybrid Cloud use will triple within the next three years. Sand Hill Group 2010 © Copyright 2011 EMC Corporation. All rights reserved. 7
  • 8. Latency: The Cloud’s Achilles Heel Shipping Costs © Copyright 2011 EMC Corporation. All rights reserved. 8
  • 9. Applications In The Cloud ECM CUSTOMER COMMUNICATION CONTENT ANALYTICS GOVERNANCE DATA ANALYTICS © Copyright 2011 EMC Corporation. All rights reserved. CAPTURE/INGEST CONTENT DELIVERY 9
  • 10. Private Cloud for Efficient Content Analytics Ideal First Step Trusted Controlled Reliable Secure © Copyright 2011 EMC Corporation. All rights reserved. Simple Low Cost Flexible Dynamic 10
  • 11. THANK YOU © Copyright 2011 EMC Corporation. All rights reserved. 11

Notas do Editor

  1. P:In order to develop a strategy for employing the Cloud for efficient content analytics, we need to understand the characteristics of the content we want to analyze and we need to understand the benefits of the Cloud.Both, Cloud and Content Analytics solutions are growing rapidly. With the amount of content being generated and the need to find that content, its become essential we develop strategies for analyzing and tagging it so we can find it later. We spend a lot of time and money finding what we need – and by some accounts it costs even more to reproduce it if we can’t find it. Because information is exploding and a greater percentage of it is now richer in content, we also need a strategy that allows the most efficient use of resources to process the content.A: During this presentation, I’d like you to consider the following:Think about your content – and its characteristicsThink about your needs, your customer needsThink about how you can meet your customer needs more efficiently – enabling them to find information more efficiently and make better decisions.Gaining a better understanding about your content, needs and the different cloud models will help drive your cloud strategy and bring you the best value when employing a cloud solution for efficient content analyticsB: I believe the cloud holds immense power, flexibility and economies of scale to meet our content analytics needs that solve problems, allow us to gain better insight and make better decisions that were previously out of reach when faced with massive amounts of data.
  2. I think of information growth in three dimensions – volume, richness and dispersionHow much content do I need to analyze today and what is the rate of its growth – how much content will I potentially be analyzing several years from nowWhat type of content is it? Is it unstructured? Is it just text? Or is it video, images, or audio?Where is it? Is it already in the cloud? If not, can I put it on the Cloud?Finally, let’s not forget, what type of analytics do I need? Categorization, Named Entity Extraction, Pattern Detection, Sentiment Analysis, Facial Recognition (in video)? Something else?.
  3. Consumers of content have different needs. In order to determine the type of Content Analytics you’ll leverage to understand and tag your content, you have to understand these needs.Who is your end-user consuming the content and output of content analysis? Are they librarians or taxonomists? Perhaps the primary use case in this instance is to provide a solution for your subject matter experts to discover new information that helps them build and maintain taxonomies and knowledge bases – that eventually better serve your customers.Is the end-user your customer? How will your end-users want to find the content – how do you want them to look for it? Do you want them to search for content by what it is “about” within a certain context? Is your user an analyst who’s required to comply to new policies on how to handle content with sensitive information? Always ask:Who will consume the content?For what purpose?Do they know what they are looking for?The answers will help choose how to analyze your content and therefore help drive your strategy for employing the cloud.
  4. Let’s refresh our memories with some common types of content analysis. I’m going to mostly focus on analyzing text on this slide.Each of these has its benefits and disadvantages:Categorization’s intent is to determine if the content is about a concept you have described in a taxonomyNamed Entity Extraction uses Natural Language Processing Algorithms to discover new information like People, Places and OrganizationsPattern Detection uses rules expressed in some language to find patterns in the content – most effective when the domain is known.It’s important to note, real-life tests have shown Categorization to be three orders of magnitude faster than Named Entity Extraction. Pattern Detection falls in the large middle somewhere. This is important because when we talk about economies of scale with the cloud, Named Entity Extraction – which is compute intensive seems like a very nice fit for a Cloud-based solution. There are many other types of analysis, such as Sentiment Analysis and even more complex analysis like facial recognition in Images and Videos that I didn’t get into; but I think the point is made.... I think its safe to say that these types of analysis are compute intensive and would also be good candidates for leveraging the Cloud.------Now, you might look at the last couple of slides and think, well…no kidding. Sure, I need to establish the personas I’m serving, their needs, then design a solution...
  5. Performance in the cloud has been touted; but one thing to keep in mind with regard to performance is the overall cost of content analytics. Don’t just evaluate cloud performance; but also consider network performance between the cloud edge and your enterprise or customer facing application.No cloud solutions are immune to latency – the extent varies; however there are architectures and solutions (such as WAN optimization) that can reduce the latency to an acceptable level. How you package your content to the cloud may also increase or decrease latency. For instance, if you are sending entire documents to the cloud (where text extraction occurs, then analysis), this may cost more to transfer if you were just transferring text (having done the extraction on-premise).In some cases, you may not have a choice. If you need to send video and images for complex analysis in the cloud, you will have to make do with the latency knowing you are benefitting from analyzing those assets in the cloud.Content Analytics in general is a very compute intensive process – some analysis are much faster than others; but we do know this – content must be extracted and analyzed in some form or fashion. You must ask, does executing content analytics in the cloud give me the performance benefit to justify the cost associated with latency. I believe it does in most cases.Here’s an interesting analogy of sorts:I was speaking to a colleague of mine in France about the benefits and disadvantages of content analytics in the cloud and he gave an interesting analogy. He said, in Europe, shrimp are fished in the North, then shipped to Morocco to be prepared, then shipped back to the North to be consumed. He finished by saying “Content is like Shrimp!”  (Hardly possessing any culinary skills, I have no clue what is involved in the preparation; but for some reason its worth it)-----The point here is that if the cost of shipping content to the cloud for analysis (and getting the results back) is smaller than the cost of analyzing it on premise, then its worth having the content analyzed on the Cloud. But keep in mind, this only applies if your content is on-premise. If the content is already in the Cloud, then the shipping cost kept to a minimum (such as results, or taxonomies). The size of the data also counts as well as network latency from the Cloud edge to your enterprise.
  6. There are many applications and services available in the cloud today. Organizations are moving their IT operations, data and applications to the cloud and are reporting immediate benefits in terms of cost, performance and customer satisfaction.I believe the Cloud will continue to grow and as security concerns are mitigated, we’ll see greater adoption rates.For content analytics, I believe we all agree unstructured content is growing explosively, we also can agree that in order to find it, we need to efficiently analyze it and intelligently tag it. Knowing the potential of the cloud today, it makes sense to consider a cloud model for efficient content analytics.Facebook is producing summaries over large amounts of data to drive business decisions. With around a half billions users and billions of page views every day, you could say Facebook accumulates massive amounts of data. In order to drive innovation, developers needed tools to mine and manipulate data – roughly 15 terabytes per day. Before the cloud, this analysis was nearly impossible to solve. See full description here: http://www.boozallen.com/media/file/MassiveData.pdfBig Data trends, statistics are helping companies determine their next moves – via Hadoop & MapReduce, why not Content AnalyticsExamples:Log ProcessingEvent DetectionFraud AnalysisTrend Analysis
  7. Today, the private cloud offers the best balance of cloud benefits. The private cloud takes the benefits of economies of scale, low cost and flexibility the public cloud offers - and keeps the infrastructure in an internal closed network – where knowledge can remain secure and under better control of your organization. Its also easier for your organization to consider migrating your content to data management services in the private cloud knowing that its secure – and if you choose to leave the content in your existing managed repositories, you’ll be confident that the latency issue will be a smaller factor moving within the private network.Any computational intensive process is an ideal candidate for leveraging the cloud – content analytics falls into this category. If security, control and latency issues are mitigated, there is little argument against using a cloud-based solution for content analytics.