SlideShare a Scribd company logo
1 of 26
Submitted by:
SUVEEKSHA JAIN
Mtech I Sem
SJEC
 Data leakage is the unauthorized transmission of
sensitive data or information from within an
organization to an external destination or recipient.
 Sensitive data of companies and organization
includes
 intellectual property,
 financial information ,
 patient information,
 personal credit card data ,
and other information depending upon the business and the
industry.
 In the course of doing business, sometimes data must
be handed over to trusted third parties for some
enhancement or operations.
 Sometimes these trusted third parties may act as
points of data leakage.
 Example:
a) A hospital may give patient records to researcher who
will devise new treatments.
b) A company may have partnership with other
companies that require sharing of customer data.
c) An enterprise may outsource its data processing , so
data must be given to various other companies.
Development chains
Supply chains Outsourcing
Business hubs
Demand chains
Sept. 2011 Science Applications
International Corp
Backup tapes stolen from a car
containing 5,117,799 patients’
names, phone numbers ,Social
Security numbers, and medical
information.
July 2008 Google Data were stolen, not from
Google offices, but from the
headquarters of an HR
outsourcing company ,Colt
Express.
July 2009 American Express DBA stole a laptop containing
thousands of American Express
card numbers. The DBA
reported it stolen
Aug. 2007 Nuclear Laboratory
in Los Alamos
An employee of the U.S. nuclear
laboratory in Los Alamos
transmitted confidential
information by email.
Data leakage incidents
 Owner of data is termed as the distributor
and the third parties are called as the
agents .
 In case of data leakage, the distributor must
assess the likelihood that the leaked data
came from one or more agents, as opposed
to having been independently gathered by
other means.
SYSTEM DIAGRAM
Watermarking
Overview:
A unique code is embedded in each distributed
copy. If that copy is later discovered in the hands of an
unauthorized party, the leaker can be identified.
Mechanism:
The main idea is to generate a watermark [W(x; y)]
using a secret key chosen by the sender such that W(x;
y) is indistinguishable from random noise for any
entity that does not know the key (i.e., the recipients).
 The sender adds the watermark W(x; y) to the
information object I(x; y) and thus forms a transformed
object TI(x; y) before sharing it with the recipient(s).
 It is then hard for any recipient to guess the
watermark W(x; y) (and subtract it from the
transformed object TI(x; y));
 The sender on the other hand can easily extract and
verify a watermark (because it knows the key).
 It involves some modification of data that is making
the data less sensitive by altering attributes of the data.
 The second problem is that these watermarks can be
sometimes destroyed if the recipient is malicious.
 Using data allocation strategies, the distributer give
data to agents in order to improve the chances of
detecting guilty agents.
 Fake object is added to identify the guilty party.
 Distributer will be more confident when data leaked
by agents and they may stop doing business with him.
ARCHITECTURAL VIEW OF
THE SYSTEM
Thus we need a data leakage detection technique which fulfils
the following objective and abides by the given constraint.
CONSTRAINT
To satisfy agent requests by providing them with the number
of objects they request or with all available objects that satisfy their
conditions.
Avoid perturbation of original data before handing it to agents
OBJECTIVE
To be able to detect an agent who leaks any portion of his
data.
Different WaterMarking system:
 Embedding and extraction
 Secure speed spectrum Watermarking
 DCT-Based Watermarking
 Speed spectrum
 Wavelet based Watermarking
 Robust watermarking technique
 Invisible watermarking
 Watermarking of digital audio and image using Matlab
 Watermarking while preserving the critical path
 Buyer seller watermarking protocols
 Watermarking using cellular automata transform
 Fragile watermarking
 Data Allocation Module
 Fake Object Module
 Data Distributor Module
 Agent guilt Module
 Data Allocation: The main focus of our project is the
data allocation problem as how can the distributor
“intelligently” give data to agents in order to improve
the chances of detecting a guilty agent.
 Fake Object: Fake objects are objects generated by the
distributor in order to increase the chances of detecting
agents that leak data. The distributor may be able to add
fake objects to the distributed data in order to improve
his effectiveness in Detecting guilty agents. Our use of
fake objects is inspired by the use of “trace” records in
mailing lists.
 Data Distributor: A data distributor has given sensitive data to
a set of Supposedly trusted agents (third parties). Some of the
data is leaked and found in an unauthorized place (e.g., on the
web or somebody’s laptop). The distributor must assess the
likelihood that the leaked data came from one or more agents,
as opposed to having been independently gathered by other
means.
 Agent guilt: To compute prfgiijsg,we need an estimate for
the probability that values in S can be “guessed”by target.
 Cloud is large group of interconnected computers.
Any authorized user can access these apps from any
computer over internet.
Key properties of cloud computing:
 User centric
 Task centric
 Powerful
 Accessible
 Intelligent
 programmable
 Right protection is provided for relational data
 Watermarking technique for multimedia data
 Achieving K-Anonymity Privacy Protection
 Watermarking the relational databses
 Lineage tracing general data warehouse
transformations
 The presented strategies assume that there is a fixed
set of agents with requests known in advance.
 The distributor may have a limit on the number of fake
objects.
 Cloud computing technology enables data to be stored
in the cloud and enables users both inside and outside
the company to access the same data which increases
the usefulness of data
 It helps in detecting whether the distributor’s sensitive
data has been leaked by the trustworthy or authorized
agents.
 It helps to identify the agents who leaked the data.
 Reduces cybercrime.
 Though the leakers are identified using the traditional
technique of watermarking, certain data cannot admit
watermarks.
 In spite of these difficulties, it is possible to assess the
likelihood that an agent is responsible for a leak.
 We observed that distributing data judiciously can make a
significant difference in identifying guilty agents using the
different data allocation strategies.
Data leakage detection

More Related Content

What's hot

Data leakage detection (synopsis)
Data leakage detection (synopsis)Data leakage detection (synopsis)
Data leakage detection (synopsis)
Mumbai Academisc
 
computer forensics
computer forensicscomputer forensics
computer forensics
Akhil Kumar
 
Presentation-Detecting Spammers on Social Networks
Presentation-Detecting Spammers on Social NetworksPresentation-Detecting Spammers on Social Networks
Presentation-Detecting Spammers on Social Networks
Ashish Arora
 
01 Computer Forensics Fundamentals - Notes
01 Computer Forensics Fundamentals - Notes01 Computer Forensics Fundamentals - Notes
01 Computer Forensics Fundamentals - Notes
Kranthi
 
Cloud Forensics
Cloud ForensicsCloud Forensics
Cloud Forensics
sdavis532
 

What's hot (20)

Data leakage detection (synopsis)
Data leakage detection (synopsis)Data leakage detection (synopsis)
Data leakage detection (synopsis)
 
Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
 
FAKE NEWS DETECTION (1).pptx
FAKE NEWS DETECTION (1).pptxFAKE NEWS DETECTION (1).pptx
FAKE NEWS DETECTION (1).pptx
 
Indentify Theft Slide Show
Indentify Theft Slide ShowIndentify Theft Slide Show
Indentify Theft Slide Show
 
computer forensics
computer forensicscomputer forensics
computer forensics
 
CNIT 121: 8 Forensic Duplication
CNIT 121: 8 Forensic DuplicationCNIT 121: 8 Forensic Duplication
CNIT 121: 8 Forensic Duplication
 
Chap 2 computer forensics investigation
Chap 2  computer forensics investigationChap 2  computer forensics investigation
Chap 2 computer forensics investigation
 
Data recovery
Data recoveryData recovery
Data recovery
 
Big Data Analytics for Smart Health Care
Big Data Analytics for Smart Health CareBig Data Analytics for Smart Health Care
Big Data Analytics for Smart Health Care
 
final presentation fake news detection.pptx
final presentation fake news detection.pptxfinal presentation fake news detection.pptx
final presentation fake news detection.pptx
 
IRJET- A Literature Review on Deta Leakage Detection
IRJET-  	  A Literature Review on Deta Leakage DetectionIRJET-  	  A Literature Review on Deta Leakage Detection
IRJET- A Literature Review on Deta Leakage Detection
 
The Dark Web
The Dark WebThe Dark Web
The Dark Web
 
Seminar datawarehousing
Seminar datawarehousingSeminar datawarehousing
Seminar datawarehousing
 
An introduction to cyber forensics and open source tools in cyber forensics
An introduction to cyber forensics and open source tools in cyber forensicsAn introduction to cyber forensics and open source tools in cyber forensics
An introduction to cyber forensics and open source tools in cyber forensics
 
Social Data Mining
Social Data MiningSocial Data Mining
Social Data Mining
 
Presentation-Detecting Spammers on Social Networks
Presentation-Detecting Spammers on Social NetworksPresentation-Detecting Spammers on Social Networks
Presentation-Detecting Spammers on Social Networks
 
01 Computer Forensics Fundamentals - Notes
01 Computer Forensics Fundamentals - Notes01 Computer Forensics Fundamentals - Notes
01 Computer Forensics Fundamentals - Notes
 
Cloud Forensics
Cloud ForensicsCloud Forensics
Cloud Forensics
 
Privacy, security and ethics in data science
Privacy, security and ethics in data sciencePrivacy, security and ethics in data science
Privacy, security and ethics in data science
 
Cloud computing and Cloud Enabling Technologies
Cloud computing and Cloud Enabling TechnologiesCloud computing and Cloud Enabling Technologies
Cloud computing and Cloud Enabling Technologies
 

Viewers also liked

Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]
MithunPChandra
 
Vibration analysis at thermal power plants
Vibration analysis at thermal power plantsVibration analysis at thermal power plants
Vibration analysis at thermal power plants
SHIVAJI CHOUDHURY
 

Viewers also liked (16)

Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]
 
Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
 
American Leak Detection - Description of Services
American Leak Detection - Description of ServicesAmerican Leak Detection - Description of Services
American Leak Detection - Description of Services
 
web services
web servicesweb services
web services
 
Neural network journal by Engr. Edgar Carrillo II
Neural network journal by Engr. Edgar Carrillo IINeural network journal by Engr. Edgar Carrillo II
Neural network journal by Engr. Edgar Carrillo II
 
Wireless languages and content generation technologies
Wireless languages and content generation technologiesWireless languages and content generation technologies
Wireless languages and content generation technologies
 
Simulation of Natural Gas leak detection system using AI
Simulation of Natural Gas leak detection system using AISimulation of Natural Gas leak detection system using AI
Simulation of Natural Gas leak detection system using AI
 
Development of underwater quality and natural gas leak detection system using...
Development of underwater quality and natural gas leak detection system using...Development of underwater quality and natural gas leak detection system using...
Development of underwater quality and natural gas leak detection system using...
 
Gail(india) limited
Gail(india) limitedGail(india) limited
Gail(india) limited
 
Open mp library functions and environment variables
Open mp library functions and environment variablesOpen mp library functions and environment variables
Open mp library functions and environment variables
 
B. basic of vibration
B. basic of vibrationB. basic of vibration
B. basic of vibration
 
Study Of The Fault Diagnosis Based On Wavelet And Fuzzy Neural Network For Th...
Study Of The Fault Diagnosis Based On Wavelet And Fuzzy Neural Network For Th...Study Of The Fault Diagnosis Based On Wavelet And Fuzzy Neural Network For Th...
Study Of The Fault Diagnosis Based On Wavelet And Fuzzy Neural Network For Th...
 
Vibration analysis at thermal power plants
Vibration analysis at thermal power plantsVibration analysis at thermal power plants
Vibration analysis at thermal power plants
 
Condition monitoring of rotating machines ppt
Condition monitoring of rotating machines pptCondition monitoring of rotating machines ppt
Condition monitoring of rotating machines ppt
 
Presentation on Condition Monitoring
Presentation on Condition MonitoringPresentation on Condition Monitoring
Presentation on Condition Monitoring
 
LinkedIn SlideShare: Knowledge, Well-Presented
LinkedIn SlideShare: Knowledge, Well-PresentedLinkedIn SlideShare: Knowledge, Well-Presented
LinkedIn SlideShare: Knowledge, Well-Presented
 

Similar to Data leakage detection

Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdfData leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
naresh2004s
 
dataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptxdataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptx
naresh2004s
 
10.1.1.436.3364.pdf
10.1.1.436.3364.pdf10.1.1.436.3364.pdf
10.1.1.436.3364.pdf
mistryritesh
 
Dn31538540
Dn31538540Dn31538540
Dn31538540
IJMER
 
Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
Ajitkaur saini
 
Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
bunnz12345
 
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
Drog3
 
83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt
naresh2004s
 
Information Technology Security Is Vital For The Success...
Information Technology Security Is Vital For The Success...Information Technology Security Is Vital For The Success...
Information Technology Security Is Vital For The Success...
Brianna Johnson
 
Jpdcs1(data lekage detection)
Jpdcs1(data lekage detection)Jpdcs1(data lekage detection)
Jpdcs1(data lekage detection)
Chaitanya Kn
 
Cloud computing
Cloud computingCloud computing
Cloud computing
Ali Raza
 

Similar to Data leakage detection (20)

Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdfData leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
 
dataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptxdataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptx
 
10.1.1.436.3364.pdf
10.1.1.436.3364.pdf10.1.1.436.3364.pdf
10.1.1.436.3364.pdf
 
Dn31538540
Dn31538540Dn31538540
Dn31538540
 
Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
 
Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
 
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
 
IRJET- Detecting Data Leakage and Implementing Security Measures in Cloud Com...
IRJET- Detecting Data Leakage and Implementing Security Measures in Cloud Com...IRJET- Detecting Data Leakage and Implementing Security Measures in Cloud Com...
IRJET- Detecting Data Leakage and Implementing Security Measures in Cloud Com...
 
83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt
 
Privacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposurePrivacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposure
 
Information Technology Security Is Vital For The Success...
Information Technology Security Is Vital For The Success...Information Technology Security Is Vital For The Success...
Information Technology Security Is Vital For The Success...
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Privacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposurePrivacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposure
 
Cloud assisted mobile-access of health data with privacy and auditability
Cloud assisted mobile-access of health data with privacy and auditabilityCloud assisted mobile-access of health data with privacy and auditability
Cloud assisted mobile-access of health data with privacy and auditability
 
Contractor Exposed Manufacturer's Sensitive Data
Contractor Exposed Manufacturer's Sensitive DataContractor Exposed Manufacturer's Sensitive Data
Contractor Exposed Manufacturer's Sensitive Data
 
Jpdcs1(data lekage detection)
Jpdcs1(data lekage detection)Jpdcs1(data lekage detection)
Jpdcs1(data lekage detection)
 
Employment Feedback by Securing Data using Anonymous Authentication
Employment Feedback by Securing Data using Anonymous AuthenticationEmployment Feedback by Securing Data using Anonymous Authentication
Employment Feedback by Securing Data using Anonymous Authentication
 
Data Allocation Strategies for Leakage Detection
Data Allocation Strategies for Leakage DetectionData Allocation Strategies for Leakage Detection
Data Allocation Strategies for Leakage Detection
 
Information Security in Big Data : Privacy and Data Mining
Information Security in Big Data : Privacy and Data MiningInformation Security in Big Data : Privacy and Data Mining
Information Security in Big Data : Privacy and Data Mining
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 

Data leakage detection

  • 2.  Data leakage is the unauthorized transmission of sensitive data or information from within an organization to an external destination or recipient.  Sensitive data of companies and organization includes  intellectual property,  financial information ,  patient information,  personal credit card data , and other information depending upon the business and the industry.
  • 3.  In the course of doing business, sometimes data must be handed over to trusted third parties for some enhancement or operations.  Sometimes these trusted third parties may act as points of data leakage.  Example: a) A hospital may give patient records to researcher who will devise new treatments. b) A company may have partnership with other companies that require sharing of customer data. c) An enterprise may outsource its data processing , so data must be given to various other companies.
  • 4. Development chains Supply chains Outsourcing Business hubs Demand chains
  • 5. Sept. 2011 Science Applications International Corp Backup tapes stolen from a car containing 5,117,799 patients’ names, phone numbers ,Social Security numbers, and medical information. July 2008 Google Data were stolen, not from Google offices, but from the headquarters of an HR outsourcing company ,Colt Express. July 2009 American Express DBA stole a laptop containing thousands of American Express card numbers. The DBA reported it stolen Aug. 2007 Nuclear Laboratory in Los Alamos An employee of the U.S. nuclear laboratory in Los Alamos transmitted confidential information by email. Data leakage incidents
  • 6.  Owner of data is termed as the distributor and the third parties are called as the agents .  In case of data leakage, the distributor must assess the likelihood that the leaked data came from one or more agents, as opposed to having been independently gathered by other means.
  • 8. Watermarking Overview: A unique code is embedded in each distributed copy. If that copy is later discovered in the hands of an unauthorized party, the leaker can be identified. Mechanism: The main idea is to generate a watermark [W(x; y)] using a secret key chosen by the sender such that W(x; y) is indistinguishable from random noise for any entity that does not know the key (i.e., the recipients).
  • 9.  The sender adds the watermark W(x; y) to the information object I(x; y) and thus forms a transformed object TI(x; y) before sharing it with the recipient(s).  It is then hard for any recipient to guess the watermark W(x; y) (and subtract it from the transformed object TI(x; y));  The sender on the other hand can easily extract and verify a watermark (because it knows the key).
  • 10.  It involves some modification of data that is making the data less sensitive by altering attributes of the data.  The second problem is that these watermarks can be sometimes destroyed if the recipient is malicious.
  • 11.  Using data allocation strategies, the distributer give data to agents in order to improve the chances of detecting guilty agents.  Fake object is added to identify the guilty party.  Distributer will be more confident when data leaked by agents and they may stop doing business with him.
  • 13. Thus we need a data leakage detection technique which fulfils the following objective and abides by the given constraint. CONSTRAINT To satisfy agent requests by providing them with the number of objects they request or with all available objects that satisfy their conditions. Avoid perturbation of original data before handing it to agents OBJECTIVE To be able to detect an agent who leaks any portion of his data.
  • 14. Different WaterMarking system:  Embedding and extraction  Secure speed spectrum Watermarking  DCT-Based Watermarking  Speed spectrum  Wavelet based Watermarking  Robust watermarking technique  Invisible watermarking  Watermarking of digital audio and image using Matlab  Watermarking while preserving the critical path  Buyer seller watermarking protocols  Watermarking using cellular automata transform
  • 16.
  • 17.  Data Allocation Module  Fake Object Module  Data Distributor Module  Agent guilt Module
  • 18.  Data Allocation: The main focus of our project is the data allocation problem as how can the distributor “intelligently” give data to agents in order to improve the chances of detecting a guilty agent.  Fake Object: Fake objects are objects generated by the distributor in order to increase the chances of detecting agents that leak data. The distributor may be able to add fake objects to the distributed data in order to improve his effectiveness in Detecting guilty agents. Our use of fake objects is inspired by the use of “trace” records in mailing lists.
  • 19.  Data Distributor: A data distributor has given sensitive data to a set of Supposedly trusted agents (third parties). Some of the data is leaked and found in an unauthorized place (e.g., on the web or somebody’s laptop). The distributor must assess the likelihood that the leaked data came from one or more agents, as opposed to having been independently gathered by other means.  Agent guilt: To compute prfgiijsg,we need an estimate for the probability that values in S can be “guessed”by target.
  • 20.  Cloud is large group of interconnected computers. Any authorized user can access these apps from any computer over internet. Key properties of cloud computing:  User centric  Task centric  Powerful  Accessible  Intelligent  programmable
  • 21.  Right protection is provided for relational data  Watermarking technique for multimedia data  Achieving K-Anonymity Privacy Protection  Watermarking the relational databses  Lineage tracing general data warehouse transformations
  • 22.  The presented strategies assume that there is a fixed set of agents with requests known in advance.  The distributor may have a limit on the number of fake objects.
  • 23.  Cloud computing technology enables data to be stored in the cloud and enables users both inside and outside the company to access the same data which increases the usefulness of data
  • 24.  It helps in detecting whether the distributor’s sensitive data has been leaked by the trustworthy or authorized agents.  It helps to identify the agents who leaked the data.  Reduces cybercrime.
  • 25.  Though the leakers are identified using the traditional technique of watermarking, certain data cannot admit watermarks.  In spite of these difficulties, it is possible to assess the likelihood that an agent is responsible for a leak.  We observed that distributing data judiciously can make a significant difference in identifying guilty agents using the different data allocation strategies.