SlideShare uma empresa Scribd logo
1 de 30
 INTRODUCTION
 DATA MINING
 WHY DATA MINING
 APPLICATION OF DATA MINING
 STEPS OF DATA MINING
 DATA MINING TECHNIQUES
 THREAT OF DATA MINING
 SOLUTION OF THREAT
 ROLE OF DATA MINING
 DATA WAREHOUSE
 OLTP & OLAP
 DATA MINING TOOLS
 LATEST RESEARCH
INTRODUCTION
Data mining, the extraction of hidden predictive information
from large databases, is a powerful new technology with great
potential to help companies focus on the most important
information in their data warehouses.
DATA MINING
It is extraction of previously unknown, valid and understandable
information or pattern from data in repositories or sources :
 Databases
 Text files
 Social networks
 Computer simulation
The information obtained should be such that is can be used in any
organizations and enterprises for business making.
Why Data Mining ?
Data. Data everywhere yet
 I can’t find the data I need
 I can’t get the data I need
 I can’t understand the data I found
 I can’t use the data I found
• Data explosion problem
Advance data collection tools and database technology lead to
tremendous amounts of data stored in database.
• We are drawing in data, but starving for
knowledge!
• Solution: Data warehousing and Data mining
- Data warehousing and on-line analytical processing.
- Extraction of interesting knowledge using data mining.
APPLICATION OF DATA MINING
Data Mining is primarily used today by companies with a strong
consumer focus — retail, financial, communication, and marketing
organizations.
1. FINANCE INDUSTRY
Credit Card Analysis
2. INSURANCE INDUSTRY
Claims and Fraud Analysis
3. TELECOMMUNICATION
Call Record Analysis
4. TRANSPORT
Logistics Management
5. CONSUMER GOODS
Promotion Analysis
6. SCIENTIFIC RESERCH
Image, Video, Speech
7. UTILITIES
Power Usage Analysis
STEPS OF DATA MINING
 Data integration
 Data selection
 Data transformation
 Data mining
 Pattern evaluation
 Knowledge presentation
DATA MINING TECHNIQUES
Classification and Prediction
example – Focused Hiring
Cluster Analysis
example – Market Segmentation
Outlier Analysis
example – Fraud Detection
Association Analysis
example – Market Basket Analysis
Evolution Analysis
example – Forecasting stock market index using Time series Analysis
Threat To Privacy From Data Mining
They data mine information about your buying habits, sites you surf, so they
can personalize your search results when you use their search engine. It's
both frightening but on the other hand, in theory it's a way for companies to
tailor your online experience. The problem, of course, is that while generally
the data isn't scoured by humans, it is used by machines.
SOLUTION OF DATA MINING THREAT
SOLUTIONS :
 Purposes Specification & Use Limitation
 Openness
 Security Measures like Encryption
ROLE OF DATA MINING IN IT
Business Intelligence
Model Tool Method
Behavioral Basics
Information TechnologyData
Problem
Decision
DATA WAREHOUSE
Data warehousing is a technology that aggregates
structured data from one or more sources so that it can
be compared and analyzed for greater business
intelligence.
DATA WAREHOUSE
 Data warehouse provides the enterprise with a
memory.
 Data Mining provides enterprise with intelligence.
OLTP & OLAP
On-Line Transaction Processing (OLTP)
Short, simple, frequent queries and modifications
Each involving a small number of tuples
Example – answering queries from a web interface, sales at cash registers,
selling airline tickets.
On-line Application Processing (OLAP)
Few but complex queries --- may run for hours.
Queries do not depend on having an absolutely up-to-date
Database.
Example – analyst at Wal-mart look for items with increasing sales in some
region.
DATA MINING TOOLS
 Microsoft SQL Server 2005
 Microsoft SQL Server 2008
 Oracle Data Mining
 DB Miner
Latest Research and Reviews on Data
Mining
1. Systematic discovery of mutation-specific synthetic lethal by mining pan-
cancer human primary tumor data.
2. Multi-label Learning for Predicting the Activities of Antimicrobial
Peptides.
3. Semantic correction system - Little complex but interesting. Generally
retried text faces semantic error, hence leads to wrong result. Applying
this as preprocessing leads to better outcomes.
4. Syntactic correction system - Much needed now a days. Non-English
speakers creates much syntactical error. It can also be used as
preprocessing job in many projects. So you algorithm should
automatically detect such errors and suggest correct grammar.
5. Search engine for Wikipedia - Wikipedia data available as dump file.
Check dbpedia for reference. Apply indexing techniques and build
small kind of SE for wiki pages. As Wikipedia already provides this
functionality but you can work on better user experience, result
optimization.
6. Twitter tweets classifier - Pretty easy and interesting too. Creating
learning system for various categories kind of Sports, entertainment,
business, politics, Hollywood etc. Train the classifier (naive bayes,
SVM) and predict the category for incoming tweets.
7. Sentiment analysis for twitter, review, conversations - There are few
packages available in R which can help to perform this job. One needs to add
few additional feature on top of that to make more intuitive. Nltk, Stanford,
good open source tools for the same.
8. Spam mail detection - Again learning based classification system. Train
the classifier using users pre-selected spam mail which would be able to
classify new upcoming mails. If uses mark new mail as spam, then
retrain(may be some other better option).
9. Sarcasms detection - This can be very interesting one. In sentiment
analysis we identify users sentiment regarding something's, here we identify
sarcasm expressed by users. Check out Page on psu.edu - Sarcasm detection
on twitter
Data Mining and Data Warehouse

Mais conteúdo relacionado

Mais procurados

Data Preprocessing || Data Mining
Data Preprocessing || Data MiningData Preprocessing || Data Mining
Data Preprocessing || Data MiningIffat Firozy
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisEva Durall
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysisDataminingTools Inc
 
Data Science Project Lifecycle
Data Science Project LifecycleData Science Project Lifecycle
Data Science Project LifecycleJason Geng
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining ConceptsDung Nguyen
 
Data mining in Telecommunications
Data mining in TelecommunicationsData mining in Telecommunications
Data mining in TelecommunicationsMohsin Nadaf
 
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?Bernard Marr
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data miningData Mining: Application and trends in data mining
Data Mining: Application and trends in data miningDataminingTools Inc
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science IntroductionGang Tao
 
The Growing Importance of Data Cleaning
The Growing Importance of Data CleaningThe Growing Importance of Data Cleaning
The Growing Importance of Data CleaningCarolineSmith912130
 

Mais procurados (20)

Data Preprocessing || Data Mining
Data Preprocessing || Data MiningData Preprocessing || Data Mining
Data Preprocessing || Data Mining
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data Analysis
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
 
Data science
Data scienceData science
Data science
 
Data Science Project Lifecycle
Data Science Project LifecycleData Science Project Lifecycle
Data Science Project Lifecycle
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 
Exploring Data
Exploring DataExploring Data
Exploring Data
 
Data mining in Telecommunications
Data mining in TelecommunicationsData mining in Telecommunications
Data mining in Telecommunications
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
 
Data mart
Data martData mart
Data mart
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data miningData Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
The Growing Importance of Data Cleaning
The Growing Importance of Data CleaningThe Growing Importance of Data Cleaning
The Growing Importance of Data Cleaning
 
Textmining Introduction
Textmining IntroductionTextmining Introduction
Textmining Introduction
 
Data preprocessing ng
Data preprocessing   ngData preprocessing   ng
Data preprocessing ng
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Data Science
Data ScienceData Science
Data Science
 

Semelhante a Data Mining and Data Warehouse

A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentationmillerca2
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data scienceVipul Kalamkar
 
Gerenral insurance Accounts IT and Investment
Gerenral insurance Accounts IT and InvestmentGerenral insurance Accounts IT and Investment
Gerenral insurance Accounts IT and Investmentvijayk23x
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattooMohamed Magdy
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyayCameronDonovan
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyayAnthonyBennet
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of dataHarsha MV
 
Data Science- Basics.pptx
Data Science- Basics.pptxData Science- Basics.pptx
Data Science- Basics.pptxRupaliKute3
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analyticssunnypatil1778
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)mark madsen
 
The Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape OverviewThe Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape OverviewDr. Ananth Krishnamoorthy
 
How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...WiMLDSMontreal
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining Sushil Kulkarni
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxAbderrahmanABID2
 
Business analytics Project.docx
Business analytics Project.docxBusiness analytics Project.docx
Business analytics Project.docxkushi62
 
Big dataplatform operationalstrategy
Big dataplatform operationalstrategyBig dataplatform operationalstrategy
Big dataplatform operationalstrategyHimanshu Bari
 

Semelhante a Data Mining and Data Warehouse (20)

A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Gerenral insurance Accounts IT and Investment
Gerenral insurance Accounts IT and InvestmentGerenral insurance Accounts IT and Investment
Gerenral insurance Accounts IT and Investment
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyay
 
Interview for saby upadhyay
Interview for  saby upadhyayInterview for  saby upadhyay
Interview for saby upadhyay
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of data
 
Data Science- Basics.pptx
Data Science- Basics.pptxData Science- Basics.pptx
Data Science- Basics.pptx
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analytics
 
Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017 Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
The Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape OverviewThe Python ecosystem for data science - Landscape Overview
The Python ecosystem for data science - Landscape Overview
 
How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...
 
Big data
Big dataBig data
Big data
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
Business analytics Project.docx
Business analytics Project.docxBusiness analytics Project.docx
Business analytics Project.docx
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Big dataplatform operationalstrategy
Big dataplatform operationalstrategyBig dataplatform operationalstrategy
Big dataplatform operationalstrategy
 

Último

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 

Último (20)

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 

Data Mining and Data Warehouse

  • 1.
  • 2.  INTRODUCTION  DATA MINING  WHY DATA MINING  APPLICATION OF DATA MINING  STEPS OF DATA MINING  DATA MINING TECHNIQUES  THREAT OF DATA MINING  SOLUTION OF THREAT  ROLE OF DATA MINING  DATA WAREHOUSE  OLTP & OLAP  DATA MINING TOOLS  LATEST RESEARCH
  • 3. INTRODUCTION Data mining, the extraction of hidden predictive information from large databases, is a powerful new technology with great potential to help companies focus on the most important information in their data warehouses.
  • 4. DATA MINING It is extraction of previously unknown, valid and understandable information or pattern from data in repositories or sources :  Databases  Text files  Social networks  Computer simulation The information obtained should be such that is can be used in any organizations and enterprises for business making.
  • 5. Why Data Mining ? Data. Data everywhere yet  I can’t find the data I need  I can’t get the data I need  I can’t understand the data I found  I can’t use the data I found
  • 6. • Data explosion problem Advance data collection tools and database technology lead to tremendous amounts of data stored in database. • We are drawing in data, but starving for knowledge! • Solution: Data warehousing and Data mining - Data warehousing and on-line analytical processing. - Extraction of interesting knowledge using data mining.
  • 7. APPLICATION OF DATA MINING Data Mining is primarily used today by companies with a strong consumer focus — retail, financial, communication, and marketing organizations.
  • 9. 2. INSURANCE INDUSTRY Claims and Fraud Analysis
  • 15. STEPS OF DATA MINING  Data integration  Data selection  Data transformation  Data mining  Pattern evaluation  Knowledge presentation
  • 16.
  • 17. DATA MINING TECHNIQUES Classification and Prediction example – Focused Hiring Cluster Analysis example – Market Segmentation Outlier Analysis example – Fraud Detection Association Analysis example – Market Basket Analysis Evolution Analysis example – Forecasting stock market index using Time series Analysis
  • 18. Threat To Privacy From Data Mining They data mine information about your buying habits, sites you surf, so they can personalize your search results when you use their search engine. It's both frightening but on the other hand, in theory it's a way for companies to tailor your online experience. The problem, of course, is that while generally the data isn't scoured by humans, it is used by machines.
  • 19. SOLUTION OF DATA MINING THREAT SOLUTIONS :  Purposes Specification & Use Limitation  Openness  Security Measures like Encryption
  • 20. ROLE OF DATA MINING IN IT Business Intelligence Model Tool Method Behavioral Basics Information TechnologyData Problem Decision
  • 21. DATA WAREHOUSE Data warehousing is a technology that aggregates structured data from one or more sources so that it can be compared and analyzed for greater business intelligence.
  • 22.
  • 23. DATA WAREHOUSE  Data warehouse provides the enterprise with a memory.  Data Mining provides enterprise with intelligence.
  • 24. OLTP & OLAP On-Line Transaction Processing (OLTP) Short, simple, frequent queries and modifications Each involving a small number of tuples Example – answering queries from a web interface, sales at cash registers, selling airline tickets. On-line Application Processing (OLAP) Few but complex queries --- may run for hours. Queries do not depend on having an absolutely up-to-date Database. Example – analyst at Wal-mart look for items with increasing sales in some region.
  • 25.
  • 26. DATA MINING TOOLS  Microsoft SQL Server 2005  Microsoft SQL Server 2008  Oracle Data Mining  DB Miner
  • 27. Latest Research and Reviews on Data Mining 1. Systematic discovery of mutation-specific synthetic lethal by mining pan- cancer human primary tumor data. 2. Multi-label Learning for Predicting the Activities of Antimicrobial Peptides. 3. Semantic correction system - Little complex but interesting. Generally retried text faces semantic error, hence leads to wrong result. Applying this as preprocessing leads to better outcomes.
  • 28. 4. Syntactic correction system - Much needed now a days. Non-English speakers creates much syntactical error. It can also be used as preprocessing job in many projects. So you algorithm should automatically detect such errors and suggest correct grammar. 5. Search engine for Wikipedia - Wikipedia data available as dump file. Check dbpedia for reference. Apply indexing techniques and build small kind of SE for wiki pages. As Wikipedia already provides this functionality but you can work on better user experience, result optimization. 6. Twitter tweets classifier - Pretty easy and interesting too. Creating learning system for various categories kind of Sports, entertainment, business, politics, Hollywood etc. Train the classifier (naive bayes, SVM) and predict the category for incoming tweets.
  • 29. 7. Sentiment analysis for twitter, review, conversations - There are few packages available in R which can help to perform this job. One needs to add few additional feature on top of that to make more intuitive. Nltk, Stanford, good open source tools for the same. 8. Spam mail detection - Again learning based classification system. Train the classifier using users pre-selected spam mail which would be able to classify new upcoming mails. If uses mark new mail as spam, then retrain(may be some other better option). 9. Sarcasms detection - This can be very interesting one. In sentiment analysis we identify users sentiment regarding something's, here we identify sarcasm expressed by users. Check out Page on psu.edu - Sarcasm detection on twitter