SlideShare uma empresa Scribd logo
1 de 19
Real-Time Market Basket Analysis
for Retail with Hadoop
Simone Ferruzzi and Marco Mantovani
Iconsulting Spa
@IconsultingBI
Real-Time Market Basket
Analysis for Retail with
Hadoop
@IconsultingBI
ICONSULTING
ICONSULTING IS AN INDEPENDENT CONSULTING
COMPANY SPECIALIZED IN DWH,BI & PM
Strong expertise on all the market leading technologies
INNOVATIVE SPECIALIZED
DEVELOPING
SKILLS
VENDOR
INDEPENDENT
2 3 41
WHO
WE ARE
More than 300 projects; more than 100 customers
Professorship in main Italian Universities and Business Schools
In-house Academy providing education services to professionals who
need to develop their skills
Spin-off of a major Research University Consortium
25% of our time invested in R&D
Certified Partner of the main Business Intelligence software vendors
# Data Warehouse
# Business Intelligence
# Performance Management
@IconsultingBI
PROCEDURES & OPERATING INSTRUCTIONS
ACCORDING TO ISO 9001:2008
STEP BY STEP
APPROACH
PROJECT
REQUIREMENT
& RESTRAINTS
SERVICE
QUALITY
TIME & COSTS
EXECUTION
MEETING
DEADLINES
PROBLEMS &
RISKS
MANAGEMENT
COMMUNICATION
AMONG
STAKEHOLDERS
AGILE
DESIGN THINKING
METHODOLOGY
ICONSULTING Methodology
@IconsultingBI
Our
CUSTOMERS
MANUFACTURING
ALFA WASSERMANN
AMPLIFON
ARISTON THERMO
CAMAR SMA
CANTIERI SANLORENZO
CASE NEW HOLLAND
FEDRIGONI
G.D
CISA (Ingersoll-Rand)
DUCATI MOTOR HOLDING
ESSECO
FIAMM
FONTANOT
GRUPPO COESIA
GRUPPO FABBRI
ICF - LA FAENZA
IGUZZINI
I.M.A. INDUSTRIA MACCHINE AUTOMATICHE
INTERTABA - PHILIP MORRIS
KME
KOMATSU
LOWARA
MAGNETI MARELLI
MALAVOLTA CORPORATE
MAPEI
MARAZZI
MARPOSS
NEGRI BOSSI
OVA BARGELLINI
OTIS
PHILIP MORRIS ITALIA
PIRELLI
POZZI GINORI
ROSETTI MARINO
SACMI
SECI
SONY EUROPA
TEUCO GUZZINI
UNO A ERRE
VINAVIL
MEDIA & PUBLISHING
PANINI GROUP
SKY ITALIA
VODAFONE
ZANICHELLI EDITORE
GOVERNMENT & PUBLIC SECTOR
MINISTERO DELL’INTERNO
MINISTERO DEL LAVORO E DELLE POLITICHE
SOCIALI
REGIONE EMILIA ROMAGNA
REGIONE CALABRIA
REGIONE VENETO
AGREA
ARPA
ARPAT
CESIA
COMUNE DI BOLOGNA
COMUNE DI REGGIO EMILIA
ERVET
INVITALIA
I.S.P.R.A. AMBIENTE
ISTITUTO NAZIONALE FISICA NUCLEARE
LEPIDA
PROV. AUTONOMA DI BOLZANO
PROV. AUTONOMA DI TRENTO
PROVINCIA DI RIMINI
UNIVERSITA’ DI BOLOGNA
SERVICES
DAY RISTOSERVICE
GRUPPO SOCIETA’ GAS RIMINI
MOBY
RINA
SIENAMBIENTE
SOFIS
FASHION
CALZEDONIA
DIESEL
GEOX
GUCCI
IMAX
LOTTO
MILAR
FINANCIAL SERVICES
CREDIT SUISSE
DEXIA CREDIOP
FGA CAPITAL (GRUPPO FIAT)
UNIPOL BANCA
FOOD
BIRRA PERONI
ERIDANIA SADAM
GRANDI SALUMIFICI ITALIANI
MASSIMO ZANETTI BEVERAGE GROUP
MONTENEGRO
SALUMIFICIO FRATELLI BERETTA
SEGAFREDO
LARGE SCALE RETAIL
CONAD ADRIATICO
LA RINASCENTE
SMA (SIMPLY MARKET)
VIP CATERING
@IconsultingBI
Business Intelligence
Turning data into Information
Historicize and Organize Information
Facilitating access to information
Evolution Trends (Big Data)
+ end users + informations + performance
Connect analysis to Action
Analyze data in Real Time
Self-service BI
Advanced visualization (mapping, etc.)
New data type (unstructured data / text)
Information Discovery on Big Data
New channels of access (Mobile)
Collaboration & Social
@IconsultingBI
Market Basket Analysis for Retail
Client:Major Italian fashion company
(3000+ points of sales worldwide)
Need:Market Basket Analysis on sold items.
• Input: single invoice lines.
• Output: Associative Rules to verify marketing
campaigns, seasonal shopping habits, layouts of
shops, etc.
Solution:
• Based on Hadoop ecosystem
• Fully integrated with Business Intelligence platform
(Oracle Business Intelligence Enterprise Edition)
@IconsultingBI
Market Basket Analysis key concepts
• Market Basket Analysis (MBA) is an application of data mining algorithms aimed
at identifying frequent patterns and co-occurrence relationships.
• Given a set of input data, the MBA returns a set of association rules like
A B
The meaning of which is «If A occurs, then B is likely to occur» (in this case, «If you
buy product A, you will also buy B»)
• Each rule is associated with two values that measure the degree of interest:
– Support: the percentage of cases in which the two events A and B occur together on the total of the
considered cases (e.g., the number of receipts in which A and B appear together divided by the total
number of receipts);
– Confidence: the percentage of cases in which the two events A and B occur together on the total of
cases where A occurs (e.g., the number of receipts that contain both products A and B divided by the
total number of receipts where A appears).
@IconsultingBI
Example of associative rule
• Easywear Underwear
• Support: 9%
• Confidence: 50%
• In 9% of cases Easywear and Underwear products are sold together.
• In 50% of cases when someone purchases an Easywear item,
an Underwear item is also purchased.
@IconsultingBI
Case study: MBA for Retail
• Italian company leader in the Fashion industry
• Sales data from the last three years
• More than 100 million receipts
• The results obtained can be used as an indicator for:
– Defining new promotional initiatives
– Identifying optimal schemes for the layout of goods in stores
– etc.
@IconsultingBI
Architecture
Receipts
Associative
Rules
Interactive Dashboards
MBA job
Job Management
Console
Email
Number of sold
items &
Associative Rules
@IconsultingBI
MBA Algorithm Steps
Job 1
Job 2
Job 3
List of single sold items (receipt lines)
Items list aggregated for receipts
Support of the itemsets
Map
Reduce
Map
Reduce
Map
Reduce
Receipt key, item value
Combination of items inside the same receipt
Calculation of all possible Association Rules that
meet minimum Support criteria
Association Rules that meet minimum Confidence
criteria
@IconsultingBI
Job Management Interface
• Interface integrated with standard BI tool
• MBA Algorithm can run on different data sets
• Each user can perform custom analysis
• Algorithm parameters (minimum support and
confidence) can be set by end users
• Examples of different analyses:
– what types of products are sold together with a discounted item?
– are there different association rules between products sold in city-center stores and
those in outlets?
@IconsultingBI
Job Management Interface
Analysis Description
Time filters
Point of Sales
filters
Product filters
Attributes used for
association rules
Support & Confidence
parameters
Run MBA
@IconsultingBI
Results Dashboard
Support Confidence
@IconsultingBI
Analysis Examples
• From 01/09/2013 to 31/12/2013 marketing campaign of a new type of bra
• All Italian points of sales located in city centers
• Analysis between all types of item except knitwear
• Min. support 35%, min. confidence 50%
Meaning: 36% of considered receipts contain all those items; when the new bra
is purchased, 52 times out of 100 a slip and a babydoll are also purchased
Same configuration as before, but considering only PoS in shopping centers
Meaning: in shopping centers, the sales of easywear drive the sales of the new
bra.
Rules found:
new bra slip, babydoll support: 36% confidence: 52%
Rules found:
Easywear new bra support: 50% confidence: 60%
@IconsultingBI
Conclusions and future work
Conclusions
• Now business users can deeply investigate on the effectiveness of marketing and
advertising campaigns and figure out whether shop windows and in-store layouts
reach desired goals.
• Market Basket Analysis algorithm can be customized on users’ needs.
• Transparent interaction between Hadoop Cluster and Business Intelligence
platform.
Future work: from project to solution:
• Complete framework to run complex Data Mining algorithms on Big Data.
• Hadoop to exploit parallel execution and Distributed File System.
• Seamless integration with standard Business Intelligence tools.
• More user independence on data integration.
@IconsultingBI
Real-Time Market Basket
Analysis for Retail with
Hadoop
Real-time Market Basket Analysis for Retail with Hadoop

Mais conteúdo relacionado

Mais procurados

Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining ConceptsDung Nguyen
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Seerat Malik
 
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioLecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioMarina Santini
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining Sushil Kulkarni
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining Phi Jack
 
Machine Learning Introduction
Machine Learning IntroductionMachine Learning Introduction
Machine Learning IntroductionYounesCharfaoui
 
Data mining presentation.ppt
Data mining presentation.pptData mining presentation.ppt
Data mining presentation.pptneelamoberoi1030
 
Information Retrieval Models
Information Retrieval ModelsInformation Retrieval Models
Information Retrieval ModelsNisha Arankandath
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 
Lec 3 knowledge acquisition representation and inference
Lec 3  knowledge acquisition representation and inferenceLec 3  knowledge acquisition representation and inference
Lec 3 knowledge acquisition representation and inferenceEyob Sisay
 
Data Mining : Concepts
Data Mining : ConceptsData Mining : Concepts
Data Mining : ConceptsPragya Pandey
 
Anomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleAnomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleImpetus Technologies
 
Social media mining PPT
Social media mining PPTSocial media mining PPT
Social media mining PPTChhavi Mathur
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data miningEr. Nawaraj Bhandari
 

Mais procurados (20)

Data mining
Data miningData mining
Data mining
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 
Apriori algorithm
Apriori algorithmApriori algorithm
Apriori algorithm
 
Automatic indexing
Automatic indexingAutomatic indexing
Automatic indexing
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioLecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
 
Data mining in e commerce
Data mining in e commerceData mining in e commerce
Data mining in e commerce
 
Machine Learning Introduction
Machine Learning IntroductionMachine Learning Introduction
Machine Learning Introduction
 
Data mining presentation.ppt
Data mining presentation.pptData mining presentation.ppt
Data mining presentation.ppt
 
Information Retrieval Models
Information Retrieval ModelsInformation Retrieval Models
Information Retrieval Models
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 
Lec 3 knowledge acquisition representation and inference
Lec 3  knowledge acquisition representation and inferenceLec 3  knowledge acquisition representation and inference
Lec 3 knowledge acquisition representation and inference
 
Data Mining : Concepts
Data Mining : ConceptsData Mining : Concepts
Data Mining : Concepts
 
Anomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleAnomaly detection with machine learning at scale
Anomaly detection with machine learning at scale
 
Social media mining PPT
Social media mining PPTSocial media mining PPT
Social media mining PPT
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data mining
 

Destaque

Masket Basket Analysis
Masket Basket AnalysisMasket Basket Analysis
Masket Basket AnalysisMarc Berman
 
Market basket analysis
Market basket analysisMarket basket analysis
Market basket analysisVermaAkash32
 
Data mining- Association Analysis -market basket
Data mining- Association Analysis -market basketData mining- Association Analysis -market basket
Data mining- Association Analysis -market basketSwapnil Soni
 
Market Basket Analysis in SAS
Market Basket Analysis in SASMarket Basket Analysis in SAS
Market Basket Analysis in SASAndrew Kramer
 
BIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in RetailBIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in RetailSkillspeed
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Data mining slides
Data mining slidesData mining slides
Data mining slidessmj
 
Increasing Order Size With Basket Analysis
Increasing Order Size With Basket AnalysisIncreasing Order Size With Basket Analysis
Increasing Order Size With Basket AnalysisEmcien Corporation
 
Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop
Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop
Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop Jongwook Woo
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecasesudhakara st
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?sudhakara st
 
Aplicaciones Difusas: Aprendizaje Supervisado-Problemas Clasificación
Aplicaciones Difusas: Aprendizaje Supervisado-Problemas ClasificaciónAplicaciones Difusas: Aprendizaje Supervisado-Problemas Clasificación
Aplicaciones Difusas: Aprendizaje Supervisado-Problemas ClasificaciónLuis Fernando Aguas Bucheli
 
Microsoft Azure Overview Infographic
Microsoft Azure Overview InfographicMicrosoft Azure Overview Infographic
Microsoft Azure Overview InfographicMicrosoft Azure
 
Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites
Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites
Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites CombineNet, Inc.
 
IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...
IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...
IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...IEEEFINALYEARSTUDENTPROJECTS
 

Destaque (20)

Masket Basket Analysis
Masket Basket AnalysisMasket Basket Analysis
Masket Basket Analysis
 
Market basket analysis
Market basket analysisMarket basket analysis
Market basket analysis
 
Market basket analysis
Market basket analysisMarket basket analysis
Market basket analysis
 
Data mining- Association Analysis -market basket
Data mining- Association Analysis -market basketData mining- Association Analysis -market basket
Data mining- Association Analysis -market basket
 
Market Basket Analysis in SAS
Market Basket Analysis in SASMarket Basket Analysis in SAS
Market Basket Analysis in SAS
 
BIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in RetailBIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in Retail
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 
Dashboard
DashboardDashboard
Dashboard
 
Increasing Order Size With Basket Analysis
Increasing Order Size With Basket AnalysisIncreasing Order Size With Basket Analysis
Increasing Order Size With Basket Analysis
 
Hadoop on retail
Hadoop on retailHadoop on retail
Hadoop on retail
 
Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop
Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop
Market Basket Analysis Algorithm with no-SQL DB HBase and Hadoop
 
Market baasket analysis
Market baasket analysisMarket baasket analysis
Market baasket analysis
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecase
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and Hadoop
 
Aplicaciones Difusas: Aprendizaje Supervisado-Problemas Clasificación
Aplicaciones Difusas: Aprendizaje Supervisado-Problemas ClasificaciónAplicaciones Difusas: Aprendizaje Supervisado-Problemas Clasificación
Aplicaciones Difusas: Aprendizaje Supervisado-Problemas Clasificación
 
Microsoft Azure Overview Infographic
Microsoft Azure Overview InfographicMicrosoft Azure Overview Infographic
Microsoft Azure Overview Infographic
 
Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites
Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites
Supply Chain Spends: Advancing Sourcing Beyond Procurement Suites
 
IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...
IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...
IEEE 2014 JAVA DATA MINING PROJECTS A similarity measure for text classificat...
 

Semelhante a Real-time Market Basket Analysis for Retail with Hadoop

HacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOETHacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOETTanyaRaina3
 
Modern Services Marketing Session at TSIA/TSW 2017 San Diego
Modern Services Marketing Session at TSIA/TSW 2017 San DiegoModern Services Marketing Session at TSIA/TSW 2017 San Diego
Modern Services Marketing Session at TSIA/TSW 2017 San DiegoFred Isbell
 
Penser Analytics - Company Profile
Penser Analytics - Company ProfilePenser Analytics - Company Profile
Penser Analytics - Company ProfileSwarochish Chekuri
 
e-Marketing Tips
e-Marketing Tips e-Marketing Tips
e-Marketing Tips yk png
 
Digital marketing from a startup founder point of view - Brussels Meetup
Digital marketing from a startup founder point of view - Brussels MeetupDigital marketing from a startup founder point of view - Brussels Meetup
Digital marketing from a startup founder point of view - Brussels MeetupMohsin El Khamlichi
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1Joni Salminen
 
130522 ibm heyerdal fremtidens handleopplevelse
130522 ibm heyerdal   fremtidens handleopplevelse130522 ibm heyerdal   fremtidens handleopplevelse
130522 ibm heyerdal fremtidens handleopplevelseNils Kristensen
 
What is Content marketing - Moses Gomes
What is Content marketing - Moses GomesWhat is Content marketing - Moses Gomes
What is Content marketing - Moses GomesMoses Gomes
 
B2B Expo Practical B2B digital marketing trends and techniques
B2B Expo Practical B2B digital marketing trends and techniques B2B Expo Practical B2B digital marketing trends and techniques
B2B Expo Practical B2B digital marketing trends and techniques Smart Insights
 
Babiloo: The innovative dashboard for the brand's digital ecosystem.
Babiloo: The innovative dashboard for the brand's digital ecosystem. Babiloo: The innovative dashboard for the brand's digital ecosystem.
Babiloo: The innovative dashboard for the brand's digital ecosystem. Hallelujah s.r.l.
 
Marketing Analytics at IBM - export
Marketing Analytics at IBM - exportMarketing Analytics at IBM - export
Marketing Analytics at IBM - exportSpyros Kontogiorgis
 
Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...
Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...
Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...Aje Región de Murcia
 
[Webinar] The ABCs of ABM (Account-Based Marketing)
[Webinar] The ABCs of ABM (Account-Based Marketing)[Webinar] The ABCs of ABM (Account-Based Marketing)
[Webinar] The ABCs of ABM (Account-Based Marketing)Mintigo1
 
Psfk futureofretail 2017slideshare 161108174615
Psfk futureofretail 2017slideshare 161108174615Psfk futureofretail 2017slideshare 161108174615
Psfk futureofretail 2017slideshare 161108174615Jérôme MONANGE
 
PSFK Future Of Retail 2017
PSFK Future Of Retail 2017PSFK Future Of Retail 2017
PSFK Future Of Retail 2017Filipp Paster
 
Psfkfutureofretail2017slideshare 161108174615
Psfkfutureofretail2017slideshare 161108174615Psfkfutureofretail2017slideshare 161108174615
Psfkfutureofretail2017slideshare 161108174615Dieste Inc.
 

Semelhante a Real-time Market Basket Analysis for Retail with Hadoop (20)

HacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOETHacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOET
 
Big Data en Retail
Big Data en RetailBig Data en Retail
Big Data en Retail
 
Big Data and E-Commerce
Big Data and E-CommerceBig Data and E-Commerce
Big Data and E-Commerce
 
Modern Services Marketing Session at TSIA/TSW 2017 San Diego
Modern Services Marketing Session at TSIA/TSW 2017 San DiegoModern Services Marketing Session at TSIA/TSW 2017 San Diego
Modern Services Marketing Session at TSIA/TSW 2017 San Diego
 
Penser Analytics - Company Profile
Penser Analytics - Company ProfilePenser Analytics - Company Profile
Penser Analytics - Company Profile
 
e-Marketing Tips
e-Marketing Tips e-Marketing Tips
e-Marketing Tips
 
Digital marketing from a startup founder point of view - Brussels Meetup
Digital marketing from a startup founder point of view - Brussels MeetupDigital marketing from a startup founder point of view - Brussels Meetup
Digital marketing from a startup founder point of view - Brussels Meetup
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1
 
Analytics in E-commerce
Analytics in E-commerceAnalytics in E-commerce
Analytics in E-commerce
 
130522 ibm heyerdal fremtidens handleopplevelse
130522 ibm heyerdal   fremtidens handleopplevelse130522 ibm heyerdal   fremtidens handleopplevelse
130522 ibm heyerdal fremtidens handleopplevelse
 
What is Content marketing - Moses Gomes
What is Content marketing - Moses GomesWhat is Content marketing - Moses Gomes
What is Content marketing - Moses Gomes
 
B2B Expo Practical B2B digital marketing trends and techniques
B2B Expo Practical B2B digital marketing trends and techniques B2B Expo Practical B2B digital marketing trends and techniques
B2B Expo Practical B2B digital marketing trends and techniques
 
Babiloo: The innovative dashboard for the brand's digital ecosystem.
Babiloo: The innovative dashboard for the brand's digital ecosystem. Babiloo: The innovative dashboard for the brand's digital ecosystem.
Babiloo: The innovative dashboard for the brand's digital ecosystem.
 
Marketing Analytics at IBM - export
Marketing Analytics at IBM - exportMarketing Analytics at IBM - export
Marketing Analytics at IBM - export
 
Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...
Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...
Nuevos Modelos De Negocio en Las Industrias Culturales. 2º Taller Ciclo de Ge...
 
[Webinar] The ABCs of ABM (Account-Based Marketing)
[Webinar] The ABCs of ABM (Account-Based Marketing)[Webinar] The ABCs of ABM (Account-Based Marketing)
[Webinar] The ABCs of ABM (Account-Based Marketing)
 
Taking Data-Driven Strategy Beyond Messaging
Taking Data-Driven Strategy Beyond MessagingTaking Data-Driven Strategy Beyond Messaging
Taking Data-Driven Strategy Beyond Messaging
 
Psfk futureofretail 2017slideshare 161108174615
Psfk futureofretail 2017slideshare 161108174615Psfk futureofretail 2017slideshare 161108174615
Psfk futureofretail 2017slideshare 161108174615
 
PSFK Future Of Retail 2017
PSFK Future Of Retail 2017PSFK Future Of Retail 2017
PSFK Future Of Retail 2017
 
Psfkfutureofretail2017slideshare 161108174615
Psfkfutureofretail2017slideshare 161108174615Psfkfutureofretail2017slideshare 161108174615
Psfkfutureofretail2017slideshare 161108174615
 

Mais de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Mais de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Último

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Último (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Real-time Market Basket Analysis for Retail with Hadoop

  • 1. Real-Time Market Basket Analysis for Retail with Hadoop Simone Ferruzzi and Marco Mantovani Iconsulting Spa
  • 3. @IconsultingBI ICONSULTING ICONSULTING IS AN INDEPENDENT CONSULTING COMPANY SPECIALIZED IN DWH,BI & PM Strong expertise on all the market leading technologies INNOVATIVE SPECIALIZED DEVELOPING SKILLS VENDOR INDEPENDENT 2 3 41 WHO WE ARE More than 300 projects; more than 100 customers Professorship in main Italian Universities and Business Schools In-house Academy providing education services to professionals who need to develop their skills Spin-off of a major Research University Consortium 25% of our time invested in R&D Certified Partner of the main Business Intelligence software vendors # Data Warehouse # Business Intelligence # Performance Management
  • 4. @IconsultingBI PROCEDURES & OPERATING INSTRUCTIONS ACCORDING TO ISO 9001:2008 STEP BY STEP APPROACH PROJECT REQUIREMENT & RESTRAINTS SERVICE QUALITY TIME & COSTS EXECUTION MEETING DEADLINES PROBLEMS & RISKS MANAGEMENT COMMUNICATION AMONG STAKEHOLDERS AGILE DESIGN THINKING METHODOLOGY ICONSULTING Methodology
  • 5. @IconsultingBI Our CUSTOMERS MANUFACTURING ALFA WASSERMANN AMPLIFON ARISTON THERMO CAMAR SMA CANTIERI SANLORENZO CASE NEW HOLLAND FEDRIGONI G.D CISA (Ingersoll-Rand) DUCATI MOTOR HOLDING ESSECO FIAMM FONTANOT GRUPPO COESIA GRUPPO FABBRI ICF - LA FAENZA IGUZZINI I.M.A. INDUSTRIA MACCHINE AUTOMATICHE INTERTABA - PHILIP MORRIS KME KOMATSU LOWARA MAGNETI MARELLI MALAVOLTA CORPORATE MAPEI MARAZZI MARPOSS NEGRI BOSSI OVA BARGELLINI OTIS PHILIP MORRIS ITALIA PIRELLI POZZI GINORI ROSETTI MARINO SACMI SECI SONY EUROPA TEUCO GUZZINI UNO A ERRE VINAVIL MEDIA & PUBLISHING PANINI GROUP SKY ITALIA VODAFONE ZANICHELLI EDITORE GOVERNMENT & PUBLIC SECTOR MINISTERO DELL’INTERNO MINISTERO DEL LAVORO E DELLE POLITICHE SOCIALI REGIONE EMILIA ROMAGNA REGIONE CALABRIA REGIONE VENETO AGREA ARPA ARPAT CESIA COMUNE DI BOLOGNA COMUNE DI REGGIO EMILIA ERVET INVITALIA I.S.P.R.A. AMBIENTE ISTITUTO NAZIONALE FISICA NUCLEARE LEPIDA PROV. AUTONOMA DI BOLZANO PROV. AUTONOMA DI TRENTO PROVINCIA DI RIMINI UNIVERSITA’ DI BOLOGNA SERVICES DAY RISTOSERVICE GRUPPO SOCIETA’ GAS RIMINI MOBY RINA SIENAMBIENTE SOFIS FASHION CALZEDONIA DIESEL GEOX GUCCI IMAX LOTTO MILAR FINANCIAL SERVICES CREDIT SUISSE DEXIA CREDIOP FGA CAPITAL (GRUPPO FIAT) UNIPOL BANCA FOOD BIRRA PERONI ERIDANIA SADAM GRANDI SALUMIFICI ITALIANI MASSIMO ZANETTI BEVERAGE GROUP MONTENEGRO SALUMIFICIO FRATELLI BERETTA SEGAFREDO LARGE SCALE RETAIL CONAD ADRIATICO LA RINASCENTE SMA (SIMPLY MARKET) VIP CATERING
  • 6. @IconsultingBI Business Intelligence Turning data into Information Historicize and Organize Information Facilitating access to information Evolution Trends (Big Data) + end users + informations + performance Connect analysis to Action Analyze data in Real Time Self-service BI Advanced visualization (mapping, etc.) New data type (unstructured data / text) Information Discovery on Big Data New channels of access (Mobile) Collaboration & Social
  • 7. @IconsultingBI Market Basket Analysis for Retail Client:Major Italian fashion company (3000+ points of sales worldwide) Need:Market Basket Analysis on sold items. • Input: single invoice lines. • Output: Associative Rules to verify marketing campaigns, seasonal shopping habits, layouts of shops, etc. Solution: • Based on Hadoop ecosystem • Fully integrated with Business Intelligence platform (Oracle Business Intelligence Enterprise Edition)
  • 8. @IconsultingBI Market Basket Analysis key concepts • Market Basket Analysis (MBA) is an application of data mining algorithms aimed at identifying frequent patterns and co-occurrence relationships. • Given a set of input data, the MBA returns a set of association rules like A B The meaning of which is «If A occurs, then B is likely to occur» (in this case, «If you buy product A, you will also buy B») • Each rule is associated with two values that measure the degree of interest: – Support: the percentage of cases in which the two events A and B occur together on the total of the considered cases (e.g., the number of receipts in which A and B appear together divided by the total number of receipts); – Confidence: the percentage of cases in which the two events A and B occur together on the total of cases where A occurs (e.g., the number of receipts that contain both products A and B divided by the total number of receipts where A appears).
  • 9. @IconsultingBI Example of associative rule • Easywear Underwear • Support: 9% • Confidence: 50% • In 9% of cases Easywear and Underwear products are sold together. • In 50% of cases when someone purchases an Easywear item, an Underwear item is also purchased.
  • 10. @IconsultingBI Case study: MBA for Retail • Italian company leader in the Fashion industry • Sales data from the last three years • More than 100 million receipts • The results obtained can be used as an indicator for: – Defining new promotional initiatives – Identifying optimal schemes for the layout of goods in stores – etc.
  • 11. @IconsultingBI Architecture Receipts Associative Rules Interactive Dashboards MBA job Job Management Console Email Number of sold items & Associative Rules
  • 12. @IconsultingBI MBA Algorithm Steps Job 1 Job 2 Job 3 List of single sold items (receipt lines) Items list aggregated for receipts Support of the itemsets Map Reduce Map Reduce Map Reduce Receipt key, item value Combination of items inside the same receipt Calculation of all possible Association Rules that meet minimum Support criteria Association Rules that meet minimum Confidence criteria
  • 13. @IconsultingBI Job Management Interface • Interface integrated with standard BI tool • MBA Algorithm can run on different data sets • Each user can perform custom analysis • Algorithm parameters (minimum support and confidence) can be set by end users • Examples of different analyses: – what types of products are sold together with a discounted item? – are there different association rules between products sold in city-center stores and those in outlets?
  • 14. @IconsultingBI Job Management Interface Analysis Description Time filters Point of Sales filters Product filters Attributes used for association rules Support & Confidence parameters Run MBA
  • 16. @IconsultingBI Analysis Examples • From 01/09/2013 to 31/12/2013 marketing campaign of a new type of bra • All Italian points of sales located in city centers • Analysis between all types of item except knitwear • Min. support 35%, min. confidence 50% Meaning: 36% of considered receipts contain all those items; when the new bra is purchased, 52 times out of 100 a slip and a babydoll are also purchased Same configuration as before, but considering only PoS in shopping centers Meaning: in shopping centers, the sales of easywear drive the sales of the new bra. Rules found: new bra slip, babydoll support: 36% confidence: 52% Rules found: Easywear new bra support: 50% confidence: 60%
  • 17. @IconsultingBI Conclusions and future work Conclusions • Now business users can deeply investigate on the effectiveness of marketing and advertising campaigns and figure out whether shop windows and in-store layouts reach desired goals. • Market Basket Analysis algorithm can be customized on users’ needs. • Transparent interaction between Hadoop Cluster and Business Intelligence platform. Future work: from project to solution: • Complete framework to run complex Data Mining algorithms on Big Data. • Hadoop to exploit parallel execution and Distributed File System. • Seamless integration with standard Business Intelligence tools. • More user independence on data integration.