SlideShare uma empresa Scribd logo
1 de 22
Web Log & Clickstream

Michel Bruley
WA - Marketing Director

March 2012
Extract from various presentations: Levene, Nagadevara, Waterson, Teradata Aster, …

www.decideo.fr/bruley
Definitions
Web Analytics as defined by Web Analytics Association :
“ Web Analytics is the measurement, collection, analysis and reporting of
Internet data for the purposes of understanding and optimizing Web
usage.”

Clickstream as defined by Internet Advertising Bureau (IAB) :
“The electronic path a user takes while navigating from site to site, and from
page to page within a site. It is a comprehensive body of data describing the
sequence of activity between a user’s browser and any other Internet
resource, such as a Web site or third party ad server”

www.decideo.fr/bruley
What is a web log?
A record of a visit to a web page
–
–
–
–
–
–

Visitor (IP address)
URL
Time of visit
Time spent on a page
Browser used
Referring URL

– Type of request
– Reply code
– Number of bytes in the
reply
– etc…

Web log analysis is a simple kind of Web analytics processes that parses
a log file from a web server, and based on the values contained in the log
file, derives indicators about who, when, and how a web server is visited.

Usually reports are generated from the log files immediately, but the log files
can alternatively be parsed to a database and reports generated on demand.

www.decideo.fr/bruley
What is a clickstream?
A record of a path through web pages
–
–
–
–
–
–

Visitor (IP address)
URL
Time of visit
Time spent on a page
Browser used
Referring URL

–
–
–
–

Type of request
Reply code
Number of bytes in the reply
Next URL

–

etc…

A clickstream is the recording of the parts of the screen a computer user clicks
on, while web browsing or using another software application.
As the user clicks anywhere in the webpage or application, the action is logged
on a client or inside the web server, as well as possibly the web browser, router,
proxy server or ad server. Clickstream analysis is useful for web activity
analysis, software testing, market research, and for analyzing employee
productivity.
A small observation on the evolution of clickstream tracking: Initial clickstream
or click path data had to be gleaned from server log files.
www.decideo.fr/bruley
Web Log: where do they come from?
Servers
– Done on most
web servers
– Standard formats
Proxy Log

Clients
 Browsers, loggers on client machine
 Must send data back
Proxies
 Similar to servers
 Hang out in between client and server

www.decideo.fr/bruley
Why are web logs relevant?


Lots of data available
– Quantitative analysis is much more fun!



User behavior, patterns
– Real users, tasks
– Or at least more realistic users and tasks



Leaving the usability lab
– Testing effect



Fast, easy, cheap
– Automatic or almost-automatic

www.decideo.fr/bruley
Some questions on Web usages


How has information been accessed?



How frequently?



What’s popular? What’s not?



How do people enter the site? Exit?



Where do people spend time?



How long do they spend there?



How do people travel within the site?



Who are the people visiting?

www.decideo.fr/bruley
Some others questions
 Structural

– What information has been added, deleted, modified,
moved?
 Usage

–
–
–
–

+ Structural
What happens when the site changes?
Does navigation change?
Does popularity change?
What about missing data?

www.decideo.fr/bruley
How do you analyze web logs?


Data Mining: task or intent unknown
– “Automated extraction of hidden predictive information
from (large) databases”
– Server log analysis
What are people doing?



Remote Usability Testing: task or intent known
– Similar to traditional lab usability testing
– Clickstream analysis
How well does the site support
what people are doing?

www.decideo.fr/bruley
Web Log File Basic Analysis


Gives statistics such as
–
–
–
–
–
–



number of hits
average hits per time period
what are the popular pages in your site
who is visiting your site
what keywords are users searching for to get to you
what is being downloaded

Log data does not disclose the visitor’s identity

www.decideo.fr/bruley
Identification of User


By IP address
– Not so reliable as IP can be dynamic
– Different users may use same IP



Through cookies
– Reliable but user may remove cookies
– Security and privacy issues



Through login
– Users have to register

www.decideo.fr/bruley
Sessionising




Time oriented (robust)
– By total duration of session
• not more than 30 minutes
– By page stay times (good for short sessions)
• not more than 10 minutes per page

Navigation oriented (good for short sessions and when
timestamps unreliable)
– Referrer is previous page in session, or
– Referrer is undefined but request within 10 secs, or
– Link from previous to current page in web site

www.decideo.fr/bruley
Mining Navigation Patterns








Each session induces a user trail through the site
A trail is a sequence of web pages followed by a user during
a session, ordered by time of access

A pattern in this context is a frequent trail
Co-occurrence of web pages is important, e.g. shoppingbasket and checkout
Use a Markov chain model

www.decideo.fr/bruley
Starting with Raw Click Stream Data

www.decideo.fr/bruley
Finding Referring Channels

www.decideo.fr/bruley
Identifies the Customer
Identifies the customer by correlating Cookies
captured during customer interactions …
referrer=http://www.facebook.com/
sc=ar78to0wdxq143lr9p9weh6yxy8318sa
pc=pgys63w0xgn102in8ms37wka8quxe74e
ac=bd76aad17448wgng0679a044cfda00e005b130000

Persistent and Network Cookies remain unchanged

www.decideo.fr/bruley
Creates a Session Definition

Tracking of Customer Interaction Behavior During
the Session
www.decideo.fr/bruley
Some Classic Measures












Clicks: The interaction between the user and the web server is measured by the
click of a mouse
Visits: The number of times a user visits a specific web site. Every new session is
counted as a new visit
Hits: Total number of server requests serviced by the server
Exits: Site exits, counted by site inactivity for more than 30 minutes
Unique Visitors: A Unique User who accesses the site in a specified period of
time.
Repeated Visitor: The average number of times a user returns to a site over a
specific time period
Page views: The view of any page by the user. A page may contain text, images,
and other online elements and may be statically or dynamically generated and
could contain single or multiple frames or screens
Sessions: IAB defines it to be an “A sequence of Internet activity made by one
user at one site. If a user makes no request from a site during a 30 minute
period of time, the next content or ad request would then constitute the
beginning of a new visit “
Unique authenticated visitors: A unique visitor who logs on to a site via a
registration method using his/her user id and password

www.decideo.fr/bruley
Some Classic Metrics


Page views per visit: Average number of page views per visit



Page views per session: Average number of page views per session



Page views per hour/day: Average number of page views per
hour/day



Clicks per session: Average number or clicks per session



Clicks per hour: Average number of clicks per hour



Time between clicks: The average duration of time spent between
two clicks



Hits per hour: Average number of hits to the web server per hour



Busy hour of the day: The highest number of hits to the web
server in a particular hour of a day

www.decideo.fr/bruley
Information from Web Analytics
Web Analytics

Personilization











System
Improvement

Site
Modification

Business
Intelligence

Usage
characteristics

How many visitors visit the page daily?
Who are the regular visitors?
What percentage of the visitors to the page are registered users?
What are the top pages that are visited on the web page?
What is the average visit time on the website?
How often does the visitor return to the site?
What is the average page depth of a visitor?
What is the geographic distribution of users of the website?

www.decideo.fr/bruley
Teradata Aster Interpret Web Log Data
Aster Data MPP Analytic Platform
SQL-MapReduce Output
Timestamp

Persistent_Coo
kie_ID

Session

Raw Log Input
Clickstream_Log

SQL-MapReduce Output
Referral

Timestamp

SQL-MapReduce
program runs inplatform

Raw Click-stream Log Data

www.decideo.fr/bruley

Persistent_Coo
kie_ID

Raw Log Input
Clickstream_Log

Session

Referral
Teradata Aster Competitive Advantage
Internet
Use Cases

Financial Services &
Insurance
Use Cases

Media &
Information
Services
Use Cases

Retail
Use Cases

• Social networking
graph analysis

•

Real-time fraud
& link analysis

• Digital marketing
attribution

• Predictive and granular
forecasting

• Crowd-sourcing

•

Tick data analysis

•

• Virality analysis

•

Trading surveillance

• Online consumer
behavior/patterns

• Content targeting

•

•

• Advanced click-stream
analysis

Multi-variate pricing
analysis for insurance

• Advanced click-stream
analysis

Advanced click-stream
analysis

•

Behavior pattern
matching

Digital media
consumer microtargeting

•

Ad optimization

• Online targeting for
personalization/
recommendations

“Analytics themselves don't constitute a strategy, but using them to optimize a distinctive business
capability certainly constitutes a strategy.”
- T. Davenport & J. Harris, Competing on Analytics: The New Science of Winning

www.decideo.fr/bruley

Mais conteúdo relacionado

Mais procurados

Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Faisal Siddiqi
 
Metrics, Engagement & Personalization
Metrics, Engagement & Personalization Metrics, Engagement & Personalization
Metrics, Engagement & Personalization Mounia Lalmas-Roelleke
 
Tutorial on metrics of user engagement -- Applications to Search & E- commerce
Tutorial on metrics of user engagement -- Applications to Search & E- commerceTutorial on metrics of user engagement -- Applications to Search & E- commerce
Tutorial on metrics of user engagement -- Applications to Search & E- commerceMounia Lalmas-Roelleke
 
Best practises for log management
Best practises for log managementBest practises for log management
Best practises for log managementBrian Honan
 
Web Mining Presentation Final
Web Mining Presentation FinalWeb Mining Presentation Final
Web Mining Presentation FinalEr. Jagrat Gupta
 
Introduction to RapidMiner Studio V7
Introduction to RapidMiner Studio V7Introduction to RapidMiner Studio V7
Introduction to RapidMiner Studio V7geraldinegray
 
Multi-Armed Bandit and Applications
Multi-Armed Bandit and ApplicationsMulti-Armed Bandit and Applications
Multi-Armed Bandit and ApplicationsSangwoo Mo
 
The Universal Recommender
The Universal RecommenderThe Universal Recommender
The Universal RecommenderPat Ferrel
 
Causality without headaches
Causality without headachesCausality without headaches
Causality without headachesBenoît Rostykus
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender systemStanley Wang
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011idoguy
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation SystemsTrieu Nguyen
 
Naïve Bayes Classifier Algorithm.pptx
Naïve Bayes Classifier Algorithm.pptxNaïve Bayes Classifier Algorithm.pptx
Naïve Bayes Classifier Algorithm.pptxShubham Jaybhaye
 
Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027ankitgadgil
 
An Introduction to Prometheus
An Introduction to PrometheusAn Introduction to Prometheus
An Introduction to PrometheusEvgeny Shmarnev
 
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...Anmol Bhasin
 

Mais procurados (20)

Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019
 
Metrics, Engagement & Personalization
Metrics, Engagement & Personalization Metrics, Engagement & Personalization
Metrics, Engagement & Personalization
 
Tutorial on metrics of user engagement -- Applications to Search & E- commerce
Tutorial on metrics of user engagement -- Applications to Search & E- commerceTutorial on metrics of user engagement -- Applications to Search & E- commerce
Tutorial on metrics of user engagement -- Applications to Search & E- commerce
 
Best practises for log management
Best practises for log managementBest practises for log management
Best practises for log management
 
Web mining
Web miningWeb mining
Web mining
 
18 Data Streams
18 Data Streams18 Data Streams
18 Data Streams
 
Web Mining Presentation Final
Web Mining Presentation FinalWeb Mining Presentation Final
Web Mining Presentation Final
 
Introduction to RapidMiner Studio V7
Introduction to RapidMiner Studio V7Introduction to RapidMiner Studio V7
Introduction to RapidMiner Studio V7
 
Multi-Armed Bandit and Applications
Multi-Armed Bandit and ApplicationsMulti-Armed Bandit and Applications
Multi-Armed Bandit and Applications
 
The Universal Recommender
The Universal RecommenderThe Universal Recommender
The Universal Recommender
 
Causality without headaches
Causality without headachesCausality without headaches
Causality without headaches
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Naïve Bayes Classifier Algorithm.pptx
Naïve Bayes Classifier Algorithm.pptxNaïve Bayes Classifier Algorithm.pptx
Naïve Bayes Classifier Algorithm.pptx
 
Link Analysis
Link AnalysisLink Analysis
Link Analysis
 
Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027
 
Chapter 7 - Deadlocks
Chapter 7 - DeadlocksChapter 7 - Deadlocks
Chapter 7 - Deadlocks
 
An Introduction to Prometheus
An Introduction to PrometheusAn Introduction to Prometheus
An Introduction to Prometheus
 
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
 

Destaque

Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...
Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...
Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...Spark Summit
 
Clickstream Analysis
Clickstream AnalysisClickstream Analysis
Clickstream Analysisintuitiv.de
 
Clickstream Analysis with Spark
Clickstream Analysis with Spark Clickstream Analysis with Spark
Clickstream Analysis with Spark Josef Adersberger
 

Destaque (6)

Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...
Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...
Clickstream Analysis with Spark—Understanding Visitors in Realtime by Josef A...
 
Clickstream ppt copy
Clickstream ppt   copyClickstream ppt   copy
Clickstream ppt copy
 
Clickstream Analysis
Clickstream AnalysisClickstream Analysis
Clickstream Analysis
 
Clickstream Analysis with Spark
Clickstream Analysis with Spark Clickstream Analysis with Spark
Clickstream Analysis with Spark
 
Click Stream Analysis
Click Stream AnalysisClick Stream Analysis
Click Stream Analysis
 
clickstream analysis
 clickstream analysis clickstream analysis
clickstream analysis
 

Semelhante a Web log & clickstream

Module 2 onsite webanalytics
Module 2   onsite webanalyticsModule 2   onsite webanalytics
Module 2 onsite webanalyticsGayathri Choda
 
Understanding web site analytics
Understanding web site analyticsUnderstanding web site analytics
Understanding web site analyticsDan Morrill
 
SEMNE Google Analytics Master Class - 15 Oct 2014
SEMNE Google Analytics Master Class - 15 Oct 2014SEMNE Google Analytics Master Class - 15 Oct 2014
SEMNE Google Analytics Master Class - 15 Oct 2014Jay Murphy
 
Praxis Business School - Web Analytics - 2023.pptx
Praxis Business School - Web Analytics - 2023.pptxPraxis Business School - Web Analytics - 2023.pptx
Praxis Business School - Web Analytics - 2023.pptxManaliSandeepParab
 
Digital Analytics Customer Segmentation
 Digital Analytics  Customer Segmentation Digital Analytics  Customer Segmentation
Digital Analytics Customer SegmentationBenjaminFung4
 
Complete Google analytics document
Complete Google analytics documentComplete Google analytics document
Complete Google analytics documentParshuram Yadav
 
Google Analytics for Dummies
Google Analytics for DummiesGoogle Analytics for Dummies
Google Analytics for DummiesTim Lelek
 
Web Analytics Primer
Web Analytics PrimerWeb Analytics Primer
Web Analytics PrimerChad Richeson
 
Google Analytics
Google AnalyticsGoogle Analytics
Google AnalyticsKieran Daly
 
PPT 3 Web Analytics (1).pptx
PPT 3 Web Analytics (1).pptxPPT 3 Web Analytics (1).pptx
PPT 3 Web Analytics (1).pptxDevChaudhari15
 
Analytics Slides for IDEA Programme @Cattolica
Analytics Slides for IDEA Programme @CattolicaAnalytics Slides for IDEA Programme @Cattolica
Analytics Slides for IDEA Programme @CattolicaGiorgio Suighi
 
How To Promote Your Online Visibility Using Web Analytics. How To Promote You...
How To Promote Your Online Visibility Using Web Analytics. How To Promote You...How To Promote Your Online Visibility Using Web Analytics. How To Promote You...
How To Promote Your Online Visibility Using Web Analytics. How To Promote You...Overalia
 
Baseline Website Snapshot Audit v3
Baseline Website Snapshot Audit v3Baseline Website Snapshot Audit v3
Baseline Website Snapshot Audit v3Daniel McKean
 

Semelhante a Web log & clickstream (20)

Module 2 onsite webanalytics
Module 2   onsite webanalyticsModule 2   onsite webanalytics
Module 2 onsite webanalytics
 
Understanding web site analytics
Understanding web site analyticsUnderstanding web site analytics
Understanding web site analytics
 
SEMNE Google Analytics Master Class - 15 Oct 2014
SEMNE Google Analytics Master Class - 15 Oct 2014SEMNE Google Analytics Master Class - 15 Oct 2014
SEMNE Google Analytics Master Class - 15 Oct 2014
 
Google Analytics tutorial by Jay Murphy
Google Analytics tutorial by Jay Murphy Google Analytics tutorial by Jay Murphy
Google Analytics tutorial by Jay Murphy
 
Web Log Files
Web Log FilesWeb Log Files
Web Log Files
 
Praxis Business School - Web Analytics - 2023.pptx
Praxis Business School - Web Analytics - 2023.pptxPraxis Business School - Web Analytics - 2023.pptx
Praxis Business School - Web Analytics - 2023.pptx
 
Digital Analytics Customer Segmentation
 Digital Analytics  Customer Segmentation Digital Analytics  Customer Segmentation
Digital Analytics Customer Segmentation
 
Web analytics
Web analyticsWeb analytics
Web analytics
 
Google Analytics
Google Analytics Google Analytics
Google Analytics
 
Complete Google analytics document
Complete Google analytics documentComplete Google analytics document
Complete Google analytics document
 
Google Analytics for Dummies
Google Analytics for DummiesGoogle Analytics for Dummies
Google Analytics for Dummies
 
Web Analytics Primer
Web Analytics PrimerWeb Analytics Primer
Web Analytics Primer
 
Google Analytics
Google AnalyticsGoogle Analytics
Google Analytics
 
Web usage mining
Web usage miningWeb usage mining
Web usage mining
 
PPT 3 Web Analytics (1).pptx
PPT 3 Web Analytics (1).pptxPPT 3 Web Analytics (1).pptx
PPT 3 Web Analytics (1).pptx
 
analytics_def
analytics_defanalytics_def
analytics_def
 
Analytics Slides for IDEA Programme @Cattolica
Analytics Slides for IDEA Programme @CattolicaAnalytics Slides for IDEA Programme @Cattolica
Analytics Slides for IDEA Programme @Cattolica
 
How To Promote Your Online Visibility Using Web Analytics. How To Promote You...
How To Promote Your Online Visibility Using Web Analytics. How To Promote You...How To Promote Your Online Visibility Using Web Analytics. How To Promote You...
How To Promote Your Online Visibility Using Web Analytics. How To Promote You...
 
Google Analytics ppt
Google Analytics  pptGoogle Analytics  ppt
Google Analytics ppt
 
Baseline Website Snapshot Audit v3
Baseline Website Snapshot Audit v3Baseline Website Snapshot Audit v3
Baseline Website Snapshot Audit v3
 

Mais de Michel Bruley

Propos sur l'âme, extraits de recherches numériques
Propos sur l'âme, extraits de recherches numériquesPropos sur l'âme, extraits de recherches numériques
Propos sur l'âme, extraits de recherches numériquesMichel Bruley
 
Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)Michel Bruley
 
Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?Michel Bruley
 
La chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdfLa chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdfMichel Bruley
 
Synthèse sur Neuville.pdf
Synthèse sur Neuville.pdfSynthèse sur Neuville.pdf
Synthèse sur Neuville.pdfMichel Bruley
 
Propos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdfPropos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdfMichel Bruley
 
Propos sur les Big Data.pdf
Propos sur les Big Data.pdfPropos sur les Big Data.pdf
Propos sur les Big Data.pdfMichel Bruley
 
Georges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientGeorges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientMichel Bruley
 
Poc banking industry - Churn
Poc banking industry - ChurnPoc banking industry - Churn
Poc banking industry - ChurnMichel Bruley
 
Big Data POC in communication industry
Big Data POC in communication industryBig Data POC in communication industry
Big Data POC in communication industryMichel Bruley
 
Photos de famille 1895 1966
Photos de famille 1895   1966Photos de famille 1895   1966
Photos de famille 1895 1966Michel Bruley
 
Compilation d'autres textes de famille
Compilation d'autres textes de familleCompilation d'autres textes de famille
Compilation d'autres textes de familleMichel Bruley
 
Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)Michel Bruley
 
Recette de la dinde au whisky
Recette de la dinde au whiskyRecette de la dinde au whisky
Recette de la dinde au whiskyMichel Bruley
 
Les 2 guerres de René Puig
Les 2 guerres de René PuigLes 2 guerres de René Puig
Les 2 guerres de René PuigMichel Bruley
 
Une societe se_presente
Une societe se_presenteUne societe se_presente
Une societe se_presenteMichel Bruley
 
Dossiers noirs va 4191
Dossiers noirs va 4191Dossiers noirs va 4191
Dossiers noirs va 4191Michel Bruley
 
Irfm mini guide de mauvaise conduite
Irfm mini guide de mauvaise  conduiteIrfm mini guide de mauvaise  conduite
Irfm mini guide de mauvaise conduiteMichel Bruley
 

Mais de Michel Bruley (20)

Propos sur l'âme, extraits de recherches numériques
Propos sur l'âme, extraits de recherches numériquesPropos sur l'âme, extraits de recherches numériques
Propos sur l'âme, extraits de recherches numériques
 
Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)Religion : Dieu y es-tu ? (les articles)
Religion : Dieu y es-tu ? (les articles)
 
Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?Réflexion sur les religions : Dieu y es-tu ?
Réflexion sur les religions : Dieu y es-tu ?
 
La chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdfLa chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdf
 
Synthèse sur Neuville.pdf
Synthèse sur Neuville.pdfSynthèse sur Neuville.pdf
Synthèse sur Neuville.pdf
 
Propos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdfPropos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdf
 
Propos sur les Big Data.pdf
Propos sur les Big Data.pdfPropos sur les Big Data.pdf
Propos sur les Big Data.pdf
 
Sun tzu
Sun tzuSun tzu
Sun tzu
 
Georges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientGeorges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
 
Poc banking industry - Churn
Poc banking industry - ChurnPoc banking industry - Churn
Poc banking industry - Churn
 
Big Data POC in communication industry
Big Data POC in communication industryBig Data POC in communication industry
Big Data POC in communication industry
 
Photos de famille 1895 1966
Photos de famille 1895   1966Photos de famille 1895   1966
Photos de famille 1895 1966
 
Compilation d'autres textes de famille
Compilation d'autres textes de familleCompilation d'autres textes de famille
Compilation d'autres textes de famille
 
J'aime BRULEY
J'aime BRULEYJ'aime BRULEY
J'aime BRULEY
 
Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)Textes de famille concernant les guerres (1814 - 1944)
Textes de famille concernant les guerres (1814 - 1944)
 
Recette de la dinde au whisky
Recette de la dinde au whiskyRecette de la dinde au whisky
Recette de la dinde au whisky
 
Les 2 guerres de René Puig
Les 2 guerres de René PuigLes 2 guerres de René Puig
Les 2 guerres de René Puig
 
Une societe se_presente
Une societe se_presenteUne societe se_presente
Une societe se_presente
 
Dossiers noirs va 4191
Dossiers noirs va 4191Dossiers noirs va 4191
Dossiers noirs va 4191
 
Irfm mini guide de mauvaise conduite
Irfm mini guide de mauvaise  conduiteIrfm mini guide de mauvaise  conduite
Irfm mini guide de mauvaise conduite
 

Último

BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptxBT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptxNeo4j
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Hiroshi SHIBATA
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101vincent683379
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGDSC PJATK
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfFIDO Alliance
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfFIDO Alliance
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe中 央社
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...FIDO Alliance
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireExakis Nelite
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераMark Opanasiuk
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...FIDO Alliance
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty SecureFemke de Vroome
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024Stephen Perrenod
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024Lorenzo Miniero
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...marcuskenyatta275
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentationyogeshlabana357357
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyUXDXConf
 

Último (20)

BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptxBT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System Strategy
 

Web log & clickstream

  • 1. Web Log & Clickstream Michel Bruley WA - Marketing Director March 2012 Extract from various presentations: Levene, Nagadevara, Waterson, Teradata Aster, … www.decideo.fr/bruley
  • 2. Definitions Web Analytics as defined by Web Analytics Association : “ Web Analytics is the measurement, collection, analysis and reporting of Internet data for the purposes of understanding and optimizing Web usage.” Clickstream as defined by Internet Advertising Bureau (IAB) : “The electronic path a user takes while navigating from site to site, and from page to page within a site. It is a comprehensive body of data describing the sequence of activity between a user’s browser and any other Internet resource, such as a Web site or third party ad server” www.decideo.fr/bruley
  • 3. What is a web log? A record of a visit to a web page – – – – – – Visitor (IP address) URL Time of visit Time spent on a page Browser used Referring URL – Type of request – Reply code – Number of bytes in the reply – etc… Web log analysis is a simple kind of Web analytics processes that parses a log file from a web server, and based on the values contained in the log file, derives indicators about who, when, and how a web server is visited. Usually reports are generated from the log files immediately, but the log files can alternatively be parsed to a database and reports generated on demand. www.decideo.fr/bruley
  • 4. What is a clickstream? A record of a path through web pages – – – – – – Visitor (IP address) URL Time of visit Time spent on a page Browser used Referring URL – – – – Type of request Reply code Number of bytes in the reply Next URL – etc… A clickstream is the recording of the parts of the screen a computer user clicks on, while web browsing or using another software application. As the user clicks anywhere in the webpage or application, the action is logged on a client or inside the web server, as well as possibly the web browser, router, proxy server or ad server. Clickstream analysis is useful for web activity analysis, software testing, market research, and for analyzing employee productivity. A small observation on the evolution of clickstream tracking: Initial clickstream or click path data had to be gleaned from server log files. www.decideo.fr/bruley
  • 5. Web Log: where do they come from? Servers – Done on most web servers – Standard formats Proxy Log Clients  Browsers, loggers on client machine  Must send data back Proxies  Similar to servers  Hang out in between client and server www.decideo.fr/bruley
  • 6. Why are web logs relevant?  Lots of data available – Quantitative analysis is much more fun!  User behavior, patterns – Real users, tasks – Or at least more realistic users and tasks  Leaving the usability lab – Testing effect  Fast, easy, cheap – Automatic or almost-automatic www.decideo.fr/bruley
  • 7. Some questions on Web usages  How has information been accessed?  How frequently?  What’s popular? What’s not?  How do people enter the site? Exit?  Where do people spend time?  How long do they spend there?  How do people travel within the site?  Who are the people visiting? www.decideo.fr/bruley
  • 8. Some others questions  Structural – What information has been added, deleted, modified, moved?  Usage – – – – + Structural What happens when the site changes? Does navigation change? Does popularity change? What about missing data? www.decideo.fr/bruley
  • 9. How do you analyze web logs?  Data Mining: task or intent unknown – “Automated extraction of hidden predictive information from (large) databases” – Server log analysis What are people doing?  Remote Usability Testing: task or intent known – Similar to traditional lab usability testing – Clickstream analysis How well does the site support what people are doing? www.decideo.fr/bruley
  • 10. Web Log File Basic Analysis  Gives statistics such as – – – – – –  number of hits average hits per time period what are the popular pages in your site who is visiting your site what keywords are users searching for to get to you what is being downloaded Log data does not disclose the visitor’s identity www.decideo.fr/bruley
  • 11. Identification of User  By IP address – Not so reliable as IP can be dynamic – Different users may use same IP  Through cookies – Reliable but user may remove cookies – Security and privacy issues  Through login – Users have to register www.decideo.fr/bruley
  • 12. Sessionising   Time oriented (robust) – By total duration of session • not more than 30 minutes – By page stay times (good for short sessions) • not more than 10 minutes per page Navigation oriented (good for short sessions and when timestamps unreliable) – Referrer is previous page in session, or – Referrer is undefined but request within 10 secs, or – Link from previous to current page in web site www.decideo.fr/bruley
  • 13. Mining Navigation Patterns      Each session induces a user trail through the site A trail is a sequence of web pages followed by a user during a session, ordered by time of access A pattern in this context is a frequent trail Co-occurrence of web pages is important, e.g. shoppingbasket and checkout Use a Markov chain model www.decideo.fr/bruley
  • 14. Starting with Raw Click Stream Data www.decideo.fr/bruley
  • 16. Identifies the Customer Identifies the customer by correlating Cookies captured during customer interactions … referrer=http://www.facebook.com/ sc=ar78to0wdxq143lr9p9weh6yxy8318sa pc=pgys63w0xgn102in8ms37wka8quxe74e ac=bd76aad17448wgng0679a044cfda00e005b130000 Persistent and Network Cookies remain unchanged www.decideo.fr/bruley
  • 17. Creates a Session Definition Tracking of Customer Interaction Behavior During the Session www.decideo.fr/bruley
  • 18. Some Classic Measures          Clicks: The interaction between the user and the web server is measured by the click of a mouse Visits: The number of times a user visits a specific web site. Every new session is counted as a new visit Hits: Total number of server requests serviced by the server Exits: Site exits, counted by site inactivity for more than 30 minutes Unique Visitors: A Unique User who accesses the site in a specified period of time. Repeated Visitor: The average number of times a user returns to a site over a specific time period Page views: The view of any page by the user. A page may contain text, images, and other online elements and may be statically or dynamically generated and could contain single or multiple frames or screens Sessions: IAB defines it to be an “A sequence of Internet activity made by one user at one site. If a user makes no request from a site during a 30 minute period of time, the next content or ad request would then constitute the beginning of a new visit “ Unique authenticated visitors: A unique visitor who logs on to a site via a registration method using his/her user id and password www.decideo.fr/bruley
  • 19. Some Classic Metrics  Page views per visit: Average number of page views per visit  Page views per session: Average number of page views per session  Page views per hour/day: Average number of page views per hour/day  Clicks per session: Average number or clicks per session  Clicks per hour: Average number of clicks per hour  Time between clicks: The average duration of time spent between two clicks  Hits per hour: Average number of hits to the web server per hour  Busy hour of the day: The highest number of hits to the web server in a particular hour of a day www.decideo.fr/bruley
  • 20. Information from Web Analytics Web Analytics Personilization         System Improvement Site Modification Business Intelligence Usage characteristics How many visitors visit the page daily? Who are the regular visitors? What percentage of the visitors to the page are registered users? What are the top pages that are visited on the web page? What is the average visit time on the website? How often does the visitor return to the site? What is the average page depth of a visitor? What is the geographic distribution of users of the website? www.decideo.fr/bruley
  • 21. Teradata Aster Interpret Web Log Data Aster Data MPP Analytic Platform SQL-MapReduce Output Timestamp Persistent_Coo kie_ID Session Raw Log Input Clickstream_Log SQL-MapReduce Output Referral Timestamp SQL-MapReduce program runs inplatform Raw Click-stream Log Data www.decideo.fr/bruley Persistent_Coo kie_ID Raw Log Input Clickstream_Log Session Referral
  • 22. Teradata Aster Competitive Advantage Internet Use Cases Financial Services & Insurance Use Cases Media & Information Services Use Cases Retail Use Cases • Social networking graph analysis • Real-time fraud & link analysis • Digital marketing attribution • Predictive and granular forecasting • Crowd-sourcing • Tick data analysis • • Virality analysis • Trading surveillance • Online consumer behavior/patterns • Content targeting • • • Advanced click-stream analysis Multi-variate pricing analysis for insurance • Advanced click-stream analysis Advanced click-stream analysis • Behavior pattern matching Digital media consumer microtargeting • Ad optimization • Online targeting for personalization/ recommendations “Analytics themselves don't constitute a strategy, but using them to optimize a distinctive business capability certainly constitutes a strategy.” - T. Davenport & J. Harris, Competing on Analytics: The New Science of Winning www.decideo.fr/bruley