A common chorus from museum professionals is how challenging it is to make data-driven decisions with which to improve their programs. Popular tools such as Google Analytics are intuitive and seemingly easy-to-use, yet when the time comes to use data to measure a program's stated goals, too often the main question surrounding the data is "So what?" This workshop will focus on bringing clarity to this challenge. Presented at MCN2012, on 11/7/12.
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Â
Cut Through the Web Analytics Fog: Using GA Data Grabber to Act on Google Analytics Data
1. Cut Through the Fog: How to Act on Your
Museum's Website Data
MCN 2012
11/7/2012
Brian Alpert
Web Analytics and SEM Analyst
Smithsonian Institution
Elena Villaespesa
Web Analyst and Producer
Tate
2. 2
Table of Contents, Part 1
âą The Web Analytics Process
âą GA Data Grabber
âą GADG Custom Dashboard
âą Case Studies
âą Hands-on Practice
âą GA Best Practices / Tips and Tricks
5. 5
What do you mean, âSo whatâ?
âą The typical proxy for website success is quantity of stuff.
â Aggregated âbig numbersâ
â Pageviews / visits / visitors
âą Aggregated data doesnât indicate success.
â It doesn't reflect a websiteâs efficiency or quality.
â It doesn't reflect a website userâs experience.
â It doesn't help us understand how to improve the website.
âą We canât act on this data.
âą âAll data in aggregate is crap.â
â Google âAnalytics Evangelistâ Avinash Kaushik
6. 6
What web analytics is really about:
Furthering Program Goals
Reuters: Toru Hanai
7. 7
Articulating your goals is the hard part
ï§ Sometimes your museum's goals:
ï§ Arenât precisely articulated.
ï§ Arenât articulated at all (!)
ï§ Are too broad to meaningfully measure.
âAn institution for the increase and
diffusion of knowledge."
-- James Smithson
Source: Smithsonian Institution Archives
8. 8
Your goal: storyteller
ï§ Use data to tell a story.
ï§ Management loves stories.
ï§ The âSo what?â factor melts
away because it makes sense:
ï§ What was happening.
ï§ What it meant.
ï§ What you did.
ï§ Whatâs happening now.
Source: http://www.squidoo.com
9. A systematic, step-by-step process
ï§ Articulate your programâs goals.
ï§ Decide strategies to achieve those goals.
ï§ Decide tactics to pursue the strategies.
ï§ Decide what and how to measure.
ï§ Benchmark to get a sense of whatâs normal.
ï§ The process isnât âone size fits allâ!
ï§ Interpretation and consensus-building are important .
9
10. Start by articulating specific goals
ï§ Not too many!
ï§ Express what your museum is trying to accomplish.
ï§ Distill high-level goals into more specific sub-goals:
ï§ âIncrease influenceâ >> âBecome the definitive source on
Smithsonian history.â
ï§ Making the broad goal specific makes it easier to identify
strategies and tactics.
ï§ By being specific, strategies can emerge.
ï§ Articulate goals & next steps on your own.
ï§ What do you think they are?
ï§ Work with management to redefine and finalize.
10
11. 11
Determine strategies & tactics
ï§ Strategies â the plans you make to achieve the goals.
ï§ Marketing, social media are strategic pursuits.
ï§ Tactics â the things you do to advance the strategy.
ï§ Advertising, search engine optimization (SEO) are tactics.
ï§ Per the example:
ï§ Goal: âBecome the definitive source on Smithsonian history.â
ï§ Strategy: search engine performance.
ï§ Rationale: search engines have sophisticated algorithms that
determine which websites are highly relevant, or, "authoritative.â
ï§ Tactic: SEO.
ï§ Search metrics become proxies for authority.
12. 12
Decide how to measure your tactics
ï§ Choose a few measurements.
ï§ Trend them over time.
ï§ Per the example:
ï§ Measure: segment history-specific content in GA
ï§ Directories (site.edu/history)
ï§ Dedicated content (site.edu/historyblog)
ï§ Google Analytics custom variables.
ï§ Apply SEO metrics to that content:
ï§ Number of keywords referred per month.
ï§ Number of history pages drawing visits from search engines.
13. 13
You canât set targets w/o benchmarks
ï§ Set targets and timeframes based on benchmarks.
ï§ You need at least six months of data.
ï§ Data fluctuates; is often seasonal.
ï§ Six months is just an opinion.
ï§ It also depends on how much traffic your site gets.
ï§ Peer data is valuable, but hard to come by.
ï§ Balance your targets with factors beyond your control:
ï§ Are the improvements youâre seeking known to be difficult to achieve?
ï§ What is the current status of your program (i.e., brand new, mature)?
ï§ How much resources will you have to devote to implementing tactics?
14. 14
Keep it simple
ï§ Donât do too much.
ï§ Once youâve selected your strategies
and tactics, minimize the number of
measurements.
ï§ If they turn-out to be inconclusive,
refine or change them!
ï§ Itâs an ongoing process.
15. 15
Letâs connect all that to Google Analytics
ï§ Youâve made progress:
ï§ Your goals/strategies/tactics are set.
ï§ Your measurements are chosen.
ï§ You want to use GA data to understand:
ï§ Whatâs happening.
ï§ How it impacts your program.
ï§ What you can do.
ï§ Google Analytics Custom Dashboard
ï§ Enables segmentation and trending.
ï§ Datapoints mostly relate to âengagement.â
ï§ GA Data Grabber
ï§ Flexible, Excel-based GA automation tool.
ï§ Enables you to see trends better than in the GA U-I.
16. 16
GADG Custom Dashboard
ï§ âEngagementâ oriented metrics
ï§ Visit Frequency
ï§ Visit Length
ï§ Visit Depth
ï§ New vs. Returning Visits
ï§ Bounce Rate
ï§ Conversion Rate
ï§ Search Engines
ï§ A foundation to make data actionable
ï§ âKey Trends and Insightsâ
ï§ âImpact on Site/Museumâ
ï§ âSteps Being Takenâ
The easily updated, trended data is
what makes the dashboard powerful.
17. Dashboard pages are designed:
1) To help orient you toward action
2) To communicate with management
17
Summary defines
and puts the
metric in context
Chart shows
segmented data
tracked and
trended over time.
Suggestions for
Possible
Additional
Segments.
Red/Yellow/Green
status marker
shows at-a-glance
each metricâs
status.
âActionâ section answers
the question âSo what?â
âą Key Trends and Insights
âą Impact on Website / Unit
âą Steps Being Taken
Profile data pulls
automatically from
GADG; shows
metrics at-a-glance.
GADG
Instructions; show
how to create the
reports from
scratch.
18. 18
GA Data Grabber (GADG)
ï§ Populates the dashboard with data.
ï§ Extracts data from the Google
Analytics API.
ï§ Easy-to-use and customize.
ï§ Exceptional charting capabilities.
ï§ 14 days free.
ï§ $300 per year.
ï§ Limited documentation and
support.
ï§ Excel for Windows
2003/2007/2010/2011.
ï§ Excel 2011 for Mac (slow!)
http://gadatagrabbertool.com
19. 19
Customized GA Data Grabber
ï§ Ten custom reports that
work with the Dashboard
ï§ Do not rename
GADataGrabber.xlsm !
ï§ âQuerystorageâ is unhidden
ï§ Change date ranges
ï§ Change profile #'s
ï§ Change advanced segments
ï§ Make changes by hand
ï§ Do not change cell formatting.
The âquerystorageâ tab is the key to
editing the dashboardâs GADG reports.
21. All Visits data tells a nice story...
21
Minimal loyalty
group (purple)
downward trend
indicates
improving content
engagement
High loyalty
group (blue)
upward trend
indicates same
This Impact of this Data on the Site or Program
âą This good-looking chart may indicate high content engagement and/or perceived value
âą This data may correlate to increasing conversion behaviors
Acting on this Data
âą Identify moderate and high loyalty pages as a means of duplicating, or improving others
âą Examining conversion behaviors of these segments may yield add'l insights
âą Correlating high bounce rate pages to one-time visits may yield add'l insights
âą Test different content types in an attempt to move 'minimal' visitors into 'moderate' group
Key Trends
and Insights
22. 22
This Impact of this Data on the Site or Program
âą Organic search listings are driving poorly-targeted traffic
âą Will result in decreased organic search performance over time
Acting on this Data
âą Refocus title tags, meta-description tags and page content for important pages
âą Perform link analysis to see where other SEO improvements can be made
Minimal
frequency group
upward trend
indicates organic
listings are not
appropriately
targeted
Moderate
frequency group
downward trend
indicates same
High frequency
group trending
slightly downward,
in contrast to
previous chartâs
upward slope
Key Trends
and Insights
âŠBut applying segmentation tells a different story
25. 25
Wikipedia Case Study
ï§ One Smithsonian unit worked closely with Wikipedia, incorporating a
range of their content within the online encyclopedia.
ï§ The purpose was to make their content more accessible for younger
students, those less sophisticated than the academics and professional
researchers who comprise one of the siteâs core audience segments.
ï§ The hypothesis was that by doing so, this group would have their needs
met more quickly and easily, without having to navigate the Smithsonian
websiteâs more advanced, research-oriented structure.
ï§ The data shows that the needs of the group referred from Wikipedia â a
likely starting point for younger students â were largely being met by the
content posted on Wikipedia.
ï§ They were increasingly less likely to need to visit the Smithsonian site
many times.
ï§ This is in contrast to the relatively stable trend of the overall population of
visitors shown on the âall visitsâ slide (26).
28. Is the trend statistically significant?
âą Control Limits Definition
âą Avinashâs blog post
âą âInstant Cognitionâ (Clint
Ivy) blog post
Four of thirteen datapoints
are outside of the upper
and lower control limit
ranges, 30% of the data.
Is that enough to say yes,
thatâs a statistically
significant trend? The
answer is subjective, but
arguably so.
29. 29
Wikipedia Case Study (contâd)
âą The next slide shows an additional datapoint which supports the
hypothesis.
âą The group referred by Wikipedia was increasingly less likely to need to
ask the Smithsonian staff for help via the siteâs contact form.
âą In addition to indicating that the Wikipedia-referred audience was finding
the content it needed on Wikipedia, the project resulted in a reduced
burden on the Smithsonian staffers who attend to these requests.
âą The same datapoint for two other referral segments are shown, returning
visitors, and visitors from search engines, showing marked contrast with
the Wikipedia segment.
32. ï§ The two files that work
together are:
ï§ GaDataGrabber.xlsx (donât
rename this one)
ï§ GADG_Custom_Dashboard_
template.xlsx
ï§ Save the files â donât open
them from the email!
ï§ Store both spreadsheets
in the same directory.
ï§ Find and select your
profile.
ï§ Note the Profile ID number
on the right.
32
Getting Started Click here to synch
with GA.
New GADG Reports
are programmed
here.
The âREFRESH ALL
REPORTSâ button
runs the custom
dashboard reports.
Clicking âRUN THE
REPORTâ does not
refresh the
dashboard â it adds
new reports to
GADG.
Profile ID Numbers.
Your GA Profiles.
33. 33
Letâs run your dashboards!
ï§ Login to GA.
ï§ Open and login to GaDataGrabber.xlsx
ï§ Make sure macros are enabled.
ï§ Customize âquerystorageâ with your profile number â row 67.
ï§ Refresh all reports.
ï§ Open GADG_Custom_Dashboard_template.xlsx
ï§ Data should be updated in the dashboard.
ï§ Letâs look at some examples. Select the first profile ID cell
(C67), then click at the top of
the spreadsheet. Edit-in your
profile ID by hand.
Donât risk altering the cell
formatting by selecting the cell
and doing copy/paste.
Filling to the right is OK.
34. 2) Click at the top of the
spreadsheet to hand edit your
profile ID number.
34
Detail: customizing profile numbers
Altering the cell formatting in
âquerystorageâ breaks the macros.
1) Select cell C67.
3) Filling the rest of the row to
the right is OK.
Back
36. 36
Working with GADG
ï§ Clicking the big,
green RUN THE
REPORT button
adds new
worksheet-reports to
your copy of GADG.
ï§ They are named
âreport1â, âreport2â,
etc.
ï§ They are easily
removed by clicking
the red âRemove
sheetâ button on the
worksheet.
37. 37
Working with GADG
ï§ On the customized
GADG, the all-important
âquerystorageâ tab is
already showing.
ï§ If youâre working from a
clean copy of GADG,
unhide this tab by right-
clicking on the tabs at the
bottom
ï§ Select âUnhideâ and then
âquerystorage.â
38. 38
Working with GADG
ï§ Editing reports in querystorage:
ï§ Advanced segments (rows 19,20)
ï§ Custom segment #âs are
obtained by creating a one-off
report using that segment, and
finding it in querystorage
ï§ Dates (rows 26,27,28)
ï§ Profile ID numbers (row 67)
ï§ ############ is normal
ï§ You can run reports from the
âAnalyticsâ page OR querystorage
ï§ Keep track of important
querystorage elements
ï§ Profile ID numbers
ï§ Segment names and numbers
39. 39
Working with GADG
ï§ To âsaveâ a snapshot of
your work and continue
experimenting, rename
your GADG files and
Dashboard files.
ï§ To ensure the renamed
Dashboard doesnât
automatically update,
follow these steps:
ï§ Save as
ï§ Data
ï§ Edit Links
ï§ Select the dashboard
you want to save
ï§ Execute âBreak Linksâ
40. 40
Troubleshooting
ï§ Never double-click the files from an email, always âSave-as.â Opening from an email
breaks the spreadsheet relationships.
ï§ Be sure youâre logged into GA as yourself.
ï§ In querystorage, always edit by clicking at the top of the spreadsheet, and either
editing by hand, or selecting and copying, then selecting and pasting in another cell.
Never select the cell itself and copy/paste. Filling to the right is OK.
ï§ Do not change the cell formatting in querystorage; that will break the macros.
ï§ Peculiarities can sometimes be attributed to Googleâs API, and not GADG.
ï§ Data labeled â(other | other)â sometimes appears â occasionally data has to be hand-
manipulated to get it properly into the Dashboard charts.
ï§ Occasionally a blank worksheet remains after refreshing (âreport1â) â it can be
deleted.
ï§ If you change profiles and re-run a report, GADG occasionally leaves the previous
profile name in the worksheet chart. I removed profile names from the charts, but they
sometimes reappear.
ï§ Iâm happy to answer questions, but the real expert is GADG creator Mikael Thuneberg.
Post questions to his Google Group â automateanalytics. Heâs pretty responsive.
42. 42
No-filters (raw) profile
ï§ Create a profile that has no filtering of any kind, a so-
called ârawâ profile
ï§ Leave this profile alone â it serves as a backup
ï§ Protection against unintended consequence
ï§ Possible names:
ï§ Unit/profile name (backup)
ï§ Unit/profile name (unfiltered data)
43. 43
Filter-out internal-traffic
ï§ If you want to exclude visitors surfing from within the SI network
ï§ Admin >> Profiles >> Filters >> +New Filter >> External Traffic Only
44. 44
Measure only traffic taking place on your site
ï§ Scraping and re-publishing website content is a common practice.
ï§ Those sites exist to serve Google Adsense ads and make money.
ï§ Unfortunately they also
scrape your GA âUAâ
account number.
ï§ Their traffic goes into GA
as your traffic!
ï§ Include all domains, if
you use others than
si.edu.
ï§ Filter pattern:
ï§ si.edu
ï§ si.edu|example.com
45. 45
Use annotations
ï§ Super easy â a great way to know at-a-glance what
happened on your site, launches, promos, etc.
ï§ You think youâre gonna remember â youâre not!
46. 46
Custom segment: social media visitors
ï§ Regular expression:
bit.ly|bitly|blogfaves.com|blogger|bloglines|blogspot|delicious|digg|facebook|feedburner|flickr|f
oursquare|goo.gl|groups.google|groups.yahoo.com|hootsuite|instagram|linkedin|m.facebook.
com|newsgator|ow.ly|pinterest|plus.google|plus.url.google.com|reddit|stumbleupon|t.co|techn
orati|tweetdeck|twitter|typepad|tumblr|wordpress|youtube
The Regex can
also be edited to
include smaller
groups, or types
of social sites,
i.e., facebook
and twitter.
Keeping it up to
date is up to
you!
47. 47
Custom segment: engaged visits
These visits:
ï§ Were
deeper
than three
pages.
ï§ Were
longer than
three
minutes.
48. 48
Custom segment: highly-engaged visits
These visits:
ï§ Were deeper than
four pages.
ï§ Were in frequency
more than two
times in the
measured period.
ï§ Were longer than
two minutes.
These values can be
tweaked for your site,
of course!
A nice blog post on
this topic is here.
49. 49
GAâs (relatively new) âSocialâ reports
ï§ Make data-driven decisions for social media
programs:
ï§ Identify the value of traffic coming from social
sites.
ï§ Measure how they lead to direct or âassistedâ
conversions.
ï§ Understand social activities happening on and off
site.
ï§ Some of the reports require programming goals
and assigning values
ï§ Understanding âlikesâ and âsharesâ involves
tagging with the _trackSocial tag
ï§ Googleâs âsocial analyticsâ guide
ï§ Googleâs âsocial reportsâ launch blog post
50. 50
Social conversions
ï§ âSocial performance at a glance and its impact on conversions.â
ï§ âWhich goals are being impacted by social media.â
ï§ Requires adding chunks of code to all your pages.
51. 51
Social sources
ï§ âFind out how visitors from different sources behave.â
ï§ This is similar to the custom advanced segment.
Other reports:
âą Social Plugins data
âą "Activity Stream" (lacks
facebook & twitter)
52. 52
Resources
ï§ GA Data Grabber
ï§ http://www.gadatagrabbertool.com/
ï§ Automate Analytics Google Group
ï§ http://groups.google.com/group/automateanalytics/topics
ï§ Avinash Kaushikâs âOccamâs Razorâ
ï§ http://kaushik.net/avinash
ï§ Lunametrics blog
ï§ http://www.lunametrics.com/blog
ï§ Google Analytics Blog
ï§ http://analytics.blogspot.com/
ï§ Slides and future dashboards will be made available.
ï§ Send me email (alpertb@si.edu)
ï§ Questions welcome!
55. Frequency of Visits (âLoyaltyâ)
ï§ Useful engagement metric for
content sites.
ï§ Provides insight into how
compelling and/or valuable
content is perceived to be.
ï§ Frequent visitors are:
ï§ More likely to be loyal visitors
ï§ Exhibit higher levels of engagement
than infrequent, and especially one-
time visitors
55
56. Length of Visits
ï§ Useful engagement metric for
content sites.
ï§ Measures quality based on the
amount of time spent
consuming content.
ï§ Segmentation is critical.
ï§ Segments of time
ï§ Types of content consumed, or
activities pursued.
ï§ For example, spending lots of
time searching may indicate a
poor website search
experience.
56
57. Depth of Visits
ï§ Useful engagement metric
for content sites.
ï§ Number of pages per visit.
ï§ Helps understand content
consumption patterns,
which can help paint the
picture of the longer term
relationships visitors have
with the website.
57
58. Segmented Bounce Rate
ï§ Number of times a person
visits one site page and leaves
without clicking, divided by the
total number of visits.
ï§ Easily misinterpreted as
always negative.
ï§ Sometimes a high bounce rate
is desirable or expected.
ï§ Visits to single-use
informational pages
(location/hours)
ï§ Blog visits
58
59. Goal Conversions â Primary and Secondary
ï§ Any high-value behavior that
supports the site's goals.
ï§ PDF downloads
ï§ Videos watched
ï§ Donations
ï§ Completed orders
ï§ Conversions indicate higher
engagement, deeper commitment
than viewing pages.
ï§ "Conversion Rate" is the number
of conversions divided by visitors.
59
Notas do Editor
One Smithsonian unit worked closely with Wikipedia, incorporating a range of their content within the online encyclopedia. The purpose was to make their content more accessible for younger students, those less sophisticated than the academics and professional researchers who comprise one of the siteâs core audience segments. The hypothesis was that by doing so, this group would have their needs met more quickly and easily, without having to navigate the Smithsonian websiteâs more advanced, research-oriented structure. The data shows that the needs of the group referred from Wikipedia â a likely starting point for younger students â were largely being met by the content posted on Wikipedia. They were increasingly less likely to need to visit the Smithsonian site many times. This is in contrast to the relatively stable trend of the overall population of visitors shown on the previous slide.
Four of thirteen datapoints are outside of the upper and lower control limit ranges, 30% of the data. Is that enough to say yes, thatâs a statistically significant trend? The answer is subjective, but arguably so.
An additional datapoint to support the previously stated hypothesis. The group referred by Wikipedia was increasingly less likely to need to ask the Smithsonian staff for help via the siteâs contact form. In addition to indicating that the Wikipedia-referred audience was finding the content it needed on Wikipedia, the project resulted in a reduced burden on the Smithsonian staffers who attend to these requests. The same datapoint for two other referral segments are shown, returning visitors, and visitors from search engines, showing marked contrast with the Wikipedia segment.