SlideShare uma empresa Scribd logo
1 de 148
Baixar para ler offline
Doing SEO for large
websites.
Working on large websites, or large number of websites. Let’s
talk about SEO at scale, with the enterprise.
31 m
1.8m & 220kg
17x larger
4,913x heavier
1,084 T
x2
x2 x2
x2 x3
x2
SLOWER DIFFICULT TO WORK WITH
Working in a large organisation
Working with
data
Technical Foundation
Minimising Risk
Scaling Content
Reporting
Working in a large organisation
Scaling Content
Reporting
Working with
data
Technical Foundation
Minimising Risk
Working in a large organisation
Scaling Content
Working with
data
Technical Foundation
Reporting
Minimising Risk
Templates
Getting (& processing) data
Finding technical issues
Preventing technical issues
Templates
I would like a 1000 problems please.
“Please fix all 18,304 pages”
LIES
LIES
5
6
Category
Home page
Product
Contact Us
Obviously different
Small product number Main category page
Out of stock product
Extremes
Facet category page Reviews Page 2
Same page different URL
Country
County
City
Area/District
Street
Getting (& processing) data
Impressions week by week for new content
Pre change Post change
Clicks pre and post change for site sections
Competing pages for a set of terms
SLOWER DIFFICULT TO WORK WITH
SAMPLING
SLOWER DIFFICULT TO WORK WITH
SAMPLING
LIMITS
1,000 rows at a time
SLOWER DIFFICULT TO WORK WITH
SAMPLING
LIMITS LAG
SLOWER DIFFICULT TO WORK WITH
SAMPLING
LIMITS LAG
SEGMENTATION
Search console properties for a
large brand.
Register all the things.
5 sub-folders provided
260% more keywords
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Get
Get, Analyse
Get, Store, Analyse, Report
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Data studio for extracting
data
● Add a data source
● Create a table for it.
● Download the table.
With both GA & GSC, you’ll get
everything in the table, no
paginating.
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Day by day data
To get even more data we have
to get it day by day.
● bit.ly/search-console-dat
a-downloader
This bit is Search Console only.
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Getting data from APIs
Pull down your analytics data.
● Daily_google_analytics_v3
● Getting search console
data from the API
Getting data from APIs
Pull down your analytics data.
● Daily_google_analytics_v3
● Getting search console
data from the API
Getting started with pandas:
● Pandas tutorial with
ranking data
Getting data from APIs
Pull down your analytics data.
● Daily_google_analytics_v3
● Getting search console
data from the API
Getting started with pandas:
● Pandas tutorial with
ranking data
As a workflow I’d highly
recommend Jupyter notebooks
for getting started.
● Why use jupyter
notebooks?
● SearchLove Video (paid)
SEO Pythonistas
A memorial and soon to be
collection of Hamlet’s excellent
work.
SEO Pythonistas - In loving
memory of Hamlet Batista
@DataChaz
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Analyse
Store data
Get data
Report
Analyse
Store data
Get data
Report
Takes time & space.
Analyse
Store data
Get data
Report
Takes time & space.
A developer could do it.
Rolling your own
JC Chouinard has built a series
of excellent granular tutorials
which walk you through setting
up one on your own machine.
Link.
Off the shelf
Get in touch with me!
I run Piped Out which is
software for building SEO data
warehouses.
Finding technical issues
Part 1: Templates
Part 2: Logs
Part 3: Crawling Big
Part 1: Templates
Part 2: Logs
Part 3: Crawling Big
Not the same fields as a crawl.
No page title for example.
● Crawling & indexing problems
● Crawling & indexing problems
● Measuring freshness
Time until article crawled
● Crawling & indexing problems
● Measuring freshness
● Prioritisation
● Crawling & indexing problems
● Measuring freshness
● Prioritisation
● Monitoring website changes (e.g. migrations)
Jun ‘19
Apr ‘19 Aug‘19 Oct ‘19
200 301 302
Status codes in product pages
Jun ‘19
Apr ‘19 Aug‘19 Oct ‘19
200 301 302
Status codes in product pages
ELK
● Crawling & indexing problems
● Measuring freshness
● Prioritisation
● Monitoring website changes (e.g. migrations)
● Debugging
Hi x
I’m {x} from {y} and we’ve been asked to do some log analysis to understand better how Google is behaving on the website and I was hoping you could help with some questions about the log set-up (as well as with getting the logs!).
What time period do we want?
What we’d ideally like is 3-6 months of historical logs for the website. Our goal is to look at all the different pages search engines are crawling on our website, discover where they’re spending their time, the status code errors they’re
finding etc.
We can absolutely do analysis with a month or so (we've even done it with just a week or two), but it means we lose historical context and obviously we're more likely to lose things on a larger side.
There are also some things that are really helpful for us to know when getting logs.
Do the logs have any personal information in?
We’re just concerned about the various search crawler bots like Google and Bing, we don’t need any logs from users, so any logs with emails, or telephone numbers etc. can be removed.
Can we get logs from as close to the edge as possible?
It's pretty likely you've got a couple different layers of your network that might log. Ideally we want those from as close to the edge as possible. This prevents a couple issues:
● If you've got caching going on, like a CDN or Varnish then if we get logs from after them, we won't see any of the requests they answer.
● If you've got a load balancer distributing to several servers sometimes the external IP gets lost (perhaps X-Forwarded-For isn't working), which we need to verify Googlebot or we accidentally only get logs from a couple
servers.
Are there any sub parts of your site which log to a different place?
Have you got anything like an embedded Wordpress blog which logs to a different location? If so then we’ll need those logs as well. (Although of course if you're sending us CDN logs this won't matter.)
How do you log hostname and protocol?
It's very helpful for us to be able to see hostname & protocol. How do you distinguish those in the log files?
Do you log HTTP & HTTPS to separate files? Do you log hostname at all?
This is one of the problems that's often solved by getting logs closer to the edge, as while many servers won't give you those by default, load balancers and CDN's often will.
Where would we like the logs?
In an ideal world, they would be files in an S3 bucket and we can draw them down from there. If possible, we'd also ask that multiple files aren't zipped together for upload, because that makes processing harder. (No problem with
compressed logs just, just zipping multiple log files into a single archive).
Is there anything else we should know?
Best,
{x}
Part 1: Templates
Part 2: Logs
Part 3: Crawling Big
Sampling your crawl
● Limit your crawl
percentage per template.
i.e.
● 20% to product pages
● 30% to category pages
Low memory crawler
Runs locally on your machine
and allows you to crawl with a
very low memory footprint.
Doesn’t render JS or process
data however.
Run SF in the cloud
You can purchase a super high
memory computer in the cloud,
install SF on it and run it at
maximum speed.
Preventing technical issues
Search console properties for a
large brand.
Part 1: Manually crawling
Part 2: Automating assertions
Part 3: Unit testing
Change detection with SF
Change detection with SF
Part 1: Manually crawling change detection
Part 2: Automating assertions
Part 3: Unit testing
<meta name="robots" content="noindex">
<meta name="robots" content="noindex,nofollow">
<meta name="robots" content="noindex">
Is it different?
Is it the value I want?
Is it different?
<meta name="robots" content="noindex,nofollow">
<meta name="robots" content="noindex">
Element Equals
Title Big Brown Shoe - £12.99 - Example.com
Status Code 200
H1 Big Brown Shoe
Canonical <link rel="canonical" href="https:/
/example.com/product/big-brown-shoe" />
CSS Selector: #review-counter Any number
CSS Selector: #product-data {
"@context": "https:/
/schema.org/",
"@type": "Product",
"name": "Big Brown Shoe",
"description": "The biggest brownest show you can find.",
"sku": "0446310786",
"mpn": "925872",
}
Asserting with Google sheets
Asserting with Google sheets
Part 1: Manually crawling
Part 2: Automating assertions
Part 3: Unit testing
Unit tests
Create code Test code Deployment
Create code Test code Deployment
All our hard work.
Create code Test code Deployment
All our hard work.
Create code Test code Deployment
endtest.io
Conclusions
@dom_woodman
bit.ly/seo-for-large-websites
www.pipedout.com
@dom_woodman

Mais conteúdo relacionado

Mais procurados

How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021Chris Green
 
How to Succeed in B2B SEO
How to Succeed in B2B SEOHow to Succeed in B2B SEO
How to Succeed in B2B SEODominic Woodman
 
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021Alex Wright
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Petra Kis-Herczegh
 
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...Hamlet Batista
 
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...Izabela Wisniewska
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...Catalyst
 
Hreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEOHreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEOGerry White
 
rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018Mark Thomas
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScriptHamlet Batista
 
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...Branded3
 
Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Hamlet Batista
 
SMX East - SEO Tools Panel
SMX East - SEO Tools PanelSMX East - SEO Tools Panel
SMX East - SEO Tools PanelAbby Hamilton
 
Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...Bastian Grimm
 
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...Faye Watt
 
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEOUse Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEOGerry White
 
TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom Gerry White
 
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...Emily Potter
 
How Marketers Can Work With Code
How Marketers Can Work With CodeHow Marketers Can Work With Code
How Marketers Can Work With CodeKaizen
 

Mais procurados (20)

How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
 
How to Succeed in B2B SEO
How to Succeed in B2B SEOHow to Succeed in B2B SEO
How to Succeed in B2B SEO
 
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?
 
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
 
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
 
Hreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEOHreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEO
 
rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScript
 
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
 
Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5
 
SMX East - SEO Tools Panel
SMX East - SEO Tools PanelSMX East - SEO Tools Panel
SMX East - SEO Tools Panel
 
Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...
 
Screaming Frog PPT
Screaming Frog PPTScreaming Frog PPT
Screaming Frog PPT
 
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
 
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEOUse Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
 
TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom
 
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
 
How Marketers Can Work With Code
How Marketers Can Work With CodeHow Marketers Can Work With Code
How Marketers Can Work With Code
 

Semelhante a SEO for Large/Enterprise Websites - Data & Tech Side

Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)David Sottimano
 
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsSearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsDistilled
 
LatJUG. Google App Engine
LatJUG. Google App EngineLatJUG. Google App Engine
LatJUG. Google App Enginedenis Udod
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsDistilled
 
A Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryA Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryDominic Woodman
 
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...John McCaffrey
 
Windy cityrails performance_tuning
Windy cityrails performance_tuningWindy cityrails performance_tuning
Windy cityrails performance_tuningJohn McCaffrey
 
Why Analytics important for any business - EBriks Infotech
 Why Analytics important for any business - EBriks Infotech Why Analytics important for any business - EBriks Infotech
Why Analytics important for any business - EBriks InfotechEBriks Infotech Pvt. Ltd.
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Demi Ben-Ari
 
Website & Internet + Performance testing
Website & Internet + Performance testingWebsite & Internet + Performance testing
Website & Internet + Performance testingRoman Ananev
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionAnant Corporation
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Demi Ben-Ari
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Codemotion
 
Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018Bastian Grimm
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecoreSurendra Sharma
 
Why Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks InfotechWhy Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks InfotechEBriks Infotech Pvt. Ltd.
 
Datasets, APIs, and Web Scraping
Datasets, APIs, and Web ScrapingDatasets, APIs, and Web Scraping
Datasets, APIs, and Web ScrapingDamian T. Gordon
 
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorOptimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorDan Taylor
 
Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Rasel Rana
 

Semelhante a SEO for Large/Enterprise Websites - Data & Tech Side (20)

Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)
 
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsSearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
 
LatJUG. Google App Engine
LatJUG. Google App EngineLatJUG. Google App Engine
LatJUG. Google App Engine
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
 
A Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryA Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big Query
 
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
 
Windy cityrails performance_tuning
Windy cityrails performance_tuningWindy cityrails performance_tuning
Windy cityrails performance_tuning
 
Why Analytics important for any business - EBriks Infotech
 Why Analytics important for any business - EBriks Infotech Why Analytics important for any business - EBriks Infotech
Why Analytics important for any business - EBriks Infotech
 
SEARCH Y - Bastian Grimm - Migrations Best Practices
SEARCH Y - Bastian Grimm -  Migrations Best PracticesSEARCH Y - Bastian Grimm -  Migrations Best Practices
SEARCH Y - Bastian Grimm - Migrations Best Practices
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
 
Website & Internet + Performance testing
Website & Internet + Performance testingWebsite & Internet + Performance testing
Website & Internet + Performance testing
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
 
Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecore
 
Why Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks InfotechWhy Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks Infotech
 
Datasets, APIs, and Web Scraping
Datasets, APIs, and Web ScrapingDatasets, APIs, and Web Scraping
Datasets, APIs, and Web Scraping
 
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorOptimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
 
Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)
 

Mais de Dominic Woodman

19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testingDominic Woodman
 
Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...Dominic Woodman
 
Debugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking DownDebugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking DownDominic Woodman
 
How a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedHow a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedDominic Woodman
 
Matching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information ArchitectureMatching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information ArchitectureDominic Woodman
 
Split Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of LearningSplit Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of LearningDominic Woodman
 
What is AMP and do I care?
What is AMP and do I care?What is AMP and do I care?
What is AMP and do I care?Dominic Woodman
 

Mais de Dominic Woodman (7)

19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing
 
Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...
 
Debugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking DownDebugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking Down
 
How a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedHow a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO worked
 
Matching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information ArchitectureMatching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information Architecture
 
Split Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of LearningSplit Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of Learning
 
What is AMP and do I care?
What is AMP and do I care?What is AMP and do I care?
What is AMP and do I care?
 

Último

BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Kraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationKraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationtbatkhuu1
 
Unraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptxUnraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptxelizabethella096
 
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesGoogle 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesSearch Engine Journal
 
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxDigital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxZACGaming
 
Aryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptxAryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptxtegevi9289
 
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15SearchNorwich
 
Situation Analysis | Management Company.
Situation Analysis | Management Company.Situation Analysis | Management Company.
Situation Analysis | Management Company.DanielaQuiroz63
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRSapana Sha
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptxVikasTiwari846641
 
Instant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best PracticesInstant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best PracticesMedia Logic
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...aditipandeya
 
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Onlineanilsa9823
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa
 
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...ChesterYang6
 
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessBrighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessVarn
 

Último (20)

Digital Strategy Master Class - Andrew Rupert
Digital Strategy Master Class - Andrew RupertDigital Strategy Master Class - Andrew Rupert
Digital Strategy Master Class - Andrew Rupert
 
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
 
Kraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationKraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentation
 
Unraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptxUnraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptx
 
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesGoogle 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
 
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxDigital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
 
Aryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptxAryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptx
 
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
Five Essential Tools for International SEO - Natalia Witczyk - SearchNorwich 15
 
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan ScheltgenHow to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
 
Situation Analysis | Management Company.
Situation Analysis | Management Company.Situation Analysis | Management Company.
Situation Analysis | Management Company.
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCR
 
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose GuirgisCreator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptx
 
Instant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best PracticesInstant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best Practices
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
 
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdf
 
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
 
No Cookies No Problem - Steve Krull, Be Found Online
No Cookies No Problem - Steve Krull, Be Found OnlineNo Cookies No Problem - Steve Krull, Be Found Online
No Cookies No Problem - Steve Krull, Be Found Online
 
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessBrighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
 

SEO for Large/Enterprise Websites - Data & Tech Side

  • 1. Doing SEO for large websites. Working on large websites, or large number of websites. Let’s talk about SEO at scale, with the enterprise.
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 12.
  • 13. x2
  • 14. x2 x2
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. SLOWER DIFFICULT TO WORK WITH
  • 24.
  • 25. Working in a large organisation Working with data Technical Foundation Minimising Risk Scaling Content Reporting
  • 26. Working in a large organisation Scaling Content Reporting Working with data Technical Foundation Minimising Risk
  • 27. Working in a large organisation Scaling Content Working with data Technical Foundation Reporting Minimising Risk
  • 32.
  • 34.
  • 35.
  • 36.
  • 37. I would like a 1000 problems please.
  • 38. “Please fix all 18,304 pages”
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44. LIES
  • 46.
  • 48. Small product number Main category page Out of stock product Extremes
  • 49. Facet category page Reviews Page 2 Same page different URL
  • 51.
  • 53. Impressions week by week for new content
  • 54. Pre change Post change Clicks pre and post change for site sections
  • 55. Competing pages for a set of terms
  • 56. SLOWER DIFFICULT TO WORK WITH SAMPLING
  • 57.
  • 58.
  • 59.
  • 60. SLOWER DIFFICULT TO WORK WITH SAMPLING LIMITS
  • 61. 1,000 rows at a time
  • 62.
  • 63. SLOWER DIFFICULT TO WORK WITH SAMPLING LIMITS LAG
  • 64.
  • 65. SLOWER DIFFICULT TO WORK WITH SAMPLING LIMITS LAG SEGMENTATION
  • 66. Search console properties for a large brand.
  • 67. Register all the things.
  • 69. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing Get Get, Analyse Get, Store, Analyse, Report
  • 70. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 71. Data studio for extracting data ● Add a data source ● Create a table for it. ● Download the table. With both GA & GSC, you’ll get everything in the table, no paginating.
  • 72. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 73. Day by day data To get even more data we have to get it day by day. ● bit.ly/search-console-dat a-downloader This bit is Search Console only.
  • 74. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 75. Getting data from APIs Pull down your analytics data. ● Daily_google_analytics_v3 ● Getting search console data from the API
  • 76. Getting data from APIs Pull down your analytics data. ● Daily_google_analytics_v3 ● Getting search console data from the API Getting started with pandas: ● Pandas tutorial with ranking data
  • 77. Getting data from APIs Pull down your analytics data. ● Daily_google_analytics_v3 ● Getting search console data from the API Getting started with pandas: ● Pandas tutorial with ranking data As a workflow I’d highly recommend Jupyter notebooks for getting started. ● Why use jupyter notebooks? ● SearchLove Video (paid)
  • 78. SEO Pythonistas A memorial and soon to be collection of Hamlet’s excellent work. SEO Pythonistas - In loving memory of Hamlet Batista @DataChaz
  • 79. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 83.
  • 84.
  • 86. Rolling your own JC Chouinard has built a series of excellent granular tutorials which walk you through setting up one on your own machine. Link.
  • 87. Off the shelf Get in touch with me! I run Piped Out which is software for building SEO data warehouses.
  • 89. Part 1: Templates Part 2: Logs Part 3: Crawling Big
  • 90.
  • 91. Part 1: Templates Part 2: Logs Part 3: Crawling Big
  • 92.
  • 93. Not the same fields as a crawl. No page title for example.
  • 94.
  • 95. ● Crawling & indexing problems
  • 96. ● Crawling & indexing problems ● Measuring freshness
  • 98. ● Crawling & indexing problems ● Measuring freshness ● Prioritisation
  • 99.
  • 100. ● Crawling & indexing problems ● Measuring freshness ● Prioritisation ● Monitoring website changes (e.g. migrations)
  • 101. Jun ‘19 Apr ‘19 Aug‘19 Oct ‘19 200 301 302 Status codes in product pages
  • 102. Jun ‘19 Apr ‘19 Aug‘19 Oct ‘19 200 301 302 Status codes in product pages ELK
  • 103. ● Crawling & indexing problems ● Measuring freshness ● Prioritisation ● Monitoring website changes (e.g. migrations) ● Debugging
  • 104. Hi x I’m {x} from {y} and we’ve been asked to do some log analysis to understand better how Google is behaving on the website and I was hoping you could help with some questions about the log set-up (as well as with getting the logs!). What time period do we want? What we’d ideally like is 3-6 months of historical logs for the website. Our goal is to look at all the different pages search engines are crawling on our website, discover where they’re spending their time, the status code errors they’re finding etc. We can absolutely do analysis with a month or so (we've even done it with just a week or two), but it means we lose historical context and obviously we're more likely to lose things on a larger side. There are also some things that are really helpful for us to know when getting logs. Do the logs have any personal information in? We’re just concerned about the various search crawler bots like Google and Bing, we don’t need any logs from users, so any logs with emails, or telephone numbers etc. can be removed. Can we get logs from as close to the edge as possible? It's pretty likely you've got a couple different layers of your network that might log. Ideally we want those from as close to the edge as possible. This prevents a couple issues: ● If you've got caching going on, like a CDN or Varnish then if we get logs from after them, we won't see any of the requests they answer. ● If you've got a load balancer distributing to several servers sometimes the external IP gets lost (perhaps X-Forwarded-For isn't working), which we need to verify Googlebot or we accidentally only get logs from a couple servers. Are there any sub parts of your site which log to a different place? Have you got anything like an embedded Wordpress blog which logs to a different location? If so then we’ll need those logs as well. (Although of course if you're sending us CDN logs this won't matter.) How do you log hostname and protocol? It's very helpful for us to be able to see hostname & protocol. How do you distinguish those in the log files? Do you log HTTP & HTTPS to separate files? Do you log hostname at all? This is one of the problems that's often solved by getting logs closer to the edge, as while many servers won't give you those by default, load balancers and CDN's often will. Where would we like the logs? In an ideal world, they would be files in an S3 bucket and we can draw them down from there. If possible, we'd also ask that multiple files aren't zipped together for upload, because that makes processing harder. (No problem with compressed logs just, just zipping multiple log files into a single archive). Is there anything else we should know? Best, {x}
  • 105.
  • 106. Part 1: Templates Part 2: Logs Part 3: Crawling Big
  • 107. Sampling your crawl ● Limit your crawl percentage per template. i.e. ● 20% to product pages ● 30% to category pages
  • 108. Low memory crawler Runs locally on your machine and allows you to crawl with a very low memory footprint. Doesn’t render JS or process data however.
  • 109. Run SF in the cloud You can purchase a super high memory computer in the cloud, install SF on it and run it at maximum speed.
  • 111. Search console properties for a large brand.
  • 112.
  • 113.
  • 114.
  • 115.
  • 116. Part 1: Manually crawling Part 2: Automating assertions Part 3: Unit testing
  • 119.
  • 120. Part 1: Manually crawling change detection Part 2: Automating assertions Part 3: Unit testing
  • 122. <meta name="robots" content="noindex,nofollow"> <meta name="robots" content="noindex">
  • 124. Is it the value I want? Is it different?
  • 125. <meta name="robots" content="noindex,nofollow"> <meta name="robots" content="noindex">
  • 126. Element Equals Title Big Brown Shoe - £12.99 - Example.com Status Code 200 H1 Big Brown Shoe Canonical <link rel="canonical" href="https:/ /example.com/product/big-brown-shoe" /> CSS Selector: #review-counter Any number CSS Selector: #product-data { "@context": "https:/ /schema.org/", "@type": "Product", "name": "Big Brown Shoe", "description": "The biggest brownest show you can find.", "sku": "0446310786", "mpn": "925872", }
  • 129. Part 1: Manually crawling Part 2: Automating assertions Part 3: Unit testing
  • 130.
  • 131.
  • 132.
  • 134. Create code Test code Deployment
  • 135. Create code Test code Deployment All our hard work.
  • 136. Create code Test code Deployment All our hard work.
  • 137. Create code Test code Deployment
  • 138.
  • 140.
  • 142.
  • 143.
  • 144.
  • 145.
  • 146.
  • 147.