SlideShare uma empresa Scribd logo
1 de 46
Baixar para ler offline
Leveraging Insights from
Server Logs
#5hoursoftechnicalSEO
To be crawled, indexed,
and ranked.
All SEOs share a common goal:
How can we answer all these questions?
● Which pages is Googlebot crawling?
● What user-agent is it using?
● Is Googlebot crawl mirroring our understanding of site
structure and assets?
● How’s the sites tech health?
Logs are a record
of every request
a server receives.
Actions > Words.
Aggregate
Validate
Googlebot
Translate
Parse logs for
meaningful search
and analysis
Translate
Log Source 1
Logs can come from multiple places in
your stack.
Web Server 1 Web Server 2 Web Server 3
CDN
DDOS Mitigation/Bot Manager
Load Balancer
You want
enough log data
to get an
accurate
picture.
Check your CDN on data on edge node
(cached) vs server (uncached) hits
Internal Log Requests
Ask: Is there already a log management platform in place?
Be Clear: We do not want Personal Identification
Information (PII) and request it be removed
Be specific: Exported as .csv, please!
DIY Log Access
Apache (Linux Server)
NGINX (Linux Server)
IIS log files (Windows Server)
AWS Load Balancer (Load Balancer)
Google Cloud Load Balancer (Load Balancer)
AWS Cloudfront (CDN)
Accessing CloudFare log files (CDN, Enterprise account required)
Incapsula (CDN/DDoS Mitigation)
Akamai logs (CDN/DDoS Mitigation)
Standard Wordpress site?
Log into your hosting provider and look for Raw Access
Aggregate
Validate
Googlebot
Translate
Parse logs for
meaningful search
and analysis
Translate
Log Source 1
Many tools, many languages
Paid: DeepCrawl, Botify, Logz.io, Sumo Logic, Splunk
Free(mium): SEMRush, Screaming Frog Log Analyzer, Big
Query
Code savvy: Python, JP
Masochistic: Excel, Command Line
Leverage the tools and
functionalities already in place.
Aggregate
Validate
Googlebot
Translate
Parse logs for
meaningful search
and analysis
Translate
Log Source 1
Manually validate Googlebot IPs
Run a reverse DNS lookup on the accessing IP
address from your logs, using the host command.
jammer@Hypatia ~ % host 66.249.66.1
1.66.249.66.in-addr.arpa domain name pointer crawl-66-249-66-1.googlebot.com
Bulk validate Googlebot IPs with Scripts
Source: Shell Script to Detect if the IP Address Is Googlebot, Dzone
Validate Googlebot IPs with Tool
Aggregate
Validate
Googlebot
Translate
Parse logs for
meaningful
search and
analysis
Translate
Log Source 1
216.150.168.131 [07/Mar/2018:16:11:58 -0800]
66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax HTTP/1.1
www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
The values captured in logs is unique to
each site.
Make a new engineering friend to learn
exactly what they mean.
Unlock logs ≤ 6 lines
1. Data Source
2. Condition
3. Segments
4. Grouping
5. Sort
6. Limit*
Use Cases + Queries
Use Case (Basic Query)
Legacy code being brought kicking
and screaming into mobile-only index
Query: Are we migrating to mobile-only index?
1. Data Source: Your aggregated logs
2. Condition: where the requester
is (verified) Googlebot
3. Group by: User-agent
4. Count: Number of hits (desc)
5. Limit: Start with ~10 results.
(Query with grouping)
Use case: Google chose
a different canonical
Query: Are non-canonical hostnames being
crawled?
1. Data Source: Aggregated logs
2. Condition: where Googlebot
3. Group by: Hostname
4. Count: Number of hits (desc)
5. Limit: 10
(Query with creative segments)
Use case: Launching content in a
new language.
Segmentation = pattern matching/creative
thinking
Happy path: Consistent URL structure
Plan b: HTTP Entity header Content-Language
Query: Which languages are being crawled?
1. Data Source: Your aggregated logs
2. Condition: where Googlebot
3. Group by: Language
4. Count: Number of hits (desc)
5. Limit: 10
6. Limit: Start with ~10 results.
(Query with parsed segments)
Use case: Low index coverage
Build on the fly segments by parsing URL structure
/en/products/blam-o/log-12345
}Language
App
}
Manufacturer
}
SKU
}
Query: Which subfolders are being crawled?
1. Data Source: Your aggregated logs
2. Condition: where Googlebot
3. Parse: subfolder
4. Aggregate: by Subfolder
5. Count: Number of hits (desc)
6. Limit: Start with ~10 results.
(Parsed Segments AND Conditions)
Use case: Sudden crawl flux
Even search engines need to CYA
Googlebot is designed to be a good citizen of the web...
For Googlebot a speedy site is a sign of healthy servers...
If the site slows down or responds with server errors, the
[crawl rate] limit goes down and Googlebot crawls less.
Official Google Webmaster Central Blog: What Crawl Budget Means for Googlebot
Starting query: What HTTP status codes are we returning?
1. Data Source: Your aggregated logs
2. Condition: where Googlebot
3. Aggregate: by HTTP Status
4. Count: Number of hits (desc)
5. Limit: Start with ~10 results.
Iterative query: What resources are returning 5XX?
1. Data Source: Your aggregated logs
2. Condition: where Googlebot
AND
3. Condition: where 5XX
4. Parse: subfolder
5. Count: Number of hits (desc)
6. Limit: Start with ~10 results.
Advanced Use Cases +
Blended Data
Query: Non-indexable pages with bot hits
Query: Indexable pages without bot hits
Query: Bot hits by indexability
Query: In sitemaps with no bot hits
Query: Empty dynamically generated pages
| ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄|
IT'S CHAOS.
BE KIND.
|________|
(__/) ||
(•ㅅ•) ||
/   づ
I'm a mentor @ United Search
Want to take stage as an SEO speaker?
Want to stay in the audience but see more diversity in SEO events?
United Search is an SEO speaker accelerator designed to specifically aid
underrepresented groups, at no cost to students.
● Application - unitedsearch.org/apply
● Mentors - unitedsearch.org/mentors
● Mission - unitedsearch.org/about-us
For more info check out unitedsearch.org or @search_united on Twitter.

Mais conteúdo relacionado

Mais procurados

SMX East - SEO Tools Panel
SMX East - SEO Tools PanelSMX East - SEO Tools Panel
SMX East - SEO Tools PanelAbby Hamilton
 
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based WebsitesMax Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based WebsitesMax Prin
 
SEO for Large/Enterprise Websites - Data & Tech Side
SEO for Large/Enterprise Websites - Data & Tech SideSEO for Large/Enterprise Websites - Data & Tech Side
SEO for Large/Enterprise Websites - Data & Tech SideDominic Woodman
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Petra Kis-Herczegh
 
Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019Bastian Grimm
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsDistilled
 
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick StoxWhat's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick StoxAhrefs
 
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...Faye Watt
 
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018patrickstox
 
Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019Bastian Grimm
 
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019Max Prin
 
OK Google, Whats next? - OMT Wiesbaden 2018
OK Google, Whats next? - OMT Wiesbaden 2018OK Google, Whats next? - OMT Wiesbaden 2018
OK Google, Whats next? - OMT Wiesbaden 2018Bastian Grimm
 
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019patrickstox
 
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick StoxSMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stoxpatrickstox
 
SEO AJAX Crawlability in a Responsive Publisher World
SEO AJAX Crawlability in a Responsive Publisher WorldSEO AJAX Crawlability in a Responsive Publisher World
SEO AJAX Crawlability in a Responsive Publisher WorldEric Wu
 
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick StoxPubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stoxpatrickstox
 
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorOptimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorDan Taylor
 

Mais procurados (19)

SMX East - SEO Tools Panel
SMX East - SEO Tools PanelSMX East - SEO Tools Panel
SMX East - SEO Tools Panel
 
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based WebsitesMax Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
 
SEO for Large/Enterprise Websites - Data & Tech Side
SEO for Large/Enterprise Websites - Data & Tech SideSEO for Large/Enterprise Websites - Data & Tech Side
SEO for Large/Enterprise Websites - Data & Tech Side
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?
 
Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
 
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick StoxWhat's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
 
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
 
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
 
Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019
 
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
What's New with Schema and Structured Data - Max Prin - SMX Advanced 2019
 
OK Google, Whats next? - OMT Wiesbaden 2018
OK Google, Whats next? - OMT Wiesbaden 2018OK Google, Whats next? - OMT Wiesbaden 2018
OK Google, Whats next? - OMT Wiesbaden 2018
 
SEARCH Y - Bastian Grimm - Migrations Best Practices
SEARCH Y - Bastian Grimm -  Migrations Best PracticesSEARCH Y - Bastian Grimm -  Migrations Best Practices
SEARCH Y - Bastian Grimm - Migrations Best Practices
 
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019
 
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick StoxSMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
 
SEO AJAX Crawlability in a Responsive Publisher World
SEO AJAX Crawlability in a Responsive Publisher WorldSEO AJAX Crawlability in a Responsive Publisher World
SEO AJAX Crawlability in a Responsive Publisher World
 
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick StoxPubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
 
prestiva_blackhat
prestiva_blackhatprestiva_blackhat
prestiva_blackhat
 
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorOptimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
 

Semelhante a Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Hours of Technical SEO

A Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryA Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryDominic Woodman
 
Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)David Sottimano
 
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsSearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsDistilled
 
02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...
02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...
02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...Vladislav Morgun
 
Altitude San Francisco 2018: Logging at the Edge
Altitude San Francisco 2018: Logging at the Edge Altitude San Francisco 2018: Logging at the Edge
Altitude San Francisco 2018: Logging at the Edge Fastly
 
Scraping the web with Laravel, Dusk, Docker, and PHP
Scraping the web with Laravel, Dusk, Docker, and PHPScraping the web with Laravel, Dusk, Docker, and PHP
Scraping the web with Laravel, Dusk, Docker, and PHPPaul Redmond
 
Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011
Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011
Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011Vlad Savitsky
 
Google Chronicles: Analytics And Chrome
Google Chronicles: Analytics And ChromeGoogle Chronicles: Analytics And Chrome
Google Chronicles: Analytics And ChromeSarah Dutkiewicz
 
Advanced Web Scraping or How To Make Internet Your Database #seoplus2018
Advanced Web Scraping or How To Make Internet Your Database #seoplus2018Advanced Web Scraping or How To Make Internet Your Database #seoplus2018
Advanced Web Scraping or How To Make Internet Your Database #seoplus2018Esteve Castells
 
4-Step SEO Waltz: Tackle SEO Challenges Head-On
4-Step SEO Waltz: Tackle SEO Challenges Head-On4-Step SEO Waltz: Tackle SEO Challenges Head-On
4-Step SEO Waltz: Tackle SEO Challenges Head-OnSearch Engine Journal
 
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryCodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryMárton Kodok
 
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014Bastian Grimm
 
Technical SEO Checklist For Developers.pdf
Technical SEO Checklist For Developers.pdfTechnical SEO Checklist For Developers.pdf
Technical SEO Checklist For Developers.pdfBluebash LLC
 
How can a data layer help my seo
How can a data layer help my seoHow can a data layer help my seo
How can a data layer help my seoPhil Pearce
 
Code for Startup MVP (Ruby on Rails) Session 1
Code for Startup MVP (Ruby on Rails) Session 1Code for Startup MVP (Ruby on Rails) Session 1
Code for Startup MVP (Ruby on Rails) Session 1Henry S
 
Kotlin server side frameworks
Kotlin server side frameworksKotlin server side frameworks
Kotlin server side frameworksKen Yee
 

Semelhante a Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Hours of Technical SEO (20)

A Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryA Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big Query
 
Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)
 
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsSearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
 
02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...
02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...
02.03.21 Collaborator.pro Webinar Решение 10 главных задач технической оптими...
 
Altitude San Francisco 2018: Logging at the Edge
Altitude San Francisco 2018: Logging at the Edge Altitude San Francisco 2018: Logging at the Edge
Altitude San Francisco 2018: Logging at the Edge
 
Scraping the web with Laravel, Dusk, Docker, and PHP
Scraping the web with Laravel, Dusk, Docker, and PHPScraping the web with Laravel, Dusk, Docker, and PHP
Scraping the web with Laravel, Dusk, Docker, and PHP
 
Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011
Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011
Oleg Natalushko. Drupal server anatomy. DrupalCamp Kyiv 2011
 
Google Chronicles: Analytics And Chrome
Google Chronicles: Analytics And ChromeGoogle Chronicles: Analytics And Chrome
Google Chronicles: Analytics And Chrome
 
Advanced Web Scraping or How To Make Internet Your Database #seoplus2018
Advanced Web Scraping or How To Make Internet Your Database #seoplus2018Advanced Web Scraping or How To Make Internet Your Database #seoplus2018
Advanced Web Scraping or How To Make Internet Your Database #seoplus2018
 
SEO for Large Websites
SEO for Large WebsitesSEO for Large Websites
SEO for Large Websites
 
4-Step SEO Waltz: Tackle SEO Challenges Head-On
4-Step SEO Waltz: Tackle SEO Challenges Head-On4-Step SEO Waltz: Tackle SEO Challenges Head-On
4-Step SEO Waltz: Tackle SEO Challenges Head-On
 
2023-May.pptx
2023-May.pptx2023-May.pptx
2023-May.pptx
 
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryCodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
 
White Hat Cloaking
White Hat CloakingWhite Hat Cloaking
White Hat Cloaking
 
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
 
Technical SEO Checklist For Developers.pdf
Technical SEO Checklist For Developers.pdfTechnical SEO Checklist For Developers.pdf
Technical SEO Checklist For Developers.pdf
 
How can a data layer help my seo
How can a data layer help my seoHow can a data layer help my seo
How can a data layer help my seo
 
Code for Startup MVP (Ruby on Rails) Session 1
Code for Startup MVP (Ruby on Rails) Session 1Code for Startup MVP (Ruby on Rails) Session 1
Code for Startup MVP (Ruby on Rails) Session 1
 
Kotlin server side frameworks
Kotlin server side frameworksKotlin server side frameworks
Kotlin server side frameworks
 
Introduction to python scrapping
Introduction to python scrappingIntroduction to python scrapping
Introduction to python scrapping
 

Mais de Semrush

Top E-commerce Marketing Channels in 2021
Top E-commerce Marketing Channels in 2021Top E-commerce Marketing Channels in 2021
Top E-commerce Marketing Channels in 2021Semrush
 
A truly untapped marketing channel
A truly untapped marketing channelA truly untapped marketing channel
A truly untapped marketing channelSemrush
 
Jeffrey Burns - Structured Data for Healthcare
Jeffrey Burns - Structured Data for HealthcareJeffrey Burns - Structured Data for Healthcare
Jeffrey Burns - Structured Data for HealthcareSemrush
 
Garrett French and James Wirth - Building Links to Sales Pages
Garrett French and James Wirth - Building Links to Sales PagesGarrett French and James Wirth - Building Links to Sales Pages
Garrett French and James Wirth - Building Links to Sales PagesSemrush
 
Jono Alderson - Structured Data for Content Marketing
Jono Alderson - Structured Data for Content MarketingJono Alderson - Structured Data for Content Marketing
Jono Alderson - Structured Data for Content MarketingSemrush
 
Martha van Berkel - Content Marketing + Structured Data
Martha van Berkel - Content Marketing + Structured DataMartha van Berkel - Content Marketing + Structured Data
Martha van Berkel - Content Marketing + Structured DataSemrush
 
Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...
Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...
Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...Semrush
 
Ryan Stewart - Agency Growth: How to Scale From 10 to 50 Clients
Ryan Stewart - Agency Growth: How to Scale From 10 to 50 ClientsRyan Stewart - Agency Growth: How to Scale From 10 to 50 Clients
Ryan Stewart - Agency Growth: How to Scale From 10 to 50 ClientsSemrush
 
Greg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEO
Greg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEOGreg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEO
Greg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEOSemrush
 
Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...
Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...
Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...Semrush
 
Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...
Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...
Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...Semrush
 
Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...
Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...
Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...Semrush
 
Jimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal Client
Jimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal ClientJimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal Client
Jimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal ClientSemrush
 
Mordy Oberstein — Wix for SEO: Separating Fact from Fiction
Mordy Oberstein — Wix for SEO: Separating Fact from FictionMordy Oberstein — Wix for SEO: Separating Fact from Fiction
Mordy Oberstein — Wix for SEO: Separating Fact from FictionSemrush
 
Daniel Liddle — 5 Practical Ways to Implement NLP in Your SEO Strategy
Daniel Liddle — 5 Practical Ways to Implement NLP in Your SEO StrategyDaniel Liddle — 5 Practical Ways to Implement NLP in Your SEO Strategy
Daniel Liddle — 5 Practical Ways to Implement NLP in Your SEO StrategySemrush
 
Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...
Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...
Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...Semrush
 
AJ Ghergich — How Anyone Can Win Featured Snippets — Backed by Data Science
AJ Ghergich — How Anyone Can Win Featured Snippets —  Backed by Data ScienceAJ Ghergich — How Anyone Can Win Featured Snippets —  Backed by Data Science
AJ Ghergich — How Anyone Can Win Featured Snippets — Backed by Data ScienceSemrush
 
Marketing Channels: The Digital Marketing Trends for 2021
Marketing Channels: The Digital Marketing Trends for 2021Marketing Channels: The Digital Marketing Trends for 2021
Marketing Channels: The Digital Marketing Trends for 2021Semrush
 
Jono Alderson — Structured Data: Schema Changes and Updates
Jono Alderson — Structured Data: Schema Changes and UpdatesJono Alderson — Structured Data: Schema Changes and Updates
Jono Alderson — Structured Data: Schema Changes and UpdatesSemrush
 
Martha van Berkel — Structured Data: Schema Changes and Updates
Martha van Berkel — Structured Data: Schema Changes and UpdatesMartha van Berkel — Structured Data: Schema Changes and Updates
Martha van Berkel — Structured Data: Schema Changes and UpdatesSemrush
 

Mais de Semrush (20)

Top E-commerce Marketing Channels in 2021
Top E-commerce Marketing Channels in 2021Top E-commerce Marketing Channels in 2021
Top E-commerce Marketing Channels in 2021
 
A truly untapped marketing channel
A truly untapped marketing channelA truly untapped marketing channel
A truly untapped marketing channel
 
Jeffrey Burns - Structured Data for Healthcare
Jeffrey Burns - Structured Data for HealthcareJeffrey Burns - Structured Data for Healthcare
Jeffrey Burns - Structured Data for Healthcare
 
Garrett French and James Wirth - Building Links to Sales Pages
Garrett French and James Wirth - Building Links to Sales PagesGarrett French and James Wirth - Building Links to Sales Pages
Garrett French and James Wirth - Building Links to Sales Pages
 
Jono Alderson - Structured Data for Content Marketing
Jono Alderson - Structured Data for Content MarketingJono Alderson - Structured Data for Content Marketing
Jono Alderson - Structured Data for Content Marketing
 
Martha van Berkel - Content Marketing + Structured Data
Martha van Berkel - Content Marketing + Structured DataMartha van Berkel - Content Marketing + Structured Data
Martha van Berkel - Content Marketing + Structured Data
 
Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...
Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...
Rory Hope - How to Improve Performance with Audience-First SEO & Content Stra...
 
Ryan Stewart - Agency Growth: How to Scale From 10 to 50 Clients
Ryan Stewart - Agency Growth: How to Scale From 10 to 50 ClientsRyan Stewart - Agency Growth: How to Scale From 10 to 50 Clients
Ryan Stewart - Agency Growth: How to Scale From 10 to 50 Clients
 
Greg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEO
Greg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEOGreg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEO
Greg Sterling — Why Listings Management Still Matters | 5 Hours of Local SEO
 
Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...
Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...
Greg Gifford — Ricky Bobby's Guide to Winning at Local Link Building in 2021 ...
 
Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...
Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...
Tim Capper — Local SEO for Service-Area Businesses into 2021 | 5 Hours of Loc...
 
Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...
Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...
Rasmus Himmelstrup — The True Value Of Local Search: A Case Study Across Five...
 
Jimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal Client
Jimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal ClientJimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal Client
Jimmy Newson — Create the Perfect Lead Magnet for Attracting Your Ideal Client
 
Mordy Oberstein — Wix for SEO: Separating Fact from Fiction
Mordy Oberstein — Wix for SEO: Separating Fact from FictionMordy Oberstein — Wix for SEO: Separating Fact from Fiction
Mordy Oberstein — Wix for SEO: Separating Fact from Fiction
 
Daniel Liddle — 5 Practical Ways to Implement NLP in Your SEO Strategy
Daniel Liddle — 5 Practical Ways to Implement NLP in Your SEO StrategyDaniel Liddle — 5 Practical Ways to Implement NLP in Your SEO Strategy
Daniel Liddle — 5 Practical Ways to Implement NLP in Your SEO Strategy
 
Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...
Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...
Mordy Oberstein — Room for the Average Site? The SERP, Super-Authorities, & Y...
 
AJ Ghergich — How Anyone Can Win Featured Snippets — Backed by Data Science
AJ Ghergich — How Anyone Can Win Featured Snippets —  Backed by Data ScienceAJ Ghergich — How Anyone Can Win Featured Snippets —  Backed by Data Science
AJ Ghergich — How Anyone Can Win Featured Snippets — Backed by Data Science
 
Marketing Channels: The Digital Marketing Trends for 2021
Marketing Channels: The Digital Marketing Trends for 2021Marketing Channels: The Digital Marketing Trends for 2021
Marketing Channels: The Digital Marketing Trends for 2021
 
Jono Alderson — Structured Data: Schema Changes and Updates
Jono Alderson — Structured Data: Schema Changes and UpdatesJono Alderson — Structured Data: Schema Changes and Updates
Jono Alderson — Structured Data: Schema Changes and Updates
 
Martha van Berkel — Structured Data: Schema Changes and Updates
Martha van Berkel — Structured Data: Schema Changes and UpdatesMartha van Berkel — Structured Data: Schema Changes and Updates
Martha van Berkel — Structured Data: Schema Changes and Updates
 

Último

Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...
Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...
Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...Associazione Digital Days
 
History of JWT by The Knowledge Center.pdf
History of JWT by The Knowledge Center.pdfHistory of JWT by The Knowledge Center.pdf
History of JWT by The Knowledge Center.pdfwilliam charnock
 
Bamboo Charcoal Toothpaste By Phyto Atomy For More Details Message On WhatsA...
Bamboo Charcoal Toothpaste By Phyto Atomy  For More Details Message On WhatsA...Bamboo Charcoal Toothpaste By Phyto Atomy  For More Details Message On WhatsA...
Bamboo Charcoal Toothpaste By Phyto Atomy For More Details Message On WhatsA...shrutimishraqt
 
top marketing posters - Fresh Spar Technologies - Manojkumar C
top marketing posters - Fresh Spar Technologies - Manojkumar Ctop marketing posters - Fresh Spar Technologies - Manojkumar C
top marketing posters - Fresh Spar Technologies - Manojkumar CManojkumar C
 
Navigating Global Markets and Strategies for Success
Navigating Global Markets and Strategies for SuccessNavigating Global Markets and Strategies for Success
Navigating Global Markets and Strategies for SuccessElizabeth Moore
 
Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...
Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...
Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...Marketing BRANDING
 
SEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise SuccessSEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise SuccessLiv Day
 
Content Marketing: How To Find The True Value Of Your Marketing Funnel
Content Marketing: How To Find The True Value Of Your Marketing FunnelContent Marketing: How To Find The True Value Of Your Marketing Funnel
Content Marketing: How To Find The True Value Of Your Marketing FunnelSearch Engine Journal
 
Digital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet MarketingDigital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet MarketingShauryaBadaya
 
Digital Marketing in 5G Era - Digital Transformation in 5G Age
Digital Marketing in 5G Era - Digital Transformation in 5G AgeDigital Marketing in 5G Era - Digital Transformation in 5G Age
Digital Marketing in 5G Era - Digital Transformation in 5G AgeDigiKarishma
 
The Evolution of Internet : How consumers use technology and its impact on th...
The Evolution of Internet : How consumers use technology and its impact on th...The Evolution of Internet : How consumers use technology and its impact on th...
The Evolution of Internet : How consumers use technology and its impact on th...sowmyrao14
 
Best digital marketing e-book form bignners
Best digital marketing e-book form bignnersBest digital marketing e-book form bignners
Best digital marketing e-book form bignnersmuntasibkhan58
 
Agencia Marketing Branding Measurement Certification Google Ads Abril 2024
Agencia Marketing Branding Measurement Certification Google Ads Abril 2024Agencia Marketing Branding Measurement Certification Google Ads Abril 2024
Agencia Marketing Branding Measurement Certification Google Ads Abril 2024Marketing BRANDING
 
AMAZON Copywriting Portfolio by Cielo Evangelista
AMAZON Copywriting Portfolio by Cielo EvangelistaAMAZON Copywriting Portfolio by Cielo Evangelista
AMAZON Copywriting Portfolio by Cielo Evangelistacrevangelista
 
How To Become a Master In Search Engine Optimization (SEO)
How To Become a Master In Search Engine Optimization (SEO)How To Become a Master In Search Engine Optimization (SEO)
How To Become a Master In Search Engine Optimization (SEO)Blessings Ngalande
 
Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...
Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...
Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...Associazione Digital Days
 
TAM Sports IPL 17 Advertising Report- M01 - M23
TAM Sports IPL 17 Advertising Report- M01 - M23TAM Sports IPL 17 Advertising Report- M01 - M23
TAM Sports IPL 17 Advertising Report- M01 - M23Social Samosa
 
Creating a Successful Digital Marketing Campaign.pdf
Creating a Successful Digital Marketing Campaign.pdfCreating a Successful Digital Marketing Campaign.pdf
Creating a Successful Digital Marketing Campaign.pdfgopzzzin
 
Gen Z and Millennial Debit Card Use Survey.pdf
Gen Z and Millennial Debit Card Use Survey.pdfGen Z and Millennial Debit Card Use Survey.pdf
Gen Z and Millennial Debit Card Use Survey.pdfMedia Logic
 
social media optimization complete indroduction
social media optimization complete indroductionsocial media optimization complete indroduction
social media optimization complete indroductioninfoshraddha747
 

Último (20)

Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...
Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...
Richard van der Velde, Technical Support Lead for Cookiebot @CMP – “Artificia...
 
History of JWT by The Knowledge Center.pdf
History of JWT by The Knowledge Center.pdfHistory of JWT by The Knowledge Center.pdf
History of JWT by The Knowledge Center.pdf
 
Bamboo Charcoal Toothpaste By Phyto Atomy For More Details Message On WhatsA...
Bamboo Charcoal Toothpaste By Phyto Atomy  For More Details Message On WhatsA...Bamboo Charcoal Toothpaste By Phyto Atomy  For More Details Message On WhatsA...
Bamboo Charcoal Toothpaste By Phyto Atomy For More Details Message On WhatsA...
 
top marketing posters - Fresh Spar Technologies - Manojkumar C
top marketing posters - Fresh Spar Technologies - Manojkumar Ctop marketing posters - Fresh Spar Technologies - Manojkumar C
top marketing posters - Fresh Spar Technologies - Manojkumar C
 
Navigating Global Markets and Strategies for Success
Navigating Global Markets and Strategies for SuccessNavigating Global Markets and Strategies for Success
Navigating Global Markets and Strategies for Success
 
Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...
Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...
Agencia Marketing Branding Examen Fundamentals Digital Marketing Google Abril...
 
SEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise SuccessSEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise Success
 
Content Marketing: How To Find The True Value Of Your Marketing Funnel
Content Marketing: How To Find The True Value Of Your Marketing FunnelContent Marketing: How To Find The True Value Of Your Marketing Funnel
Content Marketing: How To Find The True Value Of Your Marketing Funnel
 
Digital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet MarketingDigital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet Marketing
 
Digital Marketing in 5G Era - Digital Transformation in 5G Age
Digital Marketing in 5G Era - Digital Transformation in 5G AgeDigital Marketing in 5G Era - Digital Transformation in 5G Age
Digital Marketing in 5G Era - Digital Transformation in 5G Age
 
The Evolution of Internet : How consumers use technology and its impact on th...
The Evolution of Internet : How consumers use technology and its impact on th...The Evolution of Internet : How consumers use technology and its impact on th...
The Evolution of Internet : How consumers use technology and its impact on th...
 
Best digital marketing e-book form bignners
Best digital marketing e-book form bignnersBest digital marketing e-book form bignners
Best digital marketing e-book form bignners
 
Agencia Marketing Branding Measurement Certification Google Ads Abril 2024
Agencia Marketing Branding Measurement Certification Google Ads Abril 2024Agencia Marketing Branding Measurement Certification Google Ads Abril 2024
Agencia Marketing Branding Measurement Certification Google Ads Abril 2024
 
AMAZON Copywriting Portfolio by Cielo Evangelista
AMAZON Copywriting Portfolio by Cielo EvangelistaAMAZON Copywriting Portfolio by Cielo Evangelista
AMAZON Copywriting Portfolio by Cielo Evangelista
 
How To Become a Master In Search Engine Optimization (SEO)
How To Become a Master In Search Engine Optimization (SEO)How To Become a Master In Search Engine Optimization (SEO)
How To Become a Master In Search Engine Optimization (SEO)
 
Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...
Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...
Francesco d’Angela, Service Designer di @HintoGroup- “Oltre la Frontiera Crea...
 
TAM Sports IPL 17 Advertising Report- M01 - M23
TAM Sports IPL 17 Advertising Report- M01 - M23TAM Sports IPL 17 Advertising Report- M01 - M23
TAM Sports IPL 17 Advertising Report- M01 - M23
 
Creating a Successful Digital Marketing Campaign.pdf
Creating a Successful Digital Marketing Campaign.pdfCreating a Successful Digital Marketing Campaign.pdf
Creating a Successful Digital Marketing Campaign.pdf
 
Gen Z and Millennial Debit Card Use Survey.pdf
Gen Z and Millennial Debit Card Use Survey.pdfGen Z and Millennial Debit Card Use Survey.pdf
Gen Z and Millennial Debit Card Use Survey.pdf
 
social media optimization complete indroduction
social media optimization complete indroductionsocial media optimization complete indroduction
social media optimization complete indroduction
 

Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Hours of Technical SEO

  • 1. Leveraging Insights from Server Logs #5hoursoftechnicalSEO
  • 2. To be crawled, indexed, and ranked. All SEOs share a common goal:
  • 3. How can we answer all these questions? ● Which pages is Googlebot crawling? ● What user-agent is it using? ● Is Googlebot crawl mirroring our understanding of site structure and assets? ● How’s the sites tech health?
  • 4. Logs are a record of every request a server receives.
  • 6. Aggregate Validate Googlebot Translate Parse logs for meaningful search and analysis Translate Log Source 1
  • 7. Logs can come from multiple places in your stack. Web Server 1 Web Server 2 Web Server 3 CDN DDOS Mitigation/Bot Manager Load Balancer
  • 8. You want enough log data to get an accurate picture.
  • 9. Check your CDN on data on edge node (cached) vs server (uncached) hits
  • 10. Internal Log Requests Ask: Is there already a log management platform in place? Be Clear: We do not want Personal Identification Information (PII) and request it be removed Be specific: Exported as .csv, please!
  • 11. DIY Log Access Apache (Linux Server) NGINX (Linux Server) IIS log files (Windows Server) AWS Load Balancer (Load Balancer) Google Cloud Load Balancer (Load Balancer) AWS Cloudfront (CDN) Accessing CloudFare log files (CDN, Enterprise account required) Incapsula (CDN/DDoS Mitigation) Akamai logs (CDN/DDoS Mitigation)
  • 12. Standard Wordpress site? Log into your hosting provider and look for Raw Access
  • 13.
  • 14. Aggregate Validate Googlebot Translate Parse logs for meaningful search and analysis Translate Log Source 1
  • 15. Many tools, many languages Paid: DeepCrawl, Botify, Logz.io, Sumo Logic, Splunk Free(mium): SEMRush, Screaming Frog Log Analyzer, Big Query Code savvy: Python, JP Masochistic: Excel, Command Line
  • 16. Leverage the tools and functionalities already in place.
  • 17. Aggregate Validate Googlebot Translate Parse logs for meaningful search and analysis Translate Log Source 1
  • 18. Manually validate Googlebot IPs Run a reverse DNS lookup on the accessing IP address from your logs, using the host command. jammer@Hypatia ~ % host 66.249.66.1 1.66.249.66.in-addr.arpa domain name pointer crawl-66-249-66-1.googlebot.com
  • 19. Bulk validate Googlebot IPs with Scripts Source: Shell Script to Detect if the IP Address Is Googlebot, Dzone
  • 22. 216.150.168.131 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo The values captured in logs is unique to each site. Make a new engineering friend to learn exactly what they mean.
  • 23. Unlock logs ≤ 6 lines 1. Data Source 2. Condition 3. Segments 4. Grouping 5. Sort 6. Limit*
  • 24. Use Cases + Queries
  • 25. Use Case (Basic Query) Legacy code being brought kicking and screaming into mobile-only index
  • 26. Query: Are we migrating to mobile-only index? 1. Data Source: Your aggregated logs 2. Condition: where the requester is (verified) Googlebot 3. Group by: User-agent 4. Count: Number of hits (desc) 5. Limit: Start with ~10 results.
  • 27. (Query with grouping) Use case: Google chose a different canonical
  • 28. Query: Are non-canonical hostnames being crawled? 1. Data Source: Aggregated logs 2. Condition: where Googlebot 3. Group by: Hostname 4. Count: Number of hits (desc) 5. Limit: 10
  • 29. (Query with creative segments) Use case: Launching content in a new language.
  • 30. Segmentation = pattern matching/creative thinking Happy path: Consistent URL structure Plan b: HTTP Entity header Content-Language
  • 31. Query: Which languages are being crawled? 1. Data Source: Your aggregated logs 2. Condition: where Googlebot 3. Group by: Language 4. Count: Number of hits (desc) 5. Limit: 10 6. Limit: Start with ~10 results.
  • 32. (Query with parsed segments) Use case: Low index coverage
  • 33. Build on the fly segments by parsing URL structure /en/products/blam-o/log-12345 }Language App } Manufacturer } SKU }
  • 34. Query: Which subfolders are being crawled? 1. Data Source: Your aggregated logs 2. Condition: where Googlebot 3. Parse: subfolder 4. Aggregate: by Subfolder 5. Count: Number of hits (desc) 6. Limit: Start with ~10 results.
  • 35. (Parsed Segments AND Conditions) Use case: Sudden crawl flux
  • 36. Even search engines need to CYA Googlebot is designed to be a good citizen of the web... For Googlebot a speedy site is a sign of healthy servers... If the site slows down or responds with server errors, the [crawl rate] limit goes down and Googlebot crawls less. Official Google Webmaster Central Blog: What Crawl Budget Means for Googlebot
  • 37. Starting query: What HTTP status codes are we returning? 1. Data Source: Your aggregated logs 2. Condition: where Googlebot 3. Aggregate: by HTTP Status 4. Count: Number of hits (desc) 5. Limit: Start with ~10 results.
  • 38. Iterative query: What resources are returning 5XX? 1. Data Source: Your aggregated logs 2. Condition: where Googlebot AND 3. Condition: where 5XX 4. Parse: subfolder 5. Count: Number of hits (desc) 6. Limit: Start with ~10 results.
  • 39. Advanced Use Cases + Blended Data
  • 40. Query: Non-indexable pages with bot hits
  • 41. Query: Indexable pages without bot hits
  • 42. Query: Bot hits by indexability
  • 43. Query: In sitemaps with no bot hits
  • 44. Query: Empty dynamically generated pages
  • 46. I'm a mentor @ United Search Want to take stage as an SEO speaker? Want to stay in the audience but see more diversity in SEO events? United Search is an SEO speaker accelerator designed to specifically aid underrepresented groups, at no cost to students. ● Application - unitedsearch.org/apply ● Mentors - unitedsearch.org/mentors ● Mission - unitedsearch.org/about-us For more info check out unitedsearch.org or @search_united on Twitter.