SlideShare uma empresa Scribd logo
1 de 61
@dawnieando #BigDigitalADL
Generational	
  Cruft	
  &	
  Technical	
  Debt	
  in	
  
SEOGONE	
  IS	
  
NEVER	
  GONE Dawn	
  Anderson	
  @	
  dawnieando
@dawnieando #BigDigitalADL
A New Beginning
§ “A	
  new	
  website	
  will	
  solve	
  ALL	
  our	
  
problems”
§ “Let’s	
  start	
  again”
§ “We’ll	
  just	
  migrate…	
  and	
  redirect	
  
everything”
@dawnieando #BigDigitalADL
404 Page Not Found
§ “Of	
  course,	
  we	
  won’t	
  
redirect	
  everything…”
§ “Not	
  everything	
  will	
  
be	
  worth	
  redirecting”
@dawnieando #BigDigitalADL
410 Gone
§ “Some,	
  we’ll	
  just	
  kill	
  
off	
  with	
  a	
  410…”
§ “Then	
  the	
  URLs	
  will	
  
be	
  gone”
@dawnieando #BigDigitalADL
But in Reality…it Still Exists
@dawnieando #BigDigitalADL
Because…Web Crawler System’s History Logs
SEARCH	
  ENGINES	
  HAVE	
  A	
  	
  BIG	
  MEMORY	
  AND	
  A	
  LOT	
  OF	
  
STORAGE
@dawnieando #BigDigitalADL
Web Crawler System History Logs
@dawnieando #BigDigitalADL
Web Crawler System
GOOGLE	
  NEVER	
  FORGETS
The	
  history	
  logs	
  play	
  a	
  
role	
  in	
  deciding	
  when	
  
every	
  URL	
  that	
  was	
  EVER	
  
discovered	
  gets	
  visited	
  
again
@dawnieando #BigDigitalADL
History Log Records Include:
• URL	
  fingerprint
• Timestamp	
  (last	
  crawl	
  or	
  download	
  
attempt)
• Crawl	
  status	
  (success	
  or	
  error)	
  (Response	
  
code)
• Content	
  checksum	
  (binary	
  code)
• Source	
  ID	
  (accessed	
  from	
  cache	
  or	
  
downloaded)
• Segment	
  identifier	
  (Crawl	
  segment	
  assigned	
  
to??)
• Page	
  importance	
  (a	
  measure	
  of	
  importance	
  
assigned	
  to	
  the	
  URL)
May	
  be	
  
calculated	
  by	
  
identifying	
  
historical	
  
importance	
  
scores	
  based	
  on	
  
past	
  X	
  number	
  of	
  
crawls
@dawnieando #BigDigitalADL
Gone Is Never Gone
“We	
  knew	
  there	
  was	
  content	
  
there	
  at	
  some	
  point	
  so	
  we	
  
just	
  swing	
  by	
  every	
  now	
  and	
  
then	
  to	
  see	
  if	
  anything	
  came	
  
back”	
  (John	
  Mueller,	
  2016)
@dawnieando #BigDigitalADL
Generational Cruft … NOT ‘Crufts’
@dawnieando #BigDigitalADL
The Generational ’Snail Trail’
• Old	
  XML	
  sitemaps
• Redirects	
  drop	
  away	
  on	
  old	
  site	
  
.htaccess
• DNS	
  issues
• People	
  link	
  to	
  old	
  site	
  but	
  wrong	
  
protocol
• Old	
  sites	
  not	
  verified	
  in	
  GSC
• Not	
  all	
  protocols	
  redirecting
Leaving	
  it’s	
  
slithery	
  	
  
footprint
@dawnieando #BigDigitalADL
The Generational ’Snail Trail’
• All	
  eating	
  away	
  at	
  
Googlebot’s attention	
  on	
  
your	
  server’s	
  IP
WATCH	
  OUT	
  FOR	
  THE	
  SNAIL	
  TRAIL	
  &	
  
GENERATIONAL	
  CRUFT
@dawnieando #BigDigitalADL
The Slow Page Evolution of Near Duplicates
In	
  a	
  study	
  over	
  11	
  weeks	
  Denis	
  Fetterly and	
  
Mark	
  Najork found	
  that	
  near-­‐duplicate	
  pages	
  
rarely	
  change	
  and	
  that	
  they	
  are	
  still	
  near-­‐
duplicates	
  of	
  each	
  other	
  10	
  weeks	
  later.	
  
Therefore	
  once	
  identified	
  their	
  download	
  
priority	
  may	
  be	
  reduced	
  so	
  that	
  resources	
  may	
  
be	
  used	
  more	
  efficiently	
  /	
  productively	
  
elsewhere
(Fetterly &	
  Najork,	
  2003)
Fetterly &	
  
Najork,	
  
2003
@dawnieando #BigDigitalADL
‘Transitive’??
Transitive	
  -­‐ A	
  ==	
  B	
  +	
  B	
  ==	
  C	
  then	
  A	
  ==	
  C
THEORY:	
  Maybe	
  for	
  some	
  types	
  of	
  
content	
  more	
  than	
  others	
  – e.g.	
  
ecommerce/directories	
  but	
  not	
  news
THEORY	
  ALERT	
  !!!!!!!
@dawnieando #BigDigitalADL
DUSTBUSTER & DUST CRAWLING RULES
DO	
  NOT	
  
CRAWL	
  IN	
  
THE	
  DUST
BUILDS	
  
‘HINTS’	
  ON	
  
WHAT	
  NOT	
  
TO	
  CRAWL
@dawnieando #BigDigitalADL
‘Sampling’in Crawling for Efficiency
‘SMALL	
  TEST	
  VISITS	
  TO	
  A	
  SITE	
  TO	
  
UNDERSTAND	
  WHETHER	
  IT	
  IS	
  WORTH	
  
CRAWLING’
@dawnieando #BigDigitalADL
Every Site Will Have Its Own Crawling Rules
UNSURE	
  AS	
  TO	
  WHETHER	
  
THIS	
  IS	
  BEING	
  USED
DUSTBUSTER	
  
CRAWLING
RULES
@dawnieando #BigDigitalADL
Popular CMS ’Rule Patterns’ (URL Parameters)
ALL	
  WILL	
  HAVE	
  COMMON	
  
CANONICALIZATION	
  PATTERNS	
  WHICH	
  
CAN	
  BE	
  LEARNED
@dawnieando #BigDigitalADL
CRAWLING RULES BUILT OVER TIME
Crawl	
  Frequency	
  Patterns
No	
  two	
  sites	
  will	
  have	
  the	
  same	
  crawl	
  
schedules	
  or	
  rules	
  built
Moving	
  from	
  one	
  CMS	
  to	
  another	
  may	
  
mean	
  that	
  different	
  parameters	
  are	
  
created.	
  	
  New	
  parameters	
  =	
  new	
  rules	
  
@dawnieando #BigDigitalADL
Every Version of Your Past Ecommerce Sites
“Exponentially	
  
multiplicative	
  
URLs”
Had	
  potential	
  to	
  spew…	
  at	
  some	
  point…
@dawnieando #BigDigitalADL
URLs Take Their Place in Crawling Queues
The	
  Queue	
  Gets	
  Long	
  &	
  
Congested
@dawnieando #BigDigitalADL
YOU INHERITED SEO TECHNICAL DEBT
• Previous	
  content	
  /	
  link	
  manual	
  actions
• Previous	
  algorithmic	
  suppressions
• Past	
  infinite	
  loops
• “We’ll	
  SEO	
  it	
  after	
  launch”
• “SEO	
  is	
  dead…	
  so	
  we	
  won’t	
  optimise”
• Dodgy	
  URL	
  parameters
• Misconfigured	
  URL	
  parameters
• Old	
  URL	
  crawling	
  ‘rules	
  /	
  hints’
@dawnieando #BigDigitalADL
TECHNICAL DEBT
@dawnieando #BigDigitalADL
IT WASN’T ME – PASSING THE BUCK
@dawnieando #BigDigitalADL
… with Interest
AT	
  SOME	
  POINT	
  IT	
  
MUST	
  BE	
  REPAID
@dawnieando #BigDigitalADL
GENERATIONAL CRUFT
EVERY	
  SINGLE	
  TIME	
  YOU	
  MIGRATE,	
  CHANGE	
  DESIGN,	
  REDIRECT,	
  REINVENT	
  A	
  SITE	
  /	
  URL
A	
  CLEAN	
  START
REDIRECTIONS
ANOTHER	
  STRUCTURE
FIRST	
  SITE	
  
STRUCTURE
NEW	
  CRAWLING	
  ‘RULES’	
  
BUILT
CRAWLING	
  
‘RULES’	
  BUILT
EVERYTHING	
  
IS	
  ‘200	
  OK’
MORE	
  URLs
MIXED	
  RESPONSE	
  CODES
REDIRECTIONS
‘FUZZINESS’	
  IS	
  EMERGING
NEW	
  CRAWLING	
  ‘RULES’	
  BUILT
MORE	
  URLs
REDIRECT	
  CHAINS	
  &	
  MIXED	
  
RESPONSE	
  CODES
NEW	
  SEO’s	
  DON’T	
  
KNOW	
  THE	
  ‘HISTORY’
TARGET	
  URLs	
  NOW	
  ‘VERY	
  FUZZY’
@dawnieando #BigDigitalADL
Aged ‘Patchwork Quilt’Sites
A	
  LITTLE	
  BIT	
  OF	
  THIS	
  CMS	
  AND	
  A	
  
LITTLE	
  BIT	
  OF	
  THAT	
  CMS
MANY	
  HISTORICAL	
  PARAMETERS	
  
CREATED
@dawnieando #BigDigitalADL
’Fuzzy’ URL Targets with Each Site Generation
EVERYTHING	
  GETS	
  
A	
  BIT	
  BLURRED
‘Which	
  is	
  the	
  target	
  URL	
  
again?
@dawnieando #BigDigitalADL
Time Seems To Fly… The Older You Get
Your	
  new	
  site	
  URL	
  is	
  just	
  
one	
  of	
  very	
  many	
  historical	
  
URLs	
  on	
  your	
  IP	
  to	
  be	
  
visited	
  periodically
A	
  tiny	
  fish	
  in	
  a	
  very	
  
big	
  URL	
  pond	
  queue
@dawnieando #BigDigitalADL
The Great 302s Pass PageRank Debate
@dawnieando #BigDigitalADL
SOLUTION - THE BELOVED CANONICAL
§ 30X	
  redirects
§ Canonical	
  tag
§ Href lang
§ HTTPS	
  protocol
§ Global	
  canonicalization	
  
rules
In	
  ’ALL’	
  its	
  forms
@dawnieando #BigDigitalADL
Chocolate Boxes Research
@dawnieando #BigDigitalADL
Advanced Technical SEO?
50%  of  SEOs  surveyed  
considered;;
“CANONICALIZATION  
IS  ADVANCED  
TECHNICAL  SEO”
@dawnieando #BigDigitalADL
Oh Yeah – Canonicalization is Easy
76%  of  SEOs  surveyed  
considered;;
“CANONICALIZATION  
IS  AN  EASY  CONCEPT  
TO  UNDERSTAND”
@dawnieando #BigDigitalADL
REL NEXT REL PREV is NOT Canonicalization
47%  of  SEOs  
categorizing  themselves  
as  ‘TECHNICAL  SEO’s  
considered;;
“REL=NEXT  /  REL  =  
PREV”  IS  A  FORM  OF  
CANONICALIZATION
@dawnieando #BigDigitalADL
On Redirections as Canonicalization Forms
Lots  were  unsure  about;;
“301s  and  302s  are  
BOTH  forms  of  
canonicalization”
@dawnieando #BigDigitalADL
On Href Lang as Canonicalization
Only  64%  of  ’Technical
SEOs’  thought  HRef
Lang  was  a  form  of
Canonicalization
IT  IS
@dawnieando #BigDigitalADL
URL Parameter Handling is Your Friend
Help	
  Google	
  Build	
  ‘Crawling	
  
Rules’	
  for	
  your	
  site	
  rather	
  
than	
  wasting	
  time	
  on	
  
‘sampling’	
  and	
  giving	
  a	
  bad	
  
impression
GIVE	
  HELP	
  AND	
  
GUIDANCE	
  WITH	
  THE	
  
CRAWL	
  RULE	
  AND	
  
HINT	
  BUILDING
@dawnieando #BigDigitalADL
SOLUTION - Understand URL Parameters
ACTIVE	
  PARAMETERS	
  ==	
  CHANGE	
  THE	
  CONTENT	
  ON	
  
YOUR	
  PAGE
(e.g.	
  sort,	
  filter,	
  translate,	
  paginate,	
  specify)
PASSIVE	
  PARAMETERS	
  ==	
  DO	
  NOT	
  CHANGE	
  THE	
  
CONTENT	
  ON	
  YOUR	
  PAGE
(Often	
  used	
  for	
  tracking)	
  	
  (ALIAS:	
  REPRESENTATIVE)
@dawnieando #BigDigitalADL
ACTIVE Parameters (CHANGE CONTENT)
SORT	
  ==	
  Sorts	
  dynamic	
  items	
  and	
  reorders	
  in	
  descending	
  /	
  ascending	
  price	
  /	
  
popularity	
  /	
  added
NARROWING	
  ==	
  Filters	
  dynamically	
  added	
  items	
  down	
  to	
  include	
  only	
  
features	
  &	
  attributes	
  in	
  a	
  chosen	
  consideration	
  set
SPECIFYING	
  ==	
  Identifies	
  a	
  particular	
  dynamically	
  variable	
  populated	
  
content	
  set	
  within	
  a	
  site	
  section	
  (e.g.	
  store=women)
TRANSLATING	
  ==	
  Indicates	
  a	
  language	
  driven	
  translation	
  URL	
  (e.g.	
  lang=fr)
PAGINATING	
  ==	
  Indicates	
  a	
  paginated	
  display	
  of	
  long	
  content	
  (e.g.	
  page=2)
@dawnieando #BigDigitalADL
Understand How URLs with
Multiple Parameters Are Handled
The	
  most	
  restrictive	
  parameter	
  blocked	
  overrules	
  
lesser	
  restrictions
@dawnieando #BigDigitalADL
Examples of Multiple Parameter Handling
KNOW	
  THE	
  
RULES
http://www.example.com?shopping-­‐category=DVD-­‐movies&sort-­‐
by=production-­‐year&sort-­‐order=asc WILL	
  BE	
  CRAWLED
http://www.example.com?shopping-­‐category=shoes&sort-­‐by=size&sort-­‐
order=asc WILL	
  NOT	
  BE	
  CRAWLED	
  (production-­‐year	
  blocks)
@dawnieando #BigDigitalADL
Help Googlebot Get Round its Shopping List
OPEN	
  MORE	
  CHECKOUTS
WIDEN	
  THE	
  AISLES
MAKE	
  THINGS	
  EASY	
  TO	
  FIND
DON’T	
  CONFUSE	
  
GOOGLEBOT
HELP	
  FILL	
  THE	
  TROLLEY	
  
QUICKLY
SPEED,	
  SPEED,	
  SPEED
@dawnieando #BigDigitalADL
SOLUTION - XML SitemapsAre Your
Friend… (Strong Foundations)
They	
  help	
  to	
  
pass	
  
‘importance’	
  
signals	
  within	
  
a	
  site
But…	
  never	
  
leave	
  them	
  to	
  
just	
  
autogenerate
without	
  
periodically	
  
checking
‘The	
  
foundations’	
  
underneath	
  a	
  
site
@dawnieando #BigDigitalADL
Validate & Retain in GSC ALL Past Domains &
Past Site Versions (Protocols (HTTPS / HTTP)
THERE	
  MAY	
  STILL	
  BE	
  UNDETECTED	
  ACTIVITY	
  GOING	
  ON	
  THERE
@dawnieando #BigDigitalADL
Server Log FileAnalysis is Your Friend…
You’ll	
  be	
  surprised	
  by	
  what	
  you	
  find
Find	
  out	
  what	
  Googlebot is	
  
visiting	
  and	
  when	
  (how	
  
often)	
  and	
  whether	
  it	
  
should	
  be	
  visiting	
  it	
  at	
  all
@dawnieando #BigDigitalADL
SOLUTION - Save & Grow The URLs
Not	
  EVERYTHING	
  is	
  
worthy	
  of	
  its	
  own	
  URL
VARIANTS
STEMMINGS
PLURALS
RANDOM	
  TAGS
LONG,	
  LONG,	
  LONG	
  
TAIL	
  PARAMETERS
@dawnieando #BigDigitalADL
SOLUTION - Save & Grow The URLs
A	
  URL	
  is	
  like	
  a	
  
fine	
  wine
Maturing	
  over	
  
time
@dawnieando #BigDigitalADL
Pass Strong Clues - Highly Relevant New Structure
STRONG
SEMANTICS
@dawnieando #BigDigitalADL
Wiki Page Redirects
https://dbpedia.org/sparql
Wikipedia	
  
Redirects
thesaurus.com
OR	
  A	
  GOOD	
  OLD	
  FASHIONED	
  THESAURUS
@dawnieando #BigDigitalADL
Increase ‘Importance’ quickly of target URLs
• Internal	
  link	
  optimization
• Canonicalise to	
  (if	
  relevant)
• Strengthen	
  up	
  importance	
  signals
• Inclusion	
  in	
  front	
  facing	
  and	
  XML	
  
sitemaps
• Improve	
  the	
  content	
  &	
  keep	
  it	
  
updated
• 301	
  redirect	
  to	
  (if	
  relevant	
  
redundant	
  content)
@dawnieando #BigDigitalADL
Reduce ‘Importance’ quickly of old URLs
• Internal	
  link	
  unoptimization
• 410
• Dig	
  out	
  URLs	
  with	
  links	
  to	
  them
• Orphan	
  URLs
• Canonicals	
  to	
  HTTPs
• Exclusion	
  from	
  XML	
  sitemaps	
  (even	
  
old	
  ones	
  on	
  the	
  server)
• Strip	
  out	
  content
@dawnieando #BigDigitalADL
304 IF MODIFIED HEADERS
ONLY	
  DOWNLOAD	
  IF	
  THE	
  
CONTENT	
  CHECKSUM	
  HAS	
  
CHANGED
@dawnieando #BigDigitalADL
CHOP BACK CHAINS
@dawnieando #BigDigitalADL
SOLUTION – Be Careful About Creating
New Dynamic Parameters
QUEUEING…	
  AGAIN
Waiting	
  for	
  good	
  URLs	
  to	
  be	
  
visited…	
  AGAIN
@dawnieando #BigDigitalADL
REVISIT PAST .HTACCESS FILES
Can	
  you	
  rewrite	
  the	
  rules	
  to	
  be	
  more	
  
efficient	
  or	
  cut	
  out	
  some	
  old	
  rules	
  still	
  
firing	
  unnecessarily?
@dawnieando #BigDigitalADL
SOLUTION - NEVER TRY TO ‘OUTRUN’
GOOGLEBOT
@dawnieando #BigDigitalADL
You Have a Shiny New Site… So What?
You	
  may	
  still	
  have…
GENERATIONAL	
  CRUFT	
  &	
  TECHNICAL	
  
DEBT	
  TO	
  PAY	
  OFF
GONE	
  IS	
  NEVER	
  GONE
@dawnieando #BigDigitalADL
REMEMBER
”Gone	
  is	
  
Never	
  Gone”
“Google	
  Has	
  A	
  
Big	
  Memory”
Dawn	
  Anderson	
  @	
  dawnieando
THANK	
  YOU
@dawnieando #BigDigitalADL
Sources & References
• https://patentimages.storage.googleapis.com/US8042112B1/US08042112-­‐
20111018-­‐D00000.png
• Randall,	
  K.H.,	
  Google	
  Inc.,	
  2010. Scheduler	
  for	
  search	
  engine	
  crawler.	
  U.S.	
  Patent	
  
7,725,452.

Mais conteúdo relacionado

Mais procurados

Duplicate Content Myths Types and Ways To Make It Work For You
Duplicate Content Myths Types and Ways To Make It Work For YouDuplicate Content Myths Types and Ways To Make It Work For You
Duplicate Content Myths Types and Ways To Make It Work For YouDawn Anderson MSc DigM
 
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016Dawn Anderson MSc DigM
 
Pubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn andersonPubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn andersonDawn Anderson MSc DigM
 
Dawn Anderson SEO Consumer Choice Crawl Budget Optimization Conflicts
Dawn Anderson SEO Consumer Choice Crawl Budget Optimization ConflictsDawn Anderson SEO Consumer Choice Crawl Budget Optimization Conflicts
Dawn Anderson SEO Consumer Choice Crawl Budget Optimization ConflictsDawn Anderson MSc DigM
 
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...Dawn Anderson MSc DigM
 
SEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm ShiftSEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm ShiftDawn Anderson MSc DigM
 
SEO - The Rise of Persona Modelled Intent Driven Contextual Search
SEO - The Rise of Persona Modelled Intent Driven Contextual SearchSEO - The Rise of Persona Modelled Intent Driven Contextual Search
SEO - The Rise of Persona Modelled Intent Driven Contextual SearchDawn Anderson MSc DigM
 
Technical SEO - Generational cruft in SEO - there is never a new site when th...
Technical SEO - Generational cruft in SEO - there is never a new site when th...Technical SEO - Generational cruft in SEO - there is never a new site when th...
Technical SEO - Generational cruft in SEO - there is never a new site when th...Dawn Anderson MSc DigM
 
AMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And BeyondAMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And BeyondDawn Anderson MSc DigM
 
Cruft busting technical debt code smell and refactoring for seo - state of ...
Cruft busting   technical debt code smell and refactoring for seo - state of ...Cruft busting   technical debt code smell and refactoring for seo - state of ...
Cruft busting technical debt code smell and refactoring for seo - state of ...Dawn Anderson MSc DigM
 
Owl and The Hummingbird - Ontology and SEO
Owl and The Hummingbird - Ontology and SEOOwl and The Hummingbird - Ontology and SEO
Owl and The Hummingbird - Ontology and SEODawn Anderson MSc DigM
 
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AUKeeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AUJason Mun
 
Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015Jan Hendrik Merlin Jacob
 
Lots of ways to speed up your site
Lots of ways to speed up your siteLots of ways to speed up your site
Lots of ways to speed up your siteIan Lurie
 
Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...
Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...
Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...Ronald Soh
 
SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...
SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...
SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...Distilled
 
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick StoxGoogle's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick StoxAhrefs
 
Moving URLs: Structural Web changes 
without losing rankings #SearchLove
Moving URLs: Structural Web changes 
without losing rankings #SearchLoveMoving URLs: Structural Web changes 
without losing rankings #SearchLove
Moving URLs: Structural Web changes 
without losing rankings #SearchLoveAleyda Solís
 
How We Make Websites
How We Make WebsitesHow We Make Websites
How We Make Websitesfantasticlife
 

Mais procurados (20)

Duplicate Content Myths Types and Ways To Make It Work For You
Duplicate Content Myths Types and Ways To Make It Work For YouDuplicate Content Myths Types and Ways To Make It Work For You
Duplicate Content Myths Types and Ways To Make It Work For You
 
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
SEO Crawl Rank And Crawl Tank - Brighton SEO April 2016
 
Pubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn andersonPubcon florida 2018 logs dont lie dawn anderson
Pubcon florida 2018 logs dont lie dawn anderson
 
Dawn Anderson SEO Consumer Choice Crawl Budget Optimization Conflicts
Dawn Anderson SEO Consumer Choice Crawl Budget Optimization ConflictsDawn Anderson SEO Consumer Choice Crawl Budget Optimization Conflicts
Dawn Anderson SEO Consumer Choice Crawl Budget Optimization Conflicts
 
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
 
SEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm ShiftSEO and The Mobile-First Paradigm Shift
SEO and The Mobile-First Paradigm Shift
 
SEO - The Rise of Persona Modelled Intent Driven Contextual Search
SEO - The Rise of Persona Modelled Intent Driven Contextual SearchSEO - The Rise of Persona Modelled Intent Driven Contextual Search
SEO - The Rise of Persona Modelled Intent Driven Contextual Search
 
Technical SEO - Generational cruft in SEO - there is never a new site when th...
Technical SEO - Generational cruft in SEO - there is never a new site when th...Technical SEO - Generational cruft in SEO - there is never a new site when th...
Technical SEO - Generational cruft in SEO - there is never a new site when th...
 
AMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And BeyondAMP Accelerated Mobile Pages - To AMPFinity And Beyond
AMP Accelerated Mobile Pages - To AMPFinity And Beyond
 
Cruft busting technical debt code smell and refactoring for seo - state of ...
Cruft busting   technical debt code smell and refactoring for seo - state of ...Cruft busting   technical debt code smell and refactoring for seo - state of ...
Cruft busting technical debt code smell and refactoring for seo - state of ...
 
Owl and The Hummingbird - Ontology and SEO
Owl and The Hummingbird - Ontology and SEOOwl and The Hummingbird - Ontology and SEO
Owl and The Hummingbird - Ontology and SEO
 
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AUKeeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
 
Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015Crawl Budget - Some Insights & Ideas @ seokomm 2015
Crawl Budget - Some Insights & Ideas @ seokomm 2015
 
SEO Benchmark
SEO BenchmarkSEO Benchmark
SEO Benchmark
 
Lots of ways to speed up your site
Lots of ways to speed up your siteLots of ways to speed up your site
Lots of ways to speed up your site
 
Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...
Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...
Google Search Engine Ranking Position - 200 Top Ranking Factors for SEO Marke...
 
SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...
SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...
SearchLove San Diego 2018 | Will Critchlow | From the Horse’s Mouth: What We ...
 
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick StoxGoogle's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
 
Moving URLs: Structural Web changes 
without losing rankings #SearchLove
Moving URLs: Structural Web changes 
without losing rankings #SearchLoveMoving URLs: Structural Web changes 
without losing rankings #SearchLove
Moving URLs: Structural Web changes 
without losing rankings #SearchLove
 
How We Make Websites
How We Make WebsitesHow We Make Websites
How We Make Websites
 

Semelhante a Technical SEO - Gone is Never Gone - Fixing Generational Cruft and Technical Debt in SEO - Big Digital Adelaide 2017

Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...
Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...
Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...Thiago de Faria
 
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...Codemotion
 
Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...
Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...
Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...Thiago de Faria
 
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...Codemotion
 
David Brown - Crawl Efficiency & Fixing Common Crawl Issues
David Brown - Crawl Efficiency & Fixing Common Crawl Issues David Brown - Crawl Efficiency & Fixing Common Crawl Issues
David Brown - Crawl Efficiency & Fixing Common Crawl Issues tmwi
 
SEO Table Stakes - Clarity 14
SEO Table Stakes - Clarity 14SEO Table Stakes - Clarity 14
SEO Table Stakes - Clarity 14Keith Goode
 
SearchLove Boston 2016 | Mike King | Developer Thinking for SEOs
SearchLove Boston 2016 | Mike King | Developer Thinking for SEOsSearchLove Boston 2016 | Mike King | Developer Thinking for SEOs
SearchLove Boston 2016 | Mike King | Developer Thinking for SEOsDistilled
 
Tips and tactics that generate clicks and impressions - Daniel Brooks
Tips and tactics that generate clicks and impressions - Daniel BrooksTips and tactics that generate clicks and impressions - Daniel Brooks
Tips and tactics that generate clicks and impressions - Daniel BrooksSearchNorwich
 
Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...
Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...
Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...Daniel Brooks
 
Winning SEO when doing Web Migrations #SEO4Life
Winning SEO when doing Web Migrations #SEO4LifeWinning SEO when doing Web Migrations #SEO4Life
Winning SEO when doing Web Migrations #SEO4LifeAleyda Solís
 
What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14
What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14
What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14Keith Goode
 
The Best Employee Communications Tool You Already Own But Aren't Using
The Best Employee Communications Tool You Already Own But Aren't UsingThe Best Employee Communications Tool You Already Own But Aren't Using
The Best Employee Communications Tool You Already Own But Aren't UsingJWTINSIDE
 
Do we know our data, as good as we know our tools
Do we know our data, as good as we know our tools Do we know our data, as good as we know our tools
Do we know our data, as good as we know our tools Jeremie Charlet
 
How Search Works
How Search WorksHow Search Works
How Search WorksAhrefs
 
devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...
devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...
devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...Thiago de Faria
 
SMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily Ray
SMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily RaySMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily Ray
SMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily RayLily Ray
 
SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)
SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)
SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)Evolving SEO
 
International SEO #SEBC Stockholm
International SEO #SEBC StockholmInternational SEO #SEBC Stockholm
International SEO #SEBC StockholmLisa Myers
 
Website Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick StoxWebsite Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick Stoxpatrickstox
 

Semelhante a Technical SEO - Gone is Never Gone - Fixing Generational Cruft and Technical Debt in SEO - Big Digital Adelaide 2017 (20)

Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...
Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...
Codemotion Berlin 2018 - AI with a devops mindset: experimentation, sharing a...
 
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
 
Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...
Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...
Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...
 
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...
 
David Brown - Crawl Efficiency & Fixing Common Crawl Issues
David Brown - Crawl Efficiency & Fixing Common Crawl Issues David Brown - Crawl Efficiency & Fixing Common Crawl Issues
David Brown - Crawl Efficiency & Fixing Common Crawl Issues
 
SEO Table Stakes - Clarity 14
SEO Table Stakes - Clarity 14SEO Table Stakes - Clarity 14
SEO Table Stakes - Clarity 14
 
SearchLove Boston 2016 | Mike King | Developer Thinking for SEOs
SearchLove Boston 2016 | Mike King | Developer Thinking for SEOsSearchLove Boston 2016 | Mike King | Developer Thinking for SEOs
SearchLove Boston 2016 | Mike King | Developer Thinking for SEOs
 
Tips and tactics that generate clicks and impressions - Daniel Brooks
Tips and tactics that generate clicks and impressions - Daniel BrooksTips and tactics that generate clicks and impressions - Daniel Brooks
Tips and tactics that generate clicks and impressions - Daniel Brooks
 
Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...
Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...
Search Norwich #7 - Keyword Research Tips & Tactics To Increase Clicks & Impr...
 
Winning SEO when doing Web Migrations #SEO4Life
Winning SEO when doing Web Migrations #SEO4LifeWinning SEO when doing Web Migrations #SEO4Life
Winning SEO when doing Web Migrations #SEO4Life
 
What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14
What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14
What To Do When You Can't Do Anything - SEO Soft Skills - Clarity '14
 
The Best Employee Communications Tool You Already Own But Aren't Using
The Best Employee Communications Tool You Already Own But Aren't UsingThe Best Employee Communications Tool You Already Own But Aren't Using
The Best Employee Communications Tool You Already Own But Aren't Using
 
Do we know our data, as good as we know our tools
Do we know our data, as good as we know our tools Do we know our data, as good as we know our tools
Do we know our data, as good as we know our tools
 
How Search Works
How Search WorksHow Search Works
How Search Works
 
devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...
devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...
devopsdays Kiel 2018 - Can the AI hype & ML algorithms harm your devops initi...
 
Technical SEO Audit
Technical SEO AuditTechnical SEO Audit
Technical SEO Audit
 
SMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily Ray
SMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily RaySMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily Ray
SMX West: Future-Proof Your Site for Google's Core Algorithm Updates by Lily Ray
 
SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)
SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)
SEO In 2022: Google Discover and Microsite SERPs - (SEMrush Webinar)
 
International SEO #SEBC Stockholm
International SEO #SEBC StockholmInternational SEO #SEBC Stockholm
International SEO #SEBC Stockholm
 
Website Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick StoxWebsite Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick Stox
 

Mais de Dawn Anderson MSc DigM

Human vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdfHuman vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdfDawn Anderson MSc DigM
 
Life of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic UpdatesLife of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic UpdatesDawn Anderson MSc DigM
 
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...Dawn Anderson MSc DigM
 
Passage indexing is likely more important than you think
Passage indexing is likely more important than you thinkPassage indexing is likely more important than you think
Passage indexing is likely more important than you thinkDawn Anderson MSc DigM
 
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...Dawn Anderson MSc DigM
 
Google BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual ConferenceGoogle BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual ConferenceDawn Anderson MSc DigM
 
Google BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to KnowGoogle BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to KnowDawn Anderson MSc DigM
 
Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020
Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020
Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020Dawn Anderson MSc DigM
 
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender SearchDawn Anderson MSc DigM
 
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...Dawn Anderson MSc DigM
 
Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019Dawn Anderson MSc DigM
 
Google BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard RaceGoogle BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard RaceDawn Anderson MSc DigM
 
The User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive SearchThe User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive SearchDawn Anderson MSc DigM
 
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Dawn Anderson MSc DigM
 
Using topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic searchUsing topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic searchDawn Anderson MSc DigM
 
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...Dawn Anderson MSc DigM
 
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...Dawn Anderson MSc DigM
 
Voice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEOVoice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEODawn Anderson MSc DigM
 

Mais de Dawn Anderson MSc DigM (20)

Human vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdfHuman vs AI Quality Raters for Search Engines.pdf
Human vs AI Quality Raters for Search Engines.pdf
 
Life of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic UpdatesLife of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
Life of An SEO - Surfing The Waves of Googles Many Algorithmic Updates
 
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
Natural Semantic SEO - Surfacing Walnuts in Densely Represented, Every Increa...
 
Passage indexing is likely more important than you think
Passage indexing is likely more important than you thinkPassage indexing is likely more important than you think
Passage indexing is likely more important than you think
 
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
Zipfs Law & Zipfian Distribution in SEO - Pubcon Virtual Fall 2020 - Dawn And...
 
Google BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual ConferenceGoogle BERT - SMX London 2020 Virtual Conference
Google BERT - SMX London 2020 Virtual Conference
 
Google BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to KnowGoogle BERT - What SEOs and Marketers Need to Know
Google BERT - What SEOs and Marketers Need to Know
 
Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020
Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020
Disambiguating Equiprobability in SEO Dawn Anderson Friends of Search 2020
 
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
2019 Tech SEO Boost Dawn Anderson Contextual Recommender Search
 
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
Connecting The Worlds of Information Retrieval & SEO - Search solutions 2019 ...
 
Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019Planning an SEO Strategy for a New Website - SMXL Milan 2019
Planning an SEO Strategy for a New Website - SMXL Milan 2019
 
Google BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard RaceGoogle BERT and Family and the Natural Language Understanding Leaderboard Race
Google BERT and Family and the Natural Language Understanding Leaderboard Race
 
The User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive SearchThe User is the Query - The Rise of Predictive Proactive Search
The User is the Query - The Rise of Predictive Proactive Search
 
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
 
Using topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic searchUsing topic modelling frameworks for NLP and semantic search
Using topic modelling frameworks for NLP and semantic search
 
SEO in a Mobile First World
SEO in a Mobile First WorldSEO in a Mobile First World
SEO in a Mobile First World
 
Modern Ecommerce SEO
Modern Ecommerce SEOModern Ecommerce SEO
Modern Ecommerce SEO
 
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
 
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
 
Voice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEOVoice Search Challenges For Search and Information Retrieval and SEO
Voice Search Challenges For Search and Information Retrieval and SEO
 

Último

GreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web RevolutionGreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web RevolutionWilliam Barnes
 
Uncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsUncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsVWO
 
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieBeyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieKent Kubie
 
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesGoogle 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesSearch Engine Journal
 
Local SEO Domination: Put your business at the forefront of local searches!
Local SEO Domination:  Put your business at the forefront of local searches!Local SEO Domination:  Put your business at the forefront of local searches!
Local SEO Domination: Put your business at the forefront of local searches!dstvtechnician
 
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessAggregage
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRSapana Sha
 
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessBrighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessVarn
 
Brand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdfBrand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdftbatkhuu1
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Richard Ingilby
 
9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking Men9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking MenSapana Sha
 
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfTOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfasiyahanif9977
 
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...Benjamin Szturmaj
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptxVikasTiwari846641
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 

Último (20)

Creator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose GuirgisCreator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
 
GreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web RevolutionGreenSEO April 2024: Join the Green Web Revolution
GreenSEO April 2024: Join the Green Web Revolution
 
Uncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsUncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 Reports
 
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel LeminTurn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
 
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAILBUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
 
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieBeyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
 
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesGoogle 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
 
Brand Strategy Master Class - Juntae DeLane
Brand Strategy Master Class - Juntae DeLaneBrand Strategy Master Class - Juntae DeLane
Brand Strategy Master Class - Juntae DeLane
 
Local SEO Domination: Put your business at the forefront of local searches!
Local SEO Domination:  Put your business at the forefront of local searches!Local SEO Domination:  Put your business at the forefront of local searches!
Local SEO Domination: Put your business at the forefront of local searches!
 
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail Success
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCR
 
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessBrighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
 
Brand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdfBrand experience Dream Center Peoria Presentation.pdf
Brand experience Dream Center Peoria Presentation.pdf
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
 
9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking Men9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking Men
 
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfTOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
 
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptx
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
 

Technical SEO - Gone is Never Gone - Fixing Generational Cruft and Technical Debt in SEO - Big Digital Adelaide 2017

  • 1. @dawnieando #BigDigitalADL Generational  Cruft  &  Technical  Debt  in   SEOGONE  IS   NEVER  GONE Dawn  Anderson  @  dawnieando
  • 2. @dawnieando #BigDigitalADL A New Beginning § “A  new  website  will  solve  ALL  our   problems” § “Let’s  start  again” § “We’ll  just  migrate…  and  redirect   everything”
  • 3. @dawnieando #BigDigitalADL 404 Page Not Found § “Of  course,  we  won’t   redirect  everything…” § “Not  everything  will   be  worth  redirecting”
  • 4. @dawnieando #BigDigitalADL 410 Gone § “Some,  we’ll  just  kill   off  with  a  410…” § “Then  the  URLs  will   be  gone”
  • 5. @dawnieando #BigDigitalADL But in Reality…it Still Exists
  • 6. @dawnieando #BigDigitalADL Because…Web Crawler System’s History Logs SEARCH  ENGINES  HAVE  A    BIG  MEMORY  AND  A  LOT  OF   STORAGE
  • 8. @dawnieando #BigDigitalADL Web Crawler System GOOGLE  NEVER  FORGETS The  history  logs  play  a   role  in  deciding  when   every  URL  that  was  EVER   discovered  gets  visited   again
  • 9. @dawnieando #BigDigitalADL History Log Records Include: • URL  fingerprint • Timestamp  (last  crawl  or  download   attempt) • Crawl  status  (success  or  error)  (Response   code) • Content  checksum  (binary  code) • Source  ID  (accessed  from  cache  or   downloaded) • Segment  identifier  (Crawl  segment  assigned   to??) • Page  importance  (a  measure  of  importance   assigned  to  the  URL) May  be   calculated  by   identifying   historical   importance   scores  based  on   past  X  number  of   crawls
  • 10. @dawnieando #BigDigitalADL Gone Is Never Gone “We  knew  there  was  content   there  at  some  point  so  we   just  swing  by  every  now  and   then  to  see  if  anything  came   back”  (John  Mueller,  2016)
  • 12. @dawnieando #BigDigitalADL The Generational ’Snail Trail’ • Old  XML  sitemaps • Redirects  drop  away  on  old  site   .htaccess • DNS  issues • People  link  to  old  site  but  wrong   protocol • Old  sites  not  verified  in  GSC • Not  all  protocols  redirecting Leaving  it’s   slithery     footprint
  • 13. @dawnieando #BigDigitalADL The Generational ’Snail Trail’ • All  eating  away  at   Googlebot’s attention  on   your  server’s  IP WATCH  OUT  FOR  THE  SNAIL  TRAIL  &   GENERATIONAL  CRUFT
  • 14. @dawnieando #BigDigitalADL The Slow Page Evolution of Near Duplicates In  a  study  over  11  weeks  Denis  Fetterly and   Mark  Najork found  that  near-­‐duplicate  pages   rarely  change  and  that  they  are  still  near-­‐ duplicates  of  each  other  10  weeks  later.   Therefore  once  identified  their  download   priority  may  be  reduced  so  that  resources  may   be  used  more  efficiently  /  productively   elsewhere (Fetterly &  Najork,  2003) Fetterly &   Najork,   2003
  • 15. @dawnieando #BigDigitalADL ‘Transitive’?? Transitive  -­‐ A  ==  B  +  B  ==  C  then  A  ==  C THEORY:  Maybe  for  some  types  of   content  more  than  others  – e.g.   ecommerce/directories  but  not  news THEORY  ALERT  !!!!!!!
  • 16. @dawnieando #BigDigitalADL DUSTBUSTER & DUST CRAWLING RULES DO  NOT   CRAWL  IN   THE  DUST BUILDS   ‘HINTS’  ON   WHAT  NOT   TO  CRAWL
  • 17. @dawnieando #BigDigitalADL ‘Sampling’in Crawling for Efficiency ‘SMALL  TEST  VISITS  TO  A  SITE  TO   UNDERSTAND  WHETHER  IT  IS  WORTH   CRAWLING’
  • 18. @dawnieando #BigDigitalADL Every Site Will Have Its Own Crawling Rules UNSURE  AS  TO  WHETHER   THIS  IS  BEING  USED DUSTBUSTER   CRAWLING RULES
  • 19. @dawnieando #BigDigitalADL Popular CMS ’Rule Patterns’ (URL Parameters) ALL  WILL  HAVE  COMMON   CANONICALIZATION  PATTERNS  WHICH   CAN  BE  LEARNED
  • 20. @dawnieando #BigDigitalADL CRAWLING RULES BUILT OVER TIME Crawl  Frequency  Patterns No  two  sites  will  have  the  same  crawl   schedules  or  rules  built Moving  from  one  CMS  to  another  may   mean  that  different  parameters  are   created.    New  parameters  =  new  rules  
  • 21. @dawnieando #BigDigitalADL Every Version of Your Past Ecommerce Sites “Exponentially   multiplicative   URLs” Had  potential  to  spew…  at  some  point…
  • 22. @dawnieando #BigDigitalADL URLs Take Their Place in Crawling Queues The  Queue  Gets  Long  &   Congested
  • 23. @dawnieando #BigDigitalADL YOU INHERITED SEO TECHNICAL DEBT • Previous  content  /  link  manual  actions • Previous  algorithmic  suppressions • Past  infinite  loops • “We’ll  SEO  it  after  launch” • “SEO  is  dead…  so  we  won’t  optimise” • Dodgy  URL  parameters • Misconfigured  URL  parameters • Old  URL  crawling  ‘rules  /  hints’
  • 25. @dawnieando #BigDigitalADL IT WASN’T ME – PASSING THE BUCK
  • 26. @dawnieando #BigDigitalADL … with Interest AT  SOME  POINT  IT   MUST  BE  REPAID
  • 27. @dawnieando #BigDigitalADL GENERATIONAL CRUFT EVERY  SINGLE  TIME  YOU  MIGRATE,  CHANGE  DESIGN,  REDIRECT,  REINVENT  A  SITE  /  URL A  CLEAN  START REDIRECTIONS ANOTHER  STRUCTURE FIRST  SITE   STRUCTURE NEW  CRAWLING  ‘RULES’   BUILT CRAWLING   ‘RULES’  BUILT EVERYTHING   IS  ‘200  OK’ MORE  URLs MIXED  RESPONSE  CODES REDIRECTIONS ‘FUZZINESS’  IS  EMERGING NEW  CRAWLING  ‘RULES’  BUILT MORE  URLs REDIRECT  CHAINS  &  MIXED   RESPONSE  CODES NEW  SEO’s  DON’T   KNOW  THE  ‘HISTORY’ TARGET  URLs  NOW  ‘VERY  FUZZY’
  • 28. @dawnieando #BigDigitalADL Aged ‘Patchwork Quilt’Sites A  LITTLE  BIT  OF  THIS  CMS  AND  A   LITTLE  BIT  OF  THAT  CMS MANY  HISTORICAL  PARAMETERS   CREATED
  • 29. @dawnieando #BigDigitalADL ’Fuzzy’ URL Targets with Each Site Generation EVERYTHING  GETS   A  BIT  BLURRED ‘Which  is  the  target  URL   again?
  • 30. @dawnieando #BigDigitalADL Time Seems To Fly… The Older You Get Your  new  site  URL  is  just   one  of  very  many  historical   URLs  on  your  IP  to  be   visited  periodically A  tiny  fish  in  a  very   big  URL  pond  queue
  • 31. @dawnieando #BigDigitalADL The Great 302s Pass PageRank Debate
  • 32. @dawnieando #BigDigitalADL SOLUTION - THE BELOVED CANONICAL § 30X  redirects § Canonical  tag § Href lang § HTTPS  protocol § Global  canonicalization   rules In  ’ALL’  its  forms
  • 34. @dawnieando #BigDigitalADL Advanced Technical SEO? 50%  of  SEOs  surveyed   considered;; “CANONICALIZATION   IS  ADVANCED   TECHNICAL  SEO”
  • 35. @dawnieando #BigDigitalADL Oh Yeah – Canonicalization is Easy 76%  of  SEOs  surveyed   considered;; “CANONICALIZATION   IS  AN  EASY  CONCEPT   TO  UNDERSTAND”
  • 36. @dawnieando #BigDigitalADL REL NEXT REL PREV is NOT Canonicalization 47%  of  SEOs   categorizing  themselves   as  ‘TECHNICAL  SEO’s   considered;; “REL=NEXT  /  REL  =   PREV”  IS  A  FORM  OF   CANONICALIZATION
  • 37. @dawnieando #BigDigitalADL On Redirections as Canonicalization Forms Lots  were  unsure  about;; “301s  and  302s  are   BOTH  forms  of   canonicalization”
  • 38. @dawnieando #BigDigitalADL On Href Lang as Canonicalization Only  64%  of  ’Technical SEOs’  thought  HRef Lang  was  a  form  of Canonicalization IT  IS
  • 39. @dawnieando #BigDigitalADL URL Parameter Handling is Your Friend Help  Google  Build  ‘Crawling   Rules’  for  your  site  rather   than  wasting  time  on   ‘sampling’  and  giving  a  bad   impression GIVE  HELP  AND   GUIDANCE  WITH  THE   CRAWL  RULE  AND   HINT  BUILDING
  • 40. @dawnieando #BigDigitalADL SOLUTION - Understand URL Parameters ACTIVE  PARAMETERS  ==  CHANGE  THE  CONTENT  ON   YOUR  PAGE (e.g.  sort,  filter,  translate,  paginate,  specify) PASSIVE  PARAMETERS  ==  DO  NOT  CHANGE  THE   CONTENT  ON  YOUR  PAGE (Often  used  for  tracking)    (ALIAS:  REPRESENTATIVE)
  • 41. @dawnieando #BigDigitalADL ACTIVE Parameters (CHANGE CONTENT) SORT  ==  Sorts  dynamic  items  and  reorders  in  descending  /  ascending  price  /   popularity  /  added NARROWING  ==  Filters  dynamically  added  items  down  to  include  only   features  &  attributes  in  a  chosen  consideration  set SPECIFYING  ==  Identifies  a  particular  dynamically  variable  populated   content  set  within  a  site  section  (e.g.  store=women) TRANSLATING  ==  Indicates  a  language  driven  translation  URL  (e.g.  lang=fr) PAGINATING  ==  Indicates  a  paginated  display  of  long  content  (e.g.  page=2)
  • 42. @dawnieando #BigDigitalADL Understand How URLs with Multiple Parameters Are Handled The  most  restrictive  parameter  blocked  overrules   lesser  restrictions
  • 43. @dawnieando #BigDigitalADL Examples of Multiple Parameter Handling KNOW  THE   RULES http://www.example.com?shopping-­‐category=DVD-­‐movies&sort-­‐ by=production-­‐year&sort-­‐order=asc WILL  BE  CRAWLED http://www.example.com?shopping-­‐category=shoes&sort-­‐by=size&sort-­‐ order=asc WILL  NOT  BE  CRAWLED  (production-­‐year  blocks)
  • 44. @dawnieando #BigDigitalADL Help Googlebot Get Round its Shopping List OPEN  MORE  CHECKOUTS WIDEN  THE  AISLES MAKE  THINGS  EASY  TO  FIND DON’T  CONFUSE   GOOGLEBOT HELP  FILL  THE  TROLLEY   QUICKLY SPEED,  SPEED,  SPEED
  • 45. @dawnieando #BigDigitalADL SOLUTION - XML SitemapsAre Your Friend… (Strong Foundations) They  help  to   pass   ‘importance’   signals  within   a  site But…  never   leave  them  to   just   autogenerate without   periodically   checking ‘The   foundations’   underneath  a   site
  • 46. @dawnieando #BigDigitalADL Validate & Retain in GSC ALL Past Domains & Past Site Versions (Protocols (HTTPS / HTTP) THERE  MAY  STILL  BE  UNDETECTED  ACTIVITY  GOING  ON  THERE
  • 47. @dawnieando #BigDigitalADL Server Log FileAnalysis is Your Friend… You’ll  be  surprised  by  what  you  find Find  out  what  Googlebot is   visiting  and  when  (how   often)  and  whether  it   should  be  visiting  it  at  all
  • 48. @dawnieando #BigDigitalADL SOLUTION - Save & Grow The URLs Not  EVERYTHING  is   worthy  of  its  own  URL VARIANTS STEMMINGS PLURALS RANDOM  TAGS LONG,  LONG,  LONG   TAIL  PARAMETERS
  • 49. @dawnieando #BigDigitalADL SOLUTION - Save & Grow The URLs A  URL  is  like  a   fine  wine Maturing  over   time
  • 50. @dawnieando #BigDigitalADL Pass Strong Clues - Highly Relevant New Structure STRONG SEMANTICS
  • 51. @dawnieando #BigDigitalADL Wiki Page Redirects https://dbpedia.org/sparql Wikipedia   Redirects thesaurus.com OR  A  GOOD  OLD  FASHIONED  THESAURUS
  • 52. @dawnieando #BigDigitalADL Increase ‘Importance’ quickly of target URLs • Internal  link  optimization • Canonicalise to  (if  relevant) • Strengthen  up  importance  signals • Inclusion  in  front  facing  and  XML   sitemaps • Improve  the  content  &  keep  it   updated • 301  redirect  to  (if  relevant   redundant  content)
  • 53. @dawnieando #BigDigitalADL Reduce ‘Importance’ quickly of old URLs • Internal  link  unoptimization • 410 • Dig  out  URLs  with  links  to  them • Orphan  URLs • Canonicals  to  HTTPs • Exclusion  from  XML  sitemaps  (even   old  ones  on  the  server) • Strip  out  content
  • 54. @dawnieando #BigDigitalADL 304 IF MODIFIED HEADERS ONLY  DOWNLOAD  IF  THE   CONTENT  CHECKSUM  HAS   CHANGED
  • 56. @dawnieando #BigDigitalADL SOLUTION – Be Careful About Creating New Dynamic Parameters QUEUEING…  AGAIN Waiting  for  good  URLs  to  be   visited…  AGAIN
  • 57. @dawnieando #BigDigitalADL REVISIT PAST .HTACCESS FILES Can  you  rewrite  the  rules  to  be  more   efficient  or  cut  out  some  old  rules  still   firing  unnecessarily?
  • 58. @dawnieando #BigDigitalADL SOLUTION - NEVER TRY TO ‘OUTRUN’ GOOGLEBOT
  • 59. @dawnieando #BigDigitalADL You Have a Shiny New Site… So What? You  may  still  have… GENERATIONAL  CRUFT  &  TECHNICAL   DEBT  TO  PAY  OFF GONE  IS  NEVER  GONE
  • 60. @dawnieando #BigDigitalADL REMEMBER ”Gone  is   Never  Gone” “Google  Has  A   Big  Memory” Dawn  Anderson  @  dawnieando THANK  YOU
  • 61. @dawnieando #BigDigitalADL Sources & References • https://patentimages.storage.googleapis.com/US8042112B1/US08042112-­‐ 20111018-­‐D00000.png • Randall,  K.H.,  Google  Inc.,  2010. Scheduler  for  search  engine  crawler.  U.S.  Patent   7,725,452.