Crawling, indexation & the impact on performance | Brighton SEO

Crawling, indexation &
the tangible impact they
have on performance
Martin Fennon | Ayima
SLIDESHARE.NET/AYIMA
@AYIMA

About me
Martin Fennon
7 ½ Years
SEO experience
Head of SEO

First things ﬁrst, some context

Crawl
Index
Rank
Display
Search engines crawl websites to read
your content
The content is basically stored in a giant
database of the web
Based on algorithms, Google decides what
keywords indexed pages can rank for
Indexed pages are then displayed in
search results

So, what is the relationship between
crawling & indexing?

A page is indexed by Google if it
has been visited by the Google
crawler, analyzed for content and
meaning, and stored in the Google
index

The
relationship
between
crawling &
indexing

So, with such a clear relationship,
how can we prompt Google to
crawl our pages?

XML sitemaps improve
Google’s ability to crawl
a website.
Especially on larger,
more complex websites
XML sitemaps

Do Don’t
● Split XML sitemaps on
complex websites
limiting to 10-20k
URLs*
● Include pages deeper
in the site structure
● Include non-indexable
pages
● Forget to submit your
sitemaps via GSC
*https://ohgm.co.uk/an-alternative-approach-to-xml-sitemaps/

Internal & external links
Google will crawl and discover
URLs through internal &
external links

So, if a page is crawled by Google,
is it indexed?

Indexation can be
dependant on a number
of factors
...although sometimes
there's just no winning
with Google.
...not exactly.

Can Googlebot
access and render
the content?
Rendering
What can impact indexation?
Linking
Can Google
discover our new
pages?
Canonical tag to a
different page? Or a
meta robots tag?
Index
Directives

What can impact indexation?
Linking
Can Google
discover our new
pages?
Index
Directives
Can Googlebot
access and render
the content?
Rendering

Problem: A large scale e-commerce
website was experiencing a continuous
drop in performance and market share.
They came to Ayima.

● Huge drops in performance over a prolonged
period of time
● They could not ﬁnd any direct cause
Performance overview

Don’t be the agency/team that try
to deal with this problem with a:
“Lets chuck everything at the wall
and see what sticks” solution

● Longer tail keywords were most prone to large
drops in 2020
● ‘!’ highlights ranking URL changes
Pattern of keyword ranking drops

...but what does this have to do
with crawling & indexing you ask?

Ranking URLs were
moving from facet URLs
to category pages
This suggests one of two
things:
1. Google really hates
our website
2. Or Google cannot
access them

...sure enough, some
development changes
happened over the past
12 months

They had launched
a new mobile
navigation.
This navigation was
invisible to Google
due to JavaScript
implementations

Solution: Ensure the mobile
navigation is available in the initial
DOM load of the page. This would
allow Googlebot to crawl facet
pages

Warning: Opening all facets and
possible combinations up to
Googlebot could lead to it crawling
and indexing millions of additional
URLs.

Summer dresses
Pink summer dresses
Main collection pink summer dresses
Main collection pink ﬂoral summer dresses
And so on….

https://www.fashion-brand.com/dresses?
ﬁt=main-collection&colour=pink&design=
ﬂoral&cat=summer-dresses&sizes=8

/women/dresses
/women?cat=dresses
/women/new-in?cat=dresses
/women/sale?cat=dresses
Now, which should rank
for the term ‘dresses?’

We have to ﬁnd a solution to manage
both duplication & indexed page
quality

Use the robots.txt
directives to limit
crawling
Use noindex,
nofollow tags to
limit indexation
and crawling

Robots.txt
● Can manage categories
and facet combinations
on mass by limiting
Google crawls
● Does not remove URL
from Google’s index
● Does not stop Google
indexing the URLs
Offers crawl directives. Tell crawlers what to
crawl and what not to crawl

Noindex
● Removes pages from
Google’s index
● Google has to crawl the
page to see the noindex
tag
● Trying to re-index a page,
or groups of pages, can
take time
Manages indexation, helps exclude pages from
Google’s index

Robots.txt updates are quick,
(relatively) effective and won’t
require CMS updates

Utilisation of meta robots tags at scale
will usually require CMS upgrades
But, they will remove duplication and
index bloat

Our solution:
CMS restructure to enable
noindex, nofollow on mass
to control indexation using
a mixture of automated
rules and manual overrides

Our rationale:
We wanted to remove
poor/low quality URLs from
Google’s index, control
indexing of new pages and
direct Google to pages with
trafﬁc opportunities

Decline in indexed page
Growth in impressions

Index control begins
107% increase in market share

Core categories saw over 180% increase in
clicks for non branded terms

Thank you for listening
https://www.linkedin.com/in/martin-sean-fennon-90a12772/
martin@ayima.com

Crawling, indexation & the impact on performance | Brighton SEO

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Crawling, indexation & the impact on performance | Brighton SEO

Semelhante a Crawling, indexation & the impact on performance | Brighton SEO (20)

Último

Último (20)

Crawling, indexation & the impact on performance | Brighton SEO