How to Do an SEO Log File Analysis [Template Included]

Log files have been receiving increasing recognition from technical SEOs over the past five years, and for a good reason.

They’re the most trustworthy source of information to understand the URLs that search engines have crawled, which can be critical information to help diagnose problems with technical SEO.

Google itself recognizes their importance, releasing new features in Google Search Console and making it easy to see samples of data that would previously only be available by analyzing logs.

In addition, Google Search Advocate John Mueller has publicly stated how much good information log files hold.

@glenngabe Log files are so underrated, so much good information in them.

— 🦙 johnmu.xml (personal) 🦙 (@JohnMu) April 5, 2016

With all this hype around the data in log files, you may want to understand logs better, how to analyze them, and whether the sites you’re working on will benefit from them.

This article will answer all of that and more. Here’s what we’ll be discussing:

A server log file is a file created and updated by a server that records the activities it has performed. A popular server log file is an access log file, which holds a history of HTTP requests to the server (by both users and bots).

When a non-developer mentions a log file, access logs are the ones they’ll usually be referring to.

Developers, however, find themselves spending more time looking at error logs, which report issues encountered by the server.

The above is important: If you request logs from a developer, the first thing they’ll ask is, “Which ones?”

Therefore, always be specific with log file requests. If you want logs to analyze crawling, ask for access logs.

Access log files contain lots of information about each request made to the server, such as the following:

IP addresses
User agents
URL path
Timestamps (when the bot/browser made the request)
Request type (GET or POST)
HTTP status codes

What servers include in access logs varies by the server type and sometimes what developers have configured the server to store in log files. Common formats for log files include the following:

Apache format – This is used by Nginx and Apache servers.
W3C format – This is used by Microsoft IIS servers.
ELB format – This is used by Amazon Elastic Load Balancing.
Custom formats – Many servers support outputting a custom log format.

Other forms exist, but these are the main ones you’ll encounter.

Now that we’ve got a basic understanding of log files, let’s see how they benefit SEO.

Here are some key ways:

Crawl monitoring – You can see the URLs search engines crawl and use this to spot crawler traps, look out for crawl budget wastage, or better understand how quickly content changes are picked up.
Status code reporting – This is particularly useful for prioritizing fixing errors. Rather than knowing you’ve got a 404, you can see precisely how many times a user/search engine is visiting the 404 URL.
Trends analysis – By monitoring crawling over time to a URL, page type/site section, or your entire site, you can spot changes and investigate potential causes.
Orphan page discovery – You can cross-analyze data from log files and a site crawl you run yourself to discover orphan pages.

All sites will benefit from log file analysis to some degree, but the amount of benefit varies massively depending on site size.

This is as log files primarily benefit sites by helping you better manage crawling. Google itself states managing the crawl budget is something larger-scale or frequently changing sites will benefit from.

2-google-recommendations-1 How to Do an SEO Log File Analysis [Template Included]

The same is true for log file analysis.

For example, smaller sites can likely use the “Crawl stats” data provided in Google Search Console and receive all of the benefits mentioned above—without ever needing to touch a log file.

3-crawl-stats-1 How to Do an SEO Log File Analysis [Template Included]

Yes, Google won’t provide you with all URLs crawled (like with log files), and the trends analysis is limited to three months of data.

However, smaller sites that change infrequently also need less ongoing technical SEO. It’ll likely suffice to have a site auditor discover and diagnose issues.

For example, a cross-analysis from a site crawler, XML sitemaps, Google Analytics, and Google Search Console will likely discover all orphan pages.

You can also use a site auditor to discover error status codes from internal links.

There are a few key reasons I’m pointing this out:

Access log files aren’t easy to get a hold of (more on this next).
For small sites that change infrequently, the benefit of log files isn’t as much, meaning SEO focuses will likely go elsewhere.

In most cases, to analyze log files, you’ll first have to request access to log files from a developer.

The developer is then likely going to have a few issues, which they’ll bring to your attention. These include:

Partial data – Log files can include partial data scattered across multiple servers. This usually happens when developers use various servers, such as an origin server, load balancers, and a CDN. Getting an accurate picture of all logs will likely mean compiling the access logs from all servers.
File size – Access log files for high-traffic sites can end up in terabytes, if not petabytes, making them hard to transfer.
Privacy/compliance – Log files include user IP addresses that are personally identifiable information (PII). User information may need removing before it can be shared with you.
Storage history – Due to file size, developers may have configured access logs to be stored for a few days only, making them not useful for spotting trends and issues.

These issues will bring to question whether storing, merging, filtering, and transferring log files are worth the dev effort, especially if developers already have a long list of priorities (which is often the case).

Developers will likely put the onus on the SEO to explain/build a case for why developers should invest time in this, which you will need to prioritize among other SEO focuses.

These issues are precisely why log file analysis doesn’t happen frequently.

Log files you receive from developers are also often formatted in unsupported ways by popular log file analysis tools, making analysis more difficult.

Thankfully, there are software solutions that simplify this process. My favorite is Logflare, a Cloudflare app that can store log files in a BigQuery database that you own.

Now it’s time to start analyzing your logs.

I’m going to show you how to do this in the context of Logflare specifically; however, the tips on how to use log data will work with any logs.

The template I’ll share shortly also works with any logs. You’ll just need to make sure the columns in the data sheets match up.

1. Start by setting up Logflare (optional)

Logflare is simple to set up. And with the BigQuery integration, it stores data long term. You’ll own the data, making it easily accessible for everyone.

There’s one difficulty. You need to swap out your domain name servers to use Cloudflare ones and manage your DNS there.

For most, this is fine. However, if you’re working with a more enterprise-level site, it’s unlikely you can convince the server infrastructure team to change the name servers to simplify log analysis.

I won’t go through every step on how to get Logflare working. But to get started, all you need to do is head to the Cloudflare Apps part of your dashboard.

5-logflare-1 How to Do an SEO Log File Analysis [Template Included]

6-validating-googlebot-manually How to Do an SEO Log File Analysis [Template Included]

RECOMMENDED POSTS

Starting Now: The Digital Marketing Success Plan

July 25, 2024 No Comments

analytics-backlinko-organic-search-june-2024

How to Do SEO for Your New Website [10-Step Guide]

July 23, 2024 No Comments

How to Combine SEO and Content Marketing (The Ahrefs’ Way)

July 22, 2024 No Comments

Marketing Tips You Need

Client Reviews Tell The Tale.

Nicole NoblesApril 18, 2024

Dan was a delight to work with. I needed a few headshots taken for my LinkedIn profile and Dan provided the easiest and most comfortable experience using state-of-the art equipment in a very professional setting. Also, the turn-around time on results was quick and I felt completely engaged and satisfied during the entire process. I highly recommend his services.Donny RitcharoenDecember 19, 2023

I got headshots taken and they turned out so well! The lighting was amazing.Tessa ChanMay 30, 2023

We used Appture to build a lodging website, and they were awesome! Dan went above and beyond to show us the functions and make all of our changes. Appture is our go to for web design from now on!Abigail HaleOctober 26, 2022

Appture knows their business and will go the extra mile for their customers. They do high quality work and provide great ongoing support.Chris McCorkindaleMay 24, 2022

Anita CauthornMay 24, 2022

It’s so rare in these times to find one man with so much wow factor and more rare to find men with similar interest and passion in their life journey as myself . Dan Elliott has been introduced to many in what is now considered as the Terror Dome , a place where many dreams are not deferred they are detoured to routes that lead to dead ends , he comes in full of optimism so infectious that he, maybe with out knowing is energizing those who have ventured where others would fear going with just the right jolt to forge on in the way of helping fallen humanity … His various fields of expertise has helped many in my region and I can only imagine the number he has effected beyond those I know … from day one I knew “ this was a man of kindred spirit “ Dan Elliott is a Gem and adds glimmer to things he touches … I’m a Witness ….and eternally grateful….L.Rashaan RichMay 21, 2022

Dan and his group are highly capable and knowledgeable. They work fast and get the job done. I highly recommend Appture.Justin FrankMarch 26, 2022

They are highly specialized in their work and constantly seek innovation.Ismail YenigulMarch 14, 2022

Dan is a marketing wizard. Honest, Experienced and a read deal. I am blessed to have him in my journey online :) Highly recommended.Sabbir HasanMarch 7, 2022

So much to say. Creative, Intelligent, Talented, Limitless, Affordable. It's amazing what these guys can do.Hack mackMay 17, 2019

We'd used some other agencies before, but man, they simply knocked us all over. After being in business for 30 years, I wonder how much more business we'd be doing if we'd hired them earlier.Rebecca HoneaMay 17, 2019

How to Do an SEO Log File Analysis [Template Included]

1. Start by setting up Logflare (optional)

2. Verify Googlebot

I’m not using Logflare

I’m using Cloudflare/Logflare

3. Extract data from log files

4. Add to Google Sheets

5. Add Ahrefs data

6. Check for status codes

7. Detect crawl budget wastage

8. Monitor important URLs

9. Find orphan URLs

10. Monitor crawling by directory

11. View Cloudflare cache ratios

12. Check which bots crawl your site the most

Final thoughts

RECOMMENDED POSTS

Starting Now: The Digital Marketing Success Plan

How to Do SEO for Your New Website [10-Step Guide]

Find Out More

Marketing Tips You Need

Keep In Touch

Client Reviews Tell The Tale.

ABOUT US

NEED DIRECTIONS?

Address: 6275 Plano Parkway, #500 Plano, TX 75093

Tel: (469) 808-0536

FEATURED SERVICES

REQUEST A QUOTE

RESULTS THAT AMAZE

How to Do an SEO Log File Analysis [Template Included]

First, what is a server log file?

How log files benefit SEO

How to access your log files

How to analyze your log files

1. Start by setting up Logflare (optional)

2. Verify Googlebot

I’m not using Logflare

I’m using Cloudflare/Logflare

3. Extract data from log files

4. Add to Google Sheets

5. Add Ahrefs data

6. Check for status codes

7. Detect crawl budget wastage

8. Monitor important URLs

9. Find orphan URLs

10. Monitor crawling by directory

11. View Cloudflare cache ratios

12. Check which bots crawl your site the most

Final thoughts

RECOMMENDED POSTS

Starting Now: The Digital Marketing Success Plan

How to Do SEO for Your New Website [10-Step Guide]

How to Combine SEO and Content Marketing (The Ahrefs’ Way)

Find Out More

Marketing Tips You Need

Keep In Touch

Client Reviews Tell The Tale.

ABOUT US

NEED DIRECTIONS?

Address: 6275 Plano Parkway, #500 Plano, TX 75093

Tel: (469) 808-0536

FEATURED SERVICES

REQUEST A QUOTE

RESULTS THAT AMAZE