168
2009

SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Embed Size (px)

Citation preview

Page 1: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

2009

Dominic Woodman
double back on this is so important - and you can do this
Dominic Woodman
focus here on those two things. being able to do it and why you should do it
Dominic Woodman
https://www.deepcrawl.com/knowledge/news/google-webmaster-hangout-notes-september-9th-2016/
Dominic Woodman
graph
Dominic Woodman
show disproportionate
Dominic Woodman
there's more detail here.
Dominic Woodman
too long
Dominic Woodman
mention mess
Dominic Woodman
raise hand and keep raised
Dominic Woodman
As a messy person, i can see that this is really just efficiency. The ironboard js next to the suitcase.
Dominic Woodman
I want you to raise your hand if you couldnt stand having the room your seeing in your house.If youd have to go over and do something about it.Wouldnt it be wonderful if any rooms in our houses looked that good?
Dominic Woodman
This is more likely. Were tryihg.
Dominic Woodman
one word
Dominic Woodman
move powerful to top possibly remove
Dominic Woodman
emphasis why you want do want it.
Dominic Woodman
possibly cut all this content
Dominic Woodman
time poor
Dominic Woodman
angle of structured business question - or it's easy to fish
Dominic Woodman
little work shows up interesting insights
Dominic Woodman
try putting on other data
Dominic Woodman
show all 5
Dominic Woodman
lost it
Dominic Woodman
double check deep crawl can't do this
Dominic Woodman
gifs
Dominic Woodman
shows table
Dominic Woodman
dont' talk abotu concept of a query language
Dominic Woodman
remove
Dominic Woodman
move this to after BQ
Dominic Woodman
kill sentence explanation
Dominic Woodman
too many bullets
Dominic Woodman
windows has stopped working
Dominic Woodman
bosy not lazy
Dominic Woodman
possibly rotate . backgorund
Dominic Woodman
remove this lside
Dominic Woodman
drop
Dominic Woodman
you might be missing logs
Dominic Woodman
site health
Dominic Woodman
should just be resources
Dominic Woodman
bible example
Dominic Woodman
15 words per line
Dominic Woodman
8700 bibles
Dominic Woodman
redo this section to mention possible use cases where you might use other things
Dominic Woodman
Move to after this section on tools
Dominic Woodman
Change of plan move to begiinninng as to why query languages are great.
Dominic Woodman
Turn these into tables ticking off the other points
Page 2: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
more pace change
Dominic Woodman
Lets travel back ibtime to 2009.
Page 3: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 4: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

God it’s bad.

Dominic Woodman
And i had just got this truly twrrible haircut. I say haircut, but it really was the lack of it that was so shocking.
Page 5: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 6: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 7: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 8: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 9: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 10: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 11: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 12: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 13: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 14: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 15: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 16: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

-$1.5 Billion

Dominic Woodman
larger pause
Dominic Woodman
there is a disconnect between what people say and how they behave - particularly if you're asking
Page 17: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 18: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Why hasn’t Google seen the changes on my page?

Page 19: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How should I prioritise errors in Search Console?

Page 20: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Are my canonicals being respected?

Page 21: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Does Google think this page is important?

Page 22: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
acknowledge we'll explain this
Page 23: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 24: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 25: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 26: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 27: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 28: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What can you do with logs?

PART 1: THE WHY

Getting logs

Analysing Logs

Processing Logs

PART 2: THE HOW

Dominic Woodman
sell this as hard as possible - this is the biggest possible opportunity
Page 29: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 30: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What is a log?

Dominic Woodman
watch out for mentions of log files
Dominic Woodman
says log files too much
Page 31: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

IP Address

Page 32: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

Timestamp

Page 33: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

Request type

Dominic Woodman
remove this explanation
Page 34: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

Homepage

Page 35: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

Protocol

Dominic Woodman
possibility of things
Dominic Woodman
you dont need to kniow all now,. hammer in how easy
Dominic Woodman
factually incorrect
Page 36: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

Status Code

Page 37: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

Size of the page (in bytes)

Page 38: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What does a log look like?

123.65.150.10 - - [23/Aug/2010:03:50:59 +0000] "GET /my_homepage HTTP/1.1" 200 2262 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html))"

User Agent

Page 39: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What can you do with logs?

PART 1: THE WHY

Getting logs

Analysing Logs

Processing Logs

PART 2: THE HOW

Page 40: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

5 things2 3 4 51

Page 41: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

1 Diagnose crawling & indexation issues

2 3 4 51

Page 42: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
throwaway comments
Page 43: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
how many you of you
Page 44: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Number of requests

Five folders Googlebot crawled the most

Page 45: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Five folders Googlebot crawled the most

Number of requests

Page 46: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

% of Organic sessions VS % of crawl budget

Sessions

Crawl budget

Page 47: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

2 Prioritisation

2 3 4 51

Page 48: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 49: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

example.com/article

Dominic Woodman
dont mention it's an example - obvious
Page 50: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Prioritizing

1

Full

Print

Page 51: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

example.com/article/full

Page 52: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

example.com/article/print

Page 53: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Prioritizing

2

Page 54: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

example.com/article/pdf

Page 55: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Prioritizing

3

Dominic Woodman
fuzzy
Page 56: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Prioritizing

1

Full

Print

Page 57: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

3 Spot bugs & view site health

2 3 4 51

Page 58: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Delayed errors with a limit of 1000

Page 59: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 60: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

4 How important does Google see parts of your site?

2 3 4 51

Page 61: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

My SEO was as bad as my design

Dominic Woodman
but my hair was better
Dominic Woodman
zoom in onhair chepa laugh
Page 62: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

But at least my hair was better

Page 63: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

teflsearch.com

Page 64: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

teflsearch.com/job-results

Page 65: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

teflsearch.com/job-results/country/china

Page 66: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

teflsearch.com/jobadvert3455

Page 67: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Average number of times Googlebot crawled a template

Page 68: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

1. teflsearch.com

2. teflsearch.com/job-results

3. teflsearch.com/job-results/country/china

4. teflsearch.com/job-advert3455

Page 69: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

1. teflsearch.com

2. teflsearch.com/job-results

3. teflsearch.com/job-results/country/china

4. teflsearch.com/job-advert3455

Page 70: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

teflsearch.com/job-results

Page 71: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Average number of times Googlebot crawled a template

35%

Dominic Woodman
label over 40% - perhasp extra lside
Dominic Woodman
new graph
Dominic Woodman
reverse graph order
Page 72: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

5 How fresh does it think your content is?

2 3 4 51

Dominic Woodman
show more screen shots
Page 73: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/moz-fresh

Page 74: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Average number of times a page template is crawled by Googlebot

Dominic Woodman
more detail on this
Dominic Woodman
more ephasis on this
Dominic Woodman
dotted red line
Dominic Woodman
make point of results
Dominic Woodman
clarity problem
Page 75: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

●Improve our internal linking●Build trust with last modified date in sitemap

Page 76: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

2 3 4 51

Page 77: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What can you do with logs?

PART 1: THE WHY

Getting logs

Analysing Logs

Processing Logs

PART 2: THE HOW

Page 78: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 79: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 80: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 81: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Talk to a developer and

ask for information

Page 82: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Are all the logs in one place?

Page 83: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Hi xI’m {x} from {y} and we’ve been asked to do some log analysis to understand better how Google is behaving on the website and I was hoping you could help with some questions about the log set-up (as well as with getting the logs!).What we’d ideally like is 3-6 months of historical logs for the website. Our goal is look at all the different pages search engines are crawling on our website, discover where they’re spending their time, the status code errors they’re finding etc. There are also some things that are really helpful for us to know when getting logs.Do the logs have any personal information in?We’re just concerned about the various search crawler bots like Google and Bing, we don’t need any logs from users, so any logs with emails, or telephone numbers etc. can be removed.Do you have any sort of caching which would create separate sets of logs?If there is anything like Varnish running on the server, or a CDN which might create logs in different location to the rest of your server? If so then we will need those logs as well as just those from the server. (Although we’re only concerned about a CDN if it’s caching pages, or serving from the same hostname; if you’re just using Cloudflare for example to cache external images then we don’t need it).Are there any sub parts of your site which log to a different place?Have you got anything like an embedded Wordpress blog which logs to a different location? If so then we’ll need those logs as well.Do you log hostname?It’s really useful for us to be able to see hostname in the logs. By default a lot of common server logging set-ups don’t log hostname, so if it’s not turned on, then it would be very useful to have that turned on now for any future analysis.Is there anything else we should know?Best,{x}

Email for a developer

Page 84: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

So we might have something that looks like this

Page 85: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What can you do with logs?

PART 1: THE WHY

Getting logs

Analysing Logs

Processing Logs

PART 2: THE HOW

Page 86: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 87: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How should we analyse our

logs?

Page 88: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
possibly hammered to much
Page 89: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

BigQuery

Page 90: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
also say why
Page 91: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

BigQuery

Page 92: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Google’s online database for data

analysis.

Page 93: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

1. Ask powerful questions2. Repeatable3. Scaleable4. Combine with crawl data5. Easy to set-up6. Easy to learn

What do we want from analysing our logs?

Dominic Woodman
quote how much it is
Page 94: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 95: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
moar pause
Dominic Woodman
emphasis once you've written it you can copy paste
Dominic Woodman
change gif
Page 96: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
use same book
Dominic Woodman
practice transition into this
Page 97: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 98: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 99: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

9,000,000 rows of data for 2 months.

400 - 800 queries

Page 100: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 101: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What can you do with logs?

PART 1: THE WHY

Getting logs

Analysing Logs

Processing Logs

PART 2: THE HOW

Page 102: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Format the logs so we can import them into BigQuery

Separate the Googlebot logs from all the other logs

Page 103: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Screaming Frog Log Analyser Code something

Page 104: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Screaming Frog Log Analyser

Dominic Woodman
remove the other one
Page 105: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Dominic Woodman
change to video slide
Page 106: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Code something

Page 107: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/logs-code

Page 108: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What can you do with logs?

PART 1: THE WHY

Getting logs

Analysing Logs

Processing Logs

PART 2: THE HOW

Page 109: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our data in BQ

Page 110: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

We make sure we got what we wanted

Page 111: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

THE QUESTION: What is the total number of

requests Googlebot makes each day to our site?

Page 112: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT timestampFROM [mydata.log_analysis]

Page 113: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT timestampFROM [mydata.log_analysis]

Page 114: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp)FROM [mydata.log_analysis]

Page 115: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp)FROM [mydata.log_analysis]

Page 116: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp) as dateFROM [mydata.log_analysis]

Page 117: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp) as dateFROM [mydata.log_analysis]

Page 118: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp) as date, count(*)FROM [mydata.log_analysis]

Page 119: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp) as date, count(*)FROM [mydata.log_analysis]GROUP BY date

Page 120: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp) as date, count(*) as number_of_requestsFROM [mydata.log_analysis]GROUP BY date

Page 121: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our first SQL query

SELECT DATE(timestamp) as date, count(*) as number_of_requestsFROM [mydata.log_analysis]GROUP BY date

Page 122: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Comparing logs to GSC crawl volume

Number of requests

Dominic Woodman
put in similar slides
Page 123: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Run queries

Find something weird

Go look at crawl & website

Dominic Woodman
add visual interest
Dominic Woodman
icons
Page 124: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Our data in BQ

Page 125: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

1 Diagnose crawling & indexation issues

Page 126: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

2 Prioritisation

Page 127: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

3 Spot bugs & view site health

Page 128: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

4 How important does Google see parts of your site?

Page 129: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

5 How fresh does it think your content is?

Page 130: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

1 Diagnose crawling & indexation issues

4 How important does Google see parts of your site?

Dominic Woodman
loop back to beginning verbally
Page 131: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What are the top 20 URLs crawled by Google over our logs?

Dominic Woodman
make tiny sotries
Page 132: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Login is my top crawled page and then search?

Page 133: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What are the top 20 page_path_1 folders crawled by Google over our

logs?

Page 134: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Location folders are taking more than 70% of my budget

Page 135: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Getting data by the day

Page Number of Googlebot Requests

page1 200,000

page2 120,000

Page 136: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Number of Googlebot requests day by day

Dominic Woodman
add more lines
Page 137: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

3 Spot bugs & view site health

Page 138: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How many of each status code does Google find per day over our

logs?

Page 139: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Number of Googlebot requests day by day

Dominic Woodman
stories
Page 140: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

What are most requested 404 URLs by Googlebot over the past

30 days?

Page 141: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Boy does it want that ad-tech snippet

Page 142: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

5 How fresh does it think your content is?

Page 143: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How many times on average is each page in a page template

crawled a day?

Page 144: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

Average number of times a page template is crawled by Googlebot

Page 145: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?

Dominic Woodman
put to multiple slides
Page 146: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?

Page 147: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?

Page 148: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?

Page 149: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?Which are most crawled parameters on the website?

Page 150: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?Which are most crawled parameters on the website?How often are the most visited parameters crawled each day?

Page 151: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?Which are most crawled parameters on the website?How often are the most visited parameters crawled each day?Which directories have the most 301 & 404 error codes?

Page 152: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?Which are most crawled parameters on the website?How often are the most visited parameters crawled each day?Which directories have the most 301 & 404 error codes?Which pages are crawled with parameters and without parameters?

Page 153: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?Which are most crawled parameters on the website?How often are the most visited parameters crawled each day?Which directories have the most 301 & 404 error codes?Which pages are crawled with parameters and without parameters?Which pages are only partly downloaded?How many hits does each section get, when the sections are classified in

an external dataset?

Page 154: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?Which are most crawled parameters on the website?How often are the most visited parameters crawled each day?Which directories have the most 301 & 404 error codes?Which pages are crawled with parameters and without parameters?Which pages are only partly downloaded?How many hits does each section get, when the sections are classified in

an external dataset?What percentage of a directory was crawled over the past 30 days?

Page 155: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

How long does it take for a page to be discovered after being published?What are the top 20 combinations of page_path_1 & path_path_2

folders crawled by Google over the time period of our logs?Which pages have requests from Googlebot, which don’t appear in our

crawl?What are the top non-canonical pages being crawled?Which are most crawled parameters on the website?How often are the most visited parameters crawled each day?Which directories have the most 301 & 404 error codes?Which pages are crawled with parameters and without parameters?Which pages are only partly downloaded?How many hits does each section get, when the sections are classified in

an external dataset?What percentage of a directory was crawled over the past 30 days?What are the total number of requests across two different time periods?

Page 156: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

That’s a lot of questions

Page 157: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/logs-resource

Page 158: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/logs-resource

Page 159: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/logs-resource

Page 160: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/logs-resource

Page 161: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

In Summary

Page 162: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

This is the thing you’re probably not doing

Page 163: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 164: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 165: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 166: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/logs-resource@dom_woodman

Dominic Woodman
drive back down
Page 167: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
Page 168: SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs

bit.ly/logs-resource@dom_woodman

Dominic Woodman
drive back down