Web crawler lookup

What is Googlebot?

Google-Extended is listed as a data collection crawler operated by Google. Use this crawler profile to identify user-agent tokens, operator signals, platform hints, and recommended handling.

Googlebot

Google · Data collection crawler

Version
Unknown
First seen
Sep 17, 2025
Confidence
Known user-agent token

Crawler tags

Data collection crawlerAI trainingAutonomousObeys robots.txt

Directory facts

AI model training
Listed as training
Acts on behalf of user
No, autonomous
Obeys directives
Yes, listed as obeying robots.txt

What is Googlebot?

Google-Extended is listed as a data collection crawler operated by Google. Googlebot is likely used by Google to collect or evaluate public web content for AI, language, or training systems.

Googlebot matched a known crawler token in the user-agent string, but this page alone does not prove IP ownership.

How to identify Googlebot in logs

Search server logs for Googlebot, Google-Extended. Matching those tokens is useful for discovery, but IP verification is still recommended before trusting the identity.

GooglebotGoogle-Extended

Uses standard Googlebot user agent strings

Use this as a web-crawler lookup reference for identifying how this user-agent presents itself in server logs. After you identify crawler traffic, run the WebMCP Compatibility Checker to confirm whether AI crawlers and agents can understand your site.

User-agent signals

Product tokens
Uses, standard, Googlebot, user, agent, strings
Documentation
None found in user-agent
Contact
None found
Platform
Unknown · Unknown
Browser profile
Unknown · Unknown
Browser-like UA
No
HTTP library
Unknown
Spoof risk
Medium

Questions answered by this crawler profile

What is Google-Extended?

Google-Extended is listed as a data collection crawler operated by Google. Googlebot is likely used by Google to collect or evaluate public web content for AI, language, or training systems.

Does Google-Extended train AI models?

Google-Extended is listed as being used to train AI or LLM systems.

Is Google-Extended user-triggered?

Google-Extended is listed as operating independently of a direct user action.

How do I identify Google-Extended in logs?

Search server logs for Googlebot, Google-Extended. Matching those tokens is useful for discovery, but IP verification is still recommended before trusting the identity.

Does Google-Extended respect robots.txt?

This directory entry lists the crawler as obeying robots.txt directives. Confirm behavior in your logs before relying on user-agent strings alone.