What is Googlebot?
Google-Extended is listed as a data collection crawler operated by Google. Use this crawler profile to identify user-agent tokens, operator signals, platform hints, and recommended handling.
Googlebot
Google · Data collection crawler
- Version
- Unknown
- First seen
- Sep 17, 2025
- Confidence
- Known user-agent token
Crawler tags
Directory facts
- AI model training
- Listed as training
- Acts on behalf of user
- No, autonomous
- Obeys directives
- Yes, listed as obeying robots.txt
What is Googlebot?
Google-Extended is listed as a data collection crawler operated by Google. Googlebot is likely used by Google to collect or evaluate public web content for AI, language, or training systems.
Googlebot matched a known crawler token in the user-agent string, but this page alone does not prove IP ownership.
How to identify Googlebot in logs
Search server logs for Googlebot, Google-Extended. Matching those tokens is useful for discovery, but IP verification is still recommended before trusting the identity.
Uses standard Googlebot user agent strings
Use this as a web-crawler lookup reference for identifying how this user-agent presents itself in server logs. After you identify crawler traffic, run the WebMCP Compatibility Checker to confirm whether AI crawlers and agents can understand your site.
User-agent signals
- Product tokens
- Uses, standard, Googlebot, user, agent, strings
- Documentation
- None found in user-agent
- Contact
- None found
- Platform
- Unknown · Unknown
- Browser profile
- Unknown · Unknown
- Browser-like UA
- No
- HTTP library
- Unknown
- Spoof risk
- Medium
Questions answered by this crawler profile
What is Google-Extended?
Google-Extended is listed as a data collection crawler operated by Google. Googlebot is likely used by Google to collect or evaluate public web content for AI, language, or training systems.
Does Google-Extended train AI models?
Google-Extended is listed as being used to train AI or LLM systems.
Is Google-Extended user-triggered?
Google-Extended is listed as operating independently of a direct user action.
How do I identify Google-Extended in logs?
Search server logs for Googlebot, Google-Extended. Matching those tokens is useful for discovery, but IP verification is still recommended before trusting the identity.
Does Google-Extended respect robots.txt?
This directory entry lists the crawler as obeying robots.txt directives. Confirm behavior in your logs before relying on user-agent strings alone.
