CodeGPT
CodeGPT is an autonomous browser agent operated by Codegpt. Instead of scraping content, it actually uses your site, clicking buttons, filling forms, completing checkouts, doing what a human user would do.
Traffic looks closer to a real user session than to a crawler. Pages are fully rendered, JavaScript runs, and interactions are sequential. The agent often runs on behalf of a paying customer who wants a specific outcome.
This is the leading edge of agentic commerce. The agents that buy your product, book your hotel room, or fill out your lead form on behalf of a human will live in this category.
See CodeGPT on your own site
Match the User-Agent header on incoming requests against the pattern below.
regex
For higher confidence, also verify the source IP against the operator's published ranges. UA strings can be spoofed; IP ownership is harder to fake.
Renders JavaScript
Yes
IP verification
User-Agent only
Crawl frequency
Burst, user-driven
Honors robots.txt
Yes
Honors Crawl-delay
Varies
Should I let CodeGPT through?
In most cases, yes. Agentic commerce is growing fast. Blocking these agents means losing the customers they act on behalf of. If volume gets noisy, rate-limit it before you block it outright.
Does blocking CodeGPT affect my Google rankings?
No. CodeGPT acts on behalf of one specific user at a time, not on behalf of a search index. Classical SEO is unaffected. The trade is whether you want users delegating tasks to Codegpt to be able to reach your site.
How do I confirm a request is really from CodeGPT?
Look at the User-Agent header in your access logs and match it against the strings listed above. Worth knowing that the User-Agent is easy to fake, so this check tells you "the traffic claims to be CodeGPT", not "the traffic is genuinely CodeGPT". If you need stronger guarantees, look for a reverse-DNS check or wait for Codegpt to publish IP ranges.
Can CodeGPT take actions on my site, like buying or signing up?
In principle yes. Agentic browsers can fill forms, click buttons, and complete transactions. That is the entire point. If your site leans on automation detection to prevent bots from acting, CodeGPT is a different threat model from a passive crawler and you should think about it separately.
What's the cleanest way to control CodeGPT?
Two layers. Robots.txt for the polite crawlers that read it, and rules at your CDN or edge for the ones that don't. Rankly's Agent Experience handles both from a single config, so you can allow, block, rate-limit, or serve a stripped-down version per bot. Agent Analytics handles the observation half so you know which bots are actually worth a rule.