TaobaoBot
TaobaoBot is a catalog crawler operated by Alibaba. It indexes product feeds, category pages, and product detail pages so they can be surfaced inside shopping experiences and AI-powered commerce assistants.
Allowing it is one of the main ways your products become discoverable inside agentic commerce flows. As AI shopping agents grow, this category of crawler will matter more, not less.
Blocking is fine for sites that are not retail, but think twice if you sell products online.
See TaobaoBot on your own site
Match the User-Agent header on incoming requests against the pattern below.
regex
For higher confidence, also verify the source IP against the operator's published ranges. UA strings can be spoofed; IP ownership is harder to fake.
Renders JavaScript
No
IP verification
User-Agent only
Crawl frequency
Variable
Honors robots.txt
Yes
Honors Crawl-delay
Varies
Alibaba runs 8 bots in total. Each one is a separate user-agent so you can allow or block them independently.
Shopping Bot
3- AliexpressBot
- TaobaoBotYou are here
- TmallBot
Security Scanner
1Training Crawler
1AI Search Index
1Agentic Browser
1Live-Fetch AI
1Should I let TaobaoBot through?
In most cases, yes. Catalog crawlers feed AI shopping experiences. If you sell online, you want to be in their index. If volume gets noisy, rate-limit it before you block it outright.
Does blocking TaobaoBot affect my Google rankings?
No. TaobaoBot is not a search-engine crawler. Your ranking on Google or Bing is unaffected by what you do here.
How do I confirm a request is really from TaobaoBot?
Look at the User-Agent header in your access logs and match it against the strings listed above. Worth knowing that the User-Agent is easy to fake, so this check tells you "the traffic claims to be TaobaoBot", not "the traffic is genuinely TaobaoBot". If you need stronger guarantees, look for a reverse-DNS check or wait for Alibaba to publish IP ranges.
Will TaobaoBot change how my products surface in AI search?
If Alibaba runs an AI shopping surface, then yes. The crawler builds the catalog those experiences pull from. Blocking it means your products won't appear when users ask the AI to shop on their behalf.
How is TaobaoBot different from Alibaba's other bots?
Alibaba splits work across multiple user-agents so site owners can decide on each one independently. Training crawlers, live-fetch agents, search indexers, and agentic browsers each get their own name. Worth scanning the rest of the Alibaba family above to see which ones actually matter for your site.
What's the cleanest way to control TaobaoBot?
Two layers. Robots.txt for the polite crawlers that read it, and rules at your CDN or edge for the ones that don't. Rankly's Agent Experience handles both from a single config, so you can allow, block, rate-limit, or serve a stripped-down version per bot. Agent Analytics handles the observation half so you know which bots are actually worth a rule.
Verify everything above against the operator's own documentation.