# Indian Flavors – Golden — robots.txt # Site: https://indianflavorsgolden.ca # ------------------------------------------------------------------ # Default rules for all crawlers # ------------------------------------------------------------------ User-agent: * Allow: / Disallow: /config/ Disallow: /search/ Disallow: /account/ Disallow: /api/ Allow: /api/ui-extensions/ Disallow: /static/ Disallow: /*?*author=* Disallow: /*?*tag=* Disallow: /*?*month=* Disallow: /*?*view=* Disallow: /*?*format=* # ------------------------------------------------------------------ # AI / LLM crawlers — allow content, hide legal pages # ------------------------------------------------------------------ User-agent: GPTBot User-agent: ChatGPT-User User-agent: CCBot User-agent: anthropic-ai User-agent: Claude-Web User-agent: ClaudeBot User-agent: Google-Extended User-agent: FacebookBot User-agent: cohere-ai User-agent: PerplexityBot Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html # ------------------------------------------------------------------ # Google Ads bots # ------------------------------------------------------------------ User-agent: AdsBot-Google User-agent: AdsBot-Google-Mobile User-agent: AdsBot-Google-Mobile-Apps Allow: / # ------------------------------------------------------------------ # Heavy crawlers — throttle # ------------------------------------------------------------------ User-agent: Baiduspider Crawl-delay: 10 # ------------------------------------------------------------------ # llms.txt — should be readable by AI agents but NOT indexed by search # (Noindex directive is informational; primary enforcement should be # an HTTP `X-Robots-Tag: noindex` header set at the web-server level.) # ------------------------------------------------------------------ Noindex: /llms.txt # ------------------------------------------------------------------ # Sitemap # ------------------------------------------------------------------ Sitemap: https://indianflavorsgolden.ca/sitemap.xml