Skip to content

Instantly share code, notes, and snippets.

@colorincode
Created September 11, 2024 16:36
Show Gist options
  • Save colorincode/82afeb4ca187d36de14390ddf53065b9 to your computer and use it in GitHub Desktop.
Save colorincode/82afeb4ca187d36de14390ddf53065b9 to your computer and use it in GitHub Desktop.
Concepts for blocking AI bot scraping
# not for you mr robot
## Google Fam
# -----------------
User-agent: Googlebot
User-agent: Mediapartners-Google
Disallow: /*/feed
Disallow: /?s=
## Other SE
# -----------------
User-agent: Applebot
User-agent: Bingbot
User-agent: msnbot
User-agent: Slurp
Disallow: /*/feed
Disallow: /?s=
## AI
# -----------------
User-agent: ChatGPT-User
User-agent: OAI-SearchBot
Disallow: /*/feed
Disallow: /?s=
## Internet Archive - archive.org
# -----------------
User-agent: ia_archiver
Disallow:
## Social
# -----------------
User-agent: facebookexternalhit
User-agent: Facebot
User-agent: LinkedInBot
User-agent: Twitterbot
Disallow:
## All other bots
# -----------------
User-agent: *
Disallow: /
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment