How to block AI Crawler Bots using robots.txt filewww.cyberciti.bizexternal-link Cynicus Rex ( @CynicusRex@lemmy.ml ) Privacy@lemmy.mlEnglish • 1 month ago message-square18fedilinkarrow-up166
arrow-up166external-linkHow to block AI Crawler Bots using robots.txt filewww.cyberciti.biz Cynicus Rex ( @CynicusRex@lemmy.ml ) Privacy@lemmy.mlEnglish • 1 month ago message-square18fedilink
minus-square Cynicus Rex ( @CynicusRex@lemmy.ml ) OPlinkfedilink8•1 month ago#TL;DR: User-agent: GPTBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: Google-Extended Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Amazonbot Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Omgilibot Disallow: / User-Agent: FacebookBot Disallow: / User-Agent: Applebot Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Bytespider Disallow: / User-agent: Claude-Web Disallow: / User-agent: Diffbot Disallow: / User-agent: ImagesiftBot Disallow: / User-agent: Omgilibot Disallow: / User-agent: Omgili Disallow: / User-agent: YouBot Disallow: /
minus-square mox ( @mox@lemmy.sdf.org ) linkfedilink4•1 month agoOf course, nothing stops a bot from picking a user agent field that exactly matches a web browser.
minus-square JackbyDev ( @JackbyDev@programming.dev ) linkfedilinkEnglish3•1 month agoNothing stops a bot from choosing to not read robots.txt
minus-square mox ( @mox@lemmy.sdf.org ) linkfedilink2•edit-21 month agoIndeed, as has already been said repeatedly in other comments.
#TL;DR:
Of course, nothing stops a bot from picking a user agent field that exactly matches a web browser.
Nothing stops a bot from choosing to not read robots.txt
Indeed, as has already been said repeatedly in other comments.