Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 4 months agoHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizexternal-linkmessage-square57fedilinkarrow-up1106
arrow-up1106external-linkHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizCynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 4 months agomessage-square57fedilink
minus-squareasudox@lemmy.worldlinkfedilinkarrow-up6·4 months agoNot sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
minus-squareɐɥO@lemmy.ohaa.xyzlinkfedilinkarrow-up16·4 months agocause many crawlers seem to explicitly crawl “forbidden” sites
minus-squareCrashumbc@lemmy.worldlinkfedilinkEnglisharrow-up3·4 months agoGoogle and script kiddies copying code…
minus-squareMangoPenguinlinkfedilinkEnglisharrow-up1·4 months agoYou could also place the same page as a hidden link on your home page.
Not sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
cause many crawlers seem to explicitly crawl “forbidden” sites
Google and script kiddies copying code…
You could also place the same page as a hidden link on your home page.