Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 3 months agoHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizexternal-linkmessage-square37fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1external-linkHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizCynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 3 months agomessage-square37fedilink
minus-squareasudox@lemmy.worldlinkfedilinkarrow-up0·3 months agoNot sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
minus-squareɐɥO@lemmy.ohaa.xyzlinkfedilinkarrow-up2·3 months agocause many crawlers seem to explicitly crawl “forbidden” sites
minus-squareMangoPenguin@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up1·3 months agoYou could also place the same page as a hidden link on your home page.
minus-squareCrashumbc@lemmy.worldlinkfedilinkEnglisharrow-up1·3 months agoGoogle and script kiddies copying code…
Not sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
cause many crawlers seem to explicitly crawl “forbidden” sites
You could also place the same page as a hidden link on your home page.
Google and script kiddies copying code…