Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 1 month agoHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizexternal-linkmessage-square62fedilinkarrow-up1104arrow-down132
arrow-up172arrow-down1external-linkHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizCynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 1 month agomessage-square62fedilink
minus-squareAsudox@lemmy.worldlinkfedilinkarrow-up6arrow-down1·1 month agoNot sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
minus-squareɐɥO@lemmy.ohaa.xyzlinkfedilinkarrow-up16·1 month agocause many crawlers seem to explicitly crawl “forbidden” sites
minus-squareCrashumbc@lemmy.worldlinkfedilinkEnglisharrow-up3·1 month agoGoogle and script kiddies copying code…
Not sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
cause many crawlers seem to explicitly crawl “forbidden” sites
Google and script kiddies copying code…