IP address blocked on certain site
cmeclax-sazri
cmeclax-sazri at ixazon.dynip.com
Fri Feb 4 20:42:17 UTC 2011
On Friday 04 February 2011 13:38:14 Joe Btfsplk wrote:
> No ideas yet on what "automated software that doesn't follow /robots.txt
> is forbidden," means?
robots.txt is a file put on some websites as a directive to robots. If you run
a wiki, and you want only current versions, not the hundreds of previous
versions of every page, indexed, you could put a directive in robots.txt, or
label the pages themselves as "noindex nofollow". Automated software that
ignores such directives is likely to eat up huge amounts of bandwidth and
create copies that are many times bigger than the original.
cmeclax
***********************************************************************
To unsubscribe, send an e-mail to majordomo at torproject.org with
unsubscribe or-talk in the body. http://archives.seul.org/or/talk/
More information about the tor-talk
mailing list