PDA

View Full Version : blocked bots list


WebSavvy
19-06-2006, 18:43/06:43PM
Below are a list of bots I've blocked from my site using .htaccess. Not all of the bots listed below are "bad" bots per se, many are just "annoying" bots.

Many of the ones listed below are complete site rippers, downloaders, scrapers, malware bots, and so on.

It's not necessary for you to block all of the bots listed below, these are just from my personal list of site intruders. Yours may vary.

To block bad bots from your site, you can add the following into your .htaccess file in the root of your domain.


SetEnvIfNoCase User-agent "8484_Boston_Project" blocked=yes
SetEnvIfNoCase User-agent "Abacho" blocked=yes
SetEnvIfNoCase User-agent "acontbot" blocked=yes
SetEnvIfNoCase User-agent "AdoSpeaker" blocked=yes
SetEnvIfNoCase User-agent "ah-ha" blocked=yes
SetEnvIfNoCase User-agent "AIBOT" blocked=yes
SetEnvIfNoCase User-agent "aipbot" blocked=yes
SetEnvIfNoCase User-agent "Amfibibot" blocked=yes
SetEnvIfNoCase User-agent "AnswerBus" blocked=yes
SetEnvIfNoCase User-agent "appie" blocked=yes
SetEnvIfNoCase User-agent "Arachmo" blocked=yes
SetEnvIfNoCase User-agent "Arameda" blocked=yes
SetEnvIfNoCase User-agent "Arellis" blocked=yes
SetEnvIfNoCase User-agent "Argus" blocked=yes
SetEnvIfNoCase User-agent "ASPSeek" blocked=yes
SetEnvIfNoCase User-agent "asterias" blocked=yes
SetEnvIfNoCase User-agent "baiduspider" blocked=yes
SetEnvIfNoCase User-agent "BecomeBot" blocked=yes
SetEnvIfNoCase User-agent "BigCliqueBOT" blocked=yes
SetEnvIfNoCase User-agent "Bimbot" blocked=yes
SetEnvIfNoCase User-agent "BLA" blocked=yes
SetEnvIfNoCase User-agent "boitho.com-dc" blocked=yes
SetEnvIfNoCase User-agent "BruinBot" blocked=yes
SetEnvIfNoCase User-agent "btbot" blocked=yes
SetEnvIfNoCase User-agent "bumblebee" blocked=yes
SetEnvIfNoCase User-agent "CCGCrawl" blocked=yes
SetEnvIfNoCase User-agent "ccubee" blocked=yes
SetEnvIfNoCase User-agent "CipinetBot" blocked=yes
SetEnvIfNoCase User-agent "ColdFusion" blocked=yes
SetEnvIfNoCase User-agent "Combine" blocked=yes
SetEnvIfNoCase User-agent "contextadbot" blocked=yes
SetEnvIfNoCase User-agent "ConveraCrawler" blocked=yes
SetEnvIfNoCase User-agent "ConveraMultiMediaCrawler" blocked=yes
SetEnvIfNoCase User-agent "cosmos" blocked=yes
SetEnvIfNoCase User-agent "CostaCider" blocked=yes
SetEnvIfNoCase User-agent "Cowbot" blocked=yes
SetEnvIfNoCase User-agent "CrawlConvera" blocked=yes
SetEnvIfNoCase User-agent "CrawlWave" blocked=yes
SetEnvIfNoCase User-agent "CXL-FatAssANT" blocked=yes
SetEnvIfNoCase User-agent "DataCha0s" blocked=yes
SetEnvIfNoCase User-agent "DataFountains" blocked=yes
SetEnvIfNoCase User-agent "Deepindex" blocked=yes
SetEnvIfNoCase User-agent "DiamondBot" blocked=yes
SetEnvIfNoCase User-agent "Digger" blocked=yes
SetEnvIfNoCase User-agent "DM-Search" blocked=yes
SetEnvIfNoCase User-agent "Drecombot" blocked=yes
SetEnvIfNoCase User-agent "DTAagent" blocked=yes
SetEnvIfNoCase User-agent "EasyDL" blocked=yes
SetEnvIfNoCase User-agent "EnfinBot" blocked=yes
SetEnvIfNoCase User-agent "Eule-Robot" blocked=yes
SetEnvIfNoCase User-agent "EuripBot" blocked=yes
SetEnvIfNoCase User-agent "eventax" blocked=yes
SetEnvIfNoCase User-agent "Exabot" blocked=yes
SetEnvIfNoCase User-agent "Exabot-Images" blocked=yes
SetEnvIfNoCase User-agent "fantomas" blocked=yes
SetEnvIfNoCase User-agent "Favcollector" blocked=yes
SetEnvIfNoCase User-agent "Faxobot" blocked=yes
SetEnvIfNoCase User-agent "FDM_2.x" blocked=yes
SetEnvIfNoCase User-agent "Firefox_1.0.6_kasparek" blocked=yes
SetEnvIfNoCase User-agent "fluffy" blocked=yes
SetEnvIfNoCase User-agent "Franklin_Locator" blocked=yes
SetEnvIfNoCase User-agent "FyberSpider" blocked=yes
SetEnvIfNoCase User-agent "Gaisbot" blocked=yes
SetEnvIfNoCase User-agent "Galaxy" blocked=yes
SetEnvIfNoCase User-agent "GalaxyBot" blocked=yes
SetEnvIfNoCase User-agent "gazz" blocked=yes
SetEnvIfNoCase User-agent "genevabot" blocked=yes
SetEnvIfNoCase User-agent "GeoBot" blocked=yes
SetEnvIfNoCase User-agent "Girafabot" blocked=yes
SetEnvIfNoCase User-agent "GOFORITBOT" blocked=yes
SetEnvIfNoCase User-agent "GroschoBot" blocked=yes
SetEnvIfNoCase User-agent "Grub" blocked=yes
SetEnvIfNoCase User-agent "gsa-crawler" blocked=yes
SetEnvIfNoCase User-agent "GT::WWW" blocked=yes
SetEnvIfNoCase User-agent "HappyFunBot" blocked=yes
SetEnvIfNoCase User-agent "Healthbot" blocked=yes
SetEnvIfNoCase User-agent "holmes" blocked=yes
SetEnvIfNoCase User-agent "HooWWWer" blocked=yes
SetEnvIfNoCase User-agent "Hotzonu" blocked=yes
SetEnvIfNoCase User-agent "htdig" blocked=yes
SetEnvIfNoCase User-agent "Html_Link_Validator_" blocked=yes
SetEnvIfNoCase User-agent "HttpProxy" blocked=yes
SetEnvIfNoCase User-agent "http_sample" blocked=yes
SetEnvIfNoCase User-agent "httpunit" blocked=yes
SetEnvIfNoCase User-agent "ia_archiver" blocked=yes
SetEnvIfNoCase User-agent "ichiro" blocked=yes
SetEnvIfNoCase User-agent "IconSurf" blocked=yes
SetEnvIfNoCase User-agent "Iltrovatore-Setaccio" blocked=yes
SetEnvIfNoCase User-agent "Indy Library" blocked=yes
SetEnvIfNoCase User-agent "InetURL" blocked=yes
SetEnvIfNoCase User-agent "InfociousBot" blocked=yes
SetEnvIfNoCase User-agent "INGRID" blocked=yes
SetEnvIfNoCase User-agent "InnerpriseBot" blocked=yes
SetEnvIfNoCase User-agent "intraVnews" blocked=yes
SetEnvIfNoCase User-agent "IOneSearch.bot" blocked=yes
SetEnvIfNoCase User-agent "ISC_Systems_iRc_Search" blocked=yes
SetEnvIfNoCase User-agent "Jakarta_Commons-HttpClient" blocked=yes
SetEnvIfNoCase User-agent "Java" blocked=yes
SetEnvIfNoCase User-agent "Jayde Crawler" blocked=yes
SetEnvIfNoCase User-agent "JetBot" blocked=yes
SetEnvIfNoCase User-agent "KakleBot" blocked=yes
SetEnvIfNoCase User-agent "Kyluka" blocked=yes
SetEnvIfNoCase User-agent "lanshanbot" blocked=yes
SetEnvIfNoCase User-agent "LapozzBot" blocked=yes
SetEnvIfNoCase User-agent "larbin" blocked=yes
SetEnvIfNoCase User-agent "LinkAlarm" blocked=yes
SetEnvIfNoCase User-agent "Link_Valet_Online" blocked=yes
SetEnvIfNoCase User-agent "LinkWalker" blocked=yes
SetEnvIfNoCase User-agent "LocalcomBot" blocked=yes
SetEnvIfNoCase User-agent "LWP::Simple" blocked=yes
SetEnvIfNoCase User-agent "lwp-trivial" blocked=yes
SetEnvIfNoCase User-agent "Mac_Finder" blocked=yes
SetEnvIfNoCase User-agent "Mackster" blocked=yes
SetEnvIfNoCase User-agent "Matrix" blocked=yes
SetEnvIfNoCase User-agent "Metaspinner" blocked=yes
SetEnvIfNoCase User-agent "Microsoft_URL_Control" blocked=yes
SetEnvIfNoCase User-agent "Mirago" blocked=yes
SetEnvIfNoCase User-agent "Missigua_Locator" blocked=yes
SetEnvIfNoCase User-agent "MJ12bot" blocked=yes
SetEnvIfNoCase User-agent "Mnogosearch" blocked=yes
SetEnvIfNoCase User-agent "MonkeyCrawl" blocked=yes
SetEnvIfNoCase User-agent "Mozdex" blocked=yes
SetEnvIfNoCase User-agent "MSNPTC" blocked=yes
SetEnvIfNoCase User-agent "MVAClient" blocked=yes
SetEnvIfNoCase User-agent "My_WinHTTP_Connection" blocked=yes
SetEnvIfNoCase User-agent "NaverBot" blocked=yes
SetEnvIfNoCase User-agent "NavissoBot" blocked=yes
SetEnvIfNoCase User-agent "NetMind-Minder" blocked=yes
SetEnvIfNoCase User-agent "NetMonitor" blocked=yes
SetEnvIfNoCase User-agent "Networking4all" blocked=yes
SetEnvIfNoCase User-agent "Newsgroupreporter_LinkCheck" blocked=yes
SetEnvIfNoCase User-agent "NextGenSearchBot" blocked=yes
SetEnvIfNoCase User-agent "NG" blocked=yes
SetEnvIfNoCase User-agent "nicebot" blocked=yes
SetEnvIfNoCase User-agent "NimbleCrawler" blocked=yes
SetEnvIfNoCase User-agent "NLCrawler" blocked=yes
SetEnvIfNoCase User-agent "NPBot" blocked=yes
SetEnvIfNoCase User-agent "NuSearch Spider" blocked=yes
SetEnvIfNoCase User-agent "Nutch" blocked=yes
SetEnvIfNoCase User-agent "NutchCVS" blocked=yes
SetEnvIfNoCase User-agent "ObjectsSearch" blocked=yes
SetEnvIfNoCase User-agent "Ocelli" blocked=yes
SetEnvIfNoCase User-agent "Octora_Beta" blocked=yes
SetEnvIfNoCase User-agent "Octopus" blocked=yes
SetEnvIfNoCase User-agent "OmniExplorer_Bot" blocked=yes
SetEnvIfNoCase User-agent "Omnipelagos" blocked=yes
SetEnvIfNoCase User-agent "online link validator" blocked=yes
SetEnvIfNoCase User-agent "Openbot" blocked=yes
SetEnvIfNoCase User-agent "Openfind" blocked=yes
SetEnvIfNoCase User-agent "Orbiter" blocked=yes
SetEnvIfNoCase User-agent "OutfoxBot" blocked=yes
SetEnvIfNoCase User-agent "PageBitesHyperBot" blocked=yes
SetEnvIfNoCase User-agent "page_verifier" blocked=yes
SetEnvIfNoCase User-agent "Pajaczek" blocked=yes
SetEnvIfNoCase User-agent "Patwebbot" blocked=yes
SetEnvIfNoCase User-agent "PEAR_HTTP_Request_class" blocked=yes
SetEnvIfNoCase User-agent "PEERbot" blocked=yes
SetEnvIfNoCase User-agent "PhpDig" blocked=yes
SetEnvIfNoCase User-agent "PHP_version_tracker" blocked=yes
SetEnvIfNoCase User-agent "pipeLiner" blocked=yes
SetEnvIfNoCase User-agent "POE-Component-Client-HTTP" blocked=yes
SetEnvIfNoCase User-agent "Poirot" blocked=yes
SetEnvIfNoCase User-agent "polybot" blocked=yes
SetEnvIfNoCase User-agent "Pompos" blocked=yes
SetEnvIfNoCase User-agent "Pooodle_predictor" blocked=yes
SetEnvIfNoCase User-agent "Poodle_predictor" blocked=yes
SetEnvIfNoCase User-agent "Popdexter" blocked=yes
SetEnvIfNoCase User-agent "Port_Huron_Labs" blocked=yes
SetEnvIfNoCase User-agent "psbot" blocked=yes
SetEnvIfNoCase User-agent "psycheclone" blocked=yes
SetEnvIfNoCase User-agent "PyQuery" blocked=yes
SetEnvIfNoCase User-agent "Python-urllib" blocked=yes
SetEnvIfNoCase User-agent "QweeryBot" blocked=yes
SetEnvIfNoCase User-agent "Reaper" blocked=yes
SetEnvIfNoCase User-agent "RAMPyBot" blocked=yes
SetEnvIfNoCase User-agent "Random" blocked=yes
SetEnvIfNoCase User-agent "Ranking-Manager" blocked=yes
SetEnvIfNoCase User-agent "REL_Link_Checker_Lite" blocked=yes
SetEnvIfNoCase User-agent "robschecker" blocked=yes
SetEnvIfNoCase User-agent "RRG" blocked=yes
SetEnvIfNoCase User-agent "RufusBot" blocked=yes
SetEnvIfNoCase User-agent "SandCrawler" blocked=yes
SetEnvIfNoCase User-agent "SANSARN" blocked=yes
SetEnvIfNoCase User-agent "SBIder" blocked=yes
SetEnvIfNoCase User-agent "schibstedsokbot" blocked=yes
SetEnvIfNoCase User-agent "scooter" blocked=yes
SetEnvIfNoCase User-agent "Screw-Ball" blocked=yes
SetEnvIfNoCase User-agent "Scrubby" blocked=yes
SetEnvIfNoCase User-agent "Search-10" blocked=yes
SetEnvIfNoCase User-agent "search.ch" blocked=yes
SetEnvIfNoCase User-agent "Searchmee!" blocked=yes
SetEnvIfNoCase User-agent "SearchSpider" blocked=yes
SetEnvIfNoCase User-agent "Seekbot" blocked=yes
SetEnvIfNoCase User-agent "Sensis Web Crawler" blocked=yes
SetEnvIfNoCase User-agent "Sensis.com.au Web Crawler" blocked=yes
SetEnvIfNoCase User-agent "Shim+Bot" blocked=yes
SetEnvIfNoCase User-agent "ShunixBot" blocked=yes
SetEnvIfNoCase User-agent "shybunnie-engine" blocked=yes
SetEnvIfNoCase User-agent "SideWinder" blocked=yes
SetEnvIfNoCase User-agent "silk" blocked=yes
SetEnvIfNoCase User-agent "SiteSpider" blocked=yes
SetEnvIfNoCase User-agent "SlySearch" blocked=yes
SetEnvIfNoCase User-agent "sna-" blocked=yes
SetEnvIfNoCase User-agent "Snap" blocked=yes
SetEnvIfNoCase User-agent "Snappy" blocked=yes
SetEnvIfNoCase User-agent "Snoopy" blocked=yes
SetEnvIfNoCase User-agent "sohu-search" blocked=yes
SetEnvIfNoCase User-agent "Speed-Meter" blocked=yes
SetEnvIfNoCase User-agent "SpeedySpider" blocked=yes
SetEnvIfNoCase User-agent "Spinne" blocked=yes
SetEnvIfNoCase User-agent "SquidClamAV_Redirector" blocked=yes
SetEnvIfNoCase User-agent "Squid-Prefetch" blocked=yes
SetEnvIfNoCase User-agent "SquigglebotBot" blocked=yes
SetEnvIfNoCase User-agent "StackRambler" blocked=yes
SetEnvIfNoCase User-agent "sureseeker" blocked=yes
SetEnvIfNoCase User-agent "SurveyBot" blocked=yes
SetEnvIfNoCase User-agent "SuperBot" blocked=yes
SetEnvIfNoCase User-agent "SygolBot" blocked=yes
SetEnvIfNoCase User-agent "SynoBot" blocked=yes
SetEnvIfNoCase User-agent "Szukacz" blocked=yes
SetEnvIfNoCase User-agent "TerrawizBot" blocked=yes
SetEnvIfNoCase User-agent "ThisIsOurYear_Linkchecker" blocked=yes
SetEnvIfNoCase User-agent "thumbshots-de-Bot" blocked=yes
SetEnvIfNoCase User-agent "Tkensaku" blocked=yes
SetEnvIfNoCase User-agent "topicblogs" blocked=yes
SetEnvIfNoCase User-agent "TridentSpider" blocked=yes
SetEnvIfNoCase User-agent "troovziBot" blocked=yes
SetEnvIfNoCase User-agent "TurnitinBot" blocked=yes
SetEnvIfNoCase User-agent "TutorGigBot" blocked=yes
SetEnvIfNoCase User-agent "Ultraseek" blocked=yes
SetEnvIfNoCase User-agent "unchaos_crawler" blocked=yes
SetEnvIfNoCase User-agent "Updated" blocked=yes
SetEnvIfNoCase User-agent "URL Spider Pro" blocked=yes
SetEnvIfNoCase User-agent "URL Spider SQL" blocked=yes
SetEnvIfNoCase User-agent "Vagabondo" blocked=yes
SetEnvIfNoCase User-agent "vBSEO_" blocked=yes
SetEnvIfNoCase User-agent "W3CRobot" blocked=yes
SetEnvIfNoCase User-agent "webbot" blocked=yes
SetEnvIfNoCase User-agent "WebCorp" blocked=yes
SetEnvIfNoCase User-agent "webcrawl.net" blocked=yes
SetEnvIfNoCase User-agent "Web_Downloader" blocked=yes
SetEnvIfNoCase User-agent "WebFindBot" blocked=yes
SetEnvIfNoCase User-agent "WebIndexer" blocked=yes
SetEnvIfNoCase User-agent "Webnavigator" blocked=yes
SetEnvIfNoCase User-agent "WebStripper" blocked=yes
SetEnvIfNoCase User-agent "WEP_Search" blocked=yes
SetEnvIfNoCase User-agent "WhizBang" blocked=yes
SetEnvIfNoCase User-agent "WISEbot" blocked=yes
SetEnvIfNoCase User-agent "Wotbox" blocked=yes
SetEnvIfNoCase User-agent "WWW-Mechanize" blocked=yes
SetEnvIfNoCase User-agent "WWWeasel" blocked=yes
SetEnvIfNoCase User-agent "wwwster" blocked=yes
SetEnvIfNoCase User-agent "Xenu_Link_Sleuth" blocked=yes
SetEnvIfNoCase User-agent "xirq" blocked=yes
SetEnvIfNoCase User-agent "XunBot" blocked=yes
SetEnvIfNoCase User-agent "yacybot" blocked=yes
SetEnvIfNoCase User-agent "YadowsCrawler" blocked=yes
SetEnvIfNoCase User-agent "Yeti" blocked=yes
SetEnvIfNoCase User-agent "YottaShopping_Bot" blocked=yes
SetEnvIfNoCase User-agent "Zao" blocked=yes
SetEnvIfNoCase User-agent "Zatka" blocked=yes
SetEnvIfNoCase User-agent "Zealbot" blocked=yes
SetEnvIfNoCase User-agent "Zeus_" blocked=yes
SetEnvIfNoCase User-agent "ZipppBot" blocked=yes
SetEnvIfNoCase User-agent "ZyBorg" blocked=yes
<Limit GET POST PUT>
Order allow,deny
deny from env=blocked
allow from all
</limit>

Connie
20-06-2006, 20:35/08:35PM
Let me add please do not post comments in this thread. That will defeat the purpose of the thread which is to give a list of bad bots in one place that is easy for anyone to find without sorting through a dozen post for new additions to the list.

Yes Deb thanks for the initial code and list.

Th original thread was here (http://www.ihelpyou.com/forums/showthread.php?s=&threadid=22455)

or start a new thread here (http://www.ihelpyou.com/forums/forumdisplay.php?s=&forumid=157)

If you have bad bots to post please feel free to do so in this thread. Otherwise use the original thread or new forum to post your comments in.

I have some to add once I compare with your list. No need for duplicates.

WebSavvy
25-06-2006, 00:28/12:28AM
Seekbot is in the list above. It seems they also use another user-agent string (now added to my blocked bots list on my site).


SetEnvIfNoCase User-agent "First_Browse_of_COnn" blocked=yes

WebSavvy
25-06-2006, 00:44/12:44AM
2 more to add to the list:


SetEnvIfNoCase User-agent "BDFetch" blocked=yes
SetEnvIfNoCase User-agent "Wells_Search_II" blocked=yes

Connie
04-07-2006, 12:40/12:40PM
Here are a few more.

SetEnvIfNoCase User-agent "BlackWidow" blocked=yes
SetEnvIfNoCase User-agent "Bot\ mailto:craftbot@yahoo.com" blocked=yes
SetEnvIfNoCase User-agent "ChinaClaw" blocked=yes
SetEnvIfNoCase User-agent "Custo" blocked=yes
SetEnvIfNoCase User-agent "DISCo" blocked=yes
SetEnvIfNoCase User-agent "Download\ Demon" blocked=yes
SetEnvIfNoCase User-agent "eCatch" blocked=yes
SetEnvIfNoCase User-agent "EirGrabber" blocked=yes
SetEnvIfNoCase User-agent "EmailSiphon" blocked=yes
SetEnvIfNoCase User-agent "EmailWolf" blocked=yes
SetEnvIfNoCase User-agent "Express\ WebPictures" blocked=yes
SetEnvIfNoCase User-agent "ExtractorPro" blocked=yes
SetEnvIfNoCase User-agent "EyeNetIE" blocked=yes
SetEnvIfNoCase User-agent "FlashGet" blocked=yes
SetEnvIfNoCase User-agent "GetRight" blocked=yes
SetEnvIfNoCase User-agent "GetWeb!" blocked=yes
SetEnvIfNoCase User-agent "Go!Zilla" blocked=yes
SetEnvIfNoCase User-agent "Go-Ahead-Got-It" blocked=yes
SetEnvIfNoCase User-agent "GrabNet" blocked=yes
SetEnvIfNoCase User-agent "Grafula" blocked=yes
SetEnvIfNoCase User-agent "HMView" blocked=yes
SetEnvIfNoCase User-agent "HTTrack" blocked=yes
SetEnvIfNoCase User-agent "Image\ Stripper" blocked=yes
SetEnvIfNoCase User-agent "Image\ Sucker" blocked=yes
SetEnvIfNoCase User-agent "InterGET" blocked=yes
SetEnvIfNoCase User-agent "Internet\ Ninja" blocked=yes
SetEnvIfNoCase User-agent "JetCar" blocked=yes
SetEnvIfNoCase User-agent "JOC\ Web\ Spider" blocked=yes
SetEnvIfNoCase User-agent "LeechFTP" blocked=yes
SetEnvIfNoCase User-agent "Mass\ Downloader" blocked=yes
SetEnvIfNoCase User-agent "MIDown\ too" blocked=yesl
SetEnvIfNoCase User-agent "Mister\ PiX" blocked=yes
SetEnvIfNoCase User-agent "Navroad" blocked=yes
SetEnvIfNoCase User-agent "NearSite" blocked=yes
SetEnvIfNoCase User-agent "Net\ Vampire" blocked=yes
SetEnvIfNoCase User-agent "NetAnts" blocked=yes
SetEnvIfNoCase User-agent "NetSpider" blocked=yes
SetEnvIfNoCase User-agent "NetZIP" blocked=yes
SetEnvIfNoCase User-agent "Offline\ Explorer" blocked=yes
SetEnvIfNoCase User-agent "Offline\ Navigator" blocked=yes
SetEnvIfNoCase User-agent "PageGrabber" blocked=yes
SetEnvIfNoCase User-agent "Papa\ Foto" blocked=yes
SetEnvIfNoCase User-agent "pavuk" blocked=yes
SetEnvIfNoCase User-agent "pcBrowser" blocked=yes
SetEnvIfNoCase User-agent "RealDownload" blocked=yes
SetEnvIfNoCase User-agent "ReGet" blocked=yes
SetEnvIfNoCase User-agent "SiteSnagger" blocked=yes
SetEnvIfNoCase User-agent "SmartDownload" blocked=yes
SetEnvIfNoCase User-agent "SuperHTTP" blocked=yes
SetEnvIfNoCase User-agent "Surfbot" blocked=yes
SetEnvIfNoCase User-agent "tAkeOut" blocked=yes
SetEnvIfNoCase User-agent "Teleport\ Pro" blocked=yes
SetEnvIfNoCase User-agent "VoidEYE" blocked=yes
SetEnvIfNoCase User-agent "Web\ Image\ Collector" blocked=yes
SetEnvIfNoCase User-agent "Web\ Sucker" blocked=yes
SetEnvIfNoCase User-agent "WebAuto" blocked=yes
SetEnvIfNoCase User-agent "WebCopier" blocked=yes
SetEnvIfNoCase User-agent "WebFetch" blocked=yes
SetEnvIfNoCase User-agent "WebGather" blocked=yes #from wmw
SetEnvIfNoCase User-agent "WebGo\ IS" blocked=yes
SetEnvIfNoCase User-agent "WebLeacher" blocked=yes
SetEnvIfNoCase User-agent "WebReaper" blocked=yes
SetEnvIfNoCase User-agent "WebSauger" blocked=yes
SetEnvIfNoCase User-agent "Website\ eXtractor" blocked=yes
SetEnvIfNoCase User-agent "Website\ Quester" blocked=yes
SetEnvIfNoCase User-agent "WebStripper" blocked=yes
SetEnvIfNoCase User-agent "WebWhacker" blocked=yes
SetEnvIfNoCase User-agent "WebZIP" blocked=yes
SetEnvIfNoCase User-agent "Wget" blocked=yes
SetEnvIfNoCase User-agent "Widow" blocked=yes
SetEnvIfNoCase User-agent "WWWOFFLE" blocked=yes
SetEnvIfNoCase User-agent "Xaldon\ WebSpider" blocked=yes

For comments or questions about badbots please make them here (http://www.ihelpyou.com/forums/showthread.php?s=&threadid=22608)

Connie
07-07-2006, 19:30/07:30PM
I'm locking this thread. There has been a lot of confusion based on Debs original list. If you have a question about a particular bot please start a new thread about that bot.

Deb originally said Below are a list of bots I've blocked from my site using .htaccess. Not all of the bots listed below are "bad" bots per se, many are just "annoying" bots.

If the bot you post in a new thread needs to be on this list then a Super Mod will include it.

That way some of us will have done some investigation. Then we can say a bot is a bad bot or we don't know.

WebSavvy
14-07-2006, 14:32/02:32PM
SetEnvIfNoCase User-agent "cfetch" blocked=yes
SetEnvIfNoCase User-agent "lwp-request" blocked=yes
SetEnvIfNoCase User-agent "^oBot" blocked=yes

WebSavvy
20-08-2006, 13:20/01:20PM
When blocking bots using this method, it's better to set the block rule this way:

SetEnvIfNoCase User-agent "^BlockedBotName" blocked=yes

Vs this way:

SetEnvIfNoCase User-agent "BlockedBotName" blocked=yes

The difference is this: ^

Using "in" as an example:

If we block "in" ... it also will block:
instant
begin
fine

If we block "^in" ... it will only block bad bots starting with "in"
like:
injure
incase

Tony (grungee) couldn't get to my site because he was somehow matching something in the blocked bots rule.

I changed it to "^
and now he can get to my site just fine.