常见的100多个爬虫

107 个机器人
Yahoo Slurp
Unknown robot (identified by ‘crawl’)
Googlebot
Yahoo! Slurp China
GouGou
OutfoxBot
GigaBot
Lilina
MSNBot
Java (Often spam bot)
NewsGator Online
BaiDuSpider
Sina Iask Spider
Bloglines
MagpieRSS
Alexa (IA Archiver)
Feedfetcher-Google
MT::Telegraph::Agent
Feedburner
RoJo aggregator
Python-urllib
Jakarta commons-httpclient
Heritrix
Google AdSense
Mydoyouhike
Unknown robot (identified by ‘spider’)
Voyager
Ocelli
Unknown robot (identified by hit on ‘robots.txt’)
Unknown robot (identified by ‘robot’)
larbin
Hylanda
ZyBorg
ZhuaXia
lanshanbot
Technoratibot
Microsoft URL Control
IRLbot
Unknown robot (identified by ‘bot/’ or ‘bot-‘)
Turn It In
Megite
MSIECrawler
msnbot-media
Sogou Spider
NutchCVS
MJ12bot
Kinjabot
Gaisbot
SurveyBot
Ask
StackRambler
Girafabot
T-H-U-N-D-E-R-S-T-O-N-E
Yahoo Feed Seeker
WordPress
UniversalFeedParser
Sphere Scout
findlinks
SBIder
Yahoo-Blogs
FeedValidator
Yahoo-MMCrawler
lwp-trivial
Webdup
Blogslive
IBM Almaden Research Center WebFountain?
Openfind data gatherer
BlogPulse ISSpider intelliseek.com
HPPrint
Walhello appie
BlogSearch
ping.blo.gs
Biz360 spider
UP.Browser
topicblogs
Exabot
Snappy
LinkWalker
BlogBridge Service
The World Wide Web Worm
Nutch
The web archive (IA Archiver)
Feedster
YahooSeeker-Testing
Voila
aipbot
PluckFeedCrawler
Everest-Vulcan
NG 2.x (Exalead)
MS SharePoint Portal Server – MS Search 4.0 Robot
Missigua_Locator
boitho.com-dc
Sunrise
Blogshares Spiders
ExactSeek Crawler
Nagios
nicebot
HTTrack off-line browser
Harvest
SandCrawler (Microsoft)
edgeio-retriever
NG 1.x (Exalead)
HTMLParser
Scooter
Y!J Yahoo Japan
arks
Tagyu Agent

0 0 投票数
文章评分
订阅评论
提醒
guest

3 评论
最旧
最新 最多投票
内联反馈
查看所有评论
Louis Han
2011年2月24日 12:42

总结得真够全

娜娜lei1314
2011年6月8日 10:07

我怎么一个都不认识呢,呜呜!

3
0
希望看到您的想法,请您发表评论x