日志里搜索引擎机器人的名称大全

搜索引擎机器人的名称

Googlebot

google

Baiduspider

BaiDuSpider

msnbot 或 MSNBOT MSNBot

Isaac Ding

P.Arthur

Yahoo!+Slurp

sohu-search

SpiderMan =(yahoo)

Gaisbot

psbot

MSIECrawler

SBL-BOT

Mediapartners-Google

BecomeBot

8fang.net

Gigabot

Spider.NET

Wget

HaoRanSoft

Swoogle

Ultraseek

Java

LinkWalker

Openfind 或Openbot

AIBOT

Nutch

Lycos

Robozilla

crawler

Web Indexing Robot

SurveyBot 2.3 (Whois Source)

FAST-WebCrawler

Scooter

Slurp

Alexa

WISENutbot

IBM_Planetwide

Turn It In

larbin

Jeeves

crawl

Voila

appie

Google AdSense

spider

robot

BBot

NewzCrawler

=============================

以下未验证

--------------------------------------------------------------------------------

AbachoBOT=Abacho.com

abcdatos_botlink=Abcdatos.com

http://www.abcdatos.com/botlink/=Abcdatos.com

AESOP_com_SpiderMan=Aesop.com

ah-ha.com crawler (crawler@ah-ha.com)=ah-ha.com

ia_archiver=Archive.org

Scooter=Altavista.com

Mercator=Altavista.com

Scooter2_Mercator_3-1.0=Altavista.com

roach.smo.av.com-1.0=Altavista.com

Tv_Merc_resh_26_1_D-1.0=Altavista.com

AltaVista-Intranet=Altavista.co.uk

jan.gelin@av.com=Altavista.co.uk

FAST-WebCrawler=alltheweb.com

crawler@fast.no=alltheweb.com

Acoon Robot=acoon.de

antibot=antisearch.net

Atomz=atomz.com

Buscaplus Robi=buscaplus.com

CanSeek/=canseek.ca

support@canseek.ca=canseek.ca

ChristCRAWLER=christcrawler.com

Crawler=crawler.de

admin@crawler.de=crawler.de

DaAdLe.com ROBOT/=daadle.com

RaBot=daum.net

Agent-admin/=daum.net

phortse@hanmail.net=daum.net

contact/jylee@kies.co.kr=kies.co.kr

DeepIndex=deepindex.com

DittoSpyder=ditto.com

Jack=domanova.co.uk

Speedy Spider=entireweb.com

ArchitextSpider=excite.com

ArchitectSpider=excite.com

Arachnoidea=euroseek.net

arachnoidea@euroseek.net=euroseek.net

EZResult=ezresults.com

Fast PartnerSite Crawler=fastsearch.net

FAST Data Search Crawler=fastsearch.net

KIT-Fireball=fireball.de

FyberSearch=fybersearch.com

GalaxyBot=galaxy.com

geckobot=geckobot.com

GenCrawler=gendoor.com

GeonaBot=geona.com

Googlebot=Google.com

googlebot@googlebot.com=Google.com

google=Google.com

moget/2.0=goo.ne.jp

moget@goo.ne.jp=goo.ne.jp

Aranha=girafa.com

Slurp.so/1.0=Yahoo

slurp@inktomi.com=Yahoo

Slurp/2.0j=Yahoo

www.inktomisearch.com=Yahoo

Slurp/2.0-KiteHourly=Yahoo

Slurp/2.0-OwlWeekly=Yahoo

spider@aeneid.com=Yahoo

Slurp/3.0-AU=Yahoo

Toutatis 2.5-2=hoppa.com

Hubater=hubat.com

IlTrovatore-Setaccio=iltrovatore.it

IncyWincy=incywincy.com

UltraSeek=infoseek.com

InfoSeek Sidewinder=infoseek.com

Mole2/1.0=intags.de

webmaster@intags.de=intags.de

MP3Bot=mp3bot.de

C-PBWF-ip3000.com-crawler=ip3000.com

ip3000.com-crawler=ip3000.com

kuloko-bot/0.2=kuloko.com

LNSpiderguy=lexis-nexis.com

NetResearchServer=look.com

MantraAgent=looksmart.com

NetResearchServer=loopimprovements.com

Lycos_Spider_(T-Rex)=lycos.com

JoocerBot=joocer.com

HenryTheMiragoRobot=mirago.co.uk

mozDex/=mozdex.com

MSNBOT/0.1=MSN

Gulliver=northernlight.com

ObjectsSearch/0.01=objectssearch.com

PicoSearch/=picosearch.com

PJspider=portaljuice.com

DIIbot=powerinter.net

nttdirectory_robot=navi.ocn.ne.jp

super-robot@super.navi.ocn.ne.jp=navi.ocn.ne.jp

griffon=super.navi.ocn.ne.jp

griffon@super.navi.ocn.ne.jp=super.navi.ocn.ne.jp

Spider/maxbot.com=maxbot.com

admin@maxbot.com=maxbot.com

gazz/1.0=Unknown Spider

gazz@nttrd.com=Unknown Spider

National
Directory-SuperSpider=nationaldirectory.com

dloader(NaverRobot)/=naver.com

dumrobo(NaverRobot)/=naver.com

Openfind piranha=openfind.com

Shark=openfind.com

robot-response@openfind.com.tw=openfind.com.tw

Openbot/=openfind.com.tw

psbot=picsearch.org

CrawlerBoy=pinpoint.com

ip3000.com=petersnews.com

AlkalineBOT=AlkalineBOT

Fluffy the spider=searchhippo.com

info@searchhippo.com=searchhippo.com

Scrubby/=scrubtheweb.com

asterias=singingfish.com

speedfind ramBot xtreme=speedfind.de

Kototoi/0.1=s.u-tokyo.ac.jp

Searchspider/=searchspider.com

SightQuestBot/=sightquest.com

Spider_Monkey/=spidermonkey.ca

Surfnomore Spider v1.1=surfnomore.com

Robot@SuperSnooper.Com=supersnooper.com

teoma_
agent1=teoma.com

teoma_admin@hawkholdings.com=teoma.com

Teradex_Mapper=mapper.teradex.com

mapper@teradex.com=mapper.teradex.com

ESISmartSpider=travel-finder.com

Spider TraficDublu=traficdublu.ro

Tutorial Crawler=tutorgig.com

UK Searcher Spider=uksearcher.co.uk

Vivante Link Checker=vivante.com

appie=walhello.com

Nazilla=websmostlinked.com

www.WebWombat.com.au=webwombat.com.au

marvin/infoseek=webseek.de

marvin-team@webseek.de=webseek.de

MuscatFerret=webtop.com

WhizBang! Lab=whizbanglabs.com

ZyBorg=wisenut.com

WIRE WebRefiner=wire.co.uk

WSCbot=worldsearchcenter.com

Yandex=yandex.com

Yellopet-Spider=yellowpet.com

Iron33=verno.ueda.info.waseda.ac.jp/

ALink=Link Checkers

AMeta=Link Checker

ASPSearch URL Checker=Link Checker

BlogBot=Link Checker

BMChecker=Link Checker

Bookmark Buddy=Link Checker

Check&Get=Link Checker

CheckWeb=Link Checker

CNET_Snoop=Link Checker

CSE HTML Validator=Link Checker

DRKSpider=Link Checker

DISCo Watchman=Link Checker

DoctorHTML=Link Checker

Email Extractor=Email Extractor

EmailSiphon=Email Extractor

EmailWolf=Email Extractor

FavOrg=Link Checker

Favorites Sweeper=Link Checker

FreshLinks.exe=Link Checker

Funnel Web
Profiler=Link Checker

Html Link Validator=Link Checker

The Informant=Link Checker

The Intraformant=Link Checker

InternetLinkAgent=Link Checker

InternetPeriscope=Link Checker

javElink=Link Checker

jdwhatsnew.cgi=Link Checker

JRTS Check Favorites
Utility=Link Checker

Lambda LinkCheck=Link Checker

LinkLint-checkonly=Link Checker

LinkAlarm=Link Checker

Linkbot=Link Checker

Linkman=Link Checker

LinkProver=Link Checker

Links=Link Checker

LinkScan Server=Link Checker

LinkSweeper=Link Checker

Link Valet Online=Link Checker

LinkVerify Spider=Link Checker

LinkWalker=Link Checker

Morning Paper=Link Checker

MoveAnnouncer=Link Checker

NetLookout=Link Checker

NetMechanic=Link Checker

www.elsop.com=Link Checker

NetMind-Minder=Link Checker

Net
Monitor=Link Checker

Netprospector JavaCrawler=Link Checker

online link validator=Link Checker

Rational SiteCheck=Link Checker

Robozilla=Link Checker

RPT-HTTP
Client=Link Checker

SurfMaster=Link Checker

SyncIT=Link Checker

Watchfire WebXM=Link Checker

WatzNew Agent=Link Checker

WebSite-Watcher=Link Checker

WebTrends Link Analyzer=Link Checker

Weblink Scanner=Link Checker

Xenu's Link Sleuth=Link Checker

W3C_Validator=Link Validator

WDG_Validator/=Link Validator

Tooter=Link Validator

citenikbot/=citenik.co.uk

CLIPS-index=clips-index.imag.fr/

Computer_and_Automation_Research_Institute_Crawler=Research Bot

cosmos=xyleme.com

robot@xyleme.com=xyleme.com

DiaGem/=DiaGem

Digimarc WebReader=digimarc.com

EchO!/2.0=voila.com

FinaleRobot=expressus.com

robot-master@expressus.com=expressus.com

Ideare - SignSite=ideare.com

GentleSpider=research.att.com

Gulper Web Bot=Gulper Web Bot

larbin=Unknown Spider

sebastien.ailleret@inria.fr=inria.fr

ghi@lcs.mit.edu=Unknown Spider

MultiText=MultiText

NEC Research Agent=NEC Research Agent

OntoSpider=OntoSpider

sherlock_spider=sherlock.com.cn

Steeler=Steeler

ru-robot=rutgers.edu

0.1_hseo(at)cs.rutgers.edu=rutgers.edu

WebGather=WebGather

xyro=xyro

xcrawler@inria.fr=Unknown Spider

Zao/0.2=Zao

ADSARobot=ADSARobot

AnswerChase=AnswerChase

ASPSeek=ASPSeek

AVSearch=AVSearch

Checkbot=Checkbot

DaviesBot=DaviesBot

deepweb=deepweb.com

GigaBaz=brainbot.com

GigaBazVStheWeb=brainbot.com

crawler@brainbot.com=brainbot.com

Giskard=oralco.com

InternetSeer=InternetSeer

ipiumBot=ipiumBot

InsumaScout=InsumaScout

Katriona=Katriona

LEIA=LEIA

LexiBot=lexibot.com

metabot=metabot

NetCruiser=NetCruiser

NPBot=nameprotect.com

NetZippy=NetZippy

NZBot=navigationzone.com

Opencola=opencola.com

Oxxbot1=Oxxbot

Pansophica=Pansophica

Phoaks=Phoaks

PICgrabber=PICgrabber

PictureOfInternet=PictureOfInternet

erik@mal
function.org=Unknown Spider

PintaSpider=PintaSpider

PolyBot=PolyBot

Squid=Squid

Sqworm=Sqworm

TaWWWantula=TaWWWantula

TeraCrawl=TeraCrawl

TurnitinBot=turnitin.com

UCmore=ucmore.com

UdmSearch=mnoGoSearch

unlostBot=unlost.com

URLBlaze=urlblaze.net

UrlScope=UrlScope

Vagabondo=Vagabondo

vspider=vspider

WAVETools=WAVETools

Webbandit=Webbandit

Webclipping.com=Webclipping.com

webcollage=webcollage

WebCompass=WebCompass

WebGenie=WebGenie

Web Magnet=Unknown Spider

WebMiner=Unknown Spider

Webpush=Unknown Spider

WebSymmetrix=Unknown Spider

webrank=Unknown Spider

webwasher=Unknown Spider

WhosTalking=Unknown Spider

AnzwersCrawl/2.0=Anzwers

fido/1.0 Harvest/1.4.pl2=Planet Search

GAIS Robot/1.0B2=seednet

Googlebot/1.0=Google.com

Gulliver/1.2=Northern Light

Infoseek Sidewinder/0.9=Infoseek

KIT_Fireball/2.0=Fireball

lwp-trivial/1.27=Search 4 Free

Lycos_Spider_(T-Rex)/3.0=Lycos

Scooter/1.0=AltaVista

Scooter/1.0 scooter@pa.dec.com=AltaVista

Scooter/1.1 (custom)=AltaVista

Scooter/2.0 G.R.A.B. X2.0=AltaVista

Scooter/2.0 G.R.A.B. V1.1.0=AltaVista

search.at V1.2=search.at

inktomi=Inktomi Spider

SwissSearch V1.2=SwissSearch

The Informant=The Informant

Ultraseek=Infoseek

WebCrawler/3.0 Robot libwww/5.0a=WebCrawler

WebCrawler-AddURL/2.0=WebCrawler

WiseWire=WiseWire

WiseWire-Alpha-1.0=WiseWire

WiseWire-Alpha-Spider=WiseWire

WiseWire-Alpha12-Spider971219a=WiseWire

WiseWire-Alpha12-Spider(97122
3a)=WiseWire

WiseWire-HotSpider-1.0=WiseWire

WiseWire-Spider=WiseWire

WiseWire-Spider-1.0=WiseWire

WiseWire-Spider2=WiseWire

WiseWire-Widow-1.0=WiseWire

WiseWire-Widow-1.0r=WiseWire

WiseWire-Widow-1.0-ALPHA12=WiseWire

CherryPickerSE/1.0=Email Extractor

CherryPickerElite/1.0=Email Extractor

Crescent Internet ToolPak HTTP OLE Control v.1.0=Email Extractor

EmailCollector/1.0=Email Extractor

EmailWolf 1.00=Email Extractor

ExtractorPro=Email Extractor

ask jeeves=Ask Jeeves

lycos=Lycos.com

whatuseek=What You Seek

wisenutbot=Looksmart

msnbot=MSN

GigaBlast=Gigablast

Gigabot=Gigablast

archive_org=Archive.org

jeeves=Ask Jeeves

Asterias=Singingfish Spider

Slurp=Inktomi Spider

ZyBorg=LookSmart Bot

baiduspider=Baidu

时间: 2024-07-31 14:31:05

日志里搜索引擎机器人的名称大全的相关文章

[译] 在 Apache 和 Nginx 日志里检测爬虫机器人

本文讲的是[译] 在 Apache 和 Nginx 日志里检测爬虫机器人, 原文地址:Detecting Bots in Apache & Nginx Logs 原文作者:Mark Litwintschik 译文出自:掘金翻译计划 译者:luoyaqifei 校对者:forezp,1992chenlu 在 Apache 和 Nginx 日志里检测爬虫机器人 现在阻止基于 JavaScript 追踪的浏览器插件享有九位数的用户量,从这一事实可以看出,web 流量日志可以成为一个很好的.能够感知有多

通过分析网站日志 了解搜索引擎变化

作为一名站长不但要懂得如何写原创,如何发外链,还要学会分析网站的日志.站长会分析网站的日志,就能了解你的网站在搜索引擎中是否比较重要.通过网站日志你能得到哪些重要的信息,下面来详细说明下: 一.看网站的抓取情况 1.新站刚上线,看看搜索引擎有没有来你网站抓取了; 2.网站收录异常,或者被k,通过日志可以了解搜索引擎是否还有来光顾你的网站; 3.对于网站的问题进行解决,必须要读懂日志; 二.怎么查找网站日志? 一般在FTP一个名为logs的文件夹,不同的服务器可能日志文件命名不一样,不过一定会包含

网站日志里的秘密 分析网站日志有助于SEO

网站日志可以很好的记录访客和蜘蛛的访问情况,通过网站日志可以很好的了解网站的一些状况,这也是为什么现在很多SEO都会去分析网站日志的原因,但是分析网站日志的人不一定完全了解网站日志,下面我就浅谈一下网站日志里的秘密. 分析网站日志当然需要网站日志分析器,当然现在很多人使用免费的网站日志分析器,但是这些网站日志分析器分析出来的东西很有限,所以说很多网站信息也就被影藏了,下面我就以那种付费的网站日志分析器来阐述. 大家通过普通日志分析器一般都是看有没蜘蛛来过,什么蜘蛛,访问时间,访问哪些了页面.访问

proteus-怎样将protues里元件的封装名称改成Altium+Designer里对应的封装名称

问题描述 怎样将protues里元件的封装名称改成Altium+Designer里对应的封装名称 proteus与altium_designer联合使用 怎样将protues里元件的封装名称改成Altium+Designer里对应的封装名称 解决方案 http://zhidao.baidu.com/link?url=wdwYFVJFpQUJxyNmCCGcbEAjrfE1zcEREz5_5SFYsAuZI0Cbip9WSFeZhMrs9mdCTOYG2_03z5T_gBjlRkVgMq

数据库里修改主键名称

问题描述 数据库里修改主键名称 怎么修改表的主键名称? 创建主键时出错了 invalid ALTER TABLE option, 其他的可以创建,这是什么问题?大神 求解 解决方案 你用的什么数据库 解决方案二: 主键名称合法不,主键名称重复超过2次没 解决方案三: 先删除主键约束,在修改字段,之后再加上主键

搜索引擎蜘蛛IP地址大全

各类搜索引擎蜘蛛IP地址大全 百度蜘蛛           220.181.38.177     220.181.19.*     159.226.50.*     202.108.11.*     202.108.22.*     202.108.23.*     202.108.249.*     202.108.250.*     61.135.145.*     61.135.146.* google蜘蛛    216.239.33.*    216.239.35.*      216.

查看网站日志中搜索引擎蜘蛛的来访记录的方法

摘要: 对于很多做网站的新手来说,都没有经过系统地授课进行网络技术和建站知识学习,做网站都是靠自学,遇到难题在论坛发帖提问,更不会懂得网站优化,对于较基本操作通过网站日志 对于很多做网站的新手来说,都没有经过系统地授课进行网络技术和建站知识学习,做网站都是靠自学,遇到难题在论坛发帖提问,更不会懂得网站优化,对于较基本操作--通过网站日志看蜘蛛来访情况都不知到哪里看,怎么看.前两天看到很多人发帖提问,回复者的答案却比较精炼,不具体,提问者还是云里雾里的,现我就以自己的网站来系统地操作一次,提交给大

关于网站IIS日志分析搜索引擎爬虫说明

 iis默认的日志文件在C:\WINDOWS\system32\LogFiles中,下面是Seoer惜缘的服务器日志,通过查看,就可以了解搜索引擎蜘蛛爬行经过,如: 2008-08-19 00:09:12 W3SVC962713505 203.171.226.111 GET /index.html - 80 - 61.135.168.39 Baiduspider+ (+http://www.baidu.com/search/spider.htm) 200 0 64 1.203.171.226.1

IIS日志分析搜索引擎爬虫记录程序

使用注意: 修改iis.php文件中iis日志的绝对路径 例如:$folder="c:/windows/system32/logfiles/站点日志目录/"; //后面记得一定要带斜杠(/). ( 用虚拟空间的不懂查看你的站点绝对路径?上传个探针查看! 直接查看法:http://站点域名/iis.php 本地查看法:把日志下载到本地 http://127.0.0.1/iis.php ) 注意: //站点日志目录,注意该目录必须要有站点用户读取权限! //如果把日志下载到本地请修改143