qzmjef 发表于 2024-8-25 22:08:05

php菜鸟教程:一招教你怎么样识别真假百度蜘蛛


    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">本文介绍PHP<span style="color: black;">怎样</span>识别蜘蛛的真实性,<span style="color: black;">重点</span>是<span style="color: black;">经过</span><span style="color: black;">有些</span>组合手段对搜索引擎蜘蛛的真实性进行判断,让虚假蜘蛛无处可逃。我<span style="color: black;">晓得</span>部分站长为了安全起见,都会屏蔽假人<span style="color: black;">拜访</span>,<span style="color: black;">然则</span>会对蜘蛛敞开<span style="color: black;">前门</span>,<span style="color: black;">倘若</span><span style="color: black;">此时</span>候有人伪装成蜘蛛,对你的网站为所欲为,那岂不是<span style="color: black;">无</span>一点安全感吗?之前我介绍过PHP<span style="color: black;">怎样</span>识<span style="color: black;">拜访</span>者是不是搜索引擎的<span style="color: black;">设备</span>(参考<span style="color: black;">文案</span>《PHP<span style="color: black;">怎样</span>识别蜘蛛:全网最新最全蜘蛛整理》),<span style="color: black;">这儿</span>就不对识别蜘蛛的<span style="color: black;">办法</span>进行<span style="color: black;">仔细</span>介绍了,本文<span style="color: black;">重点</span>是介绍<span style="color: black;">怎样</span>识别蜘蛛的真假。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/sz_mmbiz_png/fm7RnX9X5QQGqvEa4sw5aBicVl5Xfv4KgnwfTPBetvDMuaKvnTZdODZY3j8GmYzuy6YUvCSb3EWNjZrN0pxpicag/640?wx_fmt=png&amp;from=appmsg&amp;tp=webp&amp;wxfrom=5&amp;wx_lazy=1&amp;wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
    <h2 style="color: black; text-align: left; margin-bottom: 10px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;"><span style="color: black;">1、</span>识别蜘蛛</span></strong></span></h2>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">判断蜘蛛的真假之前,<span style="color: black;">咱们</span>先来识别蜘蛛。为后面的真假判断工作做好服务。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">$tmp=&nbsp;[.$_SERVER.</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">];</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">//获取<span style="color: black;">拜访</span>者UA,<span style="color: black;">这儿</span>在UA数据前后都加了一个]符号,<span style="color: black;">亦</span><span style="color: black;">能够</span><span style="color: black;">不消</span><span style="color: black;">增多</span>。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">$tmp=&nbsp;strtolower($tmp);</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">$ipAddress=&nbsp;$_SERVER;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">//获取<span style="color: black;">拜访</span>者的IP<span style="color: black;">位置</span>&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">if&nbsp;(strpos($tmp,&nbsp;google)&nbsp;!==false)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;谷歌蜘蛛;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}&nbsp;else&nbsp;if&nbsp;(strpos($tmp,&nbsp;bing)&nbsp;&gt;&nbsp;0)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;必应蜘蛛;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}&nbsp;else&nbsp;if&nbsp;(strpos($tmp,&nbsp;baidu)&nbsp;&gt;&nbsp;0)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;百度蜘蛛;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}&nbsp;else&nbsp;if&nbsp;(strpos($tmp,&nbsp;bytespider)&nbsp;&gt;&nbsp;0)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;头条蜘蛛;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}</p>
    <h2 style="color: black; text-align: left; margin-bottom: 10px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">2、</span><span style="color: black;">经过</span>蜘蛛IP<span style="color: black;">位置</span>判断蜘蛛的真假</strong></span></h2>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">以百度蜘蛛为例,网络上<span style="color: black;">非常多</span>大牛说<span style="color: black;">区别</span>IP段的百度蜘蛛<span style="color: black;">表率</span><span style="color: black;">区别</span>的蜘蛛性质,<span style="color: black;">然则</span>该信息并<span style="color: black;">无</span>得到百度的确认,可能大部分是依靠这些大牛<span style="color: black;">经过</span><span style="color: black;">海量</span>数据分析而得,均是经验之谈。我收集了部分百度蜘蛛的IP,分享在下面:</p>

    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">116.179.32.*:新版百度蜘蛛,高权重段,<span style="color: black;">通常</span>抓取<span style="color: black;">文案</span>页。IP 段位于山西阳泉联通。百度云计算阳泉中心<span style="color: black;">位置于</span>世界最大中文搜索引擎百度<span style="color: black;">机构</span>创始人李彦宏的家乡山西省阳泉经济<span style="color: black;">研发</span>区东区。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.75:重点抓取更新<span style="color: black;">文案</span>的内页达到 90%,8%抓取首页,2%其他。权重 IP 段,爬过的<span style="color: black;">文案</span>或首页基本 24 小时放出来。 220.181.108.77:专用抓首页 IP 权重段,<span style="color: black;">通常</span>返回代码是 304 0 0 <span style="color: black;">表率</span>未更新。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.80:专用抓取首页 IP 权重段,<span style="color: black;">通常</span>返回代码是 304 0 0 <span style="color: black;">表率</span>未更新。&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.82:抓取tag页面。&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.83:专用抓取首页 IP 权重段,<span style="color: black;">通常</span>返回代码是 304 0 0 <span style="color: black;">表率</span>未更新。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.86:专用抓取首页 IP 权重段,<span style="color: black;">通常</span>返回代码是 304 0 0 <span style="color: black;">表率</span>未更新。&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.89:专用抓取首页 IP 权重段,<span style="color: black;">通常</span>返回代码是 304 0 0 <span style="color: black;">表率</span>未更新。&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.91:属于综合的,<span style="color: black;">重点</span>抓取首页和内页或其他,属于权重 IP 段,爬过的<span style="color: black;">文案</span>或首页基本 24 小时放出来。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.92:同上 98%抓取首页,可能还会抓取其他 (不<span style="color: black;">指的是</span>内页),属于权重 IP 段此段爬过的<span style="color: black;">文案</span>或首页基本 24 小时放出来。&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.94:专用抓取首页 IP 权重段,<span style="color: black;">通常</span>返回代码是 304 0 0 <span style="color: black;">表率</span>未更新。&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.93:重点抓取内页,爬过的<span style="color: black;">文案</span>或首页基本 24 小时放出来。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">220.181.108.95:这个是百度抓取首页的专用 IP,如是 220.181.108 段的话,基本<span style="color: black;">来讲</span>你的网站会天天隔夜快照。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">...此处省略一万个汉字...</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">这儿</span>就不对蜘蛛的IP段进行全面的展示了,免得把<span style="color: black;">专题</span>扯远了,<span style="color: black;">大众</span>可以去网上<span style="color: black;">自动</span>收集百度蜘蛛的所有IP<span style="color: black;">位置</span>。接着上面的PHP代码:</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">$baiduips=;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">if&nbsp;($bot==&nbsp;百度蜘蛛)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;foreach($baiduips&nbsp;as&nbsp;$value)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;if&nbsp;($ipAddress==&nbsp;$value)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;百度蜘蛛[真];</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}&nbsp;else&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;百度蜘蛛[假];</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;}</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">运用</span>IP对蜘蛛的真假性进行判断需要你搜集的IP足够全面和准确,否则会影响<span style="color: black;">咱们</span>对蜘蛛真假判断的准确性。其他蜘蛛的IP需您<span style="color: black;">自动</span>进行收集。</p>
    <h2 style="color: black; text-align: left; margin-bottom: 10px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;"><span style="color: black;">3、</span><span style="color: black;">经过</span><span style="color: black;">查找</span>蜘蛛IP的DNS记录进行判断</span></strong></span></h2>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">倘若</span>你搜集的蜘蛛IP数据<span style="color: black;">不足</span>全面,但你依然<span style="color: black;">运用</span>了IP<span style="color: black;">位置</span>比对<span style="color: black;">办法</span>来进行判断,<span style="color: black;">咱们</span>还<span style="color: black;">能够</span><span style="color: black;">经过</span><span style="color: black;">查找</span>蜘蛛IP的DNS记录进行判断,<span style="color: black;">增多</span><span style="color: black;">靠谱</span>性。想要<span style="color: black;">查找</span>一个IP的DNS记录,<span style="color: black;">咱们</span><span style="color: black;">能够</span><span style="color: black;">运用</span>PHP内置函数gethostbyaddr()来进行<span style="color: black;">查找</span>,<span style="color: black;">倘若</span><span style="color: black;">无</span><span style="color: black;">查找</span>到IP的DNS记录就会返回原IP。接着上面的PHP代码:</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">if&nbsp;($bot=&nbsp;百度蜘蛛[假])&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;$domainName=&nbsp;gethostbyaddr($ipAddress);&nbsp;//获取IP的DNS记录&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">if&nbsp;(strpos($domainName,&nbsp;baidu.com)&nbsp;!==false)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;百度蜘蛛[真];</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;}&nbsp;else&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;$bot=&nbsp;百度蜘蛛[假];</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;}</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">以上代码只是对百度蜘蛛进行了简单的判断,本文分享几个其他的<span style="color: black;">重点</span>蜘蛛的DNS记录:</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">百度蜘蛛:</strong>DNS记录需<span style="color: black;">包含</span>(baidu.com)</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">谷歌蜘蛛:</strong>DNS记录需<span style="color: black;">包含</span>(googlebot.com)</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">必应蜘蛛:</strong>DNS记录需<span style="color: black;">包含</span>(search.msn.com)</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">头条蜘蛛:</strong>DNS记录需<span style="color: black;">包含</span>(compute.amazonaws.com)</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">以上这四个蜘蛛是分析于我网站的蜘蛛数据,至于同一个蜘蛛<span style="color: black;">是不是</span>还有其他的DNS记录,需要<span style="color: black;">更加多</span>的数据进行分析,<span style="color: black;">或</span>搜索引擎官方的通告。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">经过</span><span style="color: black;">经过</span><span style="color: black;">查找</span>蜘蛛IP的DNS记录进行判断蜘蛛的真假<span style="color: black;">是不是</span>可行呢?<span style="color: black;">咱们</span><span style="color: black;">能够</span>写个简单的代码进行测试一下:</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">//&nbsp;假设$ip是您想<span style="color: black;">查找</span>的IP<span style="color: black;">位置</span>&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">$ipAddress=&nbsp;220.181.108.75;&nbsp;//&nbsp;执行逆向&nbsp;DNS&nbsp;<span style="color: black;">查找</span>&nbsp;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">$domainName=&nbsp;gethostbyaddr($ipAddress);</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">//&nbsp;打印结果</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">if&nbsp;($domainName)&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">echo&nbsp;"IP<span style="color: black;">位置</span>:".$ipAddress.</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;DNS记录:.$domainName;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}&nbsp;else&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;echo&nbsp;"无DNS记录:".$ipAddress;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">if&nbsp;($ipAddress==&nbsp;gethostbyaddr($ipAddress))&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;echo&nbsp;无耻!;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}&nbsp;else&nbsp;{</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">&nbsp;&nbsp;&nbsp;&nbsp;echo&nbsp;牛逼!;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">}&nbsp;?&gt;</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">注释:该代码取了一个<span style="color: black;">咱们</span>前面收集的百度蜘蛛IP中的几个个IP进行测试。</p>

    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">测试一:百度IP220.181.108.75</strong></p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">测试结果:IP<span style="color: black;">位置</span>:220.181.108.75DNS记录:baiduspider-220-181-108-75.crawl.baidu.com</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/sz_mmbiz_png/fm7RnX9X5QQGqvEa4sw5aBicVl5Xfv4Kge1MEPUtU8JfLjLiaghGlBlibEQgHAWKMHTW2VNUzR7aUTiakARFDaY1Bg/640?wx_fmt=png&amp;from=appmsg&amp;tp=webp&amp;wxfrom=5&amp;wx_lazy=1&amp;wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">测试二:百度IP220.181.108.91</strong></p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">测试结果:IP<span style="color: black;">位置</span>:220.181.108.91DNS记录:baiduspider-220-181-108-91.crawl.baidu.com牛逼!</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/sz_mmbiz_png/fm7RnX9X5QQGqvEa4sw5aBicVl5Xfv4KgmfbsFKFaTvks9XiattHLxsibP3ticpZMdeGib8wBQVCiaTrWu39utqBUVkg/640?wx_fmt=png&amp;from=appmsg&amp;tp=webp&amp;wxfrom=5&amp;wx_lazy=1&amp;wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">测试三:谷歌蜘蛛IP66.249.71.9</strong></p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">测试结果:IP<span style="color: black;">位置</span>:66.249.71.9DNS记录:crawl-66-249-71-9.googlebot.com牛逼!</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/sz_mmbiz_png/fm7RnX9X5QQGqvEa4sw5aBicVl5Xfv4KgS3EvgLYW87YDXVCSKAYfglHXlM1Tz1U3o4iahcbeVkTHxj1cujCywPQ/640?wx_fmt=png&amp;from=appmsg&amp;tp=webp&amp;wxfrom=5&amp;wx_lazy=1&amp;wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">可见该<span style="color: black;">办法</span>对<span style="color: black;">有些</span>正规的蜘蛛还是比较适用的。<span style="color: black;">况且</span>比蜘蛛IP比对法省事得多。</p>
    <h2 style="color: black; text-align: left; margin-bottom: 10px;"><span style="color: black;">4、</span>总结</h2>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">本文<span style="color: black;">重点</span>介绍了<span style="color: black;">经过</span>蜘蛛IP<span style="color: black;">位置</span>判断蜘蛛的真假和<span style="color: black;">经过</span><span style="color: black;">查找</span>蜘蛛IP的DNS记录来判断蜘蛛的真假,<span style="color: black;">一般</span><span style="color: black;">状况</span>下,两种发放<span style="color: black;">一块</span>用,<span style="color: black;">能够</span><span style="color: black;">供给</span>准确率。基本上<span style="color: black;">能够</span>排除大部分<span style="color: black;">经过</span>技术手段伪造的假蜘蛛。</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">在<span style="color: black;">这儿</span>我还分析几个<span style="color: black;">反常</span>的UA特征:</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">Wordup-1</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">PycURL</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">curl/7.29.0</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">okhttp</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">python</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">Python-urllib/3.10</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">Dalvik/2.1.0</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">Go-http-client/1.1</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">Mozilla/5.0&nbsp;(Windows&nbsp;NT&nbsp;10.0;&nbsp;WOW64;&nbsp;rv:48.0)&nbsp;Gecko/20100101&nbsp;Firefox/48.0</p>
    <p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">倘若</span>你的UA数据能够匹配这些UA特征,<span style="color: black;">能够</span>直接封禁,<span style="color: black;">经过</span>我<span style="color: black;">自己</span>网站的数据进行分析判断,<span style="color: black;">她们</span>基本<span style="color: black;">无</span>干啥好事。当然UA数据为空的<span style="color: black;">亦</span><span style="color: black;">能够</span>屏蔽掉,基本都是<span style="color: black;">设备</span>。</p>




张露zhang 发表于 2024-9-6 01:33:20

期待你更多的精彩评论,一起交流学习。

nqkk58 发表于 2024-10-2 17:50:37

外链发布论坛学习网络优化SEO。

m5k1umn 发表于 2024-10-3 02:52:51

楼主的文章非常有意义,提升了我的知识水平。
页: [1]
查看完整版本: php菜鸟教程:一招教你怎么样识别真假百度蜘蛛