
如何做搜尋引擎蜘蛛日誌分析
搜尋引擎蜘蛛日誌檔案是一種非常強大但未被站長充分利用的檔案,分析它可以獲取有關每個搜尋引擎如何爬取網站內容的相關資訊點,及檢視搜尋引擎蜘蛛在一段時間內的行為。
IP地址(35) | 伺服器名稱 | 所屬國家 |
---|---|---|
52.112.95.38 | 52.112.95.38 | US |
52.114.14.71 | 52.114.14.71 | SG |
52.114.77.236 | 52.114.77.236 | IE |
52.114.75.216 | 52.114.75.216 | NL |
52.114.32.28 | 52.114.32.28 | JP |
52.114.128.147 | 52.114.128.147 | US |
52.114.142.71 | 52.114.142.71 | US |
52.114.14.102 | 52.114.14.102 | SG |
52.114.75.71 | ? | NL |
52.114.128.37 | 52.114.128.37 | US |
52.115.252.9 | ? | US |
52.114.77.26 | ? | IE |
52.114.32.212 | 52.114.32.212 | JP |
20.53.76.223 | 20.53.76.223 | AU |
52.114.7.6 | 52.114.7.6 | HK |
52.112.112.150 | 52.112.112.150 | US |
52.112.112.120 | 52.112.112.120 | US |
52.112.114.122 | 52.112.114.122 | US |
52.115.248.9 | 52.115.248.9 | US |
52.112.74.60 | 52.112.74.60 | US |
52.112.125.8 | 52.112.125.8 | JP |
52.112.95.132 | 52.112.95.132 | US |
52.112.39.132 | 52.112.39.132 | US |
52.112.39.133 | 52.112.39.133 | US |
52.123.190.60 | 52.123.190.60 | US |
52.123.138.160 | 52.123.138.160 | IE |
52.123.138.164 | 52.123.138.164 | IE |
52.123.138.200 | 52.123.138.200 | IE |
52.123.190.124 | 52.123.190.124 | US |
52.123.138.236 | 52.123.138.236 | IE |
52.112.49.104 | 52.112.49.104 | MY |
52.112.74.61 | 52.112.74.61 | JP |
52.123.190.88 | 52.123.190.88 | US |
52.112.49.112 | 52.112.49.112 | MY |
52.112.103.76 | 52.112.103.76 | FR |
52.123.190.89 | 52.123.190.89 | US |
52.112.103.72 | 52.112.103.72 | FR |
52.123.190.36 | 52.123.190.36 | US |
52.112.95.134 | 52.112.95.134 | US |
52.112.49.197 | 52.112.49.197 | MY |
52.112.95.133 | 52.112.95.133 | US |
52.123.145.90 | 52.123.145.90 | FR |
52.123.138.237 | 52.123.138.237 | IE |
52.112.49.156 | 52.112.49.156 | MY |
52.123.138.201 | 52.123.138.201 | IE |
52.123.190.126 | 52.123.190.126 | US |
52.112.126.96 | 52.112.126.96 | FR |
52.123.190.90 | 52.123.190.90 | US |
52.123.138.202 | 52.123.138.202 | IE |
52.123.190.125 | 52.123.190.125 | US |
52.112.49.196 | 52.112.49.196 | MY |
44.206.124.255 | ec2-44-206-124-255.compute-1.amazonaws.com | US |
44.217.173.33 | ec2-44-217-173-33.compute-1.amazonaws.com | US |
100.29.43.167 | ec2-100-29-43-167.compute-1.amazonaws.com | US |
52.112.103.112 | 52.112.103.112 | FR |
52.112.125.9 | 52.112.125.9 | JP |
44.220.233.222 | ec2-44-220-233-222.compute-1.amazonaws.com | US |
37.187.5.192 | ns3126614.ip-37-187-5.eu | FR |
52.112.49.157 | 52.112.49.157 | MY |
IP地址(4) | 伺服器名稱 | 所屬國家 |
---|---|---|
52.114.128.147 | 52.114.128.147 | US |
52.112.95.38 | 52.112.95.38 | US |
52.114.75.216 | 52.114.75.216 | NL |
52.114.14.71 | 52.114.14.71 | SG |
IP地址(46) | 伺服器名稱 | 所屬國家 |
---|---|---|
3.209.61.81 | ec2-3-209-61-81.compute-1.amazonaws.com | US |
52.114.14.71 | 52.114.14.71 | SG |
52.114.32.212 | 52.114.32.212 | JP |
34.239.74.52 | ec2-34-239-74-52.compute-1.amazonaws.com | US |
52.112.95.38 | 52.112.95.38 | US |
52.114.77.236 | 52.114.77.236 | IE |
52.114.75.216 | 52.114.75.216 | NL |
52.114.75.71 | ? | NL |
34.238.206.169 | ec2-34-238-206-169.compute-1.amazonaws.com | US |
20.53.76.223 | 20.53.76.223 | AU |
這取決於你。數字存檔通常是為了儲存歷史記錄。如果你出於某種原因不想成為歷史記錄的一部分,你可以攔截這型別的蜘蛛爬蟲。
您可以通過在網站的 robots.txt 中設定使用者代理訪問規則來遮蔽 SkypeUriPreview 或限制其訪問許可權。我們建議安裝 Spider Analyser 外掛,以檢查它是否真正遵循這些規則。
# robots.txt # 下列程式碼一般情況可以攔截該代理 User-agent: SkypeUriPreview Disallow: /
您無需手動執行此操作,可通過我們的 Wordpress 外掛 Spider Analyser 來攔截不必要的蜘蛛或者爬蟲。