
如何做搜尋引擎蜘蛛日誌分析
搜尋引擎蜘蛛日誌檔案是一種非常強大但未被站長充分利用的檔案,分析它可以獲取有關每個搜尋引擎如何爬取網站內容的相關資訊點,及檢視搜尋引擎蜘蛛在一段時間內的行為。
IP地址(84) | 伺服器名稱 | 所屬國家 |
---|---|---|
87.117.229.53 | scan08.fgxintel.com | GB |
88.150.220.8 | scan15.fgxintel.com | GB |
78.129.201.60 | scan121.fgxintel.com | GB |
109.169.85.38 | scan01.fgxintel.com | GB |
88.150.240.193 | scan98.fgxintel.com | GB |
176.227.202.15 | scan04.fgxintel.com | GB |
78.129.237.30 | scan123.fgxintel.com | GB |
87.117.234.176 | scan115.fgxintel.com | GB |
88.150.240.30 | scan13.fgxintel.com | GB |
109.169.15.85 | scan129.fgxintel.com | GB |
88.150.240.220 | scan122.fgxintel.com | GB |
78.129.221.11 | scan107.fgxintel.com | GB |
109.169.10.14 | 109.169.10.14 | GB |
109.169.10.8 | 109.169.10.8 | GB |
78.129.237.113 | scan111.fgxintel.com | GB |
78.129.221.10 | scan105.fgxintel.com | GB |
78.129.253.107 | scan20.fgxintel.com | GB |
78.129.212.66 | scan19.fgxintel.com | GB |
109.169.87.58 | scan09.fgxintel.com | GB |
78.129.221.32 | scan113.fgxintel.com | GB |
80.84.57.6 | scan07.fgxintel.com | GB |
109.169.10.7 | 109.169.10.7 | GB |
109.169.10.11 | 109.169.10.11 | GB |
88.150.241.10 | scan101.fgxintel.com | GB |
109.169.10.10 | 109.169.10.10 | GB |
109.169.10.6 | 109.169.10.6 | GB |
109.169.10.2 | 109.169.10.2 | GB |
109.169.87.34 | scan12.fgxintel.com | GB |
109.169.10.3 | 109.169.10.3 | GB |
88.150.230.191 | scan117.fgxintel.com | GB |
78.129.221.12 | scan109.fgxintel.com | GB |
78.129.237.56 | scan126.fgxintel.com | GB |
88.150.241.123 | scan102.fgxintel.com | GB |
88.150.241.127 | scan108.fgxintel.com | GB |
109.169.10.9 | 109.169.10.9 | GB |
176.227.202.14 | scan18.fgxintel.com | GB |
109.169.10.4 | 109.169.10.4 | GB |
109.169.10.12 | 109.169.10.12 | GB |
109.169.86.65 | scan124.fgxintel.com | GB |
109.169.10.13 | 109.169.10.13 | GB |
78.129.237.55 | scan125.fgxintel.com | GB |
109.169.87.7 | scan16.fgxintel.com | GB |
87.117.229.54 | scan17.fgxintel.com | GB |
88.150.240.189 | scan93.fgxintel.com | GB |
87.117.234.60 | scan92.fgxintel.com | GB |
88.150.220.6 | scan10.fgxintel.com | GB |
88.150.241.125 | scan104.fgxintel.com | GB |
88.150.241.126 | scan106.fgxintel.com | GB |
88.150.240.31 | scan14.fgxintel.com | GB |
3.69.48.48 | ec2-3-69-48-48.eu-central-1.compute.amazonaws.com | DE |
109.169.87.13 | scan11.fgxintel.com | GB |
109.169.10.5 | 109.169.10.5 | GB |
18.206.190.112 | ec2-18-206-190-112.compute-1.amazonaws.com | US |
91.213.50.8 | 91.213.50.8 | RU |
83.147.52.42 | 83.147.52.42 | US |
105.110.165.177 | 105.110.165.177 | DZ |
185.195.232.249 | 185.195.232.249 | GB |
54.218.13.219 | ec2-54-218-13-219.us-west-2.compute.amazonaws.com | US |
IP地址(2) | 伺服器名稱 | 所屬國家 |
---|---|---|
78.129.221.11 | scan107.fgxintel.com | GB |
88.150.241.124 | scan103.fgxintel.com | GB |
IP地址(39) | 伺服器名稱 | 所屬國家 |
---|---|---|
109.169.15.78 | scan118.fgxintel.com | GB |
78.129.221.85 | scan116.fgxintel.com | GB |
3.86.51.122 | ec2-3-86-51-122.compute-1.amazonaws.com | US |
88.150.240.30 | scan13.fgxintel.com | GB |
88.150.220.6 | scan10.fgxintel.com | GB |
88.150.241.10 | scan101.fgxintel.com | GB |
109.169.87.7 | scan16.fgxintel.com | GB |
88.150.220.8 | scan15.fgxintel.com | GB |
78.129.237.56 | scan126.fgxintel.com | GB |
109.169.87.34 | scan12.fgxintel.com | GB |
一般不攔截。此類爬蟲通常是網站所有者提交掃描請求才會出現。如果攔截,則無法執行相應的掃描動作。
您可以通過在網站的 robots.txt 中設定使用者代理訪問規則來遮蔽 Foregenix crawler 或限制其訪問許可權。我們建議安裝 Spider Analyser 外掛,以檢查它是否真正遵循這些規則。
# robots.txt # 下列程式碼一般情況可以攔截該代理 User-agent: Foregenix crawler Disallow: /
您無需手動執行此操作,可通過我們的 Wordpress 外掛 Spider Analyser 來攔截不必要的蜘蛛或者爬蟲。