robots.txt:用户代理:Googlebot 不允许:/Google 仍在编制索引

标签 robots.txt googlebot google-index

查看本站的robots.txt:

fr2.dk/robots.txt

内容是:

User-Agent: Googlebot
Disallow: /

那应该告诉谷歌不要索引该网站,不是吗?

如果为真,为什么该网站会出现在谷歌搜索中?

最佳答案

除了必须等待,因为 Google 的索引更新需要一些时间,还要注意,如果您有其他网站链接到您的网站,仅 robots.txt 不足以删除您的网站。

引用 Google 的支持页面 "Remove a page or site from Google's search results" :

If the page still exists but you don't want it to appear in search results, use robots.txt to prevent Google from crawling it. Note that in general, even if a URL is disallowed by robots.txt we may still index the page if we find its URL on another site. However, Google won't index the page if it's blocked in robots.txt and there's an active removal request for the page.



上述文件中还提到了一种可能的替代解决方案:

Alternatively, you can use a noindex meta tag. When we see this tag on a page, Google will completely drop the page from our search results, even if other pages link to it. This is a good solution if you don't have direct access to the site server. (You will need to be able to edit the HTML source of the page).

关于robots.txt:用户代理:Googlebot 不允许:/Google 仍在编制索引,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/4769140/

相关文章:

robots.txt - "/recaptcha/api2/logo_48.png"被谷歌屏蔽

performance - 如何防止Googlebot淹没网站?

googlebot - google bot rel ="nofollow"停止关注多长时间

magento - 通过 Google 获取仅显示 "q"

indexing - 如何索引在数据库中自动创建的页面

google-maps-api-3 - 抱歉,我们这里没有图像... - Google 索引页面 | WMT内容关键词|谷歌机器人

seo - Robots.txt http ://example. com 与 http ://www. example.com

seo - 需要使用同一目录级别的 robots.txt 来阻止子域

seo - 来自早期被谷歌实时索引的网站的页面

seo - 如何为移动端和桌面端不同页面的网站创建站点地图?