タイトル「robots.txt」を検索 - はてなブックマーク

1 - 6 件 / 6件

新着順人気順

絞り込み

検索対象
ブックマーク数
期間
セーフサーチ

robots.txtの検索結果1 - 6 件 / 6件

辻正浩 | Masahiro Tsuji on Twitter: "よくあるrobots.txtの誤りで、致命的なトラブルになる事もあるのにあまり知られていない仕様の紹介で連ツイート。誤りは表に出ることが少ないので日本語で実例紹介を見たことが無いのですが、公共の面も持つサイトでの誤りを発見したので注意喚起意図で実例を紹介します。(続く"
- 443 users
- twitter.com/tsuj
- テクノロジー
- 2022/10/29
- SEO
- あとで読む
- web
- robots.txt
- 開発
- web制作
- トラブル
- google
- webサービス
Google's robots.txt Parser is Now Open Source
- 194 users
- opensource.googleblog.com
- テクノロジー
- 2019/07/02
The latest news from Google on open source releases, major projects, events, and student outreach programs. Originally posted on the Google Webmaster Central Blog For 25 years, the Robots Exclusion Protocol (REP) was only a de-facto standard. This had frustrating implications sometimes. On one hand, for webmasters, it meant uncertainty in corner cases, like when their text editor included BOM char
- クローラー
- google
- あとで読む
- clawler
- robots.txt
- Developers
- library
- web
- OSS
- C++
robots.txtでのnoindexをGoogleが完全にサポート終了、2019年9月1日から
- 83 users
- www.suzukikenichi.com
- テクノロジー
- 2019/07/03
[レベル: 上級] robots.txt の noindex 構文のサポートを終了することを Google は告知しました。 REP のインターネット標準化にともなう決定です。機能していたが未サポートだった robots.txt の noindex クローラのクロールを拒否するために robots.txt では Disallow 構文を用います。 User-agent: * Disallow: /dontcrawl.html Google では、クロールではなくインデックスを拒否するために Noindex 構文が使えていました。 User-agent: Googlebot Noindex: /dontindex.html HTML の head セクションで使える noindex robots meta タグと同じ働きをします。しかし、robots.txt での noindex を G
- seo
- google
- あとで読む
- Web
- 通信
- network
- ネット
- 開発
Google Search Console、「robots.txt によりブロックされましたが、インデックスに登録しました」への対処方法
- 67 users
- u-ff.com
- テクノロジー
- 2020/04/09
Googleがrobots.txtを無視する robots.txtというファイルをブログに設置すると、特定のURLをGoogleがクロールしないように制御できます。 ttps://u-ff.com/korona-kannikensakitto-part1/?replytocom=64 ttps://u-ff.com/korona-kannikensakitto-part3/?replytocom=81 ttps://u-ff.com/crawl-budget/?replytocom=162 ttps://u-ff.com/crawl-budget/?replytocom=166 上記のようなURLへクロールしてほしくなかったので、robots.txtに Disallow: /*?replytocom=* という設定を追加しました。詳しい設定手順は下記をご参照ください。
Googleがウェブサイト管理に欠かせない「robots.txt」のインターネット標準化を推進
- 55 users
- gigazine.net
- テクノロジー
- 2019/07/02
Googleやbingといった検索エンジンがさまざまなサイトの情報を検索できるのは、クローラーと呼ばれるボットが自動的にサイトを巡回するおかげ。このクローラーによるサイト巡回をサイトの管理者側で制御するために必要なのが「robots.txt」と呼ばれるテキストファイルです。20年以上使われながらも正式に標準化されていなかったrobots.txtについて、Googleがインターネット標準化にむけて動き出しています。 draft-rep-wg-topic-00 - Robots Exclusion Protocol https://tools.ietf.org/html/draft-rep-wg-topic-00 Official Google Webmaster Central Blog: Formalizing the Robots Exclusion Protocol Specifica
GitHub - google/robotstxt: The repository contains Google's robots.txt parser and matcher as a C++ library (compliant to C++11).
- 50 users
- github.com/google
- テクノロジー
- 2019/07/01
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
- parser
- C++
- google
- library
- github
- あとで読む