Overview: Block Search Engine Crawling and Indexing
This feature is supported for: Robots.txt, the capability to prevent search engine crawling on some of all shop pages is available in version 3.0 and higher. Adding no index rules to the entire shop is available in all versions of ecommerce.
This feature is NOT supported for: Robots.txt, the capability to prevent search engine crawling on some of all shop pages is not available in any version prior to 3.0.
Key Terms
Robots.txt
Definition from Google
The robots
meta
tag lets you utilize a granular, page-specific approach to controlling how an individual page should be indexed and served to users in Google Search results. Place the robotsmeta
tag in the<head>
section of a given page…The robots
meta
tag instructs search engines not to show the page in search results. The value of thename
attribute (robots
) specifies that the rule applies to all crawlers.
No Index Rules
Definition from Google
noindex
is a rule set with either a<meta>
tag or HTTP response header and is used to prevent indexing content by search engines that support thenoindex
rule, such as Google. When Googlebot crawls that page and extracts the tag or header, Google will drop that page entirely from Google Search results, regardless of whether other sites link to it.
Feature Description
Resorts often have hidden categories and products exclusively for certain guests. To control access and guide customers through the marketing site before shopping, blocking the shop from search engine crawling and indexing is helpful.
Using robots.txt, resorts can set rules to prevent search engines from new crawling of pages. However, if a page has already been crawled and indexed, it may still appear in search results. To avoid this, a "no index" rule can be applied to the entire shop, preventing all shop pages from showing up in search results, regardless of prior crawling and indexing.
For newly created products or categories that shouldn't be in search results, adding them to the "no index" rules in robots.txt before publishing prevents them from appearing in search results. Keeping up with robots.txt rules allows you to keep most shop pages indexable and searchable while ensuring that newly created products and categories stay hidden from search engine results.
For more detailed information on the differences between search engine crawling and indexing, refer to Google's documentation.
NOTE: Organic search accounts for about 1.8% of traffic across all Aspenware sites, so making your shop completely unavailable to search engines could restric traffic and influence metrics.