Pages

Sunday, August 5, 2012

Robots

1) With so much talk about getting into search engines, and achieving high rankings on a particular
   search engine, it may seem crazy to do anything which would prevent this from happening. However, there are times when you have a web site or page(s) within a web site which you don't want to end up in a search engine.

2) It might be a page of family photos, where the URL is given out only to family and friends. It
   might be a resource page for a professor to give his/her students additional information. Or, it
   might be a temporary page to provide information, such as a list of links, but one which you don't
   want to include as part of your site, for fear of adversely impacting your page rank.


3) You may say you don't have to worry about this because you haven't submitted your site to any
   search engines. However, if someone else has put a link on their site to your site, the spiders
   that crawl their site will eventually end up crawling your site.

4) There is code you can put into your HTML which will prevent the bots from spidering your page(s). This code is put in the header of your web page(s) (meaning between the <head> and </head> tags), in the form of a meta tag (and you thought meta tags were for keywords only).

5)Here are four examples of how this code will look;

   <meta name="robots" content="index,follow">

   <meta name="robots" content="noindex,follow">

   <meta name="robots" content="index,nofollow">

   <meta name="robots" content="noindex,nofollow">


6) There are two parameters; "index" and "follow". "Index" refers to whether or not you want that
   particular page to be indexed. "Follow" refers to whether or not you want the spider to follow the
   links on that page. You would decide whether you wanted the page indexed, and whether you wanted the links followed. Based on that decision, you would chose one of the above options. The final code would look something like this:

<head>

<title>New Page</title>

<meta name="robots" content="noindex,nofollow">

</head>

No comments:

Post a Comment