Archives for September 2009

Ways to Help Bing’s Bot Using the Robots.txt File

Robots.txt files can be used to block bots (the most common use) and at the same time specific which files can be crawled. To do this you should use the Allow directive. See example below.

Allow: /private/public.doc
Disallow: /private/

According to Bing if “there is some logical confusion and both Allow and Disallow directives apply to a URL, the Allow directive takes precedent.”

Other things you can or should do with your robots.txt file include:

Wildcards – Wildcards can be used in a variety of ways in robots.txt files such as :

  • Blocking bots from accessing all URLs that contain a specific directory name;
  • Blocking bots from accessing all URLs that end with a specific string regardless of the directory where it is found; and
  • Blocking bots from accessing all URLs that contain a specific line anywhere in their URL string.

All the above can be done using the “*”character, which is used to represent characters appended to the strong of a URL. To filter by file extension the character “$”must be used.

XML Sitemaps – Make sure to add a reference to your sitemap at the end of your robots.txt file to make it easy for bots to make its way through your site. To reference a sitemap use the following syntax:

Sitemap: http://www.your-url.com/sitemap.xml

File format – Make sure to save your robots.txt file in a standard file format such as ASCII or UTF-8.

Validate at Bing’s Webmaster Central – They have an online robots.txt validation tool. If you are not a member then join or at least use other online validation tools.

Source: Prevent a bot from getting “lost in space” (SEM 101)

Filed in: Bing, SEO Tips

by: Noemi

2 Comments

Image Tips from Google

Last post I gave a few tips on how to rank well in Google and Bing’s image search. Here are some more tips straight from Google’s Developer Programs Tech Lead Maile Ohye. According to Maile Ohye when inserting images in your web pages you should:

1. Use large sized images with good aspect ratio – Just in case you don’t know what an aspect ratio is, it is the width of an image divided by its height. The aspect ratio of an image is important because if you use images with unequal aspect ratios Google will prefer to display “better images” as an insert on the web as well as news search results. Using images with standard aspect ratios will affect not only your standing in Image Search but also in Google Web.

2. Use descriptive captions and alt text – I have mentioned this before so I don’t really need to expound on this matter but I just want to reiterate the need to use KEYWORDS in the captions and alt text.

3. Keep the image near the title/heading – If you are using several images in one page keep each image close to the paragraphs/text body it is related to.

4. Keep the images inline and use non-clickable version

5. Use jpeg format – Apparently Google prefers the jpeg format over other formats (e.g. PNG, BMP). I still haven’t figured out why but since they have expressly stated this fact then it is better to just obey.

Filed in: SEO Tips

by: Noemi

1 Comment