Control Which Areas Of Your Site Are Indexed With A Robots.txt File
Sometimes there are areas of your site that you don’t want indexing by search engines - e.g. admin files or images. By creating a robots.txt file, you can tell search engine robots about what pages on your website should be crawled and consequently indexed.
A robots.txt can be created quickly with Notepad. If you are using Wordpress a sample robots.txt file would be:
User-agent: *
Disallow: /wp-
Disallow: /feed/
Disallow: /trackback/
“User-agent: *” means that all the search bots (from Google, Yahoo, MSN and so on) should use those instructions to crawl your website. Unless your website is complex you will not need to set different instructions for different spiders.
“Disallow: /wp-” will make sure that the search engines will not crawl the Wordpress files. This line will exclude all files and foldes starting with “wp-” from the indexation, avoiding duplicated content and admin files.
If you are not using Wordpress just substitute the Disallow lines with files or folders on your website that should not be crawled, for instance:
User-agent: *
Disallow: /images/
Disallow: /cgi-bin/
Disallow: /any other folder to be excluded/
After you created the robots.txt file just upload it to your root directory and you are done!
Thanks to DailyBlogTips for today’s tip
Bookmark & Share
Related Posts
- Install Important WordPress Security Update
- MyBloop: Free Unlimited Online File Storage
- Simple Way To Improve Search Engine Listings
- Best Torrent Search Engines
- Fix HTTP Errors With WordPress 2.5
- WordPress SEO Tips
Related posts brought to you by Yet Another Related Posts Plugin.


