<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Best Blogs Asia &#187; robots text file</title>
	<atom:link href="http://www.bestblogs.asia/tags/robots-text-file/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bestblogs.asia</link>
	<description>Showcasing Internet Marketing in Asia, Asian Bloggers, giving advice on improving blogs and making money online.</description>
	<lastBuildDate>Wed, 22 Sep 2010 11:31:37 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How to use Robots.txt</title>
		<link>http://www.bestblogs.asia/robots-file.php</link>
		<comments>http://www.bestblogs.asia/robots-file.php#comments</comments>
		<pubDate>Tue, 21 Apr 2009 03:27:44 +0000</pubDate>
		<dc:creator>Best Blogs</dc:creator>
				<category><![CDATA[Blog Tools]]></category>
		<category><![CDATA[Robots file]]></category>
		<category><![CDATA[robots text file]]></category>
		<category><![CDATA[we are the robots]]></category>

		<guid isPermaLink="false">http://www.bestblogs.asia/?p=415</guid>
		<description><![CDATA[A guide on how to use the robots.txt file effectively and how it can be used to promote your site.]]></description>
			<content:encoded><![CDATA[<p><strong>Robots.txt file is a file placed in your main directory and issues commands to spiders visiting your site.  The importance of a robots file can mean certain pages/sections can be &#8220;crawled&#8221; or not crawled depending on the issues given.</strong></p>
<h3>Using a Robots File Effectively</h3>
<p>Generally we want as much as exposure as possible to our sites, but there some content that you don&#8217;t want indexed and listed on search engines.  This is where a robots.txt can be used effectively.</p>
<h4>Definitions</h4>
<p>User-agent: this parameter defines, for which bots the next parameters will be valid. * is a wildcard which means all bots or Googlebot for Google.<br />
Disallow: defines which folders or files will be excluded. None means nothing will be excluded, / means everything will be excluded or /folder name/ or /filename can be used to specify the values to excluded.<br />
Allow: this parameter works just the opposite of Disallow. You can mention which content will be allowed to be crawled here. * is a wildcard.<br />
Request-rate: defines pages/seconds to be crawled ratio. Example, 1/20 would be 1 page in every 20 second.<br />
Crawl-delay: defines howmany seconds to wait after each succesful crawling.<br />
Visit-time: you can define between which hours you want your pages to be crawled.<br />
Sitemap: this is the parameter where you can show where your sitemap file is (You must use the complete URL address for the file).</p>
<h4>Example</h4>
<p>This the robots.txt I use on my site:</p>
<pre>
<code>
User-agent: *
Disallow: /cms/feed/
Disallow: */feed/*
Disallow: /feed
Disallow: /cms/wp-content/
Disallow: /cms/wp-plugins/
Disallow: */wp-content/*
Disallow: /cms/wp-content/plugins/
Disallow: /cms/index.php
Sitemap: http://www.bestblogs.asia/sitemap.xml
</code>
</pre>
<p><a href="http://www.bestblogs.asia">Best Blogs Asia</a>.</p>.]]></content:encoded>
			<wfw:commentRss>http://www.bestblogs.asia/robots-file.php/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

