<?xml version="1.0" encoding="UTF-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
> <channel><title>Comments on: Robot txt mystery unraveled</title> <atom:link href="http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/feed/" rel="self" type="application/rss+xml" /><link>http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/</link> <description></description> <lastBuildDate>Tue, 16 Mar 2010 16:21:37 +0000</lastBuildDate> <generator>http://wordpress.org/?v=2.9.2</generator> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <item><title>By: Brian</title><link>http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/comment-page-1/#comment-5147</link> <dc:creator>Brian</dc:creator> <pubDate>Wed, 08 Jul 2009 17:09:54 +0000</pubDate> <guid
isPermaLink="false">http://potpolitics.com/?p=2637#comment-5147</guid> <description>Kikolani,If the .com and .net have identical content, you should just use a 301 redirect on the .com to avoid the issue you talked about. (I assume you are using virtual hosts?) If you are using apache this is pretty easy to do with htaccess.</description> <content:encoded><![CDATA[<p>Kikolani,</p><p>If the .com and .net have identical content, you should just use a 301 redirect on the .com to avoid the issue you talked about. (I assume you are using virtual hosts?) If you are using apache this is pretty easy to do with htaccess.</p> ]]></content:encoded> </item> <item><title>By: Kikolani</title><link>http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/comment-page-1/#comment-5004</link> <dc:creator>Kikolani</dc:creator> <pubDate>Fri, 03 Jul 2009 20:47:39 +0000</pubDate> <guid
isPermaLink="false">http://potpolitics.com/?p=2637#comment-5004</guid> <description>I have a tricky thing I&#039;d like to use the robots for, but haven&#039;t figured out how yet.  Basically, my client has both the .net and .com of his domain.  The site files are all hosted under .com, but .net is the primary domain that shows up in search results.  However, if you type in any page.com, it will pull up the same content as any page.net, which seems like a duplicate content issue.  Can I block the robots from crawling the .com site, even if the files are hosted on the .com site?  I know, pretty odd.~ Kristi
.-= Kikolani´s last blog ..&lt;a href=&quot;http://feedproxy.google.com/~r/kikolani/~3/ctI2AiQ3pf0/fetching-friday-resources-mashup-followfriday-tennis-love.html&quot; rel=&quot;nofollow&quot;&gt;Fetching Friday - Resources Mashup, #FollowFriday, &amp; Some Tennis Love&lt;/a&gt; =-.</description> <content:encoded><![CDATA[<p>I have a tricky thing I&#8217;d like to use the robots for, but haven&#8217;t figured out how yet.  Basically, my client has both the .net and .com of his domain.  The site files are all hosted under .com, but .net is the primary domain that shows up in search results.  However, if you type in any page.com, it will pull up the same content as any page.net, which seems like a duplicate content issue.  Can I block the robots from crawling the .com site, even if the files are hosted on the .com site?  I know, pretty odd.</p><p>~ Kristi<br
/> <span
class="cluv"> Kikolani´s last blog ..<a
href="http://feedproxy.google.com/~r/kikolani/~3/ctI2AiQ3pf0/fetching-friday-resources-mashup-followfriday-tennis-love.html">Fetching Friday &#8211; Resources Mashup, #FollowFriday, &amp; Some Tennis Love</a> <span
class="heart_tip_box"><img
class="heart_tip" alt="My ComLuv Profile" border="0" width="16" height="14" src="http://potpolitics.com/wordpress/wp-content/plugins/commentluv/images/littleheart.gif"/></span></span></p> ]]></content:encoded> </item> <item><title>By: orovo</title><link>http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/comment-page-1/#comment-5002</link> <dc:creator>orovo</dc:creator> <pubDate>Fri, 03 Jul 2009 10:12:45 +0000</pubDate> <guid
isPermaLink="false">http://potpolitics.com/?p=2637#comment-5002</guid> <description>i am not feeling very shy for saying that i didn&#039;t know at all about the robot.txt  but after reading your post i got to know some basics &amp; seriously i am still in search of some information about the robot.txt &amp; i am going for the forum discussion !Thanks for your perfect topic as robot .txt!it is very helpful to control bugs from the search engines!</description> <content:encoded><![CDATA[<p>i am not feeling very shy for saying that i didn&#8217;t know at all about the robot.txt  but after reading your post i got to know some basics &amp; seriously i am still in search of some information about the robot.txt &amp; i am going for the forum discussion !Thanks for your perfect topic as robot .txt!it is very helpful to control bugs from the search engines!</p> ]]></content:encoded> </item> <item><title>By: Tycoon Blogger @Make Money Blogging</title><link>http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/comment-page-1/#comment-4997</link> <dc:creator>Tycoon Blogger @Make Money Blogging</dc:creator> <pubDate>Fri, 03 Jul 2009 02:46:13 +0000</pubDate> <guid
isPermaLink="false">http://potpolitics.com/?p=2637#comment-4997</guid> <description>That is way over my head.  I think I will outsource this to some one.  Thanks for breaking it down though as I was not familiar with this.
.-= Tycoon Blogger @Make Money Blogging´s last blog ..&lt;a href=&quot;http://tycoonblogger.com/social-media/twitter-voyeurism&quot; rel=&quot;nofollow&quot;&gt;Twitter Voyeurism&lt;/a&gt; =-.</description> <content:encoded><![CDATA[<p>That is way over my head.  I think I will outsource this to some one.  Thanks for breaking it down though as I was not familiar with this.<br
/> <span
class="cluv"> Tycoon Blogger @Make Money Blogging´s last blog ..<a
href="http://tycoonblogger.com/social-media/twitter-voyeurism">Twitter Voyeurism</a> <span
class="heart_tip_box"><img
class="heart_tip" alt="My ComLuv Profile" border="0" width="16" height="14" src="http://potpolitics.com/wordpress/wp-content/plugins/commentluv/images/littleheart.gif"/></span></span></p> ]]></content:encoded> </item> <item><title>By: Ste@DS Bundle</title><link>http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/comment-page-1/#comment-4994</link> <dc:creator>Ste@DS Bundle</dc:creator> <pubDate>Thu, 02 Jul 2009 20:50:24 +0000</pubDate> <guid
isPermaLink="false">http://potpolitics.com/?p=2637#comment-4994</guid> <description>Some great tips here, robots.txt has always mystified me.</description> <content:encoded><![CDATA[<p>Some great tips here, robots.txt has always mystified me.</p> ]]></content:encoded> </item> <item><title>By: bbrian017</title><link>http://potpolitics.com/2009/07/02/robot-txt-mystery-unraveled/comment-page-1/#comment-4988</link> <dc:creator>bbrian017</dc:creator> <pubDate>Thu, 02 Jul 2009 12:57:28 +0000</pubDate> <guid
isPermaLink="false">http://potpolitics.com/?p=2637#comment-4988</guid> <description>Great read John! Ironic but I just added a robots.txt file to my social network about 8 weeks ago and since the pr update I got a page rank of 4. I get a lot of spam on the upcoming.php page and told the SE’s to not index of follow that link. I also added other pages that showed duplicate content.This is my file here,# All robots will spider the domain
User-agent: *
Disallow: /templates/
Disallow: /3rdparty/
Disallow: /libs/
Disallow: /modules/
Disallow: /plugins/
Disallow: /internal/
Disallow: /backup/
Disallow: /thickbox/
Disallow: /api/
Disallow: /evb/
Disallow: /avatars/
Disallow: /admin_index.php
Disallow: /admin
Disallow: /login.php
Disallow: /js/
Disallow: /img/
Disallow: /upcoming.phpp.s. I engaged this article!
.-= bbrian017´s last blog ..&lt;a href=&quot;http://www.blogengage.com/story.php?title=7-days-7-colours-thailand--thailand-art-photography&quot; rel=&quot;nofollow&quot;&gt;7 Days 7 Colours Thailand &#124; Thailand Art Photography&lt;/a&gt; =-.</description> <content:encoded><![CDATA[<p>Great read John! Ironic but I just added a robots.txt file to my social network about 8 weeks ago and since the pr update I got a page rank of 4. I get a lot of spam on the upcoming.php page and told the SE’s to not index of follow that link. I also added other pages that showed duplicate content.</p><p>This is my file here,</p><p># All robots will spider the domain<br
/> User-agent: *<br
/> Disallow: /templates/<br
/> Disallow: /3rdparty/<br
/> Disallow: /libs/<br
/> Disallow: /modules/<br
/> Disallow: /plugins/<br
/> Disallow: /internal/<br
/> Disallow: /backup/<br
/> Disallow: /thickbox/<br
/> Disallow: /api/<br
/> Disallow: /evb/<br
/> Disallow: /avatars/<br
/> Disallow: /admin_index.php<br
/> Disallow: /admin<br
/> Disallow: /login.php<br
/> Disallow: /js/<br
/> Disallow: /img/<br
/> Disallow: /upcoming.php</p><p>p.s. I engaged this article!<br
/> <span
class="cluv"> bbrian017´s last blog ..<a
href="http://www.blogengage.com/story.php?title=7-days-7-colours-thailand--thailand-art-photography">7 Days 7 Colours Thailand | Thailand Art Photography</a> <span
class="heart_tip_box"><img
class="heart_tip" alt="My ComLuv Profile" border="0" width="16" height="14" src="http://potpolitics.com/wordpress/wp-content/plugins/commentluv/images/littleheart.gif"/></span></span></p> ]]></content:encoded> </item> </channel> </rss>
<!-- This site's performance optimized by W3 Total Cache. Dramatically improve the speed and reliability of your blog!

Learn more about our WordPress Plugins: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (user agent is rejected)
Database Caching 16/38 queries in 0.112 seconds using disk

Served from: perfora.net @ 2010-03-16 20:24:24 -->