<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="https://syndication.webwiz.net/rss_namespace/">
 <channel>
  <title>Web Wiz Support and Community Forums : Robots.txt file project</title>
  <link>https://forums.webwiz.net/</link>
  <description><![CDATA[This is an XML content feed of; Web Wiz Support and Community Forums : Web Wiz Forums : Robots.txt file project]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Mon, 13 Apr 2026 08:58:48 +0000</pubDate>
  <lastBuildDate>Fri, 08 Jul 2005 22:29:54 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 12.08</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>https://forums.webwiz.net/RSS_post_feed.asp?TID=15698</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Web Wiz Support and Community Forums]]></title>
   <url>https://forums.webwiz.net/forum_images/web_wiz_forums.png</url>
   <link>https://forums.webwiz.net/</link>
  </image>
  <item>
   <title><![CDATA[Robots.txt file project : I wonder if someone could write...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86651.html#86651</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=19578">radic</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 08&nbsp;July&nbsp;2005 at 10:29pm<br /><br />I wonder if someone could write a similar script for webwiz forums for the google Sitemap.<DIV>&nbsp;</DIV><DIV>Here is a link to the snitz code and I would like to do the same thing with my webwiz forum: <A href="http://forum.snitz.com/forum/topic.asp?TOPIC_ID=58757" target="_blank">http://forum.snitz.com/forum/topic.asp?TOPIC_ID=58757</A></DIV>]]>
   </description>
   <pubDate>Fri, 08 Jul 2005 22:29:54 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86651.html#86651</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project :   dpyers wrote:Something new...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86516.html#86516</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=19578">radic</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 06&nbsp;July&nbsp;2005 at 8:58am<br /><br /><table width="99%"><tr><td class="BBquote"><img src="forum_images/quote_box.png" title="Originally posted by dpyers" alt="Originally posted by dpyers" style="vertical-align: text-bottom;" /> <strong>dpyers wrote:</strong><br /><br />Something new from google that you may be interested in<BR><A href="http://www.google.com/webmasters/sitemaps/login" target="_blank">https://www.google.com/webmasters/sitemaps/login</A><BR></td></tr></table> <DIV>&nbsp;</DIV><DIV>&nbsp;</DIV><DIV>hahaha, ive been busy working on these for my sites for the last few days since I herd of this. So the point of this post is not really an issue anymore if this Google Sitemap thing works. I got a snitz forum Site Map indexed yesterday and im now working on ones for the rest of my sites... <IMG height=17 alt=Wink src="https://forums.webwiz.net/smileys/smiley2.gif" width=17 align=absMiddle border="0">&nbsp;</DIV><DIV>&nbsp;</DIV><DIV>dpyers, ok I see what your saying now and thanks for that information and that was explained well. <BR></DIV>]]>
   </description>
   <pubDate>Wed, 06 Jul 2005 08:58:14 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86516.html#86516</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project : Something new from google that...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86489.html#86489</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=9949">dpyers</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 05&nbsp;July&nbsp;2005 at 9:05pm<br /><br />Something new from google that you may be interested in<br><a href="http://www.google.com/webmasters/sitemaps/login" target="_blank">https://www.google.com/webmasters/sitemaps/login</a><br>]]>
   </description>
   <pubDate>Tue, 05 Jul 2005 21:05:56 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86489.html#86489</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project : dpyers is right, the search engines...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86428.html#86428</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=12115">wistex</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 04&nbsp;July&nbsp;2005 at 7:35pm<br /><br />dpyers is right, the search engines only follow and index links.&nbsp; They don't index any file that noone links directly to.&nbsp; I've never had a problem with Google or any of the others linking to any header or footer files.<DIV>&nbsp;</DIV><DIV>Remember, the bots can't read your directory structure, they can only follow hyperlinks in a webpage.</DIV>]]>
   </description>
   <pubDate>Mon, 04 Jul 2005 19:35:40 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86428.html#86428</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project : The SE bots follow links. They...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86411.html#86411</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=9949">dpyers</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 04&nbsp;July&nbsp;2005 at 12:25pm<br /><br />The SE bots follow links. They don't actually "walk" a directory tree.You don't link to an include file so the bots never see it.<br>The first time a bot visits, it does a "shallow" scan - usually only 1or two levels deep from your home page. On subsequent visits, it scansdeeper and deeper. It can take months to fully scan a deep site.<br><br>Pages that change frequently are visited more often by the bots. Oncethey get past your main forum page, they'll see pages that changefrewuently and keep coming back more often. I've seen forums that getscanned twice a day.<br><br>One way of getting your site fully indexed wuicker is to include a sitemap on your front page. Another way to get a forum site indexed is toinclude a link to active topics on the home page.<br><br>The robots.txt exclude functions are used to prevent the bots fromfollowing links to those directories - not to prevent walking thedirectories which is something bots can't do if you've turned offdirectory views for your site.<br><br>One of the useful excluding directories and files from robots.txt doesis to keep hidden things that you want hidded from the se's - likekeeping your images out of google.<br><br>Note that robots.txt is only useful for conforming search engines. BadBots either ignore it, or use it to identify areas where "good stuff"might be kept.<br>]]>
   </description>
   <pubDate>Mon, 04 Jul 2005 12:25:56 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86411.html#86411</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project : Ok thanks for the feedback, I&amp;#039;m...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86400.html#86400</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=19578">radic</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 04&nbsp;July&nbsp;2005 at 9:45am<br /><br />Ok thanks for the feedback, I'm quite surprised that you both think its better to not use the Disallow command for files that you dont want shown on google etc.<DIV>&nbsp;</DIV><DIV>I'm well aware of content is King etc and have been studying SEO quite a bit but there is so much to know. I mean what would be the point of having a file that has no content like an included header or one of the forum files that has no use or could produce an error if landed on?</DIV><DIV>&nbsp;</DIV><DIV>I would much rather have the robot come in and index all the forum posts and boards than letting the robot index all these incuded or funtion files, I think you need to make it easy &amp; clear for them for what they should do with your site&nbsp;and not throw hurdles in their path.&nbsp; <BR><BR>Its also a waste of bandwith although thats not the point.&nbsp;The robots will turn away and not index a site properly if they have to deal with too much junk, they just want the content files, not the includes, not the errors caused be landing on a include file etc. You also dont want these files for everyone to see on google etc, you want users to find the forum posts where the keywords are.</DIV><DIV>&nbsp;</DIV><DIV>Anyway thats just how I see it but very interested to hear more.</DIV><DIV>&nbsp;</DIV><DIV>&nbsp;</DIV>]]>
   </description>
   <pubDate>Mon, 04 Jul 2005 09:45:05 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86400.html#86400</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project : BTW, if you press the space bar...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86354.html#86354</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=12115">wistex</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 03&nbsp;July&nbsp;2005 at 8:31pm<br /><br />BTW, if you press the space bar after typing a URL, it makes it clickable.&nbsp; Here's a clickable version of the links provided above: <A href="http://www.isapirewrite.com/" target="_blank">http://www.isapirewrite.com/</A> and <A href="http://www.robotstxt.org/wc/norobots.html" target="_blank">http://www.robotstxt.org/wc/norobots.html</A> <DIV>&nbsp;</DIV><DIV>The key to getting your site indexed is content content content.&nbsp; Yes, there are tricks to get Google and other search engines to understand your site better and therefore rank it higher, but content still comes first.&nbsp; On one of my sites, we have <strong>not</strong> used&nbsp;any SEO tricks, and yet we rank it the top 10 on Google and Yahoo! and MSN in some of the appropriate categories.&nbsp; Actually, we do the opposite of what some SEO people say, yet we rank in the top 10.&nbsp; Why?&nbsp; The content is good and people keep coming back, and search engines have ways to figure this out.</DIV><DIV>&nbsp;</DIV><DIV>And excluding files won't help you get indexed, actually it will probably hurt you.&nbsp; The larger your site, the more of an authority you are, and that effects your ranking.&nbsp; So excluding files would probably make you less of an authority since your site will appear smaller to search engines.</DIV>]]>
   </description>
   <pubDate>Sun, 03 Jul 2005 20:31:56 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86354.html#86354</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project : Radic, what makes you think that...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86336.html#86336</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=18522">Duval</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 03&nbsp;July&nbsp;2005 at 4:04pm<br /><br />Radic, what makes you think that by excluding files you'll get spideredmore thoroughly? Generally the robots.txt exclusion is for non publicareas of your site or with pages that you have concerns about duplicatecontent.<br><br>The single best thing that one can do with a forum to improve searchengine spidering is to rewrite the url's. http://www.isapirewrite.com/<br><br>Here's a link to the exclusion protocol http://www.robotstxt.org/wc/norobots.html <br><br>Please repost any specifics if you run into difficulty.<br>]]>
   </description>
   <pubDate>Sun, 03 Jul 2005 16:04:56 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86336.html#86336</guid>
  </item> 
  <item>
   <title><![CDATA[Robots.txt file project : Hi,  I would like to add all...]]></title>
   <link>https://forums.webwiz.net/robots-txt-file-project_topic15698_post86307.html#86307</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=19578">radic</a><br /><strong>Subject:</strong> 15698<br /><strong>Posted:</strong> 02&nbsp;July&nbsp;2005 at 8:59pm<br /><br />Hi,<DIV>&nbsp;</DIV><DIV>I would like to add all of the files from the forum except for the pages like default.asp, forum_topics.asp &amp; forum_posts.asp etc into my site robots.txt file. Instead of doing this from scratch I would like to see if anyone else has one already completed and could share with the community. </DIV><DIV>&nbsp;</DIV><DIV>If you want googlebot etc to&nbsp;index more files and more often then this is vital.</DIV><DIV>&nbsp;</DIV><DIV>If you want traffic and high SEO listings then this is vital.</DIV>]]>
   </description>
   <pubDate>Sat, 02 Jul 2005 20:59:10 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/robots-txt-file-project_topic15698_post86307.html#86307</guid>
  </item> 
 </channel>
</rss>