<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="https://syndication.webwiz.net/rss_namespace/">
 <channel>
  <title>Web Wiz Support and Community Forums : Search Robots?</title>
  <link>https://forums.webwiz.net/</link>
  <description><![CDATA[This is an XML content feed of; Web Wiz Support and Community Forums : Web Wiz Forums : Search Robots?]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Wed, 08 Apr 2026 16:12:42 +0000</pubDate>
  <lastBuildDate>Sat, 26 Jun 2004 15:52:45 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 12.08</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>https://forums.webwiz.net/RSS_post_feed.asp?TID=10867</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Web Wiz Support and Community Forums]]></title>
   <url>https://forums.webwiz.net/forum_images/web_wiz_forums.png</url>
   <link>https://forums.webwiz.net/</link>
  </image>
  <item>
   <title><![CDATA[Search Robots? : google, yahoo, aol, and the other...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post60472.html#60472</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=9949">dpyers</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 26&nbsp;June&nbsp;2004 at 3:52pm<br /><br /><P>google, yahoo, aol, and the other engines all have forms you can use to submit your site to them.</P><P>You should also submit to dmoz - the open directory project - as many search engines use them as source info&nbsp;- but they do manual checks and it can take months/years before they get around to you.</P><P>If you have a link to your homepage on a related site that google regularly visits, their bot will follow the link to you and index you as well.</P><P>Google will usually to a surface scan of your site within a few days/weeks of submission. They'll to a deep crawl within a few months.</P>]]>
   </description>
   <pubDate>Sat, 26 Jun 2004 15:52:45 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post60472.html#60472</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? : does your pages have titles, and...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post60462.html#60462</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=2216">dj air</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 26&nbsp;June&nbsp;2004 at 2:14pm<br /><br />does your pages have titles, and descriptions you can add the followingto the pages aswell that can help, once they been they know when tocome back.<br><pre id="line1">&lt;<span ="start-tag">meta</span><span ="attribute-name"> name</span>=<span ="attribute-value">"expires" </span><span ="attribute-name">content</span>=<span ="attribute-value">"never"</span>&gt;<br>&lt;<span ="start-tag">meta</span><span ="attribute-name"> name</span>=<span ="attribute-value">"robots" </span><span ="attribute-name">content</span>=<span ="attribute-value">"INDEX,FOLLOW"</span>&gt;<br>&lt;<span ="start-tag">meta</span><span ="attribute-name"> name</span>=<span ="attribute-value">"revisit-after" </span><span ="attribute-name">content</span>=<span ="attribute-value">"7 Days"</span>&gt;<br></pre><br>]]>
   </description>
   <pubDate>Sat, 26 Jun 2004 14:14:53 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post60462.html#60462</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? : i kno this isnt my post and its...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post60442.html#60442</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=16498">web-geek</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 26&nbsp;June&nbsp;2004 at 4:57am<br /><br /><P>i kno this isnt my post and its not really about the forum, but instead of kicking out the bots is there anyway to encourage them to come to ur site? like feeding mice cheese to get them to go to the mouse trap.</P><P>im jst finding that with all my sites ive had they are neva or i hav neva found out if they hav been crawled or not and they do not appear in search engines.</P>]]>
   </description>
   <pubDate>Sat, 26 Jun 2004 04:57:13 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post60442.html#60442</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? : The robots.txt file only works...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post59692.html#59692</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=9949">dpyers</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 15&nbsp;June&nbsp;2004 at 2:48pm<br /><br />The robots.txt file only works for well behaved bots who take the trouble to read and follow it. Bad Bots ignore it and also any robots meta tags.]]>
   </description>
   <pubDate>Tue, 15 Jun 2004 14:48:40 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post59692.html#59692</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? : you could block the search engines...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post59673.html#59673</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=6524">Scotty32</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 15&nbsp;June&nbsp;2004 at 10:50am<br /><br />you could block the search engines (robots) from accessing your site with the robot.txt in the root folder<br><br><a href="http://www.google.com/webmasters/3.html#B4" target="_blank">http://www.google.com/webmasters/3.html#B4</a> theres sum info off google about the robot.txt file<br><br>just make a robot.txt to block search engines (robots) from accessing your forum <br>]]>
   </description>
   <pubDate>Tue, 15 Jun 2004 10:50:25 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post59673.html#59673</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? : Some search robots harvest email...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post59664.html#59664</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=9949">dpyers</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 15&nbsp;June&nbsp;2004 at 7:58am<br /><br /><P>Some search robots harvest email addresses from pages that are able to be displayed. </P><P>As a rule of thumb, never put your email address on a page.&nbsp;One of the reasons forums like this have Private Mail is so members can email each other without exposing their email address to the world.</P>]]>
   </description>
   <pubDate>Tue, 15 Jun 2004 07:58:02 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post59664.html#59664</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? : they are exactly what they say...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post59654.html#59654</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=2216">dj air</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 15&nbsp;June&nbsp;2004 at 6:07am<br /><br /><P>they are exactly what <FONT face="Arial, Helvetica, sans-serif">they</FONT> say they are.</P><P>they search the forums . and getting data from the forum so people from the search engines can search the forum, for an answer for the search word.</P><P>hope that makes sense.</P><P>there is nothing to worry about with them.</P><P>run down in the above</P><P>- u post a message.</P><P>- a search bot comes to your forum and cashes/adds the details of the post to their database.</P><P>- someone on the search engine site (google for example). search for something . </P><P>-t can use the data from the search that the search bot did.</P><P>- to show results from the search on google.</P>]]>
   </description>
   <pubDate>Tue, 15 Jun 2004 06:07:41 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post59654.html#59654</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? :    Dr Moocowz wrote:Every once...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post59653.html#59653</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=13756">thekiwi</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 15&nbsp;June&nbsp;2004 at 6:07am<br /><br /><table width="99%"><tr><td class="BBquote"><img src="forum_images/quote_box.png" title="Originally posted by Dr Moocowz" alt="Originally posted by Dr Moocowz" style="vertical-align: text-bottom;" /> <strong>Dr Moocowz wrote:</strong><br /><br /><p>Every once in a while on my forum, I'll see in the Active Users something like the following...</p><p>Guest 1&nbsp;&nbsp; June 14 2004 at 1:19pm&nbsp;&nbsp; June 14 2004 at 1:50pm&nbsp;&nbsp;&nbsp; 31 minutes&nbsp;&nbsp; <strong>Search Robot</strong>&nbsp;&nbsp;&nbsp; <strong>MSN</strong></p><p>I check the server logs and I can't really see anything malicious going on...</p><p>I guess my question is what exactly is a Search Robot, what does itdo, and should I be concerned about it doing something malicious oraccessing something it shouldn't have access to?</p><p>Thanks ahead of time.</p></td></tr></table><br>Search Robots index the internet .... eg GoogleBot for Google etc.<br>If you dont want your site to be indexed (spidered) then add the appropriate info to a robots.txt in the root of your site.<br>]]>
   </description>
   <pubDate>Tue, 15 Jun 2004 06:07:03 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post59653.html#59653</guid>
  </item> 
  <item>
   <title><![CDATA[Search Robots? : Every once in a while on my forum,...]]></title>
   <link>https://forums.webwiz.net/search-robots_topic10867_post59609.html#59609</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="https://forums.webwiz.net/member_profile.asp?PF=15085">Dr Moocowz</a><br /><strong>Subject:</strong> 10867<br /><strong>Posted:</strong> 14&nbsp;June&nbsp;2004 at 5:35pm<br /><br /><P>Every once in a while on my forum, I'll see in the Active Users something like the following...</P><P>Guest 1&nbsp;&nbsp; June 14 2004 at 1:19pm&nbsp;&nbsp; June 14 2004 at 1:50pm&nbsp;&nbsp;&nbsp; 31 minutes&nbsp;&nbsp; <strong>Search Robot</strong>&nbsp;&nbsp;&nbsp; <strong>MSN</strong></P><P>I check the server logs and I can't really see anything malicious going on...</P><P>I guess my question is what exactly is a Search Robot, what does it do, and should I be concerned about it doing something malicious or accessing something it shouldn't have access to?</P><P>Thanks ahead of time.</P>]]>
   </description>
   <pubDate>Mon, 14 Jun 2004 17:35:59 +0000</pubDate>
   <guid isPermaLink="true">https://forums.webwiz.net/search-robots_topic10867_post59609.html#59609</guid>
  </item> 
 </channel>
</rss>