vBulletin SEO Forums

SEO

vBulletin Search Engine Optimization

Buy vBSEO Now!
vBSEO 2.0 Style Released vBSEO 3.3.0 GOLD Launched vBSEO's "LiveStats" for Google Analytics vB Sitemap Generator, Version 2.5 Success with vBSEO = 600ore Web Visitors + $1400 in a Day!

SE bots seem to ignore robots.txt exclusion of newreply.php

This is a discussion on SE bots seem to ignore robots.txt exclusion of newreply.php within the General Discussion forums, part of the vBSEO SEO Plugin category; I have installed the Track Guest Visits modification, which shows guest and SE bot activity on a forum. While browsing ...

Go Back   vBulletin SEO Forums > vBSEO SEO Plugin > General Discussion

Enhancing 80 million pages.

  #1  
Old 04-06-2008, 01:36 AM
711 711 is offline
Junior Member
Big Board Administrator
 
Real Name: 711
Join Date: Nov 2007
Posts: 16
SE bots seem to ignore robots.txt exclusion of newreply.php

I have installed the Track Guest Visits modification, which shows guest and SE bot activity on a forum.

While browsing the spider activity page, I noticed lots of request to newreply.php, even though I have that page disallowed in my robots.txt.

Here is a screenshot of what I mean:





Notice all the:

Called with DO = 'newreply'

Here is my current robots.txt content:

HTML Code:
User-agent: *
Disallow: /forums/admincp/
Disallow: /forums/attachment.php
Disallow: /forums/avatar.php
Disallow: /forums/calendar.php
Disallow: /forums/cgi-bin/
Disallow: /forums/clientscript/
Disallow: /forums/cron.php
Disallow: /forums/editpost.php
Disallow: /forums/faq.php
Disallow: /forums/image.php
Disallow: /forums/images/
Disallow: /forums/includes/
Disallow: /forums/install/
Disallow: /forums/ispy.php
Disallow: /forums/joinrequests.php
Disallow: /forums/login.php
Disallow: /forums/member.php
Disallow: /forums/member2.php
Disallow: /forums/misc.php
Disallow: /forums/modcp/
Disallow: /forums/moderator.php
Disallow: /forums/newreply.php
Disallow: /forums/newthread.php
Disallow: /forums/online.php
Disallow: /forums/payments.php
Disallow: /forums/poll.php
Disallow: /forums/postings.php
Disallow: /forums/printthread.php
Disallow: /forums/private.php
Disallow: /forums/private2.php
Disallow: /forums/profile.php
Disallow: /forums/register.php
Disallow: /forums/reputation.php
Disallow: /forums/search.php
Disallow: /forums/sendmessage.php
Disallow: /forums/sendmessage.php?do=
Disallow: /forums/sendtofriend.php
Disallow: /forums/showgroups.php
Disallow: /forums/showpost.php
Disallow: /forums/sitemap/
Disallow: /forums/spy.php
Disallow: /forums/subscription.php
Disallow: /forums/tags/
Disallow: /forums/threadrate.php
Disallow: /forums/upload.php
Disallow: /forums/usercp.php
Disallow: /forums/weeklystats.php
Disallow: /forums/statistics.php
Disallow: /forums/stats.php
Disallow: /forums/infraction.php
Disallow: /forums/ajax.php
Disallow: /forums/arcade.php
 
User-agent: Slurp
Disallow: /gallery/
Crawl-delay: 90
Sitemap: http://www.entropiaforum.com/forums/sitemap_index.xml.gz
Any ideas or suggestions on how to prevent the spiders from visiting newreply.php (or any other unwanted pages) would be appreciated, thank!

Last edited by 711; 04-06-2008 at 06:49 AM. Reason: Fixed image
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #2  
Old 04-06-2008, 05:47 PM
briansol's Avatar
Senior Member
 
Real Name: Brian
Join Date: Apr 2006
Location: Central CT, USA
Posts: 7,090
the robots file is merely a suggestion to the SE. It won't 100% stop anything.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #3  
Old 04-06-2008, 06:30 PM
711 711 is offline
Junior Member
Big Board Administrator
 
Real Name: 711
Join Date: Nov 2007
Posts: 16
Quote:
Originally Posted by briansol View Post
the robots file is merely a suggestion to the SE. It won't 100% stop anything.
True, though I thought Googlebot tended to be more "well-behaved", and usually respected the robots.txt suggestions?
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads

Thread Thread Starter Forum Replies Last Post
Blocking bots in robots.txt - how do they see URLs? Dave Hybrid General Discussion 0 06-20-2007 06:53 PM
Just a test, please ignore Lian Off-Topic & Chit Chat 1 01-16-2007 02:00 AM


All times are GMT -4. The time now is 11:22 PM.


Powered by vBulletin Version 3.8.3
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.3.0 ©2009, Crawlability, Inc.