Results 1 to 3 of 3

SE bots seem to ignore robots.txt exclusion of newreply.php

This is a discussion on SE bots seem to ignore robots.txt exclusion of newreply.php within the General Discussion forums, part of the vBSEO SEO Plugin category; I have installed the Track Guest Visits modification, which shows guest and SE bot activity on a forum. While browsing ...

  1. #1
    711
    711 is offline
    Junior Member
    Real Name
    711
    Join Date
    Nov 2007
    Posts
    16
    Liked
    0 times

    SE bots seem to ignore robots.txt exclusion of newreply.php

    I have installed the Track Guest Visits modification, which shows guest and SE bot activity on a forum.

    While browsing the spider activity page, I noticed lots of request to newreply.php, even though I have that page disallowed in my robots.txt.

    Here is a screenshot of what I mean:





    Notice all the:

    Called with DO = 'newreply'

    Here is my current robots.txt content:

    HTML Code:
    User-agent: *
    Disallow: /forums/admincp/
    Disallow: /forums/attachment.php
    Disallow: /forums/avatar.php
    Disallow: /forums/calendar.php
    Disallow: /forums/cgi-bin/
    Disallow: /forums/clientscript/
    Disallow: /forums/cron.php
    Disallow: /forums/editpost.php
    Disallow: /forums/faq.php
    Disallow: /forums/image.php
    Disallow: /forums/images/
    Disallow: /forums/includes/
    Disallow: /forums/install/
    Disallow: /forums/ispy.php
    Disallow: /forums/joinrequests.php
    Disallow: /forums/login.php
    Disallow: /forums/member.php
    Disallow: /forums/member2.php
    Disallow: /forums/misc.php
    Disallow: /forums/modcp/
    Disallow: /forums/moderator.php
    Disallow: /forums/newreply.php
    Disallow: /forums/newthread.php
    Disallow: /forums/online.php
    Disallow: /forums/payments.php
    Disallow: /forums/poll.php
    Disallow: /forums/postings.php
    Disallow: /forums/printthread.php
    Disallow: /forums/private.php
    Disallow: /forums/private2.php
    Disallow: /forums/profile.php
    Disallow: /forums/register.php
    Disallow: /forums/reputation.php
    Disallow: /forums/search.php
    Disallow: /forums/sendmessage.php
    Disallow: /forums/sendmessage.php?do=
    Disallow: /forums/sendtofriend.php
    Disallow: /forums/showgroups.php
    Disallow: /forums/showpost.php
    Disallow: /forums/sitemap/
    Disallow: /forums/spy.php
    Disallow: /forums/subscription.php
    Disallow: /forums/tags/
    Disallow: /forums/threadrate.php
    Disallow: /forums/upload.php
    Disallow: /forums/usercp.php
    Disallow: /forums/weeklystats.php
    Disallow: /forums/statistics.php
    Disallow: /forums/stats.php
    Disallow: /forums/infraction.php
    Disallow: /forums/ajax.php
    Disallow: /forums/arcade.php
     
    User-agent: Slurp
    Disallow: /gallery/
    Crawl-delay: 90
    Sitemap: http://www.entropiaforum.com/forums/sitemap_index.xml.gz
    Any ideas or suggestions on how to prevent the spiders from visiting newreply.php (or any other unwanted pages) would be appreciated, thank!
    Last edited by 711; 04-06-2008 at 07:49 AM. Reason: Fixed image

  2. #2
    Senior Member briansol's Avatar
    Real Name
    Brian
    Join Date
    Apr 2006
    Location
    Central CT, USA
    Posts
    6,981
    Liked
    8 times
    the robots file is merely a suggestion to the SE. It won't 100% stop anything.

  3. #3
    711
    711 is offline
    Junior Member
    Real Name
    711
    Join Date
    Nov 2007
    Posts
    16
    Liked
    0 times
    Quote Originally Posted by briansol View Post
    the robots file is merely a suggestion to the SE. It won't 100% stop anything.
    True, though I thought Googlebot tended to be more "well-behaved", and usually respected the robots.txt suggestions?

Similar Threads

  1. Blocking bots in robots.txt - how do they see URLs?
    By Dave Hybrid in forum General Discussion
    Replies: 0
    Last Post: 06-20-2007, 07:53 PM
  2. Just a test, please ignore
    By Lian in forum Off-Topic & Chit Chat
    Replies: 1
    Last Post: 01-16-2007, 03:00 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •