Results 1 to 14 of 14

please validate my robots.txt

This is a discussion on please validate my robots.txt within the General Discussion forums, part of the vBulletin SEO Discussion category; Code: User-agent: gigabot Disallow: / User-agent: * Disallow: /newreply.php Disallow: /newthread.php Disallow: /private.php Disallow: /printthread.php Disallow: /sendmessage.php Disallow: /showpost.php is ...

  1. #1
    Senior Member
    Real Name
    Future
    Join Date
    Mar 2006
    Posts
    579
    Liked
    1 times
    Blog Entries
    1

    Question please validate my robots.txt

    Code:
    User-agent: gigabot
    Disallow: /
    
    User-agent: *
    Disallow: /newreply.php
    Disallow: /newthread.php
    Disallow: /private.php
    Disallow: /printthread.php
    Disallow: /sendmessage.php
    Disallow: /showpost.php
    is this correct ?
    lot of content is indexed in serps with those filenames which leads to duplicate content on same site and lot of error messages generated.

  2. #2
    vBSEO Staff Oleg Ignatiuk's Avatar
    Real Name
    Oleg Ignatiuk
    Join Date
    Jun 2005
    Location
    Belarus
    Posts
    25,744
    Liked
    168 times
    Hello,

    your robots.txt entries are correct.
    By the way, there is a new robots.txt tool available in the Google sitemaps account.

  3. #3
    Member
    Real Name
    Christoph Puetz
    Join Date
    Mar 2006
    Location
    Highlands Ranch, CO - USA
    Posts
    30
    Liked
    0 times
    Are these kind of entries still needed when using VBSEO?

    Chris

  4. #4
    vBSEO Staff Oleg Ignatiuk's Avatar
    Real Name
    Oleg Ignatiuk
    Join Date
    Jun 2005
    Location
    Belarus
    Posts
    25,744
    Liked
    168 times
    Hello Chris,

    yes, since these URLs are not rewritten, the robots.txt exclusions are still valid.

  5. #5
    Member
    Real Name
    Christoph Puetz
    Join Date
    Mar 2006
    Location
    Highlands Ranch, CO - USA
    Posts
    30
    Liked
    0 times
    Thanks. Just updated mine after reading your reply.

    Christoph

  6. #6
    Member
    Real Name
    Trana
    Join Date
    Jan 2006
    Posts
    77
    Liked
    0 times
    Quote Originally Posted by Oleg Ignatiuk
    Hello Chris,

    yes, since these URLs are not rewritten, the robots.txt exclusions are still valid.
    What does excluding these URLS actually do? Why would you want to do this?

  7. #7
    vBSEO Staff Oleg Ignatiuk's Avatar
    Real Name
    Oleg Ignatiuk
    Join Date
    Jun 2005
    Location
    Belarus
    Posts
    25,744
    Liked
    168 times
    Hello Josh,

    this prevents SE from indexing the "noise" pages, focusing the bots on the more valuable content and reducing the load on your server.

  8. #8
    Member
    Real Name
    Trana
    Join Date
    Jan 2006
    Posts
    77
    Liked
    0 times
    OK, I changed mine to:

    User-agent: *
    Disallow: /forum/calendar.php
    Disallow: /forum/editpost.php
    Disallow: /forum/memberlist.php
    Disallow: /forum/misc.php
    Disallow: /forum/newreply.php
    Disallow: /forum/newthread.php
    Disallow: /forum/private.php
    Disallow: /forum/register.php
    Disallow: /forum/report.php
    Disallow: /forum/search.php
    Disallow: /forum/showgroups.php
    Disallow: /forum/usercp.php
    Disallow: /forum/admincp/
    Disallow: /forum/modcp/
    Disallow: /forum/online.php
    Disallow: /forum/subscription.php
    Disallow: /forum/sendtofriend.php
    Disallow: /forum/threadrate.php
    Disallow: /forum/attachment.php
    Disallow: /forum/avatar.php
    Disallow: /forum/faq.php
    Disallow: /forum/usercp.php
    Disallow: /forum/profile.php
    Disallow: /users/
    Disallow: /gallery/
    Disallow: /banner/

    Any problems with doing this?

  9. #9
    vBSEO Staff Oleg Ignatiuk's Avatar
    Real Name
    Oleg Ignatiuk
    Join Date
    Jun 2005
    Location
    Belarus
    Posts
    25,744
    Liked
    168 times
    Hello Josh,

    these pages may have some content important to be indexed: calendar.php, showgroups.php, faq.php. Make sure that it's ok to exclude them at your forums.
    Also, I'm sure it's better to index your /gallery/ (and probably /users/) folders.

  10. #10
    Member
    Real Name
    mal
    Join Date
    Mar 2006
    Posts
    81
    Liked
    0 times
    As a matter of interest, where exactly is the correct place to install the robots.txt file. Each directory or only in the root?

  11. #11
    Senior Member
    Real Name
    Keith Cohen
    Join Date
    Jul 2005
    Location
    Raleigh, NC USA
    Posts
    6,147
    Liked
    13 times
    Only in the root. You can define specific paths to block though, as seen in the previous posts in this thread.

  12. #12
    Senior Member briansol's Avatar
    Real Name
    Brian
    Join Date
    Apr 2006
    Location
    Central CT, USA
    Posts
    6,981
    Liked
    8 times
    I use the following on my site.

    Code:
    User-agent: gigabot
    Disallow: /
    
    User-agent: *
    Disallow: /cgi-bin/
    Disallow: /admincp/
    Disallow: /includes/
    Disallow: /install/
    Disallow: /ioncube/
    Disallow: /mint/
    Disallow: /modcp/
    Disallow: /newreply.php
    Disallow: /newthread.php
    Disallow: /private.php
    Disallow: /printthread.php
    Disallow: /sendmessage.php
    Disallow: /showpost.php

  13. #13
    Member
    Real Name
    Trana
    Join Date
    Jan 2006
    Posts
    77
    Liked
    0 times
    Quote Originally Posted by Oleg Ignatiuk
    Also, I'm sure it's better to index your /gallery/ (and probably /users/) folders.
    If I don't want spiders to crawl my member profiles, should I block the users, members and profile.php?

  14. #14
    vBSEO Staff Oleg Ignatiuk's Avatar
    Real Name
    Oleg Ignatiuk
    Join Date
    Jun 2005
    Location
    Belarus
    Posts
    25,744
    Liked
    168 times
    Quote Originally Posted by trana
    If I don't want spiders to crawl my member profiles, should I block the users, members and profile.php?
    To disallow crawling of members info, you should include:
    Code:
    Disallow: /member.php
    Disallow: /members/
    (assuming your member profile URL format defined in vbseocp is "members/...")

Similar Threads

  1. robots.txt
    By Zenith in forum General Discussion
    Replies: 64
    Last Post: 12-01-2010, 07:52 PM
  2. Temp robots.txt Brand New Forum?
    By rmjvol in forum Pre-Sales Questions
    Replies: 7
    Last Post: 08-26-2006, 01:53 AM
  3. robots.txt
    By croportal in forum General Discussion
    Replies: 1
    Last Post: 02-05-2006, 12:31 PM
  4. robots.txt entries
    By shaochun in forum General Discussion
    Replies: 5
    Last Post: 12-10-2005, 07:18 PM
  5. "should" I use a robots.txt file?
    By drex in forum General Discussion
    Replies: 5
    Last Post: 11-03-2005, 08:47 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •