Page 1 of 5 1 2 3 4 5 LastLast
Results 1 to 15 of 65

robots.txt

This is a discussion on robots.txt within the General Discussion forums, part of the vBulletin SEO Discussion category; Hiyas Just looking for opinions on robots.txt. Should I disallow items like newthread.php, newreply.php, profile.php etc., or should I just ...

  1. #1
    Junior Member
    Join Date
    Dec 2005
    Posts
    15
    Liked
    0 times

    robots.txt

    Hiyas

    Just looking for opinions on robots.txt. Should I disallow items like newthread.php, newreply.php, profile.php etc., or should I just have no robots.txt and let the bots go nuts, or something in between. What do you people do in this regard?

    Cheers

    Z

  2. #2
    ADM
    ADM is offline
    Senior Member ADM's Avatar
    Real Name
    Peter Papadopoulos
    Join Date
    Aug 2005
    Location
    Perth, Australia
    Posts
    254
    Liked
    0 times
    Personally I disallow them, uses up resources and bandwidth and it doesn't hurt to disallow them anyway.

    I just didn't like seeing them trying to access those pages in Whos Online.

    Some search engines seem to ignore nofollow as well.

  3. #3
    Senior Member
    Real Name
    FAA Zooman
    Join Date
    Dec 2005
    Location
    Cumbria, UK
    Posts
    208
    Liked
    0 times
    bit of a follow on

    Now I have VBSEO should i leave these in?

    User-agent: *
    Disallow: /printthread
    Disallow: /send message
    Disallow: /register
    Disallow: /login
    Disallow: /new reply
    Disallow: /subscription
    Disallow: /private
    Disallow: /misc
    Disallow: /show post
    Disallow: /report
    Disallow: /new thread
    Disallow: /calendar
    Disallow: /poll
    Disallow: /links
    Disallow: /arcade
    Disallow: /member list
    Disallow: /members
    Disallow: /search
    Disallow: /FAQ
    Disallow: /online
    Disallow: /vb shout
    And is there any on the list that should not be (or maybe I have missed).

    For you know I rather have only 3 good pages indexed then 100 useless pages + the 3 good ones.

  4. #4
    ADM
    ADM is offline
    Senior Member ADM's Avatar
    Real Name
    Peter Papadopoulos
    Join Date
    Aug 2005
    Location
    Perth, Australia
    Posts
    254
    Liked
    0 times
    I don't think you have the format of the robots.txt correct.

    You have spaces in your disallow lines and your urls aren't supposed to have that.

    For an example look at mine I have:
    http://www.alanwake.net/robots.txt

  5. #5
    Senior Member
    Real Name
    FAA Zooman
    Join Date
    Dec 2005
    Location
    Cumbria, UK
    Posts
    208
    Liked
    0 times
    It was done to be easier on the eyes when I pasted it from notepad. It's not like that in the the doc. But well spotted . Anyway any feedback on including them all, some or more.

  6. #6
    Member
    Join Date
    Nov 2005
    Posts
    53
    Liked
    0 times
    What about this?

    User-agent: *
    Disallow: /calendar.php
    Disallow: /editpost.php
    Disallow: /member.php
    Disallow: /memberlist.php
    Disallow: /misc.php
    Disallow: /newreply.php
    Disallow: /newthread.php
    Disallow: /printthread.php
    Disallow: /private.php
    Disallow: /register.php
    Disallow: /report.php
    Disallow: /search.php
    Disallow: /showgroups.php
    Disallow: /usercp.php
    Disallow: /impressum.php
    Disallow: /admincp/
    Disallow: /modcp/
    Disallow: /online.php
    Disallow: /subscription.php
    Disallow: /sendtofriend.php
    Disallow: /threadrate.php
    Disallow: /poll.php
    Disallow: /attachment.php
    Disallow: /avatar.php
    Disallow: /faq.php
    Disallow: /usercp.php
    Disallow: /profile.php

  7. #7
    Senior Member
    Real Name
    FAA Zooman
    Join Date
    Dec 2005
    Location
    Cumbria, UK
    Posts
    208
    Liked
    0 times
    Very good, quite a few i missed out.

  8. #8
    Member
    Join Date
    Nov 2005
    Posts
    53
    Liked
    0 times
    You can see my robots.txt here:

    http://www.netbusinesstalk.com/robots.txt

    Do you think this is good?

  9. #9
    Senior Member BamaStangGuy's Avatar
    Real Name
    Brent Wilson
    Join Date
    Aug 2005
    Location
    Huntsville, Alabama
    Posts
    2,483
    Liked
    0 times
    Quote Originally Posted by Spencer
    You can see my robots.txt here:

    http://www.netbusinesstalk.com/robots.txt

    Do you think this is good?
    You have usercp in there twice.

    Other than that it is fine

  10. #10
    Member
    Join Date
    Nov 2005
    Posts
    53
    Liked
    0 times
    ^^ Fixed.

  11. #11
    Senior Member
    Real Name
    FAA Zooman
    Join Date
    Dec 2005
    Location
    Cumbria, UK
    Posts
    208
    Liked
    0 times
    you don't need the php part

  12. #12
    Junior Member racerx's Avatar
    Real Name
    Marcelo
    Join Date
    Dec 2005
    Posts
    3
    Liked
    0 times
    You can check if you have a valid robots.txt in this link

    http://tool.motoricerca.info/robots-...checkreferer=1

  13. #13
    Senior Member
    Real Name
    Michael
    Join Date
    Oct 2005
    Posts
    1,755
    Liked
    1 times
    Blog Entries
    1
    I see that most of you have a robots.txt file.

    How important is it to have it? I have never had one. I would assume you just create it and stick it in the root.

  14. #14
    Senior Member
    Real Name
    Michael
    Join Date
    Oct 2005
    Posts
    1,755
    Liked
    1 times
    Blog Entries
    1
    I did quite a bit of reseach on robotx.txt yesterday, and from most of what I read, it appears important to have the file even if it is one telling robots they can follow anything. Otherwise they tend to get directed to 404 files or something like that.

    I have since added on to my site that disallows nothing and it even shows up in my awstats how many times a spider successfully hit the robots.txt file.

    maybe this will help my indexing. *crosses fingers*

  15. #15
    Member
    Join Date
    Nov 2005
    Posts
    53
    Liked
    0 times
    Can you post your robots.txt in this thread?

Page 1 of 5 1 2 3 4 5 LastLast

Similar Threads

  1. Temp robots.txt Brand New Forum?
    By rmjvol in forum Pre-Sales Questions
    Replies: 7
    Last Post: 08-26-2006, 02:53 AM
  2. robots.txt entries
    By shaochun in forum General Discussion
    Replies: 5
    Last Post: 12-10-2005, 08:18 PM
  3. Possible to use Disallow: /*?pp=10 in robots.txt?
    By PageUp in forum General Discussion
    Replies: 3
    Last Post: 11-03-2005, 10:01 PM
  4. "should" I use a robots.txt file?
    By drex in forum General Discussion
    Replies: 5
    Last Post: 11-03-2005, 09:47 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •