Page 1 of 3 1 2 3 LastLast
Results 1 to 15 of 36

Robots.txt now support the sitemap file

This is a discussion on Robots.txt now support the sitemap file within the General Discussion forums, part of the vBSEO Google/Yahoo Sitemap category; Hello, I just see this in the google blog and it`s really useful since we won`t have to submit our ...

  1. #1
    Senior Member
    Real Name
    Mohamed
    Join Date
    Dec 2006
    Posts
    3,891
    Liked
    1 times

    Robots.txt now support the sitemap file

    Hello,
    I just see this in the google blog and it`s really useful since we won`t have to submit our sitemaps to every search engine individual anymore

    it's now easier for you to tell us where your Sitemaps live. We wondered if we could make it so easy that you wouldn't even have to tell us and every other search engine that supports Sitemaps. But how? Well, every website can have a robots.txt file in a standard location, so we decided to let you tell us about your Sitemap in the robots.txt file. All you have to do is add a line like

    Sitemap: http://www.mysite.com/sitemap.xml

    to your robots.txt file. Just make sure you include the full URL, including the http://. That's it. Of course, we still think it's useful to submit your Sitemap through Webmaster tools so you can make sure that the Sitemap was processed without any issues and you can get additional statistics about your site
    Official Google Webmaster Central Blog: What's new with Sitemaps.org?

  2. #2
    Senior Member
    Real Name
    Joseph Ward
    Join Date
    Jun 2005
    Posts
    23,847
    Liked
    32 times
    Blog Entries
    9
    Very cool. They've really simplied the process, and I'm sure other search engines will now follow.

  3. #3
    Senior Member
    Real Name
    Nick Le Mouton
    Join Date
    Nov 2005
    Location
    New Zealand
    Posts
    422
    Liked
    14 times
    Looks good, and also Ask.com and Microsoft Live are now supporting sitemaps through autodiscovery (as well as yahoo/google). Which means all the major search engines will know about every page of your site soon!

    Awesome

  4. #4
    Senior Member NeutralizeR's Avatar
    Real Name
    Mavi KARANLIK
    Join Date
    Feb 2006
    Location
    Ankara/TÜRKİYE
    Posts
    310
    Liked
    1 times
    vBSEO Sitemap Generator's sitemap index file is: sitemap_index.xml.gz

    .gz format supported?

  5. #5
    vBSEO Staff Oleg Ignatiuk's Avatar
    Real Name
    Oleg Ignatiuk
    Join Date
    Jun 2005
    Location
    Belarus
    Posts
    25,689
    Liked
    157 times
    Yes, Google supports gzip compressed sitemaps.
    Oleg Ignatiuk / Crawlability Inc.
    vBSEO 3.6.0 GOLD Released!
    Unveiling the NEW vBSEO Sitemap Generator 3.0. - available NOW for vBSEO Customers!


  6. #6
    Senior Member NeutralizeR's Avatar
    Real Name
    Mavi KARANLIK
    Join Date
    Feb 2006
    Location
    Ankara/TÜRKİYE
    Posts
    310
    Liked
    1 times
    Quote Originally Posted by Oleg Ignatiuk View Post
    Yes, Google supports gzip compressed sitemaps.
    I was refferring to this:

    "We wondered if we could make it so easy that you wouldn't even have to tell us and every other search engine that supports Sitemaps."

    but did a research and gzip seems to be supported by the other search engines, too.

    Added my line as Sitemap: http://www.mysite.com/sitemap.xml.gz

  7. #7
    Senior Member
    Real Name
    Michael
    Join Date
    Oct 2005
    Posts
    1,755
    Liked
    1 times
    Blog Entries
    1
    Thanks for posting this information.

  8. #8
    Junior Member
    Real Name
    romeo
    Join Date
    Apr 2007
    Posts
    13
    Liked
    0 times
    Hi,

    Please excuse me because i am newbie.

    2) i have never make a robots.txt for my vbulletin, how can i do this one and what code can i put in it ?
    1) where can i put the robots.txt ? in the root site or in forum folder ?

    thanks

  9. #9
    Senior Member
    Real Name
    Michael
    Join Date
    Oct 2005
    Posts
    1,755
    Liked
    1 times
    Blog Entries
    1
    file should go in root, and it is important to have one because virtually all spiders will first search to see if this file exists on your site to see if there are specific instructions for them to follow.

    What you can or should put in there is a debate that will go on for years, and partly depends on what you have in your site that you don't want search engines indexing.

    Here is an example from one of my sites

    Code:
    User-agent: *
    Disallow: /forum/vbseo_sitemap
    Disallow: /forum/admincp
    Disallow: /forum/attachments
    Disallow: /forum/attachment.php
    Disallow: /forum/arcade.php
    Disallow: /forum/calendar.php?do=add
    Disallow: /forum/cron.php
    Disallow: /forum/editpost.php
    Disallow: /forum/login.php
    Disallow: /forum/modcp
    Disallow: /forum/moderator.php
    Disallow: /forum/membermap.php
    Disallow: /forum/newreply.php
    Disallow: /forum/newthread.php
    Disallow: /forum/online.php
    Disallow: /forum/payments.php
    Disallow: /forum/pda
    Disallow: /forum/postings.php
    Disallow: /forum/printthread.php
    Disallow: /forum/private.php
    Disallow: /forum/profile.php
    Disallow: /forum/register.php
    Disallow: /forum/report.php
    Disallow: /forum/reputation.php
    Disallow: /forum/sendtofriend.php
    Disallow: /forum/search.php
    Disallow: /forum/sendmessage.php
    Disallow: /forum/showpost.php
    Disallow: /forum/subscription.php
    Disallow: /forum/threadrate.php
    Disallow: /forum/usercp.php
    Disallow: /forum/spy.php
    Disallow: /forum/tags/
    Here is an example of someone just blocking specific spiders
    Code:
    User-agent: BoardTracker
    Disallow: /
    
    User-agent: Gigabot
    Disallow: /
    Or, you could simply have a robots.txt file that is blank. You don't have to have anything in it, but most believe it is helpful to restrict spiders from going through unimportant files, and/or files that don't need to be seen by the public.

    Having a robots.txt file that is blank simply tells spiders that everything is available to be searched.

  10. #10
    Senior Member Code Monkey's Avatar
    Real Name
    Code Monkey
    Join Date
    Aug 2006
    Posts
    780
    Liked
    0 times
    You should list your sitemap index file not your individual sitemaps.

  11. #11
    Junior Member
    Real Name
    romeo
    Join Date
    Apr 2007
    Posts
    13
    Liked
    0 times
    Thanks

    No I have not a private forum ...

    so i have make a robots.txt with this only code :


    Code:
    Sitemap: http://www.mysite.com/forum/vbseo_sitemap/data/sitemap_index.xml.gz
    and put it in my root :
    www.mysite.com/robots.txt

    So all is ok ?

  12. #12
    Junior Member
    Real Name
    romeo
    Join Date
    Apr 2007
    Posts
    13
    Liked
    0 times
    excuse this duplicate post...

  13. #13
    Senior Member
    Real Name
    Keith Cohen
    Join Date
    Jul 2005
    Location
    Raleigh, NC USA
    Posts
    6,147
    Liked
    12 times
    Quote Originally Posted by rinka128 View Post
    Thanks

    No I have not a private forum ...

    so i have make a robots.txt with this only code :


    Code:
    Sitemap: http://www.mysite.com/forum/vbseo_sitemap/data/sitemap_index.xml.gz
    and put it in my root :
    www.mysite.com/robots.txt

    So all is ok ?
    No, you should use:

    Sitemap: http://www.mysite.com/forum/sitemap_index.xml.gz

  14. #14
    Senior Member briansol's Avatar
    Real Name
    Brian
    Join Date
    Apr 2006
    Location
    Central CT, USA
    Posts
    6,981
    Liked
    8 times
    Quote Originally Posted by rinka128 View Post
    Thanks

    No I have not a private forum ...

    so i have make a robots.txt with this only code :


    Code:
    Sitemap: http://www.mysite.com/forum/vbseo_sitemap/data/sitemap_index.xml.gz
    and put it in my root :
    www.mysite.com/robots.txt

    So all is ok ?
    no, you only want to link to to the root, as vbseo has the .htaccess condition to re-write it to its 'real' location.

    http://www.mysite.com/forum/sitemap_index.xml.gz
    or
    http://www.mysite.com/sitemap_index.xml.gz

  15. #15
    Junior Member
    Real Name
    romeo
    Join Date
    Apr 2007
    Posts
    13
    Liked
    0 times
    Ok ! thanks :-)

Page 1 of 3 1 2 3 LastLast

Similar Threads

  1. Joint support for the Sitemap Protocol
    By Mert Gökçeimam in forum General Discussion
    Replies: 5
    Last Post: 11-16-2006, 04:15 PM
  2. Problem with the sitemap file
    By Toocool in forum Troubleshooting
    Replies: 3
    Last Post: 01-29-2006, 05:33 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •