Results 1 to 5 of 5

Confused about Spiders...

This is a discussion on Confused about Spiders... within the General Discussion forums, part of the vBSEO SEO Plugin category; I am confused about these spiders... Even though my site has been up for like 3 months...301 twice to its ...

  1. #1
    Senior Member
    Real Name
    gotlinks
    Join Date
    Jun 2006
    Posts
    202
    Liked
    5 times

    Confused about Spiders...

    I am confused about these spiders...

    Even though my site has been up for like 3 months...301 twice
    to its new and final domain. The 301 indexing seems to be
    fairly quick, even with the brand new domain, and spiders seem
    to follow the 301 to where the domain is, np...but it all seems
    to be unstable...1 min I will have 5 spiders, the next 20,25,30,
    a few min later, 0 spiders, then it starts all over again, nothing
    appears to be really stable at all...

    Then I have the baidu..I know this spider is from china, but
    what is its purpose on my site if people in china can not
    visit the site since they are not allowed open access...so
    it does not make sense on why it is even there....

    It would seem the other spiders are just as worthless, they could
    be hackers, spammers, scrapers, etc...junk spiders...

    and google/yahoo seem to show up randomly throughout the day...

    ideas?

  2. #2
    vBSEO Staff Brian Cummiskey's Avatar
    Real Name
    Brian Cummiskey
    Join Date
    Jul 2009
    Location
    btwn NYC and Boston
    Posts
    12,789
    Liked
    657 times
    Blog Entries
    2
    Spiders will come and go as they please. There's no controlling them. Since they don't really create a session, it can often appear that more than one is on your site. In reality, it's usually just one spider loading X pages and appearing as X spiders online. Sometimes there are more than one, but most times not.

    All spiders will visit all sites they want to unless told otherwise. You can block baidu the same way you blocked china, via htaccess and/or robots.txt directives.

    All in all, I find blocking spiders to be generally a waste of time. Unless you're peaking resources and every little bit counts, I wouldn't waste any time trying to tame spiders. There's easily 100,000 known bots/spiders for various items and sites and trying to exclude the 'bad' ones is a daunting effort for effectively 0 gain. And finally, if you're at this point of peak resources, your time will be better spent migrating to better/newer/faster hardware.

  3. #3
    Senior Member webmastersitesi's Avatar
    Join Date
    Oct 2007
    Posts
    518
    Liked
    16 times
    Blog Entries
    3
    You can use robots.txt to allow only google yahoo and bign spiders and disallow the rest. You can implement this solution to htaccess but either ways will cost you spend some resources.

    Then I would recommend you check cloudflare system. They have a pretty good and free solution for harmful spiders and bots. They use general blacklist for such bad intentioned spiders and keep them away from reaching your site at dns level. It maybe little bit complicated plase check my final blog post.

  4. #4
    Senior Member
    Real Name
    gotlinks
    Join Date
    Jun 2006
    Posts
    202
    Liked
    5 times
    Quote Originally Posted by Brian Cummiskey View Post
    Spiders will come and go as they please. There's no controlling them. Since they don't really create a session, it can often appear that more than one is on your site. In reality, it's usually just one spider loading X pages and appearing as X spiders online. Sometimes there are more than one, but most times not.

    All spiders will visit all sites they want to unless told otherwise. You can block baidu the same way you blocked china, via htaccess and/or robots.txt directives.

    All in all, I find blocking spiders to be generally a waste of time. Unless you're peaking resources and every little bit counts, I wouldn't waste any time trying to tame spiders. There's easily 100,000 known bots/spiders for various items and sites and trying to exclude the 'bad' ones is a daunting effort for effectively 0 gain. And finally, if you're at this point of peak resources, your time will be better spent migrating to better/newer/faster hardware.
    ok thats good and all...but I do not understand why they visit old forum content like 3 months old,
    rather then bombarding the new daily content, and blog content...I see them mostly visiting
    parts of the forum, rarely do i see any on the VB Blog...

    whadaya know, I see a vBSEO spider...is that really a vBSEO spider?

  5. #5
    vBSEO Staff Brian Cummiskey's Avatar
    Real Name
    Brian Cummiskey
    Join Date
    Jul 2009
    Location
    btwn NYC and Boston
    Posts
    12,789
    Liked
    657 times
    Blog Entries
    2
    The vBSEO spider is used for the linkback/trackback/refback system. If you're seeing one on your site, another vbseo-powered forum likely has linked to you.

    I cannot comment on how or why spiders do what they do. Those questions are probably better asked to the spider owners themselves as I/we have no control over what or why they do.

Similar Threads

  1. Sorry I Am Confused
    By dieselpowered in forum LinkBacks
    Replies: 18
    Last Post: 08-25-2008, 01:54 PM
  2. Ugh... confused somewhat.
    By PicoDeath in forum General Discussion
    Replies: 6
    Last Post: 02-27-2008, 08:33 PM
  3. Want to buy but got confused...
    By Anatoliy in forum Pre-Sales Questions
    Replies: 9
    Last Post: 11-09-2006, 06:33 PM
  4. Confused
    By mototips in forum Pre-Sales Questions
    Replies: 5
    Last Post: 08-17-2006, 03:54 PM
  5. Rather confused
    By Mirzone in forum General Discussion
    Replies: 4
    Last Post: 03-21-2006, 01:12 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •