vBulletin SEO Forums

SEO

vBulletin Search Engine Optimization

Buy vBSEO Now! HACKER SAFE certified sites prevent over 99.9% of hacker crime.
ne nw
vBSEO Total Support Team Launches DeskPro New vBSEO Discount Level for Network Builders vBSEO 3.2.0 GOLD Has Landed Success with vBSEO = 600ore Web Visitors + $1400 in a Day! Crawlability Inc. Files for SEO Technology Patent
se sw

robots.txt timeout?

This is a discussion on robots.txt timeout? within the Troubleshooting forums, part of the vBSEO Google/Yahoo Sitemap category; For some reason, my sitemap stopped working. It's not vbseo 's fault because joomla's sitemap is too reported as broken ...

Go Back   vBulletin SEO Forums > vBSEO Google/Yahoo Sitemap > Troubleshooting

Enhancing 80 million pages.

Register FAQ Members List Social Groups Calendar Search Today's Posts Mark Forums Read
  #1  
Old 04-05-2008, 05:31 PM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
robots.txt timeout?

For some reason, my sitemap stopped working. It's not vbseo's fault because joomla's sitemap is too reported as broken in google webmaster's tools.

Quote:
URL timeout: robots.txt timeout
We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit.
Here's my robots.txt
http://www.chilehardware.com/robots.txt

These are my sitemaps:
VBSEO's: http://www.chilehardware.com/sitemap_index.xml.gz
Joomla's: http://www.chilehardware.com/index.p...=xml&no_html=1


how can a robots.txt be wrong? I edited it two days ago to add a delay for yahoo slurp. After noticing that google was avoiding my site, I removed that line. Now it doesn't work anymore
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #2  
Old 04-05-2008, 07:54 PM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
update: I temporarily removed robots.txt to see what happens.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #3  
Old 04-05-2008, 08:13 PM
Joe Ward's Avatar
vBSEO Staff
vBSEO Total Customer SupportvBSEO Documenter
 
Real Name: Joseph Ward
Join Date: Jun 2005
Location: Puerto Rico
Posts: 20,190
Blog Entries: 7
I think someone else recently reported strange issues after updating their robots.txt file.
__________________
Joe Ward / Crawlability Inc.
Support Team Launches New DeskPro Powered Tool Enhanced Support at Your Service

vBSEO 3.2.0 Launched - Maximum Overdrive for Your Web Traffic! Over 100 Instant SEO Optimizations

6X Traffic - $1400 in One Day with vBSEO! Imagine What the vBSEO Patent Pending Technology Can Do For You.

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #4  
Old 04-05-2008, 08:18 PM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
I thought that perhaps I uploaded it in binary mode, *shrug* anyway I have no robots.txt file right now, let's see what happens.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #5  
Old 04-06-2008, 03:32 PM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
Ok, I removed robots.txt and now I'm getting this message:

Quote:
Network unreachable: Network unreachable
We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit.
I guess somehow I'm restricting googlebot from accessing my page, but where to look for that?
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #6  
Old 04-06-2008, 03:58 PM
Oleg Ignatiuk's Avatar
vBSEO Staff
vBSEO Total Customer SupportvBSEO Documenter
 
Real Name: Oleg Ignatiuk
Join Date: Jun 2005
Location: Belarus
Posts: 21,923
Please check this similar case: Google-Sitemap not loaded (403)
__________________
Oleg Ignatiuk / Crawlability Inc.
Support Team Launches New DeskPro Powered Tool Enhanced Support at Your Service

vBSEO 3.2.0 Launched - Maximum Overdrive for Your Web Traffic! Over 100 Instant SEO Optimizations

6X Traffic - $1400 in One Day with vBSEO! Imagine What the vBSEO Patent Pending Technology Can Do For You.

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #7  
Old 04-09-2008, 04:13 AM
Senior Member
 
Real Name: Seleno
Join Date: Mar 2007
Posts: 211
Blog Entries: 1
Hi There
i have the same problem
how can i solve this problem?
is it really from the server?
my sitemaps loading and opening normally
since 1 week every thing was okey with them and i saw all urls
but since 2 days i see this message:
Network unreachable: robots.txt unreachable
We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #8  
Old 04-09-2008, 05:55 AM
Oleg Ignatiuk's Avatar
vBSEO Staff
vBSEO Total Customer SupportvBSEO Documenter
 
Real Name: Oleg Ignatiuk
Join Date: Jun 2005
Location: Belarus
Posts: 21,923
You should contact your host to find out whether they apply the same filtering.
__________________
Oleg Ignatiuk / Crawlability Inc.
Support Team Launches New DeskPro Powered Tool Enhanced Support at Your Service

vBSEO 3.2.0 Launched - Maximum Overdrive for Your Web Traffic! Over 100 Instant SEO Optimizations

6X Traffic - $1400 in One Day with vBSEO! Imagine What the vBSEO Patent Pending Technology Can Do For You.

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #9  
Old 04-09-2008, 09:17 AM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
I discarded the four most common reasons:

1.- It's not robots.txt, because I validated, edited and removed it, with no changes whatsoever.
2.- It's not my firewall (double checked, and the logs are showing header 200 for googlebot when he tries to crawl).
3.- It's not my host. My server isn't behind any filter or external firewall but mine.
4.- It's not the DNS, because I moved my domain's DNS to another nameserver and the problem is still there.

It's amazing. It took me five years to get there, and google took a week to destroy it.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #10  
Old 04-09-2008, 09:47 AM
Senior Member
 
Real Name: Seleno
Join Date: Mar 2007
Posts: 211
Blog Entries: 1
omg
what should we do then?
google remove your site?
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #11  
Old 04-10-2008, 02:01 AM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
nope, it's not a google ban, nor have we done anything that can fall into "black hat SEO techniques". It's just that google's DNS cannot resolve my URL.

I've tried making google crawl my site directly by its IP number, and it does, but of course it retrieves a sitemap full of my urls, which in turn it cannot crawl.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #12  
Old 04-10-2008, 12:28 PM
Senior Member
 
Real Name: Seleno
Join Date: Mar 2007
Posts: 211
Blog Entries: 1
my support told me the same first time
but i keep asking them, then they found that they block googlebot ip
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #13  
Old 04-11-2008, 09:03 PM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
It's not the case. I'm using an external DNS, and my server is directly attached to the internet.

To make sure, I used a spare domain I have, pointed to my machine, installed a copy of my site in another folder, created the vhost, then I went to google webmasters and added that domain: guess what, it passed with flying colors.

I insist: google cannot resolve my URL.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
  #14  
Old 04-13-2008, 04:10 AM
amenadiel's Avatar
Senior Member
Big Board Administrator
 
Real Name: Felipe CHW
Join Date: Feb 2007
Location: Santiago, Chile
Posts: 163
Edit: Finally, on saturday, the problem solved out of the blue. Google is back.

I lost a lot of pageviews but I hope I'll catch up soon to what we had.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Share on Facebook!
Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads

Thread Thread Starter Forum Replies Last Post
robots.txt Zenith General Discussion 46 10-21-2008 02:00 PM
Cookies TimeOut probelem? aycan555 Pre-Sales Questions 1 05-09-2007 10:45 PM
Timeout error when submitting to Yahoo BamaStangGuy Troubleshooting 12 01-23-2006 12:59 PM
Timeout von PHP: Sitemaps werden nicht komplett generiert Hexemer Deutsch 17 01-15-2006 06:38 PM


All times are GMT -4. The time now is 09:17 PM.


Powered by vBulletin Version 3.8.0 Release Candidate 2
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.2.5 ©2008, Crawlability, Inc.