TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Google bot delays executing JavaScript for days

128 pointsby dbeardslover 12 years ago

8 comments

h2sover 12 years ago
<p><pre><code> &#62; If you're removing code or changing an endpoint, &#62; be careful you don't screw the Google bot, which &#62; might be "viewing" 3-day-old pages on your &#62; altered backend. </code></pre> An interesting proposition. Personally, unless I was operating in some sector where keeping Googlebot happy was key to staying competitive and there was solid evidence it could hurt my page rank, I don't think I'd be prepared to go to this length. Google is doing quite an atypical thing here compared to regular browsers and I'd like to think Google engineers are smart enough to account for this type of thing in the early stages of planning.<p>They have a difficult cache invalidation problem here. The only way to find out if the Javascript in use on a site has changed is by checking if the page HTML has changed. And on top of that, the Javascript can change without any noticeable change to the HTML.
评论 #4686398 未加载
ashrayover 12 years ago
Googlebot also does some other crazy stuff. Like looking at url patterns and then trying out variations.. they're almost trying to sniff URLs!<p>For example if I have a page: www.domain.com/xyz/123<p>Googlebot (without any links to other pages, will actually try URLs like) www.domain.com/xyz/1234 www.domain.com/xyz/122 www.domain.com/xyz/121 and so on...<p>It's crazy how much 'looking around' they do these days!
评论 #4686397 未加载
eliover 12 years ago
I'm not too surprised. I've got Googlebot still requesting old URLs even through there are no incoming links to them (that I know of) and they've been either 404 or 301 redirected for six months. I even tried using 410 Gone instead of 404, but it made no difference.
评论 #4685842 未加载
评论 #4685478 未加载
jes5199over 12 years ago
Your users may be, too. It's not unusual for me to open my sleeping laptop several days later and expect the open web pages to work without refreshing them.
评论 #4685975 未加载
评论 #4689456 未加载
TazeTSchnitzelover 12 years ago
I wonder if it is Google's visual site previews/thumbnails that you get when you click on the arrow at the side of a search result, that are doing this.<p>Perhaps Google fetches the crawled page from the cache and then renders that for the previews?
评论 #4686505 未加载
georgemcbayover 12 years ago
Is this surprising? I'd expect the possibility of this sort of behavior from any system that was vaguely Map-Reduce-y and operated on the scale of data that Google's indexing does.
ericcholisover 12 years ago
I'm wondering if some of the simpler cache-busting tricks would force google update their cache. For example, somescript.js?v=201210221559.
评论 #4685239 未加载
listsover 12 years ago
Did anyone else get really bad font rendering running Chrome on Windows 7?
评论 #4686598 未加载