TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: What do you do when your website seems to be penalised by Google?

41 pointsby mmavnnover 10 years ago
I have a personal blog (mostly dev related); it&#x27;s been going for a while. On a couple of specialist subjects (F# type providers being the main example) some of the posts are reasonably popular and linked to by many other people. Although it&#x27;s a small site, on these subjects it tends to show up in the first page on Bing, Duck Duck Go, etc for searches like &quot;Type provider tutorial&quot; and right at the top if you use a specific phrase (like the title of my most popular post, &quot;Type Providers from the Ground Up&quot;.<p>Google hates it. Basically, however specific the query, my blog never turns up unless you actually put the base url into your query. Ironically, plenty of spam sites&#x27; copies of the posts appear quite high in the search results.<p>What do you do in these types of situations? I&#x27;ve done no SEO beyond writing content, so I&#x27;m pretty sure I&#x27;ve used no &quot;black hat&quot; techniques. I&#x27;ve no ads, no duplicate content. Google webmaster tools claims the site is not blacklisted and that there is nothing wrong with it.<p>It feels wrong and possibly pointless to start again several years down the line with a new url just because Google doesn&#x27;t seem to like the current one; but on the other hand, the lack of organic search results will always be a limit on the readership. For a personal blog this is irritating and disappointing - if I was freelance or this was my company blog, it would be a real and immediate financial hit.<p>Thoughts or advice for people facing this situation?

8 comments

patio11over 10 years ago
This is a hard thing to debug from outside the Googleplex, but you are currently serving a canonical tag:<p>&lt;link rel=&quot;canonical&quot; href=&quot;<a href="http://blog.mavnn.co.uk/type-providers-from-the-ground-up&quot;&gt;" rel="nofollow">http:&#x2F;&#x2F;blog.mavnn.co.uk&#x2F;type-providers-from-the-ground-up&quot;&gt;</a><p>for a URL which cannot possibly return an HTTP 200. (It 301s to a URL with a &#x2F; on the end.)<p>This combination <i>could</i> cause Google to conclude that you have no page which requires inclusion in their main index.
评论 #8873260 未加载
评论 #8873263 未加载
mtbcoderover 10 years ago
Regarding the spam sites, in your RSS feed, you are publishing your full articles. More than likely, the scraper sites are pulling directly from these feeds, publishing quickly and getting Googlebot to see the content before it hits your site (thus receiving attribution). I would suggest:<p>1) Summaries only in RSS feeds. 2) Throttle the RSS feed back by several hours so that your latest article is not listed immediately. 3) Upon publishing, immediately link to the article via all of your social media outlets. 4) When internally linking within articles, use full URL paths and not relative. (If the spam sites are directly pulling your content and not cleaning up, you may be able to get a link back to your site from the scraped content.)<p>When publishing, timing is everything. Just my $0.02 based on my own experiences dealing with spam sites.<p>On a side note, even though we are in the age of HTML5, I would still suggest sticking with one H1 tag per page, if possible.
评论 #8873964 未加载
dredgeover 10 years ago
Early last year Google asked people to report such problems. You probably won&#x27;t see any direct impact from doing so, but there&#x27;s no harm in trying.<p>The form is still live at least: <a href="http://searchengineland.com/google-scraper-tool-185532" rel="nofollow">http:&#x2F;&#x2F;searchengineland.com&#x2F;google-scraper-tool-185532</a>
JVerstryover 10 years ago
First, make sure your sitemap.xml is exhaustive. Then, check the number of indexed pages in Google Webmaster Tool (after a couple of days if you had to update you sitemap). If few pages are indexed, go through this checklist at <a href="https://ligatures.net/content/expertise/site-not-indexing-checklist.html" rel="nofollow">https:&#x2F;&#x2F;ligatures.net&#x2F;content&#x2F;expertise&#x2F;site-not-indexing-ch...</a> to fix possible issues. If your pages are still not displayed in search results, then you are likely another victim of a well-known chicken-and-egg problem for content based sites: you need links for ranking and you need ranking to attract links. Yet, most niche are saturated and you are likely crushed by competition. The only efficient way out is to obtain dofollow backlinks from sites&#x2F;blogs which are: i) Not under your control (i.e., a forum profile link is under your control...) ii) Editorially reviewed iii) Have relevant topics to yours iv) Which are already trusted by Google v) Which are popular Other links won&#x27;t make much of a difference.
borrameover 10 years ago
I have the same situation, without manual actions in webmaster tools my website was wiped from search since 5th Dec 2014.<p>It appears if you search domain.com and if you search site:domain.com but if you search just &quot;domain&quot; it doesn&#x27;t appears and the website has been more than 10 years well indexed.<p>I&#x27;m very worried because I can&#x27;t contact google to know what happens because as I said I don&#x27;t have manual penalties to reconsider and I&#x27;m losing my own users that search for the domain.
rfergieover 10 years ago
Have you registered your site with Google webmaster tools (<a href="https://www.google.com/webmasters/tools/" rel="nofollow">https:&#x2F;&#x2F;www.google.com&#x2F;webmasters&#x2F;tools&#x2F;</a>)?<p>That would be a good first step for seeing if Google are having specific trouble with anything on your site
评论 #8873939 未加载
评论 #8873251 未加载
kevinbowmanover 10 years ago
Presumably all of the spam copies are actually harming your rankings as well, as Google will see those as duplicate content.<p>(Note that I&#x27;m guessing here, I have no particular authority in the area)
franzeover 10 years ago
hi, first things first<p>do not use the word &quot;penalty&quot; - you want to show up for a certain query in google, you think you should show up for it, you don&#x27;t show up for it.<p>that is the issue, nothing else.<p>you formulated a hypothesis: you think your site is &quot;penalised by Google&quot;<p>ok, go to google webmaster tools and verify<p><pre><code> - http:&#x2F;&#x2F;blog.mavnn.co.uk&#x2F; - http:&#x2F;&#x2F;mavnn.co.uk&#x2F; - http:&#x2F;&#x2F;www.mavnn.co.uk&#x2F; </code></pre> check the &quot;Site Messages&quot; navigation point of all these domain variations, if you have a penalty, then there will be a message. pro tip: only ever talk about &quot;penality&quot; if you get a message that says you have a &quot;penality&quot;. (everything else is just SEO b#llshit talk)<p>my guess: there won&#x27;t be such a message.<p>ok, the second quess is the wrong canonical, you already fixed that one. but: if you point a canonical to an HTTP 301 redirect, and the redirect points back to the original URL google will basically ignore the canonical. the canonical could have been the issue, but as <a href="https://www.google.com/search?q=site%3Ablog.mavnn.co.uk%2Ftype-providers-from-the-ground-up&amp;pws=0&amp;hl=en" rel="nofollow">https:&#x2F;&#x2F;www.google.com&#x2F;search?q=site%3Ablog.mavnn.co.uk%2Fty...</a> has been indexed (without ending slash) i doubt it.<p>ok, let&#x27;s look at anything that might be unusual about your site<p>i.e.: your start page <a href="http://www.mavnn.co.uk/" rel="nofollow">http:&#x2F;&#x2F;www.mavnn.co.uk&#x2F;</a><p>basically it consists out of a &quot;Hello World&quot; and a link to a broken URL and a link to a piece of duplicated text.<p>&quot;Hello World&quot; is a typical &quot;this server was just set up, nothing to see here&quot; message.<p>your start page is not indexed (see: site:<a href="http://www.mavnn.co.uk/" rel="nofollow">http:&#x2F;&#x2F;www.mavnn.co.uk&#x2F;</a> )<p>that is strange. let&#x27;s formulate a hypothesis.<p>your startpage communicated a basic &quot;this server has just been set up, nothing to see here&quot; message. google has a) no interrest into indexing such websites b) the webmasters are pretty pissed, if their newly set up servers are indexed in this way, as newly set up servers are usually not very secure, yet<p>additional google sees common subdomains i.e. blog.example.com as part of the main site and not as independent webproperties (yeah, they figured that one out quite some time ago).<p>hypothesis: you communicate via your startsite that your website is not yet - probably - set up and that is why it does not send you traffic.<p>my bet is, that this is the case. why? because you startpage is the one thing that is definitely not ... like other websites our there.<p>fix it, two possibilities: <a href="http://www.mavnn.co.uk/" rel="nofollow">http:&#x2F;&#x2F;www.mavnn.co.uk&#x2F;</a> -&gt; HTTP 301 -&gt; <a href="http://blog.mavnn.co.uk/" rel="nofollow">http:&#x2F;&#x2F;blog.mavnn.co.uk&#x2F;</a><p>or you set up a proper startpage, some text what this is, some links to your other ressources.<p>after you have done one of this, do a fetch-as-googlebot (via google webmaster tools) and click the &quot;submit to index&quot; button.<p>wait two days.<p>if not, test another hypothesis or post in the google webmaster forum, actually google guys dig these kind of errors.