TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

PostgreSQL, Memory and the Cloud

211 pointsby bilalhusainalmost 4 years ago

9 comments

aeyesalmost 4 years ago
Wow, the title of this post is very calm compared to what is actually happening.<p>CloudSQL Postgres is running with a misconfigured OS OOM killer, crashes Postmaster randomly even if memory use is below instance spec. GCP closes this bug report as &quot;Won&#x27;t fix&quot;.<p>This is a priority 1 issue. Seeing a wontfix for this has completely destroyed my trust of their judgement. The bug report states that they have been in contact with support since February.<p>Unbelievable attitude towards fixing production critical problems of their platform affecting all customers.
评论 #27797319 未加载
评论 #27796169 未加载
评论 #27795908 未加载
评论 #27797995 未加载
评论 #27798549 未加载
Winsaucereralmost 4 years ago
Are there any good&#x2F;recommended books or resources for someone who wants to learn how to run postgresql well? E.g, what defaults to change and when, settings for the host OS (such as in the parent linked article), overall tips&#x2F;insights&#x2F;recommendations.
评论 #27799136 未加载
评论 #27798859 未加载
thyrsusalmost 4 years ago
Are there recommendations for learning about Linux kernel memory management? Two anecdata:<p>* I had some compute servers that were up for 200 days. The customers noticed that they were half as fast as identical hardware just booted. Dropping the file system cache (&quot;echo 3 | sudo dd of=&#x2F;proc&#x2F;sys&#x2F;vm&#x2F;drop_cache&quot;) brought the speed back up to the newly deployed servers. WTF? File system caches are supposed to be zero cost discards as soon as processes ask for RAM - but something else is going on. I suspect the kernel is behaving badly with overpopulated RAM management data (TLB entries?), but I don&#x27;t know how to measure that.<p>* If that is actually the problem, then a solution might be to decrease data size by using non-zero hugepages (&quot;cat &#x2F;proc&#x2F;sys&#x2F;vm&#x2F;nr_hugepages&quot;). I&#x27;d love to see recommendations on when to use that.
评论 #27795865 未加载
评论 #27796793 未加载
评论 #27799153 未加载
评论 #27797125 未加载
评论 #27795714 未加载
mnahkiesalmost 4 years ago
I recently managed to crash a GCP cloudsql postgres 12 host running an interactive query that was rather heavy (based on error logs OOM).<p>It surprised me because I had never executed a query and caused the whole host to crash up until that point - now I&#x27;m wondering if this misconfiguration is the cause
renewiltordalmost 4 years ago
Interesting. Also a problem with RDS: <a href="https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;52148675&#x2F;aws-rds-with-postgres-is-oom-killer-configured" rel="nofollow">https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;52148675&#x2F;aws-rds-with-po...</a>
zingaralmost 4 years ago
I&#x27;d like to thank the author for their clear, simple explanation. I haven&#x27;t had to think about allocating memory since university and am not practiced thinking about it in my software but now I feel like I have useful ways to think about why processes just disappear sometimes.
shdhalmost 4 years ago
GCP CloudSQL has a lot of issues. There was one with query insights being enabled causing segfaults on `LEFT JOIN` operations. Its since been patched, but really shitty.
yjftsjthsd-halmost 4 years ago
So are there problems with disabling overcommit? Or is it really that simple (at least for dedicated db hosts)?
评论 #27795199 未加载
评论 #27796606 未加载
评论 #27795066 未加载
dkerstenalmost 4 years ago
A metacomment about the page (rather than the content): the text in the white boxes is almost unreadable for me, the contrast is crazy low.
评论 #27796068 未加载