科技回声

8 条评论

anarazel超过 6 年前

There's no chance we go for 64bit transaction ids on the tuples themselves - the space increase would be far too big. The overhead of tuple headers is already a problem, and xmin/xmax are a significant portion of that.There were patches however that kept an 'epoch' (the upper 32bit of a 64bit transaction id) on a page level. Plus some rewrite logic when transactions that are too far away from each other to be represented as an index from a base epoch are about to be present on one page. That'd allow to effectively have 64bit xids.The in-development zheap storage engine basically does something roughly akin to that, removing the need to perform freezing when a table becomes older than ~2^31 - safety-window transactions.The transaction id that the system internally has effectively already keeps track of of xids in a 64bit manner, albeit in a somewhat over-complicated manner by keeping track of an epoch separately (there's a patch likely to land in the next version to just go for 64bit there). That's why you can see e.g. txid_current() return 64bit transaction ids.

评论 #19084229 未加载

评论 #19084108 未加载

amarshall超过 6 年前

This seems to be (with coincidental timing) the cause of Mandrill’s current outage [1]:> Mandrill uses a sharded Postgres setup as one of our main datastores. On Sunday, February 3, at 10:30pm EST, 1 of our 5 physical Postgres instances saw a significant spike in writes. The spike in writes triggered a Transaction ID Wraparound issue. When this occurs, database activity is completely halted. The database sets itself in read-only mode until offline maintenance (known as vacuuming) can occur.> The database is large—running the vacuum process takes a significant amount of time and resources, and there’s no clear way to track progress.[1] <a href="https://news.ycombinator.com/item?id=19084525" rel="nofollow">https://news.ycombinator.com/item?id=19084525</a>

评论 #19086920 未加载

throwawaymath超过 6 年前

The author talks a bit about the architecture of PostgreSQL transactions, touching on lazy transaction ID consumption and vacuuming. Notably, writes require IDs but reads do not. So this is focused on write-optimized workloads.If you want to get the basic tl;dr which answers the headline: these IDs will last so long it’s almost not worth quantifying. This is an obvious calculation even if you assume ostentatatious performance requirements three orders of magnitude greater than the author’s:<pre><code> 2^64 / (86,000 * 1,000,000,000) = 213,503.9 </code></pre> The author uses 1,000,000 writes/second; I prefer 1,000,000,000 since it’s more ridiculous. There are 86,000 seconds in a day. It will take you the better part of a millenium to exhaust those IDs, assuming you consume an average of one billion every single second.The author didn’t talk about collisions, but those are worth mentioning because you could even confidently assign these randomly instead of incrementally. Since a collision will occur (in expectation) after 2^63 transactions, you shouldn’t even have to worry about a single one occuring (on average) for almost 300 years.Of course, using 64-bit IDs comes with nontrivial space increase - every single tuple will increase by a factor of 2.EDIT: Original collision estimate is wrong, see corrections. I took (2^n)/2 = 2^(n-1) as the birthday bound instead of 2^(n/2).

评论 #19083346 未加载

评论 #19083281 未加载

评论 #19083382 未加载

评论 #19083471 未加载

aaronbwebber超过 6 年前

I highly recommend using flexible-freeze if you run Postgres in production - does not take very much effort to set up and almost certainly will help you avoid issues with txnid wraparound:<a href="https://github.com/pgexperts/flexible-freeze" rel="nofollow">https://github.com/pgexperts/flexible-freeze</a>It just runs `VACUUM FREEZE` when you schedule it (usually daily), starting with the tables closest to hitting a wraparound.

kostaw超过 6 年前

Just imagine running `VACUUM` on that table that wrote 1M rows/seconds for 300 years and now you need to vaccuum quick because the transaction ids will wrap around next year...

评论 #19084113 未加载

Thorrez超过 6 年前

What if we interpret Moore's law to say that transaction speed will double every 2 years?

评论 #19083939 未加载

评论 #19084655 未加载

评论 #19085573 未加载

hyperman1超过 6 年前

What I don't understand is how this only affects postgress. How do db2/mssql/oracle handle mvcc? Is it superior or is it a case of trade offs? Supposing the answer is publicly available.

xurukefi超过 6 年前

I guess people had similar arguments when designing IPv4.

8 条评论

anarazel超过 6 年前

评论 #19084229 未加载

评论 #19084108 未加载

amarshall超过 6 年前

评论 #19086920 未加载

throwawaymath超过 6 年前

评论 #19083346 未加载

评论 #19083281 未加载

评论 #19083382 未加载

评论 #19083471 未加载

aaronbwebber超过 6 年前

kostaw超过 6 年前

Just imagine running `VACUUM` on that table that wrote 1M rows/seconds for 300 years and now you need to vaccuum quick because the transaction ids will wrap around next year...

评论 #19084113 未加载

Thorrez超过 6 年前

What if we interpret Moore's law to say that transaction speed will double every 2 years?

评论 #19083939 未加载

评论 #19084655 未加载

评论 #19085573 未加载

hyperman1超过 6 年前

What I don't understand is how this only affects postgress. How do db2/mssql/oracle handle mvcc? Is it superior or is it a case of trade offs? Supposing the answer is publicly available.

xurukefi超过 6 年前

I guess people had similar arguments when designing IPv4.

How long will a 64 bit Transaction-ID last in PostgreSQL?

8 条评论

How long will a 64 bit Transaction-ID last in PostgreSQL?

8 条评论