How to do distributed locking

277 点作者 martinkl超过 9 年前

15 条评论

antirez超过 9 年前

Thanks to Martin for analyzing Redlock, I looked forward to an analysis. I don't agree with the two main arguments of the analysis. TLDR: I think the unique random token of Redlock is enough for Check & Set when the lock holder work materializes into a database write (also note that it requires a linearizable database, to check token_A_ID > token_B_ID in the Martin proposed solution), and I think the system model is very real world (AFAIK the algorithm is not dependent on bounded network delays, I think there is an issue in the analysis, details in my blog post so other people can evaluate), and that the fact different processes can count relative time, without any absolute clock, with a bound error (like, 10%) is absolutely credible. However, I'm analyzing the analysis in depth right now and writing a blog post with the details to be posted tomorrow. Also note that usually when you need a distributed lock, it's because you have no sane way to handle race conditions, otherwise you can do with some weaker and/or optimistic locking form to start with.

评论 #11061179 未加载

评论 #11061191 未加载

dmuth超过 9 年前

It's a long read, but the gist of this article is "use the right tool for the job". Redis is really good for some things, but in its current implementation, distributed locking is not one of them.Along with the author of this blog post, I would recommend the usage of Zookeeper if you have a need for obtaining a lock in a distributed environment. You can read an analysis of how Zookeeper performs under a (intentionally) partitioned network here:<a href="https://aphyr.com/posts/291-jepsen-zookeeper" rel="nofollow">https://aphyr.com/posts/291-jepsen-zookeeper</a>

评论 #11061065 未加载

cryptica超过 9 年前

If you want to have a truly parallel system, you cannot share resources between processes. If you need a lock, then that implies sharing of resources... Which implies limited scalability according to Amdahl's law.To achieve unlimited scalability, you need to make sure that no single data store is shared between all processes and that the amount of interprocess communication (on a per-process basis) doesn't increase as you add more processes.The more sharing you have, the less you can scale your system.

评论 #11061107 未加载

评论 #11061667 未加载

评论 #11064326 未加载

erentz超过 9 年前

"If you still don’t believe me about process pauses, then consider instead that the file-writing request may get delayed in the network before reaching the storage service. Packet networks such as Ethernet and IP may delay packets arbitrarily, and they do [7]: in a famous incident at GitHub, packets were delayed in the network for approximately 90 seconds [8]. This means that an application process may send a write request, and it may reach the storage server a minute later when the lease has already expired."This statement confused me. It seems to say that the packets were delayed in the network for 90 seconds before being delivered. From reading the original sources it actually sounds like packets were discarded by the switches, so the original requests discarded, and the nodes were partitioned for 90 seconds. When the partition was removed both nodes thought they were the leader and simultaneously requested the other to shutdown. Can anyone confirm? Keeping packets delayed in a network for 90 seconds would seem quite difficult (though not impossible assuming certain bugs).Edit: On re-reading I think this is just talking about the network stack in general - not the network. A temporary partition may delay delivery of your request until max TCP retries is exceeded on your host, if its recovered before then your request may arrive later than you intended.

评论 #11064168 未加载

amelius超过 9 年前

Why not just use the database to handle the "locking" for you? For example, to ensure that an email with ID=123 gets sent only once, just check if "email 123 sent" is in the database, otherwise commit it to the database, wait for the transaction to be committed, and send the email.Edit: Why is this downvoted? It is a serious question.

评论 #11061779 未加载

评论 #11062184 未加载

评论 #11061867 未加载

sdab超过 9 年前

"When used as a failure detector, timeouts are just a guess that something is wrong. (If they could, distributed algorithms would do without clocks entirely, but then consensus becomes impossible [10]."Having just re-read Lynch's paper, can you explain what you mean here? I didn't see anything explicitly relying on time. It could be there is some implicit usage I didnt see. Additionally, the paper's impossibility result is about "perfectly correct consensus" which applies with and without clocks and then has a positive result for "partially correct consensus" (i.e. not deciding a value is a correct result). Im not sure which you mean when you say "consensus becomes impossible" as it is either already impossible (the perfectly correct protocol) with one faulty process or (to my understanding) not dependent on time (the partially correct protocol).p.s. great article!

评论 #11064130 未加载

评论 #11061413 未加载

评论 #11062466 未加载

spo81rty超过 9 年前

For those interested in distributed locking with Redis and C#, check out this blog post we did which also links to a github project. We use this very heavily. Hope it helps someone.<a href="http://stackify.com/distributed-method-mutexing/" rel="nofollow">http://stackify.com/distributed-method-mutexing/</a>

bogdando超过 9 年前

> In a reasonably well-behaved datacenter environment, the timing assumptions will be satisfied most of the time – this is known as a partially synchronous system [12]. But is that good enough?Please consider this answer [0] (which I personally understood as - YES, real systems are always partial sync): "Asynchronous systems have no fixed upper bounds. In practice, systems tend to exhibit partial synchrony, which is described as one of two models by Dwork and Lynch in Consensus in the Presence of Partial Synchrony. [1]"[0] <a href="http://bravenewgeek.com/category/distributed-systems-2/" rel="nofollow">http://bravenewgeek.com/category/distributed-systems-2/</a>[1] <a href="http://groups.csail.mit.edu/tds/papers/Lynch/jacm88.pdf" rel="nofollow">http://groups.csail.mit.edu/tds/papers/Lynch/jacm88.pdf</a>

praneshp超过 9 年前

Interesting document a colleague of mine wrote: <a href="https://specs.openstack.org/openstack/openstack-specs/specs/chronicles-of-a-dlm.html" rel="nofollow">https://specs.openstack.org/openstack/openstack-specs/specs/...</a>

评论 #11061134 未加载

评论 #11061732 未加载

brightball超过 9 年前

Not directly related, but in a sublink within that piece I found an interesting read about 2 phase commits in Postgres.<a href="https://aphyr.com/posts/282-call-me-maybe-postgres" rel="nofollow">https://aphyr.com/posts/282-call-me-maybe-postgres</a>

mattiemass超过 9 年前

Interesting stuff. In all my distributed systems work so far, I've assumed that a distributed lock is a thing to avoid. I really should take another look at them, just as a tool to have at my disposal.

评论 #11062707 未加载

评论 #11060689 未加载

评论 #11061033 未加载

评论 #11060640 未加载

EGreg超过 9 年前

Why would you ever want to do distributed locking when you can do MVCC via eg vector clocks, possibly with cryptographic signatures?If you want to acquire some shared resource, then elect an owner for that resource.

评论 #11063687 未加载

waxjar超过 9 年前

Instead of the fencing token, could the scenario in the blog post be prevented using a good hashing function?When you perform a write, instead of the token, send a hash of the object previously read. The storage can then compare this against a hash of the resource's current state. If it doesn't match the lock expired and the write is not accepted.This would reduce the state to keep track off to the resources themselves.

hardwaresofton超过 9 年前

Has anyone used both Zookeeper and etcd in production for management of distributed state?Generally when I think of this problem I reach for etcd, not Zookeeper first, in the hopes of it being lighter (with a relatively vague definition of "light"), and easier to use.

评论 #11061048 未加载

评论 #11061043 未加载

kang超过 9 年前

> If they could, distributed algorithms would do without clocks entirely, but then consensus becomes impossibleConsensus can be achieved without clocks using blockchains: Instead of timelocking the resource, client can broadcast his altered version after making changes to the network. The next client than then start working on top of this changed resource, but since multiple versions might be floating around (due to timing issues in the first place) the longest tree wins. So if a client submits work done on a shorter tree then it is rejected by the network.This has other issues like it takes longer time & reorganization risks but it does away with clocks altogether by providing a different method of timestamping.This is similar to proof-of-work timestamp server that bitcoin uses but we can do away with proof-of-work because the resource and membership in the network is centralised.