科技回声

5 条评论

xmcqdpt2超过 3 年前

While I agree that people generally think of version control (and git) in terms of diff and patch, I think it's important to note that git itself doesn't have diff and patches. The fact that git doesn't have diffs is one of its defining characteristics.<a href="https://git-scm.com/book/en/v2/Getting-Started-What-is-Git%3F" rel="nofollow">https://git-scm.com/book/en/v2/Getting-Started-What-is-Git%3...</a>It makes git significantly faster (because it doesn't apply sequence of patches when checking out) and much more robust than a patch and diff system.I think it's an important point here because it does play a bit of a counterpoint to the OP's argument. In fact, the git storage system is precisely very simple because it's content addressed only and has no notion of history or difference,<a href="https://git-scm.com/book/en/v2/Git-Internals-Git-Objects" rel="nofollow">https://git-scm.com/book/en/v2/Git-Internals-Git-Objects</a>In the article, the author mentions something about Alice and Bob sharing only differences in JSON has a way to simplify changes. If anything the lesson from git's success should be that the aurhor's approach (the diff/patch way of Subversion etc.) is exactly the wrong one. Communicating only with full, valid JSONs and showing diffs only as a convenience to the user would be the actual "git way".

评论 #30377411 未加载

bradjonesca超过 3 年前

The ability to accommodate CRDT's in the architecture (future) is a gamechanger

jitl超过 3 年前

The article discusses the shortcoming of CRDTs - sometimes you do want a conflict that a human can resolve, instead of an algorithms best guess:> This conflict can be surfaced to Alice, and Bob can be allowed to go about his business. Could this particular problem be resolved in a purely automatic way with a CRDT? Definitely, but it probably will not result in what you want. Last first will work of course, but then which is more right might need human review, and even worse it might result in both results being interleaved (a likely outcome!).The article goes on to suggest that with a system of sharing patches, we can synchronize our distributed data stores with more precise semantics, even if we do need human intervention on conflicts sometimes. Part of this is agreeing on patch order:> We can stack either patch in any order without difficulty. Perhaps we ask Bob and Alice to agree on the application order (using pull / push as is done with git). But maybe we just allow them to apply when they arrive. The answer depends on the workflow.Do you know what a system that works like pull->rebase->push sounds like to me? If you squint a little? This sounds like operational transforms [1]. Especially if you are considering multiple different patching semantics -> Which of these you want, however, requires semantic direction of the diff algorithm. While lots of structured diff problems will be solved by the simplest algorithm, ultimately we need to have a schema that helps to direct the meaning of our diffs. String fields might be best line-based, word-based, or perhaps they must always be atomic (as with identifiers).Each patching semantic is a different type of Operation. Rebasing your local changes before sending your pending patches is the Transform. The main advantage of OT systems over CRDTs is that OT also allows for conflicts & human in the loop conflict resolution. @josephg built a JSON Operational Transform library [2] that has interesting operations like Move (something diff/patch really struggles with) as well as conflict resolution.The thing I like about the OT model is that it’s pretty easy to nest other approaches inside OT. Want to express 5 different patch semantics? Make an operation type for each. Want to support CRDT as well? Sure, make an operation type called CRDTUpdate that contains whatever delta data the underlying CRDT system would send.No matter what strategy you pick, remember to fuzz test your distributed sync system for convergence.[1]: <a href="https://en.m.wikipedia.org/wiki/Operational_transformation" rel="nofollow">https://en.m.wikipedia.org/wiki/Operational_transformation</a>[2]: <a href="https://github.com/ottypes/json1" rel="nofollow">https://github.com/ottypes/json1</a>

评论 #30377212 未加载

评论 #30376603 未加载

littlestymaar超过 3 年前

> web3oh no.

评论 #30364290 未加载

ninja_daro_yco超过 3 年前

We already have json patch(<a href="http://jsonpatch.com/" rel="nofollow">http://jsonpatch.com/</a>) and we can sent diffs as json patch document, so I don't understand what is the purpose of this proposal.

What's the Difference: JSON diff and patch

5 条评论

What's the Difference: JSON diff and patch

5 条评论