A Git catastrophe cleaned up

274 pointsby asymmetricover 8 years ago

26 comments

mchermover 8 years ago

The author concludes by saying:>I think I've written before that this profusion of solutions is the sign of a well-designed system. The tools and concepts are powerful, and can be combined in many ways to solve many problems that the designers didn't foresee.I disagree. I consider this to be a failure of Git. The set of different options (normal merge, rebase, filter-branch, etc) is complex and not cleanly orthogonal which makes for a very messy "mental model". Even experienced experts would have difficulty finding the clear, simple way to solve this problem and those less experienced would have little chance of proceeding cleanly.I really wish some tool other than Git had "won" the version-control race; I honestly believe Git to be the worst of the contenders in the most recent generation of version control systems (albeit better than the previous generation in important ways).

评论 #13228864 未加载

评论 #13229037 未加载

评论 #13229353 未加载

评论 #13228855 未加载

评论 #13230194 未加载

评论 #13230577 未加载

评论 #13228831 未加载

评论 #13230426 未加载

评论 #13229190 未加载

评论 #13229589 未加载

评论 #13228936 未加载

评论 #13230453 未加载

评论 #13230420 未加载

评论 #13230658 未加载

评论 #13230118 未加载

peffover 8 years ago

The simplest solution is:<pre><code> # try the merge; you'll get conflicts on those files git merge topic # discard the versions from the topic branch; # you know you already merged those changes in # the funny "git checkout commit", so any differences # are due to changes on master. git checkout --ours new-file-{1,18} # now you are free to fix up any real conflicts # and resolve the merge git commit </code></pre> This has the advantage of representing the true history. You had two lines of development (the original topic, and the "squashed" history created for deployment), and the merge shows them coming together and choosing the deployment-side content.

评论 #13232195 未加载

评论 #13230399 未加载

guomanminover 8 years ago

Participate in Atlassian ResearchMy name is Angela and I do research for Atlassian. I’m kicking off a round of discussions with people who use Git tools. Ideally, I’d like to talk to people that sit on a team of 3 or more. If this is you, I would love to talk to you about your experience with <using> Git tools, or just some of the pain points that are keeping you up at night when doing your jobs.We’ll just need 30 mins of your time, and as a token of my thanks to those that participate, I’d like to offer a US$50 Amazon gift voucher.   If you’re interested, just shoot me an email with your availability over the next few weeks and we can set up a time to chat for 30 minutes. Please also include your timezone so we can schedule a suitable time (as I’m located in San Francisco). Hope to talk to you soon!Cheers,   Angela Guo aguo@atlassian.com

msvalkonover 8 years ago

While an unorthodox merge strategy was used, this is what happens when you hole up in a topic branch for a long time. I bet this would've been easier had they merged smaller commits or PR's to master constantly. If one is afraid of deploying unfinished features, don't make them functional until they are ready. Tie them together once finished. Or did I miss something here?

评论 #13229251 未加载

评论 #13228956 未加载

评论 #13233846 未加载

lmmover 8 years ago

> The next day he wanted to go ahead and merge the front-end changes, but he found himself in “a bit of a pickle”. The merge didn't go forward cleanly, perhaps because of other changes that had been made to master in the meantime. And trying to rebase the branch onto the new master was a complete failure. Many of those 406 commits included various edits to the 18 back-end files that no longer made sense now that the finished versions of those files were in the master branch he was trying to rebase onto.Can one not instead merge master into the feature branch?

评论 #13229835 未加载

评论 #13230515 未加载

soft_dev_personover 8 years ago

Why not just revert the offending commit? It would be a valid blip in history as mistake made and corrected.

评论 #13229539 未加载

评论 #13228709 未加载

评论 #13229766 未加载

评论 #13228813 未加载

fpigover 8 years ago

I don't understand the problem here, why didn't he just do a merge and resolve the 18 conflicts by using the version of the file from master?And the problem wasn't in checkout-add-commit, that is a trivial issue, the WTF here is producing 406 new commits in a branch without ever thinking of merging master back into it or rebasing on master, to avoid having a giant merge later.

rurbanover 8 years ago

400 commits not cleanly applying? Not a big deal. I routinely merge 1000-2000 commits and rebase 30 active branches to that also. The solution is git rerere. It stores all the resolved merge resolutions forever, and cp or rb apply cleanly then, without any trouble. Eg <a href="https://medium.com/@porteneuve/fix-conflicts-only-once-with-git-rerere-7d116b2cec67#.8pj73vnex" rel="nofollow">https://medium.com/@porteneuve/fix-conflicts-only-once-with-...</a>

kazinatorover 8 years ago

> X decided to merge and deploy just the back-end changes, and then, once that was done and appeared successful, to merge the remaining front-end changes.> "What should X have done in the first place to avoid the pickle?"0. (Of course, not develop a 406 patch changeset and then have to pick it apart. Make smaller pushes, frequently.)1. Create a topic branch right there at the tip where the 406 changes are locally committed.2. Then use git's interactive rebase to rewrite this branch such that just the back-end commits are picked first, followed by the front end.3. Make a back-end topic branch from the last back-end commit and test that. If it's cool, master can be rebased to that and pushed to origin/master upstream.4. Test remaining front-end changes, rebase master to them, push.Also:3. a) If back-end changes need fixing, fix them on the back-end-topic branch. Then rebase the original topic to the back end topic to pick up these changes "under" it. (I.e. replay the front end over the new back end, and install as new front end).

评论 #13230754 未加载

评论 #13230639 未加载

jdonaldsonover 8 years ago

Reading this was like watching a traffic accident in slow motion. I could hear myself yelling at the author as if he were a student driver:"Use filter-branch!!! Use filter-branch!!! NOOOOOOOO NOT merge union with manual deletes!!!"But... he went and did it anyways. Honestly, reading back through commit logs, you always find the part where the driver runs off the road, plows through a clearly marked gate, runs on a train track for a mile or two, then merges back onto the main street, carrying part of a mailbox and a deer carcass.You can fault git if you want, but it seems like some of these cases just arrive naturally no matter what cvs is used. It would be great to have a "git education" repo that contains situations just like these to work through... sort of a "drivers ed" for managing a repo.

mjdover 8 years ago

The Reddit discussion of this, though brief, was interesting and to the point.<a href="https://www.reddit.com/r/git/comments/5i3mpz/another_git_catastrophe_cleaned_up_story_of_a/" rel="nofollow">https://www.reddit.com/r/git/comments/5i3mpz/another_git_cat...</a>

评论 #13234151 未加载

guard-of-terraover 8 years ago

Two fun things about git: It is deterministic, and it doesn't delete anything (readily).This means you can't really have a catastrophe.Just git reflog your way out.

评论 #13229130 未加载

评论 #13228792 未加载

评论 #13230982 未加载

评论 #13228872 未加载

krupanover 8 years ago

People in the discussion here saying, somewhat blindly it appears to me, that using mercurial would have avoided this mess. I'm a huge mercurial fan and have dealt with some tricky situations similar to this (but never this situation exactly) and I'm not so sure how mercurial would have handled it. The best I can say is that I've never known even the most adventurous mercurial user to use checkout (actually revert in mercurial) on individual files in that way. Is that something git people do more often?Aside from that, it'd be fun to see how mecurial handles this, but I'm not sure from reading the original post if I could exactly reproduce it.Mercurial would let you do the checkout (revert) trick that started it all. I can imagine it causing merge conflicts as described. Mercurial does let you specify how to resolve merge conflicts for the whole merge, or you can tell it not to resolve conflicts at all and then you can run hg resolve on a file-per-file (or glob of files) basis and tell it to pick default (equivalent of master) for the files you want. I didn't quite follow the git way of doing this he described with .gitattributes, but using hg resolve sounds easier (but neither are things a non-expert user of either tool would know).In the end some other solutions were proposed. I would not recommend using checkout (revert in mercurial) either. I don't know of a filter-branch equivalent in mercurial, but that sounds like a cool way to deal with this. In mercurial I probably would have reached for graft (equivalent of git's cherry pick), which isn't very different from git.

SadWebDeveloperover 8 years ago

The problem with Git is that everyone is trying to use Git as central repository rather than distributed as if it were SVN, personally i blame GitHub for promoting among the new developers the wrong tool for the job causing all this unnecessary drama. Git is the best version control system if and only if the project has a good leader checking everyone merges before commiting and letting everyone knows who is working on what and what parts will affect.

cyberpunkover 8 years ago

> But I couldn't think of anything, so I asked Rik Signes. Rik immediately said that X should have used git-filter-branch to separate the 406 commits into two branches, branch A with just the changes to the 18 back-end files and branch B with just the changes to the other files. (The two branches together would have had more than 406 commits, since a commit that changed both back-end and front-end files would be represented in both branches.) Then he would have had no trouble landing branch A on master and, after it was deployed, landing branch B.Well. Okay. That's a technical solution and it'd work, it's probably no less time consuming than going and fixing the code in a new branch and merging cleanly (every time I end up needing to filter branch stuff I have to RTFM on it, and it takes ages) -- this problem is NOT a technical one though; it's a process one.Why are you landing 400 commits in one go? Half of those were on files which then start causing merge conflicts for your team and wasted a huge amount of your time?Use feature flags, fix your conflicts in branch, don't merge anything into master unless it's using the 'merge' button on github/gitlab/gogs/whatever. And really think/discuss/roundtable about how you're introducing features because it sounds like this is running away from you a bit here..It doesn't need to be this complex, and these kinds of messes can't really be put on the tools -- although git certainly makes it easy to set a lot of things on fire..

评论 #13232229 未加载

forrestthewoodsover 8 years ago

I like Perforce. It may not be perfect. But it's idiot proof."Days since gitastrophe" is a common phrase. There is no Perforce equivalent. You can't blow your leg off. There aren't thousands of "Perforce made easy" blog posts because it's actually easy. There are no "fixing my p4 repo" tales because it never breaks.Thanks Perforce.

评论 #13229826 未加载

评论 #13229721 未加载

评论 #13229890 未加载

评论 #13230695 未加载

cousin_itover 8 years ago

In a corporate environment, I think I prefer a simpler workflow with a plain old centralized VCS and without using any branches at all. As code gets written, each commit goes on the trunk behind a feature flag (which you need anyway). That way each commit can benefit from continuous builds and testing, and other people can notice problems early. Branches would only be used for releases.I've worked like that for years on some pretty big projects, and it never caused complicated problems like in the OP. The only caveat is that you need a strong safety net against breaking the trunk (lots of tests, mandatory code review, etc.)

评论 #13233784 未加载

isaac_is_goatover 8 years ago

Why wouldn't they reintegrate the mainline development branch with the new branch if it was so long lived? And/Or have the new code behind a feature flag so you could potentially have it deployed but disabled? So many ways this could have been avoided with some basic forward thinking...

luosover 8 years ago

So if I get this then he just added the files to master, then tried to work on the topic branch?That really seems weird. I don't think it's git's fault. Also he could have done git merge master --accept-theirs if he really wanted some kind of history but I guess it would be worthless.

chetanahujaover 8 years ago

I humbly (re)submit this for your consideration <a href="https://git-man-page-generator.lokaltog.net/" rel="nofollow">https://git-man-page-generator.lokaltog.net/</a>

marcinkuzminskiover 8 years ago

IMHO it's odd approach to the problem. I'd rather ask the author (who knows best in this case) to split this into two separate parts that can be nicely merged.

bowmessageover 8 years ago

Worth noting that cherry-pick takes commit ID ranges in the form xxxxxx..yyyyyy, it may have simplified the driver.Thanks for the tip re: --keep-redundant-commits!

hellofunkover 8 years ago

The more I read stuff like this, the more I wonder how many problems would just go away for so many people if they used Mercurial instead.

评论 #13228947 未加载

评论 #13230387 未加载

评论 #13228948 未加载

gragasover 8 years ago

>and published the changes to `master`-_-

draw_downover 8 years ago

I hate this shit. Rebasing always causes conflicts and dealing with them is such a giant pain. I get that in this case the designer really brought the pain on themselves but I wish using git didn't require this sort of surgery periodically, which in my experience it does.

评论 #13230378 未加载

scarface74over 8 years ago

I think everyone is overlooking the main point.He said that one guy was only making changes to either the front end or back end code.They should have been two separate repos. one for the front end code and one for the back end code.

评论 #13228837 未加载

评论 #13229009 未加载

评论 #13229547 未加载