科技回声

8 条评论

jakub_g大约 2 个月前

This is super interesting, as I maintain a 1M commits / 10GB size repo at work, and I'm researching ways to have it cloned by the users faster. Basically for now I do a very similar thing manually, storing a "seed" repo in S3 and having a custom script to fetch from S3 instead of doing `git clone`. (It's faster than cloning from GitHub, as apart from not having to enumerate millions of objects, S3 doesn't throttle the download, while GH seem to throttle at 16MiB/s.)Semi-related: I always wondered but never got time to dig into what exactly are the contents of the exchange between server and client; I sometimes notice that when creating a new branch off main (still talking the 1M commits repo), with just one new tiny commit, the amount of data the client sends is way bigger than I expected (tens of MBs). I always assumed the client somehow established with the server that it has a certain sha, and only uploads missing commit, but it seems it's not exactly the case when creating a new branch.

评论 #43383610 未加载

评论 #43383711 未加载

评论 #43390898 未加载

评论 #43385889 未加载

评论 #43385458 未加载

评论 #43385872 未加载

评论 #43383826 未加载

评论 #43391825 未加载

ks2048大约 2 个月前

How much bandwidth and time is wasted cloning the entire history of large projects when people only need single snapshot in a single branch?According to SO, newer versions of git can do,<pre><code> git init git remote add origin <url> git fetch --depth 1 origin <sha1> git checkout FETCH_HEAD</code></pre>

评论 #43383901 未加载

评论 #43383891 未加载

评论 #43384136 未加载

autarch大约 2 个月前

> This has resulted in a contender for the world's smallest open source patch:Hah, got you beat: <a href="https://github.com/eki3z/mise.el/pull/12/files" rel="nofollow">https://github.com/eki3z/mise.el/pull/12/files</a>It's one ASCII character, so a one-byte patch. I don't think you can get smaller than that.

评论 #43385424 未加载

评论 #43383580 未加载

评论 #43383566 未加载

评论 #43385824 未加载

评论 #43383400 未加载

评论 #43424937 未加载

geenat大约 2 个月前

git needs built in handling of large binary files without a ton of hassle, it's all I ask. It'd make git universally applicable to all projects.mercurial had it for ages.svn had it for ages.perforce had it for ages.just keep the latest binary, or last x versions. Let us purge the rest easily.

评论 #43386667 未加载

评论 #43384404 未加载

评论 #43386436 未加载

robertlagrant大约 2 个月前

Nothing to do with the article, but I appreciate the slightly idiosyncratic GitButler YouTube videos that explain how bits of Git work.

评论 #43387506 未加载

andrewshadura大约 2 个月前

Interestingly, Mercurial had solved the bundles more than ten years ago and back then they already worked better than Git's today

评论 #43383635 未加载

评论 #43383827 未加载

评论 #43389710 未加载

评论 #43383634 未加载

mbac32768大约 2 个月前

One consequence of git clone is that if you have mega repos, it kind of ejects everything else from your cache for no win.You'd actually rather special case full clones and instruct the storage layer to avoid adding to the cache for the clone. But this isn't always possible to do.Git bundles seem like a good way to improve the performance of other requests, since they punt off to a CDN and protect the cache.

jedimastert大约 2 个月前

This actually might solve a massive CI problem we've been having...will report back tomorrow

评论 #43383393 未加载

8 条评论

jakub_g大约 2 个月前

评论 #43383610 未加载

评论 #43383711 未加载

评论 #43390898 未加载

评论 #43385889 未加载

评论 #43385458 未加载

评论 #43385872 未加载

评论 #43383826 未加载

评论 #43391825 未加载

ks2048大约 2 个月前

评论 #43383901 未加载

评论 #43383891 未加载

评论 #43384136 未加载

autarch大约 2 个月前

评论 #43385424 未加载

评论 #43383580 未加载

评论 #43383566 未加载

评论 #43385824 未加载

评论 #43383400 未加载

评论 #43424937 未加载

geenat大约 2 个月前

评论 #43386667 未加载

评论 #43384404 未加载

评论 #43386436 未加载

robertlagrant大约 2 个月前

Nothing to do with the article, but I appreciate the slightly idiosyncratic GitButler YouTube videos that explain how bits of Git work.

评论 #43387506 未加载

andrewshadura大约 2 个月前

Interestingly, Mercurial had solved the bundles more than ten years ago and back then they already worked better than Git's today

评论 #43383635 未加载

评论 #43383827 未加载

评论 #43389710 未加载

评论 #43383634 未加载

mbac32768大约 2 个月前

jedimastert大约 2 个月前

This actually might solve a massive CI problem we've been having...will report back tomorrow

评论 #43383393 未加载

Going down the rabbit hole of Git's new bundle-URI

8 条评论

Going down the rabbit hole of Git's new bundle-URI

8 条评论