TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Intern Impact: Brotli compression for Play Store app downloads

313 点作者 abhikandoi2000超过 8 年前

27 条评论

cdnsteve超过 8 年前
NGINX: <a href="https:&#x2F;&#x2F;github.com&#x2F;google&#x2F;ngx_brotli" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;google&#x2F;ngx_brotli</a><p>Apache: <a href="https:&#x2F;&#x2F;lyncd.com&#x2F;2015&#x2F;11&#x2F;brotli-support-apache&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lyncd.com&#x2F;2015&#x2F;11&#x2F;brotli-support-apache&#x2F;</a><p>Node.js: <a href="https:&#x2F;&#x2F;hacks.mozilla.org&#x2F;2015&#x2F;11&#x2F;better-than-gzip-compression-with-brotli&#x2F;" rel="nofollow">https:&#x2F;&#x2F;hacks.mozilla.org&#x2F;2015&#x2F;11&#x2F;better-than-gzip-compressi...</a>
orliesaurus超过 8 年前
So wait, if I understand this article correctly she applied the compression because someone told her to? Or did she research herself and apllied the whole thing? I agree this is a bit like a &quot;we re hiring interns&quot; post
评论 #13581987 未加载
dajohnson89超过 8 年前
The amount of negativity in the comments section here is astounding. How could you not be excited and happy for this promising young woman&#x27;s achievement? No, her work will not put her on the shortlist for a Turing Award. But it is something any engineer should be proud of, and has real impact for millions of users.<p>You have a right to be unimpressed, but if you&#x27;re taking the time to say &quot;So what?&quot; or &quot;This is just a recruiting ad&quot; then you should probably rethink. I never thought I&#x27;d say this, but the negativity here really indicates the kind of latent discrimination that so many URMs &amp; women in tech complain about. I have literally no other explanation for it -- a senior engineer at Google could have implemented this compression and it would still be HN worthy, and nobody would be calling the blog article a fluffy PR piece.
评论 #13581602 未加载
评论 #13581991 未加载
评论 #13581908 未加载
评论 #13581770 未加载
评论 #13582290 未加载
评论 #13581881 未加载
评论 #13581797 未加载
评论 #13581891 未加载
评论 #13581717 未加载
评论 #13582426 未加载
评论 #13581855 未加载
评论 #13581564 未加载
评论 #13582036 未加载
评论 #13581815 未加载
评论 #13582108 未加载
评论 #13581673 未加载
评论 #13582109 未加载
评论 #13582154 未加载
jdcarter超过 8 年前
&gt; her work resulted in saving users an expected 1.5 petabytes (that&#x27;s 1.5 million gigabytes) of data each day.<p>I&#x27;m guessing this is not a measure of data at rest, but data transferred over the network. The couple samples listed on the page ranged from 2.5% improvement to 20.3% (vs. zLib) so I guess they&#x27;re extrapolating that out to all app downloads and updates across the world. Nicely done.<p>More generally, we&#x27;ve seen some great advances in compression lately. I&#x27;ve been using Facebook&#x27;s zStandard [1] for compression in a product I&#x27;m currently working on, and I&#x27;ve been extremely pleased with both its speed and compression ratio. The days of &quot;just use zLib&quot; are coming to a close.<p>[1]: <a href="https:&#x2F;&#x2F;github.com&#x2F;facebook&#x2F;zstd" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;facebook&#x2F;zstd</a>
评论 #13581252 未加载
arenaninja超过 8 年前
Pretty cool that an intern was given this level of confidence. Less data for updating&#x2F;installing applications is good no matter how you slice it
mbesto超过 8 年前
I&#x27;ve worked with a fair number of people that graduated from the Mathematics and Informatics at Babeș-Bolyai University. I&#x27;m generally very impressed by them, and is just another data point of areas of that world that get overlooked.
Syzygies超过 8 年前
Can we get her to work for DropBox? Every time my iPad GoodReader syncs my 1,000+ papers, it has to check every file. It boggles the mind that they don&#x27;t support some version of change records.
评论 #13581140 未加载
评论 #13581852 未加载
bhouston超过 8 年前
I bet switching to LZMA would have saved even more. LZMA beats Brotli nearly every time. zStandard would likely have worked better as well. Brotli is very slow to compress.
评论 #13581627 未加载
评论 #13581569 未加载
评论 #13581597 未加载
评论 #13583058 未加载
okreallywtf超过 8 年前
There isn&#x27;t much information but this reads more like an advertisement for google internships than anything else. Not to denigrate her work, she could very well be brilliant and have gone above and beyond, but from how it reads they could be blowing it up to make it seem like every intern has a huge impact and you could too! Either way good for her, but not sure why this is so high up on HN.
评论 #13581049 未加载
评论 #13581066 未加载
评论 #13581154 未加载
评论 #13581132 未加载
iamleppert超过 8 年前
This compression technique seems to be based on the fact they have previous installation of an app that can be diffed and patched, so it wouldn&#x27;t receive any benefit from first installations, only updates. But still might be worth it for many applications. I remember I investigated a way to send and apply diffs of javascript code (using a js version of patch) and store in the browser using localstorage. However, at the time the performance wasn&#x27;t good enough when compared in an end to end benchmark.<p>However, this has got me wondering as a general corollary for application delivery...would it just make more sense to use something like a well-pruned and compact git repo, and make the connections over HTTP with gzip compression? I&#x27;m not sure how space efficient the git repo is but may seem like an interesting project. I&#x27;m wary of using any Google technology, open source or not if it can be done yourself in an afternoon.<p>Does such a thing even exist?
评论 #13586957 未加载
jknoepfler超过 8 年前
The phrasing makes it clear that this is not intended to wow a tech audience. It&#x27;s a Google ad to parents, or something.
Yuioup超过 8 年前
Mathematicians are the true programmers. I wish I was one.
评论 #13581498 未加载
jedc超过 8 年前
&quot;Google Student Blog&quot; &#x2F;&#x2F; &quot;Google news and updates especially for students&quot;<p>Important context for this blog post and the comments in this thread.
bluedino超过 8 年前
On the other end of the spectrum, how much more energy has been used by the millions of Android phones uncompressing the app, applying the patch, and re-compressing the data?
bricss超过 8 年前
Take a look at LZ5 algorithm -&gt; <a href="https:&#x2F;&#x2F;github.com&#x2F;inikep&#x2F;lz5" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;inikep&#x2F;lz5</a>
sp332超过 8 年前
This page is consistently crashing my Firefox content process. I&#x27;m running 51.0.1 64-bit on Win10. Anyone else having this problem?
评论 #13580991 未加载
评论 #13580899 未加载
评论 #13580878 未加载
jfasi超过 8 年前
I&#x27;ve seen replies about how this is a &quot;simple library swap&quot; and so doesn&#x27;t deserve the attention it recognition it has received. As some who works at Google but not anywhere remotely near this project, but with experience in similar projects, I&#x27;d like to shed some light on why this isn&#x27;t a simple library swap, and seems from far away to have been both a tremendous accomplishment and a wonderful learning experience.<p>First off, there is no such thing as a library swap at Google. Our codebase is quite large. Like shockingly overwhelmingly large. Executing a change like this is almost certainly not a case of &quot;swapping out one configuration line for another.&quot; It requires writing new code, testing it appropriately, updating any integration tests, updating documentation, etc. But the real fun starts when you&#x27;re done coding...<p>There&#x27;s the issue of frontend and backend. Serving Brotli-compressed data is great, but what if you&#x27;re app doesn&#x27;t support it? If you&#x27;re lucky, this will be handled by the underlying network layer but then you have to deal with...<p>Rollout. I don&#x27;t know how many servers are dedicated to app updates, but I imagine it&#x27;s a lot. I also imagine they&#x27;re distributed geographically, across regions and probably even continents. Getting all those servers to support new features is a delicate, time consuming process where any misstep <i>will</i> result in users noticing. It&#x27;s not coding, but that&#x27;s why it&#x27;s called &quot;software engineering&quot; and not &quot;coding engineering.&quot; But then once you&#x27;re servers are all up and running you have to deal with...<p>Versioning. Updating backend servers is bad enough, but at least you control them. What about that zoo of Android versions out in the wild? How do you ensure they all support this changes? Short answer: you don&#x27;t. You design a strategy that will allow the rollout to happen gradually over a period of time, and closely monitor it to make sure nothing unintended is happening.<p>Then how do you turn down the old feature? When do you turn it down? You need to build and properly use instrumentation to determine the safest time to kill off the old feature. Or you could never kill it and commit to paying the cost in perpetuity. That&#x27;s a design decision, and not a trivial one.<p>But, odds are you&#x27;re not the only feature being rolled out. You have to anticipate&#x2F;deal with potential interactions with other features, rollbacks of other people&#x27;s work, etc.<p>I could go on, but I think I&#x27;ve already demonstrated why this is by no means a trivial accomplishment, even for a full time engineer. Add to this the fact that every intern has to race against the clock to get ramped up on their project, making something of this complexity and with this large an impact happen deserves applause.<p>I should add, I&#x27;m speaking as myself here and not representing Google in any way.
MtL超过 8 年前
Makes you wonder how much they&#x27;d save by using Courgette, like the Chrome team does.
mnml_超过 8 年前
thats like 50 million dollars a year (in egress cost)
jordache超过 8 年前
she didn&#x27;t create a compression algorithm.<p>More akin to enabling GZIP in IIS...
评论 #13582444 未加载
jonatron超过 8 年前
1.5M GB = 1.5 PB?
PedroBatista超过 8 年前
Please fix the title, its 1.5 PB, not GB
评论 #13581199 未加载
评论 #13581754 未加载
demonshalo超过 8 年前
So she did this for free? :D please tell me this is a paid internship!
评论 #13580892 未加载
评论 #13580928 未加载
评论 #13580901 未加载
评论 #13580917 未加载
diimdeep超过 8 年前
Click bait.
评论 #13581742 未加载
divbit超过 8 年前
I don&#x27;t know much about gigabytes, but that seems like a lot<p>edit: (I&#x27;m guessing the downvotes are because I phrased it like a meme, but to clarify, this was a genuine compliment in response to a &#x27;look what this person did&#x27; type post- it&#x27;s inspiring stuff)
painted超过 8 年前
So she used a compression algorithm developed by other googlers? So what?<p>Don&#x27;t get me wrong, I&#x27;m sure she did a lot of work for it, but looks like a lot of people would have been able to do that, there is nothing innovative in what she did, right?
评论 #13581305 未加载
评论 #13581151 未加载
评论 #13581928 未加载
评论 #13582106 未加载
评论 #13581474 未加载
评论 #13581624 未加载
4twilight超过 8 年前
Why she doesn&#x27;t wear a Pied Piper t-shirt? However, I&#x27;m more interested if the Erick&#x27;s position is still vacant in the venture? (Jin Yang&#x27;s will work as well for first 2 years, I suppose).