How much your computer can do in a second

602 pointsby srirangrabout 8 years ago

32 comments

gizmoabout 8 years ago

Pretty cool, but a number of the questions are totally unknowable.For instance the question about web requests to google. Depending on your internet connection you've got more than a order of magnitude difference in the outcome.In the question about SSD performance the only hint we have is that the computer has "an SSD", but a modern PCIe SSD like in the new Macbook pro is over 10 times faster than the SSDs we got just 5 years ago.The question about JSON/Msgpack parsing is just about the implementation. Is the python msgpack library a pure python library or is the work of the entire unpackb() call done in C?The bcrypt question depends entirely on the number of rounds. The default happens to be 12. Had the default been 4 the answer would have been 1000 hashes a second instead of 3. Is the python md5 library written in C? If so, the program is indistinguishable from piping data to md5sum from bash. Otherwise it's going to be at least an order of magnitude slower.So I liked these exercises, but I liked the C questions best because there you can look at the code and figure out how much work the CPU/Disk is doing. Questions that can be reduced to "what language is this python library written in" aren't as insightful.

评论 #13961155 未加载

评论 #13961348 未加载

realoabout 8 years ago

Yes, modern computers are fast. How fast?The speed of light is about 300,000 km/s. That translates to roughly 1 ns per foot (yeah, I mix up my units... I'm Canadian...)THUS, a computer with a clock speed of 2 GHz will be able to execute, on a single core/thread, about 4 (four !) single-clock instructions between the moment photons leave your screen, and the moment they arrive into your eye 2 feet (roughly) later._That_ should give you an idea of how fast modern computers really are.And I _still_ wait quite a bit when starting up Microsoft Word.

评论 #13962494 未加载

评论 #13961530 未加载

评论 #13961664 未加载

评论 #13964540 未加载

评论 #13961344 未加载

评论 #13962227 未加载

munificentabout 8 years ago

If, like me, you spend most of your time in high-level, garbage collected "scripting" languages, it's really worth spending a little time writing a few simple C applications from scratch. It is astonishing how fast a computer is without the overhead most modern languages bring in.That overhead adds tons of value, certainly. I still use higher level languages most of the time. But it's useful to have a sense of how fast you could make some computation go if you really needed to.

评论 #13963497 未加载

评论 #13962670 未加载

评论 #13962750 未加载

评论 #13961956 未加载

评论 #13966056 未加载

userbinatorabout 8 years ago

Alternatively, this could be titled "do you know how much your computer could do in a second but isn't because of bad design choices, overengineered bloated systems, and dogmatic adherence to the 'premature optimisation' myth?"Computers are fast, but not if all that speed is wasted.A recent related article: <a href="https://news.ycombinator.com/item?id=13940014" rel="nofollow">https://news.ycombinator.com/item?id=13940014</a>

评论 #13961826 未加载

评论 #13961153 未加载

评论 #13961424 未加载

评论 #13961497 未加载

评论 #13963180 未加载

评论 #13967303 未加载

评论 #13966417 未加载

评论 #13968604 未加载

评论 #13967552 未加载

评论 #13963501 未加载

评论 #13961778 未加载

chacham15about 8 years ago

Be careful what conclusions you attempt to draw from examples when you arent sure what exactly is happening. These examples are actually very wrong and misleading.Take for example, the first code snippet about how many loops you can run in 1 second. The OP fails to realize that since the loop isnt producing anything which gets actually used, the compiler is free to optimize it out. You can see that thats exactly what it does here: <a href="https://godbolt.org/g/NWa5yZ" rel="nofollow">https://godbolt.org/g/NWa5yZ</a> All it does is call strtol and then exits. It isnt even running a loop.

评论 #13962541 未加载

评论 #13962526 未加载

评论 #13962467 未加载

评论 #13962486 未加载

评论 #13962403 未加载

评论 #13962499 未加载

评论 #13962498 未加载

dom0about 8 years ago

More impressively, sum.c could go likely an order of magnitude or so faster, when optimized.> Friends who do high performance networking say it's possible to get network roundtrips of 250ns (!!!),Well stuff like Infiniband is less network, and more similar to a bus (e.g. RDMA, atomic ops like fetch-and-add or CAS).> write_to_memory.pyIs also interesting because this is dominated by inefficiencies in the API and implementation and not actually limited by the memory subsystem.> msgpack_parse.pyAgain, a large chunk goes into inefficiencies, not so much the actual work. This is a common pattern in highly abstracted software. msgpack-c mostly works at >200 MB/s or so (obviously a lot faster if you have lots of RAWs or STRs and little structure). Funnily enough, if you link against it and traverse stuff, then a lot of time is spent doing traversals, and not the actual unpacking (in some analysis I've seen a ~1/3 - 2/3 split). So the cost of abstraction also bites here.If you toy around with ZeroMQ you can see that you'll be able to send around 3 million msg/s between threads (PUSH/PULL) from C or C++, around 300k using pyzmq (this factor 10 is sometimes called "interpreter tax"), but only around 7000 or so if you try to send Python objects using send_pyobj (which uses Pickle). That's a factor 430.

评论 #13960977 未加载

Eliezerabout 8 years ago

What an excellent teaching pattern - you're far more likely to remember what you learned if you first stop to think and record your own guess, and this is excellent UI and UX for doing that routinely and inline.

评论 #13963328 未加载

baneabout 8 years ago

This is awesome. The real lesson here is, when you make a thing, compare its performance to these kinds of expected numbers and if you're not within the same order of magnitude speedwise, you've probably screwed up somewhere.My favorite writeups are the ones that gloat about achieving hundreds of pages served per second per server. That's terrible, and nobody today even understands that.

alkonautabout 8 years ago

Don't some of these examples run in O(1) time because the value in the loop isn't used? E.g in the first example 0 is returned instead of the sum.Obviously we are talking about real world c compilers with real world optimizations so presumably we'd have to also consider whether the loop is executed at all?

paulsutterabout 8 years ago

That's nothing. Here's code that does 77GFLOPS on a single Broadwell x86 core. Yes that 77 billion opertaions per second.<a href="http://pastebin.com/hPayhGXP" rel="nofollow">http://pastebin.com/hPayhGXP</a>

评论 #13964014 未加载

asrpabout 8 years ago

This reminds me of "Latency Numbers Every Programmer Should Know"<a href="https://gist.github.com/jboner/2841832" rel="nofollow">https://gist.github.com/jboner/2841832</a>Edit: Just realized halfway through that there's already a link to this from their page!

gburtabout 8 years ago

The `bcrypt` question seems out-of-place. It has a configurable cost parameter, so almost any of the answers is correct.

bchabout 8 years ago

Hard to believe there are 124 comments here and nobody has brought up Grace Hopper's talk[0][1] yet. With good humour she gives a example of what various devices' latency are, and a simple tool to comprehend the cost and orders of magnitude.<pre><code> [0] short - https://www.youtube.com/watch?v=JEpsKnWZrJ8 [1] long - https://www.youtube.com/watch?v=ZR0ujwlvbkQ</code></pre>

gibsjoseabout 8 years ago

I'm curious to see the data collected on guesses. Some were quite difficult to guess, like hashes per second with bcrypt not knowing the cost factor, but I guess we can assume some sane default.I would have really liked to see all these numbers in C, and other languages for that matter. Perhaps add a dropdown box to select the language from a handful of options?

tomc1985about 8 years ago

One second on what?A Core i7? A raspberry Pi? A weird octo-core dual-speed ODROID? An old i915-based Celeron? My cell phone? An arduino?"Your computer" has meant all the above to me, just in the last few weeks. The author's disinclination to describe the kind of hardware this code is running on -- other than "a new laptop" -- strikes me as kind of odd.

alcuadradoabout 8 years ago

This reminds me to this email from LuaJIT's list:Computers are fast, or, a moment of appreciation for LuaJIT <a href="https://groups.google.com/forum/#!msg/snabb-devel/otVxZOj9dLA/rgCojUohBGMJ" rel="nofollow">https://groups.google.com/forum/#!msg/snabb-devel/otVxZOj9dL...</a>

norswapabout 8 years ago

Brilliant! I'd like to see those numbers summarized somewhere though, a bit like the latency numbers every programmer should know: <a href="https://gist.github.com/jboner/2841832" rel="nofollow">https://gist.github.com/jboner/2841832</a> (visual: <a href="https://i.imgur.com/k0t1e.png" rel="nofollow">https://i.imgur.com/k0t1e.png</a>)

partycoderabout 8 years ago

Computers are fast unless your algorithm is quadratic or worse, then there's no computer to help you.

Lxrabout 8 years ago

Why isn't the first Python loop (that does nothing but pass) optimised away completely?

评论 #13965698 未加载

评论 #13961637 未加载

评论 #13961425 未加载

评论 #13962336 未加载

ge96about 8 years ago

I came across this "article"? before in the past, I feel like I remember it under a different title like "language speed differences" or something. Or maybe that's another article by the same author/site/format.

srikuabout 8 years ago

The grep example should search for one character. Grep can skip bytes so that longer search strings are faster to search for. On my machine, I get from 22%-35% more time taken if I changed "grep blah" to "grep b".

thomastjefferyabout 8 years ago

Or "how fast can one of my 8 CPU cores run a for loop?" To put that in perspective: all 8 cores together give me about 40gflops. I have 2 GPUs that each give me more than 5000gflops.

urzaabout 8 years ago

Anyone care to rewrite these into c#? I am really surprised how fast these python scripts are and I would like to see comparison with equivalent tasks in c# where it stands..

kobeyaabout 8 years ago

Was disappointed to find that nearly all the examples were Python and shell script. I'm not interested in knowing random trivia about how slow various interpreters are.

samirmabout 8 years ago

The last question seems really misleading. Most modern CPUs have a cache size of 8MB (max), yet the answer is >300MB?

tim333about 8 years ago

Or running Windows Vista you can right click and display a menu in one second plus about 29 other seconds.

wtbobabout 8 years ago

Well, my computer won't display an image apparently inserted with JavaScript, although it could if I wanted to grant execute privileges on it to computers-are-fast.github.ioDoes anyone have a link to the image(s)?

评论 #13965127 未加载

brianwawokabout 8 years ago

> GitHub Pages is temporarily down for maintenance.Ironic?

ilakshabout 8 years ago

NVMe SSD can be up to 10X faster than SATA.

d--babout 8 years ago

or "computers are fast, so we might just slow things down by using python for numerical calculations"

joelthelionabout 8 years ago

This could make a pretty good hiring test. Not expecting perfect answers, but a rough correlation with the results, and some good explanations.

grepthisababout 8 years ago

Edit: I'm an idiot

评论 #13962301 未加载

评论 #13962349 未加载