TechEcho

10 comments

MikeHolmanover 6 years ago

Do you do have any plans to better distinguish between noise and regressions? I run a similar performance testing infrastructure for Chakra, and found that comparing against the previous run makes the results noisy. That means more manual review of results, which gets old fast.What I do now is run a script that averages results from the preceding 10 runs and compares that to the average of the following 5 runs to see if the regression is consistent or anomalous. If the regression is consistent, then the script automatically files a bug in our tracker.There is still some noise in the results, but it cuts down on those one-off issues.

评论 #18118501 未加载

评论 #18118315 未加载

chriswarboover 6 years ago

For those wanting to do similar tracking of benchmarks across commits, I've found Airspeed Velocity to be quite nice ( <a href="https://readthedocs.org/projects/asv" rel="nofollow">https://readthedocs.org/projects/asv</a> ). It allows (but doesn't require) benchmarks to be kept separate to the project's repo, can track different configurations separately (e.g. using alternative compilers, dependencies, flags, etc.), keeps results from different machines separated, generates JSON data and HTML reports, performs step detection to find regressions, etc.It was intended for use with Python (virtualenv or anaconda), but I created a plugin ( <a href="http://chriswarbo.net/projects/nixos/asv_benchmarking.html" rel="nofollow">http://chriswarbo.net/projects/nixos/asv_benchmarking.html</a> ) which allows using Nix instead, so we can provide any commands/tools/build-products we like in the benchmarking environment (so far I've used it successfully with projects written in Racket and Haskell).

anpover 6 years ago

hi! author here if you want to ask questions or (nicely pls) let me know where I've made mistakes!

valarauca1over 6 years ago

How do you determine baseline load of the test machine in order to qualify the correctness of the benchmark?Assuming the compiling, and testing is done in the cloud how do you ensure the target platform (processor) doesn't change, and that you aren't being subjected to neighbors who are stealing RAM bandwidth, or CPU cache resources from your VM and impacting the results?

评论 #18116574 未加载

panicover 6 years ago

The "More Like Rocket Science Rule of Software Engineering" has been WebKit policy for a while: <a href="https://web.archive.org/web/20061011203328/http://webkit.org/projects/performance/index.html" rel="nofollow">https://web.archive.org/web/20061011203328/http://webkit.org...</a> (now at <a href="https://webkit.org/performance/" rel="nofollow">https://webkit.org/performance/</a>).

评论 #18118713 未加载

habitueover 6 years ago

This project looks awesome, but as a complete aside:How long do we expect it to take before "automagically" completely replaces "automatically" in English?I am guessing less than a decade to go now

评论 #18117638 未加载

hsivonenover 6 years ago

Very nice!Do you track opt_level=2 (the Firefox Rust opt level) in addition to the default opt_level=3?

评论 #18116540 未加载

thsowersover 6 years ago

This is really cool, love the project and the writeup! I regularly use nightly (I work with Rocket) and I had always wondered about this. Thank you!

Twirrimover 6 years ago

Can I suggest you consider putting <a href="https://github.com/anp/lolbench/issues/1" rel="nofollow">https://github.com/anp/lolbench/issues/1</a> in to the README.md file, so people can easily see where to look for some TODO items?

评论 #18117630 未加载

awakeover 6 years ago

Is there any equivalent project for java.

评论 #18116230 未加载

评论 #18117664 未加载

10 comments

MikeHolmanover 6 years ago

评论 #18118501 未加载

评论 #18118315 未加载

chriswarboover 6 years ago

anpover 6 years ago

hi! author here if you want to ask questions or (nicely pls) let me know where I've made mistakes!

valarauca1over 6 years ago

评论 #18116574 未加载

panicover 6 years ago

评论 #18118713 未加载

habitueover 6 years ago

评论 #18117638 未加载

hsivonenover 6 years ago

Very nice!Do you track opt_level=2 (the Firefox Rust opt level) in addition to the default opt_level=3?

评论 #18116540 未加载

thsowersover 6 years ago

This is really cool, love the project and the writeup! I regularly use nightly (I work with Rocket) and I had always wondered about this. Thank you!

Lolbench: automagically and empirically discovering Rust performance regressions

10 comments

Lolbench: automagically and empirically discovering Rust performance regressions

10 comments