科技回声

7 条评论

It would be nice if .NET Core profiling was a bit easier on Linux, Microsoft has a shell script[1] to do profiling but it requires Windows only tools.They don't ship Crossgen with the Linux packages, and you have to manually generate the .NET runtime symbols.I've gotten things like FlameGraphs working using BCC profile[2], but it took quite a bit of work.[1]: <a href="https://raw.githubusercontent.com/dotnet/corefx-tools/master/src/performance/perfcollect/perfcollect" rel="nofollow">https://raw.githubusercontent.com/dotnet/corefx-tools/master...</a> [2]: <a href="https://github.com/iovisor/bcc/blob/master/tools/profile.py" rel="nofollow">https://github.com/iovisor/bcc/blob/master/tools/profile.py</a>

评论 #18914849 未加载

kevingadd超过 6 年前

A tip related to the throw inlining tip: One way to get more consistent/effective inlining is to split the complex 'slow paths' out of your functions into helper functions. For example, let's say you have a cached operation with cache hit and cache miss paths:<pre><code> void GetValue (string key, out SomeBigType result) { if (_cache.TryGetValue(key, out result)) return; result = new SomeBigType(key, ...); _cache[key] = result; } </code></pre> In most scenarios this function might not get inlined, because the cache miss path makes the function bigger. If you use the aggressive inlining attribute you might be able to convince the JIT to inline it, but once the function gets bigger it doesn't inline anymore.However, if you pull the cache miss out:<pre><code> void GetValue (string key, out SomeBigType result) { if (_cache.TryGetValue(key, out result)) return; GetValue_Slow(key, out result); } void GetValue_Slow (string key, out SomeBigType result) { result = new SomeBigType(key, ...); _cache[key] = result; } </code></pre> You will find that in most cases, GetValue is inlined and only GetValue_Slow produces a function call. This is especially true in release builds and you can observe it in the built-in Visual Studio profiler or by looking at method disassembly.(Keep in mind that many debuggers - including VS's - will disable JIT optimization if you start an application under the debugger or attach to it. You can disable this.)This tip applies to both desktop .NET Framework and .NET Core, in my testing (netcore is generally better at inlining, though!) If you're writing any performance-sensitive paths in a library I highly recommend doing this. It can make the code easier to read in some cases anyway.

gameswithgo超过 6 年前

One of the tips is to avoid Linq, which many .NET developers are hesitant to do. I made a library that lets you use Linq style convenience functions without a performance hit in many cases:<a href="https://github.com/jackmott/LinqFaster" rel="nofollow">https://github.com/jackmott/LinqFaster</a>

评论 #18915742 未加载

评论 #18914789 未加载

评论 #18915027 未加载

评论 #18914743 未加载

评论 #18920349 未加载

评论 #18914498 未加载

评论 #18918650 未加载

zamalek超过 6 年前

> Reduce branching & branch mispredictionI wrote a parser for a "formalized" URI (it looked somewhat like OData). This parser was being invoked millions of times and was adding minutes to an operation - it dominated the profile at something like 30% CPU time. It started off something like this:<pre><code> int state = State_Start; for (var i = 0; i < str.Length; i++) { var c = str[i]; switch (state) { case State_Start: /* Handle c for this state. */ /* Update state if a new state is reached. */ } } </code></pre> Hardly rocket science, a clear-as-day miniature state machine. VTune was screaming about the switch, so I changed it to this:<pre><code> for (var i = 0; i < str.Length; i++) { for (; i < str.Length; i++) { var c = str[i]; /* Handle c for this state. */ /* Break if a new state is reached. */ } for (; i < str.Length; i++) { var c = str[i]; /* Handle c for this state. */ /* Break if a new state is reached. */ } } </code></pre> The new profile put the function at < 0.1% of CPU time. This is something that the "premature optimization crowd" (who tend to partially quote Knuth concerning optimization) get wrong: death by a thousand cuts. A single branch in the source (it ends up being more in machine code) was costing 30% performance.

评论 #18914060 未加载

评论 #18914091 未加载

GordonS超过 6 年前

> Mark classes as sealed by defaultPlease, no! This shouldn't be the default - it's a constant bugbear of mine where I want to extend a class from a library, and I can't because it's been sealed for no good reason.

评论 #18914572 未加载

评论 #18914605 未加载

评论 #18914387 未加载

评论 #18914504 未加载

评论 #18914665 未加载

评论 #18914326 未加载

评论 #18914700 未加载

blinkingled超过 6 年前

> JIT won't inline functions that throwSeriously? Never had to worry about that in Java land. What would be the reason for this?

评论 #18915907 未加载

评论 #18914753 未加载

jermaustin1超过 6 年前

So in other words extreme tuning = do the opposite of what you probably did!

7 条评论

rhinoceraptor超过 6 年前

评论 #18914849 未加载

kevingadd超过 6 年前

gameswithgo超过 6 年前

评论 #18915742 未加载

评论 #18914789 未加载

评论 #18915027 未加载

评论 #18914743 未加载

评论 #18920349 未加载

评论 #18914498 未加载

评论 #18918650 未加载

zamalek超过 6 年前

评论 #18914060 未加载

评论 #18914091 未加载

GordonS超过 6 年前

评论 #18914572 未加载

评论 #18914605 未加载

评论 #18914387 未加载

评论 #18914504 未加载

评论 #18914665 未加载

评论 #18914326 未加载

评论 #18914700 未加载

blinkingled超过 6 年前

> JIT won't inline functions that throwSeriously? Never had to worry about that in Java land. What would be the reason for this?

评论 #18915907 未加载

评论 #18914753 未加载

jermaustin1超过 6 年前

So in other words extreme tuning = do the opposite of what you probably did!

Performance Tuning for .NET Core

7 条评论

Performance Tuning for .NET Core

7 条评论