TechEcho

13 comments

6renover 13 years ago

I wonder how much JIT compilation would help, without any hand-optimization? e.g. it'd do the initial inlining.I've been amazed at Java's speedup over multiple runs: dead-slow on startup, then improving rapidly over the next 10-20 runs, and even keeps improving slowly after that. It's a bit magical. Much (all?) of that JIT tech should be applicable to Python, I'd think.BTW: link is to the comments, not the story

评论 #3153705 未加载

mhdover 13 years ago

I'm just amazed that it took ten iterations until regexps were even considered, especially considering that the set '[a-zA-Z0-9]' is involved…

评论 #3154978 未加载

padobsonover 13 years ago

What performance gain could be had from using cStringIO as described here:<a href="http://www.skymind.com/~ocrow/python_string/" rel="nofollow">http://www.skymind.com/~ocrow/python_string/</a>It seemed like the concatenation was the primary bottleneck in this case.Also, its worth noting that percentage gains on performance have huge cost savings on infrastructure at scale. That's why blogs like this are valuable because the user experience improves while the cost to provide it is reduced.

评论 #3154984 未加载

Jabblesover 13 years ago

In C you could replace<pre><code> if x in WHITELIST </code></pre> with<pre><code> if ((x >= '0' && x <= '9') || (x >= 'A' && x <= 'Z') || (x >= 'a' && x <= 'z')) </code></pre> which I suspect would be much faster than any hash-table implementation. Also I believe that should work for UTF-8 as well as ascii. I realise that this makes it harder to expand the whitelist. I'm not very familiar with python, is there something similar that could be done?

评论 #3153695 未加载

评论 #3153710 未加载

评论 #3154707 未加载

评论 #3153910 未加载

评论 #3154638 未加载

cool-RRover 13 years ago

I wonder whether it'll be more efficient to just have a big dict mapping every character to what it should look like post-escape. (e.g. {'a': 'a', '(': '&#(;', ... })Then in your loop you're only making dict lookups.

评论 #3154989 未加载

评论 #3154689 未加载

评论 #3153807 未加载

pdhborgesover 13 years ago

It would be nice if the author profiled the code instead of just measuring the test templates time.

wladimirover 13 years ago

Interesting story, with a very good speedup. Though I personally wouldn't be this patient, and would write a simple function like this in cython or even the C API directly (especially as the Python code drifts further from idiomatic with each step...).

评论 #3154548 未加载

phren0logyover 13 years ago

At the risk of exposing my ignorance, I thought>Other “common wisdom”, like using locals instead of globals, yields relatively little gain.this advice was typically more related to avoiding collisions with variable names, rather than performance?

评论 #3153693 未加载

评论 #3153699 未加载

maceover 13 years ago

This is slightly better, IMHO: <a href="http://www.python.org/doc/essays/list2str.html" rel="nofollow">http://www.python.org/doc/essays/list2str.html</a>Some best practices when optimizing CPython code:* Re-evaluate your algorithm (an inefficient quicksort is still faster than an optimized bubblesort)* Use Python functions and constructs implemented in C (ex. most builtins, list comprehensions)* Move loops from outside functions to inside (function call overhead is high)* Use try/except to handle uncommon cases rather than using conditional checks in a loop.* Eliminate dots (attribute lookup) in tight loops (create a local alias if needed)See also: <a href="http://wiki.python.org/moin/PythonSpeed/PerformanceTips" rel="nofollow">http://wiki.python.org/moin/PythonSpeed/PerformanceTips</a>

dgrantover 13 years ago

Why is string interpolation in Python so slow?

评论 #3157374 未加载

jamwtover 13 years ago

Clean C > Ugly Python, if you're really going to force the issue out of Python's comfort zone (readability).

MostAwesomeDudeover 13 years ago

Dude, use PyPy. Seriously. Please. Sacrificing readability for this kind of work is not good.

评论 #3155295 未加载

captain-asshatover 13 years ago

I haven't written any python in a while, but a 15% speedup from inlining a function call? Really? This reinforces my preference for statically typed languages.

评论 #3155008 未加载

13 comments

6renover 13 years ago

评论 #3153705 未加载

mhdover 13 years ago

I'm just amazed that it took ten iterations until regexps were even considered, especially considering that the set '[a-zA-Z0-9]' is involved…

评论 #3154978 未加载

padobsonover 13 years ago

评论 #3154984 未加载

Jabblesover 13 years ago

评论 #3153695 未加载

评论 #3153710 未加载

评论 #3154707 未加载

评论 #3153910 未加载

评论 #3154638 未加载

cool-RRover 13 years ago

评论 #3154989 未加载

评论 #3154689 未加载

评论 #3153807 未加载

pdhborgesover 13 years ago

It would be nice if the author profiled the code instead of just measuring the test templates time.

wladimirover 13 years ago

评论 #3154548 未加载

phren0logyover 13 years ago

评论 #3153693 未加载

评论 #3153699 未加载

maceover 13 years ago

dgrantover 13 years ago

Why is string interpolation in Python so slow?

评论 #3157374 未加载

jamwtover 13 years ago

Clean C > Ugly Python, if you're really going to force the issue out of Python's comfort zone (readability).

MostAwesomeDudeover 13 years ago

Dude, use PyPy. Seriously. Please. Sacrificing readability for this kind of work is not good.

评论 #3155295 未加载

captain-asshatover 13 years ago

I haven't written any python in a while, but a 15% speedup from inlining a function call? Really? This reinforces my preference for statically typed languages.

评论 #3155008 未加载

A Python Optimization Anecdote

13 comments

A Python Optimization Anecdote

13 comments