科技回声

7 条评论

My only gripe is:> Since a new instance of Python VM is running the code, there is no GIL and you get parallelism running on multiple cores.Has the potential to be misinterpreted by new Python devs. As I understand it, each process gets its own interpreter with its own GIL. So its not that there is no GIL, just that there are now 3 GILs each managing separate execution spaces.Also, at a higher level of discussion, this is just a rehash of multi-processing vs. multi-threading so perhaps it would be more illuminating to start from there and move down to the library examples.The casual reference to multi-threading in the quote from the POSIX fork definition also adds to potential reader confusion.

评论 #13562274 未加载

rdtsc超过 8 年前

There is a nice hack for multiprocessing module to use for IO-bound code without having to fork processes:<pre><code> from multiprocessing.dummy import Pool as ThreadPool pool = ThreadPool(16) res = pool.map(one_arg_fun, [arg1, arg2, ...]) </code></pre> A 3 line IO parallelism speedup trick. Used it fetch stuff from multiple servers recently.

评论 #13562988 未加载

评论 #13563128 未加载

评论 #13562831 未加载

评论 #13562845 未加载

cossatot超过 8 年前

Some good alternatives, for when you just want quick and easy parallelization:Joblib[0]: 'Embarrassingly parallel for loops'. Basically just write a generator and get multicore processing on it. Pretty straightforward.Multiprocess[1]: This is a fork of the multiprocessing module. The biggest benefit to me (the one time I used it) is that it's easier to create shared data structures for multiprocessing (which I couldn't do with multiprocessing.Pool). Last week I had to do a really long graph calculation that only needed to return a result for a very, very small fraction of the arguments. Joblib stored all of the null results as 'None', which tore through my RAM after billions of relatively fast calculations before crashing. But using a shared dictionary and multiprocess.Pool and imap_unordered, I was able to use mutlicore processing, adding items to the dict only if the right conditions were met, and discarding the 'None's. RAM use was very minimal.[0]: <a href="https://pythonhosted.org/joblib/index.html" rel="nofollow">https://pythonhosted.org/joblib/index.html</a> [1]: <a href="https://pypi.python.org/pypi/multiprocess" rel="nofollow">https://pypi.python.org/pypi/multiprocess</a>

评论 #13563857 未加载

sp332超过 8 年前

I used this module for a fractal rendering program, years ago. The APIS are (were?) a little annoying - last I checked, the relevant code had to be picklable. But it used all my CPUs and scaled just fine.

lowglow超过 8 年前

I had a really rough time finding practical/real world examples of this stuff. After digging around and piecing together a bunch of tips/guides/pointers, I managed to get what I wanted working.If you're looking for some real world examples of using multiprocessing (v3), check here: <a href="https://github.com/dpgailey/asteria/blob/master/asteria-v3/client.py" rel="nofollow">https://github.com/dpgailey/asteria/blob/master/asteria-v3/c...</a>

评论 #13564140 未加载

mchristen超过 8 年前

Nice job hijacking my browser back button...

评论 #13562544 未加载

brianolson超过 8 年前

If you need multi-core for performance, first you should rewrite in a compiled language instead of Python. Java/C/C++/Go/FORTRAN/whatever will run 10 to 20 times faster before even worrying about running multi-core.

The multiprocessing module in Python

7 条评论

The multiprocessing module in Python

7 条评论