Coroutines for Go

330 点作者 trulyrandom将近 2 年前

22 条评论

alphazard将近 2 年前

It looks like a lot of people are missing the point here. Yes a coroutine library would be a worse/more cumbersome way to do concurrency than the go keyword.The use case motivating all the complexity is function iterators, where `range` can be used on functions of type `func() (T, bool)`. That has been discussed in the Go community for a long time, and the semantics would be intuitive/obvious to most Go programmers.This post addresses the next thing: Assuming function iterators are added to the language, how do I write one of these iterators that I can use in a for loop?It starts by noticing that it is often very easy to write push iterators, and builds up to a push-to-pull adapter. It also includes a general purpose mechanism for coroutines, which the adapter is built on.If all of this goes in, I think it will be bad practice to use coroutines for things other than iteration, just like it's bad practice to use channels/goroutines in places where a mutex would do.

评论 #36764227 未加载

评论 #36764011 未加载

评论 #36766742 未加载

评论 #36768663 未加载

评论 #36772923 未加载

评论 #36764005 未加载

Zach_the_Lizard将近 2 年前

I have written Go professionally for many years now and don't want to see it become something like the Python Twisted / Tornado / whatever frameworks.The go keyword nicely prevents the annoying function coloring problem, which causes quite a bit of pain.Sometimes in high performance contexts I'd like to be able to do something like e.g. per CPU core data sharding, but this proposal doesn't scratch those kinds of itches.

评论 #36764361 未加载

评论 #36764608 未加载

评论 #36764133 未加载

评论 #36765301 未加载

评论 #36764110 未加载

评论 #36764703 未加载

MathMonkeyMan将近 2 年前

Multitasking systems gave us processes.But those were too much.So we got threads, which are processes that share an address space, file table, and some other things. The scheduler can switch from one to the other more easily than between processes, and data can be shared between threads without needing serialization.But those were too much.So we got user space threads, which are logical threads of execution that are driven by a runtime entirely in user space. The runtime adds scheduling hooks into all I/O functions in the standard library, or even uses a system API like Unix signals to preempt logical threads. No system-level context switching is needed. User space threads can be tiny.But those were too much.So we got coroutines, which allow a programmer to define logical "threads" of execution that cooperatively interact with each other. There is no assumption about the presence of a scheduler. The programmer either writes their own event loop or invokes one from a library in a "real" logical thread.I wonder what comes next. As far as [communicating sequential processes][1] are concerned, maybe cooperative coroutines are a low as you can go.[1]: <a href="https://www.cs.cmu.edu/~crary/819-f09/Hoare78.pdf" rel="nofollow noreferrer">https://www.cs.cmu.edu/~crary/819-f09/Hoare78.pdf</a>

评论 #36769329 未加载

评论 #36769911 未加载

评论 #36767054 未加载

评论 #36768760 未加载

djha-skin将近 2 年前

I thought that the entire point of green threads was so that I didn't have to use something like Python's `yield` keyword to get nice, cooperative-style scheduling.I thought go's `insert resumes at call points and other specific places` design decision was a very nice compromise.This is allowing access to more and more of the metal. At what point are we just recreating Zig here? What's next? An optional garbage collector?

评论 #36763513 未加载

评论 #36763457 未加载

chrsig将近 2 年前

Coroutines are one thing that i'd probably prefer language support for rather than a library.<pre><code> x := co func(){ var z int for { z++ yield z } } y := x() for y := range x { ... } </code></pre> or something to that effect. It's cool that it can be done at all in pure go, and I can see the appeal of having a standard library package for it with an optimized runtime instead of complecting the language specification. After all, if it's possible to do in pure go, then other implementations can be quickly bootstrapped.My $0.02, as someone that uses go at $work daily: I'd be happy to have either, but I'd prefer it baked into the language. Go's concurrency primitives have always been a strength, just lean into it.

评论 #36766158 未加载

评论 #36768585 未加载

评论 #36765767 未加载

silisili将近 2 年前

Not sure I'm a fan. Looking through the examples, I feel like this makes the language much harder to read and follow, but maybe that's just my own brain and biases.Further, it doesn't seem to me to allow you to do anything you can't currently do with blocking channels and/or state.

评论 #36765530 未加载

评论 #36767575 未加载

评论 #36766789 未加载

xpressvideoz将近 2 年前

Reading the comments makes me feel bittersweet.- Many people consider coroutines and green threads to be more or less the same thing, when they both have their pros and cons.- The fact that the omission of iterators is even acceptable in the Go community saddens me. They seem to deliberately refuse any feature that might make the language even slightly more complex, in the name of simplicity. But hey, at least they retracted their opinion on generics.I'm again reminded that Go is not my language.

评论 #36766707 未加载

评论 #36767099 未加载

评论 #36769061 未加载

评论 #36769813 未加载

评论 #36768570 未加载

pjmlp将近 2 年前

I guess it is great that they are finally paying attentio to programming languages like CLU.On the other side, given my experience with .NET and C++ co-routines, and Active Objects (in Symbian C++ and Active Oberon) not sure if this is really something to add to Go.Even the .NET team has acknowledged at this year's BUILD, that if they could go back in time having the runtime handle them Go-style would probably been a better decision, given how many developers keep having issues understanding async/await.

vaastav将近 2 年前

Not sure if this really is required. Most cases in Go are served well by GoRoutines and for yield/resume semantics, 2 blocking channel are enough. This seems to add complexity for the sake of it and not sure it actually adds any new power to Go that already didn't exist.

评论 #36763488 未加载

评论 #36766803 未加载

pmarreck将近 2 年前

As a point of comparison, here's my demo from a recent presentation of firing up 1 million (1,000,000) Elixir (BEAM VM) threads, sending them all a "Hello!" message, and then each thread waits a random amount of time between 0 and 2 seconds to send a message back of "Process <their number> received message <themessage>!"At the same time, I am running the Erlang observer beside it to watch what happens to the CPU and memory consumption and how quickly it recovers/cleans up the garbage.The biggest bottleneck here is the terminal's ability to keep up, but the observer seems to reflect what's happening accurately.<a href="https://www.youtube.com/watch?v=yxyYKnashR0">https://www.youtube.com/watch?v=yxyYKnashR0</a>The code I used: <a href="https://gist.github.com/pmarreck/4cc8f2f55a561ebce2012085a3a631f0" rel="nofollow noreferrer">https://gist.github.com/pmarreck/4cc8f2f55a561ebce2012085a3a...</a>These features have been built into Erlang (and thus Elixir) since the 1980's. I'm sure many of you have heard of the Actor model and/or Erlang's "legendary" implementation of it, but I don't know how many have actually seen it in action with monitoring kit running.I think it would be great for Go if it offered language-level support like this, but given the extremely resource-efficient implementation (both in spawning and runtime consumption) of threads on the BEAM VM, coupled with the ease of concurrency which comes directly from only permitting immutable values, I don't think it will ever be matched.

RcouF1uZ4gsC将近 2 年前

I don’t think Coroutines would fit in with Go. There is a huge emphasis on simplicity. Coroutines add a massive amount of complexity. In addition, goroutines provide the best parts of Coroutines - cheap, easy to use, non-blocking operations - without a lot of the pain pints such as “coloring” or functions and issues with using things like mutexes.Just the question of whether one should use a goroutine or a coroutine adds complexity.

评论 #36764367 未加载

评论 #36764412 未加载

jerf将近 2 年前

I'm not 100% sure this is the case, but I believe the context of this goes something like this. As Go has added generics, there are proposals to add generic data structures like a Set. Generics solve almost every problem with that, but there is one conspicuous issue that remains for a data structure: You can iterate over a slice or a map with the "range" keyword, and that yields special iteration behavior, but there is no practical way to do that with a general data structure, if you consider constructing an intermediate map or slice to be an insufficient solution. Go is generally performance-sensitive enough that it is.The natural solution to this is some sort of iterator, as in Python or other languages. (Contra frequent accusations to the contrary, the Go community is aware of other language's efforts.)So this has opened the can of worms of trying to create an iteration standard for Go.Go has something that has almost all the semantics we want right now. You can also "range" over a channel. This consumes one value at a time from the channel and provides it to the iteration, exactly as you'd expect, and the iteration terminates when the channel is closed. It just has one problem, which is that it involves a full goroutine and a synchronized channel send operation for each loop of the iteration. As I said in another comment, if what is being iterated on is something huge like a full web page fetch, this is actually fine, but no concurrency primitive can keep up with the efficiency of incrementing an integer, a single instruction which may literally take an amortized fraction of a cycle on a modern processor. With generics you can even relatively implement filter, map, etc. on this iterator... but adding a goroutine and synchronized commit for each such element of a pipeline is just crazy.I believe the underlying question in this post is, can we use standard Go mechanisms to implement the coroutines without creating a new language construct, then use the compiler under the hood to convert it to an efficient execution? Basically, can this problem be solved with compiler optimizations rather than a new language construct? From this point of view, the payload of this article is really only that very last paragraph; the entire rest of the article is just orientation. If so, then Go can have coroutine efficiency with the standard language constructs that already exist. Perhaps some code that is using this pattern goroutine already might speed up too "for free".The concerns people have about this complexifying Go, the entire point of this operation is to suck the entire problem into the compiler with 0 changes to the spec. Not complexifying Go with a formal iteration standard is the entire point of this operation. If one wishes to complain, the correct complaint is the exact opposite one, that Go is not "simply" "just" implementing iterators as a first class construct just like all the other languages.Also, in the interests of not posting a full new post, note that in general I shy away from the term "coroutine" because a coroutine is what this article describes, exactly, and nothing less. To those posting "isn't a goroutine already a coroutine?", the answer is, no, and in fact almost nothing called a coroutine by programmers nowadays actually is. The term got simplified down to where it just means thread or generator as Python uses the term, depending on the programming community you're looking at, but in that context we don't need to use the term "coroutine" that way, because we already have the word "thread" or "generator". This is what "real" coroutines are, and while I won't grammatically proscribe to you what you can and can not say, I will reiterate that I personally tend to avoid the term because the conflation between the sloppy programmer use and the more precise academic/compiler use is just confusing in almost all cases.

评论 #36764115 未加载

HumblyTossed将近 2 年前

what? I'm a Go newb, but isn't this what goroutines and channels get you?

评论 #36763824 未加载

评论 #36768393 未加载

VWWHFSfQ将近 2 年前

Aside:Lua is an absolute work of art. Everything about the tiny language, how it works, and even all the little peculiarities, just makes sense.

评论 #36766749 未加载

评论 #36765772 未加载

评论 #36764820 未加载

FZambia将近 2 年前

Wondering whether coroutines may be a step towards async event-based style APIs without allocating read buffers for the entire connection. I.e. a solution to problems discussed in <a href="https://github.com/golang/go/issues/15735">https://github.com/golang/go/issues/15735</a>. Goroutines provide a great way to have non-blocking IO with synchronous code – but when it comes to effective memory management with many connections Go community tend to invent raw epoll implementations: <a href="https://www.freecodecamp.org/news/million-websockets-and-go-cc58418460bb/" rel="nofollow noreferrer">https://www.freecodecamp.org/news/million-websockets-and-go-...</a>. So my question here – can coroutines somehow bring new possibilities in terms of working with network connections?

xwowsersx将近 2 年前

Somewhat on topic given that OP brought up coroutines in Python: what resources have folks used to understand Python's asyncio story in depth? I'm just now finally understanding how to use stuff, but it was through a combination of the official documentation, the books "Using Asyncio in Python" and "Expert Python Programming", none of which were particularly good. Normally I'd rely just on the official docs, but the docs have created much confusion, it seems, because there's a lot in them that are useful more so for library/framework developers than for users. So, I'm just wondering if anyone has great resources for really gaining a strong understanding of Python's asyncio or how else you might have gone about gaining proficiency to the point where you felt comfortable using asyncio in real projects.

评论 #36764979 未加载

评论 #36765812 未加载

up2isomorphism将近 2 年前

The most valuable quality of a programming language committee is holding the temptation to add any new features unless it is something that drives existing users away.

samsquire将近 2 年前

This is a thoroughly interesting topic. Thanks for the article.I haven't thought much about iterators link to coroutines.As a hobby, I am working to write about a dream programming language. I happen to be really interested in parallelism, asynchronous, coroutines, multithreading and concurrency.I want:* seamlessly switch between remote-thread coroutine, local thread coroutine.* concurrency and parallelism and async to be easy to think about, reason about, read and program* programs should be easy to parallelise and be async and concurrentGo iterators seem to be local to a thread, but what if you want to distribute work across threads?I've been thinking of scheduling recently.Imagine you're a search engine company and you want to index links between URLs. How would you solve this with coroutines?<pre><code> task download-url for url in urls: download(url) task extract-links parsed = parse(document) return parsed task fetch-links for link in document.query("a") return link task save-data db.save(url, link) </code></pre> How would you do control flow and scheduling and parallelism and async efficiently with this code?* `db.save()`, `download()` are IO intensive whereas `document.query("a")` and `parse` is CPU intensive.* I want to handle plurality or multiple items trivially such as multiple URLs and multiple links.* I want to keep IO and CPU in flight at all times.I think I want this schedule:<a href="https://user-images.githubusercontent.com/1983701/254083968-b46485c8-fe5f-43ea-b840-d0d63dab4a51.PNG" rel="nofollow noreferrer">https://user-images.githubusercontent.com/1983701/254083968-...</a>I have a toy 1:M:L 1 scheduler thread:M kernel threads:N lightweight threads lightweight scheduler in C, Rust and Java<a href="https://github.com/samsquire/preemptible-thread">https://github.com/samsquire/preemptible-thread</a>This lets me switch between tasks and preempt them from user space without assistance at descheduling time.I have a simplistic async/await state machine thread pool in Java. My scheduling algorithm is very simple.I want things like backpressure, circuit breakers, rate limiting, load shedding, rate adjustment, queuing.

kragen将近 2 年前

i've been thinking about a closely related feature in a different context: adding block arguments, as in smalltalk or ruby or especially lobster, to a language more like c, with static types and stack allocationi think this would be favorable for (among other things) clu-like iterators and imgui libraries, where you often want to do something like<pre><code> submenu("&Edit") { command("&Cut") { clip_cut(getSelection()); } ... } </code></pre> this is especially useful in a context where you're heap-allocating sparingly or not at all, because the subroutine taking the block argument can stack-allocate some resource, pass it to the block, and deallocate it once the block returns; python context managers and win32 paint messages are two cases where people commonly do this sort of thing, but things like save-excursion, with-output-file, transactional memory, and gsave/grestore also provide motivationthe conventional way to do this is to package up the block into a closure, then use a full-fledged function invocation to invoke it, using a calling convention that supports closures. but i suspect a more relaxed and efficient approach is to use an asymmetric coroutine calling convention, in which the callee yields back control to its caller at the entry point to the block, and the block then resumes the callee when it finishes. so instead of merely dividing registers into callee-saved and call-clobbered, as subroutine calling conventions do, we would divide them into callee-saved upon return but upon yield containing callee values the block must have restored upon resumption; caller coroutine context registers, which are callee-saved upon return and also on yield; and call-clobbered. you also need in many cases a way for the block to safely force an early exit from the calleethis allows the caller's local variables to be in registers its blocks can use without further ado, or at least indexed off of such a register, while allowing the yield and resume operations to be, in many cases, just a single machine instruction. and it does not require heap allocationas an example of taking this to the point of absurdity, here's an untested subroutine for iterating over a nul-terminated string passed in r0 with a block passed in r1, using a hypothetical coroutine convention which passes at least r4 through from its caller to its blocks<pre><code> itersz: push {r6, r7, r8, lr} mov r7, r0 mov r6, r1 1: ldrb r0, [r7], #1 cbz r0, 1f blx r1 b 1b 1: pop {r6, r7, r8, pc} </code></pre> and here is another untested subroutine which uses it to calculate a string hash<pre><code> hashsz: push {r4, r5, r9, lr} movs r4, #53 adr r1, 1f blx itersz mov r0, r4 pop {r4, r5, r9, pc} 1: eor r4, r0, r4, ror #27 bx lr </code></pre> even in this case where both the iteration and the visitor block are utterly trivial, the runtime overhead per item (compared to putting them in the same subroutine) is evidently extremely modest; my estimate is 7 cycles per byte rather than 4 cycles per byte on in-order hardware with simple branch prediction, so, on the order of 1 ns on the hardware russ used as his reference. for anything more complex the overhead should be insignificantit's less general than the mechanism russ proposes here (it doesn't solve the celebrated samefringe problem), but it's also an order of magnitude more efficient, because the yield and resume operations are less work than a subroutine call, though still more work than, say, decrementing a register and jumping if nonzero

评论 #36776877 未加载

pierrebai将近 2 年前

The examples given prompt me to say: if all you have is Rube-Goldberg hammer, everything looks like an Escheresque nail.Sieving primes by turning functions into coroutines, parsing text by yielding characters, all with unnatural functions and state management... that;s an improvement over what?

ketchupdebugger将近 2 年前

I'm not sure why author is advocating for single threaded patterns in a multithreaded environment. Not sure why he's trying to limit himself like this. The magic of goroutines is that you can use all of your cores easily not just one. Python and Lua has no choice.

评论 #36763993 未加载

metadat将近 2 年前

Reasoning about and following the control flow of the proposed code hurts me inside. If Go adds function coloring via (e.g. python's async and/or yield concepts), I'm out, because I don't want to use this, much less encounter it in the form of a bug in some library.Java and C++ are largely inferior for my typical purposes, but at the end of the day they work fine and are stable in terms of direction, and don't tend to repeatedly bloat the language over pedantry. If you want top-notch performance, there's already C, C++, and Rust.I am not a fan of the function coloring shit in Python and Javascript.I don't want the kitchen sink!

评论 #36782848 未加载