Back to basics: Why we chose long-polling over websockets

258 点作者 lunarcave4 个月前

28 条评论

Animats4 个月前

Long polling has some problems of its own.Second Life has an HTTPS long polling channel between client and server. It's used for some data that's too bulky for the UDP connection, not too time sensitive, or needs encryption. This has caused much grief.On the client side, the poller uses libcurl. Libcurl has timeouts. If the server has nothing to send for a while, libcurl times out. The client then makes the request again. This results in a race condition if the server wants to send something between timeout and next request. Messages get lost.On top of that, the real server is front-ended by an Apache server. This just passes through relevant requests, blocking the endless flood of junk HTTP requests from scrapers, attacks, and search engines. Apache has a timeout, and may close a connection that's in a long poll and not doing anything.Additional trouble can come from middle boxes and proxy servers that don't like long polling.There are a lot of things out there that just don't like holding an HTTP connection open. Years ago, a connection idle for a minute was fine. Today, hold a connection open for ten seconds without sending any data and something is likely to disconnect it.The end result is an unreliable message channel. It has to have sequence numbers to detect duplicates, and can lose messages. For a long time, nobody had discovered that, and there were intermittent failures that were not understood.In the original article, the chart section labelled "loop" doesn't mention timeout handling. That's not good. If you do long polling, you probably need to send something every few seconds to keep the connection alive. Not clear what a safe number is.

评论 #42604888 未加载

评论 #42605065 未加载

评论 #42604963 未加载

评论 #42609799 未加载

评论 #42604648 未加载

ipnon4 个月前

Articles like this make me happy to use Phoenix and LiveView every day. My app uses WebSockets and I don’t think about them at all.

评论 #42603144 未加载

评论 #42602514 未加载

评论 #42602188 未加载

评论 #42603710 未加载

评论 #42601696 未加载

评论 #42605047 未加载

评论 #42601552 未加载

评论 #42601771 未加载

评论 #42604362 未加载

CharlieDigital4 个月前

Would there be any technical benefit to this over using server sent events (SSE)?Both are similar in that they hold the HTTP connection open and have the benefit of being simply HTTP (the big plus here). SSE (at least to me) feels like it's more suitable for some use cases where updates/results could be streamed in.A fitting use case might be where you're monitoring all job IDs on behalf of a given client. Then you could move the job monitoring loop to the server side and continuously yield results to the client.

评论 #42601006 未加载

评论 #42600661 未加载

评论 #42601166 未加载

评论 #42605884 未加载

yuliyp4 个月前

I think this article is tying a lot of unrelated decisions to "Websocket" vs "Long-polling" when they're actually independent. A long-polling server could handle a websocket client with just a bit of extra work to handle keep-alive.For the other direction, to support long-polling clients if your existing architecture is websockets which get data pushed to them by other parts of the system, just have two layers of servers: one which maintains the "state" of the connection, and then the HTTP server which receives the long polling request can connect to the server that has the connection state and wait for data that way.

评论 #42603939 未加载

评论 #42604469 未加载

wereHamster4 个月前

Unrelated to the topic in the article…<pre><code> await new Promise(resolve => setTimeout(resolve, 500)); </code></pre> In Node.js context, it's easier to:<pre><code> import { setTimeout } from "node:timers/promises"; await setTimeout(500);</code></pre>

评论 #42604620 未加载

评论 #42604538 未加载

baumschubser4 个月前

I like long polling, it’s easy to understand from start to finish and from client perspective it just works like a very slow connection. You have to keep track of retries and client-side cancelled connections to have one but only one (and the right one) of requests at hand to answer to.One thing that seems clumsy in the code example is the loop that queries the data again and again. Would be nicer if the data update could also resolve the promise of the response directly.

评论 #42601555 未加载

评论 #42601386 未加载

rednafi4 个月前

Neither Server-Sent Events nor WebSockets have replaced all use cases of long polling reliably. The connection limit of SSE comes up a lot, even if you’re using HTTP/2. WebSockets, on the other hand, are unreliable as hell in most environments. Also, WS is hard to debug, and many of our prod issues with WS couldn’t even be reproduced locally.Detecting changes in the backend and propagating them to the right client is still an unsolved problem. Until then, long polling is surprisingly simple and a robust solution that works.

评论 #42604779 未加载

评论 #42608774 未加载

imglorp4 个月前

Since the article mentioned Postgres by name, isn't this a case for using its asynchronous notification features? Servers can LISTEN to a channel and PG can TRIGGER and NOTIFY them when the data changes.No polling needed, regardless of the frontend channel.

评论 #42604305 未加载

评论 #42604484 未加载

bigbones4 个月前

I don't know how meaningful it is any more, but with long polling with a short timeout and a gracefully ended request (i.e. chunked encoding with an eof chunk sent rather than disconnection), the browser would always end up with one spare idle connection to the server, making subsequent HTTP requests for other parts of the UI far more likely to be snappier, even if the app has been left otherwise idle for half the dayI guess at least this trick is still meaningful where HTTP/2 or QUIC aren't in use

bartvk4 个月前

Refreshing to be reminded of a relatively simple alternative to websockets. For a short time, I worked at a now-defunct startup which had made the decision for websockets. It was an app that would often be used on holiday so testing was done on hotel and restaurant wifi. Websockets made that difficult.

评论 #42601167 未加载

peheje4 个月前

What about HTTP/2 Multiplexing, how does it hold up against long-polling and websockets?I have only tried it briefly when we use gRPC: <a href="https://grpc.io/docs/what-is-grpc/core-concepts/#server-streaming-rpc" rel="nofollow">https://grpc.io/docs/what-is-grpc/core-concepts/#server-stre...</a>Here it's easy to specify that a endpoint is a "stream", and then the code-generation tool gives all tools really to just keep serving the client with multiple responses. It looks deceptively simple. We already have setup auth, logging and metrics for gRPC, so I hope it just works off of that maybe with minor adjustments. But I'm guessing you don't need the gRPC layer to use HTTP/2 Multiplexing?

评论 #42603702 未加载

评论 #42606050 未加载

评论 #42608521 未加载

vouwfietsman4 个月前

The points mentioned against websockets are mostly fud, I've used websockets in production for a very heavy global data streaming application, and I would respond the following to the "upsides" of not using websockets:> Observability Remains UnchangedActually it doesn't, many standard interesting metrics will break because long-polling is not a standard request either.> Authentication SimplicitySure, auth is different than with http, but not more difficult. You can easily pass a token.> Infrastructure CompatibilityI'm sure you can find firewalls out there where websockets are blocked, however for my use case I have never seen this reported. I think this is outdated, for sure you don't need "special proxy configurations or complex infrastructure setups".> Operational SimplicityRestarts will drop any persistent connection, state can be both or neither in WS or in LP, it doesn't matter what you use.> Client implementationIt mentions "no special WebSocket libraries needed" and also "It works with any HTTP client". Guess what, websockets will work with any websocket client! Who knew!Finally, in the conclusion:> For us, staying close to the metal with a simple HTTP long polling implementation was the right choiceCalling simple HTTP long polling "close to the metal" in comparison to websockets is weird. I wouldn't be surprised if websockets scale much better and give much more control depending on the type of data, but that's besides the point. If you want to use long polling because you prefer it, go ahead. Its a great way to stick to request/response style semantics that web devs are familiar with. Its not necessary to regurgitate a bunch of random hearsay arguments that may influence people in the wrong way.Try to actually leave the reader with some notion of when to use long polling vs when to use websockets, not a post-hoc justification of your decision based on generalized arguments that do not apply.

评论 #42604588 未加载

vitus4 个月前

I would appreciate if the article spent more time actually discussing the benefits of websockets (and/or more modern approaches to pushing data from server -> browser) and why the team decided those benefits were not worth the purported downsides. I could see the same simplicity argument being applied to using unencrypted HTTP/1.1 instead of HTTP/2, or TCP Reno instead of CUBIC.The section at the end talking about "A Case for Websockets" really only rehashes the arguments made in "Hidden Benefits of Long-Polling" stating that you need to reimplement these various mechanisms (or just use a library for it).My experience in this space is from 2011, when websockets were just coming onto the scene. Tooling / libraries were much more nascent, websockets had much lower penetration (we still had to support IE6 in those days!), and the API was far less stable prior to IETF standardization. But we still wanted to use them when possible, since they provided much better user experience (lower latency, etc) and lower server load.

bob10294 个月前

I think we could have the best of both worlds. E.g.:<a href="https://socket.io/docs/v4/how-it-works/#upgrade-mechanism" rel="nofollow">https://socket.io/docs/v4/how-it-works/#upgrade-mechanism</a>

评论 #42604936 未加载

LeicaLatte4 个月前

Long polling is my choice for simple, reliable and plug and play like interfaces. HTTP requests tend to be standard and simplify authentication as well. Systems with frequent but not constant updates are ideal. Text yes. Voice maybe not.Personal Case Study: I built mobile apps which used Flowise assistants for RAG and found websockets compeletely out of line with the rest of my system and interactions. Suddenly I was fitting a round peg in a square hole. I switched to OpenAI assistants and their polling system felt completely "natural" to integrate.

feverzsj4 个月前

Why not just use chunked encoding and get rid of extra requests.

Cort3z4 个月前

I think they are mixing some problems here. They could probably have used their original setup with Postgres NOTIFY+triggers in stead of polling, and only have one "pickup poller" to catch any missed events/jobs. In my opinion transaction medium should not be linked to how the data is manage internally, but I know from experience that this separation is often hard to achieve in practice.

emilio13374 个月前

The article does discuss a lot of mixed concepts. I would prefer one process polling new jobs/state and one process handling http connections/websockets. Hence no flooding the database and completely scalable from the client side. The database process pushes everything downstream via some queue while the other process/server handles those and sends them to respective clients

sgarland4 个月前

The full schema isn’t listed, but the indices don’t make sense to me.(id, cluster_id) sounds like it could / should be the PKIf the jobs are cleared once they’ve succeeded, and presumably retried if they’ve failed or stalled, then the table should be quite small; so small, that a. The query planner is unlikely to use the partial index on (status) b. The bloat from the rapidity of DELETEs likely overshadows the live tuple size.

DougN74 个月前

I implemented a long polling solution in desktop software over 20 years ago and it’s still working great. It can even be used as a tunnel to stream RDP sessions, through which YouTube can play without a hiccup. Big fan of long polling, though I admit I didn’t get a chance to try web sockets back then.

评论 #42603453 未加载

k__4 个月前

Half-OT:What's the most resource efficient way to push data to clients over HTTP?I can send data to a server via HTTP request, I just need a way to notify a client about a change and would like to avoid polling for it.I heard talk about SSE, WebSockets, and now long-polling.Is there something else?What requires the least resources on the server?

评论 #42602058 未加载

gloosx4 个月前

>Our system handles hundreds of worker nodes constantly polling our PostgreSQL-backed control plane for new jobsDoes everybody poll their PosgreSQL to get new rows in real-time? This is really weird, there are trigger functions and notifications.

评论 #42619902 未加载

amelius4 个月前

> Corporate firewalls blocking WebSocket connections was one of our other worries. Some of our users are behind firewalls, and we don't need the IT headache of getting them to open up WebSockets.Don't websockets look like ordinary https connections?

评论 #42603721 未加载

评论 #42602500 未加载

mojuba4 个月前

Can someone explain why TTL = 60s is a good choice? Why not more, or less?

评论 #42604295 未加载

justinl334 个月前

yeah, authentication complexity with WebSockets is severely underappreciated. We ran into major RBAC headaches when clients needed to switch between different privilege contexts mid-session. Long polling with standard HTTP auth patterns eliminates this entire class of problems.

评论 #42601481 未加载

tguvot4 个月前

Another reason: there is a patent troll suing companies over usage of websockets.

sneak4 个月前

Given long polling, I have never ever understood why websockets became a thing. I’ve never implemented them and never will, it’s a protocol extension where none is necessary.

评论 #42604656 未加载

valenterry4 个月前

Using websockets with graphql, I feel like a lot of the challenges are then already solved. From the post:- Observability: WebSockets are more stateful, so you need to implement additional logging and monitoring for persistent connections: solved with graphql if the existing monitoring is already sufficient.- Authentication: You need to implement a new authentication mechanism for incoming WebSocket connections: solved with graphql.- Infrastructure: You need to configure your infrastructure to support WebSockets, including load balancers and firewalls: True, firewalls need to be updated.- Operations: You need to manage WebSocket connections and reconnections, including handling connection timeouts and errors: normally already solved by the graphql library. For errors, it's basically the same though.- Client Implementation: You need to implement a client-side WebSocket library, including handling reconnections and state management: Just have to use a graphql library that comes with websocket support (I think most of them do) and configure it accordingly.

评论 #42601188 未加载

评论 #42601334 未加载

评论 #42604580 未加载