Server-Sent Events (SSE) Are Underrated

338 points by Igor_Wiwi 6 months ago

kdunglas 6 months ago

A while ago I created Mercure: an open pub-sub protocol built on top of SSE that is a replacement for WebSockets-based solutions such as Pusher. Mercure is now used by hundreds of apps in production.

At the core of Mercure is the hub. It is a standalone component that maintains persistent SSE (HTTP) connections to the clients, and it exposes a very simple HTTP API that server apps and clients can use to publish. POSTed updates are broadcasted to all connected clients using SSE. This makes SSE usable even with technologies not able to maintain persistent connections such as PHP and many serverless providers.

Mercure also adds nice features to SSE such as a JWT-based authorization mechanism, the ability to subscribe to several topics using a single connection, events history, automatic state reconciliation in case of network issue…

I maintain an open-source hub written in Go (technically, a module for the Caddy web server) and a SaaS version is also available.

Docs and code are available on https://mercure.rocks

Dren95 6 months ago

Cool didn’t know this. I used a similar solution called Centrifugo for a while. It allows you to choose which transport to use (ws, sse, others)
https://github.com/centrifugal/centrifugo
- FZambia 6 months ago
  
  Wow, it's fascinating how a single HN comment can drive meaningful traffic to a project! I'm the author of Centrifugo, and I appreciate you mentioning it here.
  Let me share a bit more about Centrifugo transport choices. It’s not just about supporting multiple transports — developers can also choose between bidirectional and unidirectional communication models, depending on their needs.
  For scenarios where stable subscriptions are required without sending data from the client to the server, Centrifugo seamlessly supports unidirectional transports like SSE, HTTP-streaming, unidirectional gRPC streams, and even unidirectional WebSockets (this may sound kinda funny for many I guess). This means integration is possible without relying on client-side SDKs.
  However, Centrifugo truly shines in its bidirectional communication capabilities. Its primary transport is WebSocket – with JSON or Protobuf protocols, with SSE/HTTP-streaming fallbacks that are also bidirectional — an approach reminiscent of SockJS, but with more efficient implementation and no mandatory sticky sessions. Sticky sessions is an optimization in Centrifugo, not a requirement. It's worth noting that SSE only supports JSON format, since binary is not possible with it. This is where HTTP-streaming in conjuction with ReadableStream browser API can make much more sense!
  I believe Centrifugo gives developers the flexibility to choose the transport and communication style that best fits their application's needs. And it scales good out of the box to many nodes – with the help of Redis or Nats brokers. Of course this all comes with limitations every abstraction brings.
apitman 6 months ago

The site mentions battery-efficiency specifically. I'm curious what features does Mercure provide in that direction?
- kdunglas 6 months ago
  
  SSE/Mercure (as WebSockets) is much battery-efficient than polling (push vs poll, less bandwidth used).
  Additionally, on controlled environnements, SSE can use a « push proxy » to wake up the device only when necessary: https://html.spec.whatwg.org/multipage/server-sent-events.ht...
- pests 6 months ago
  
  It comes down to all the extra bytes sent and processed (local and remote, and in flight) by long polling. SSE events are small while other methods might require multiple packets and all the needless headers throughout the stack, for example.
whateveracct 6 months ago

I've used Mercure before at a startup of mine. Self hosted. Worked great! And still works to this day (I haven't actively worked on that startup for years myself.)
tonyhart7 6 months ago

its cool but its in go, do you know other implementation in rust ????

dugmartin 6 months ago

It doesn’t mention the big drawback of SSE as spelled out in the MDN docs:

“Warning: When not used over HTTP/2, SSE suffers from a limitation to the maximum number of open connections, which can be especially painful when opening multiple tabs, as the limit is per browser and is set to a very low number (6).”

atombender 6 months ago

One of my company's APIs uses SSE, and it's been a big support headache for us, because many people are being corporate firewalls that don't do HTTP/2 or HTTP/3, and people often open many tabs at the same time. It's unfortunately not possible to detect client-side whether the limit has been reached.
Another drawback of SSE is lack of authorization header support. There are a few polyfills (like this one [1]) that simulate SSE over fetch/XHR, but it would be nice to not need to add the bloat.
[1] https://github.com/EventSource/eventsource
- fitsumbelay 6 months ago
  
  I hate to suggest a solution before testing it myself so apologies in advance but I have a hunch that Broadcast Channel API can help you detect browser tab opens on client side. New tabs won't connect to event source and instead listen for localStorage updates that the first loaded tab makes.
  https://www.google.com/search?q=can+I+use+BroadcastChannel+A...
  The problem in this case is how to handle the first tab closing and re-assign which tab then becomes the new "first" tab that connects to the event source but it may be a LOE to solve.
  Again apologies for suggesting unproven solutions but at the same time I'm interested in feedback it gets to see if its near the right track
  - nchmy 6 months ago
    
    Rather than Broadcast Channel, you can use the Web Locks API (https://developer.mozilla.org/en-US/docs/Web/API/Web_Locks_A...). This handles the leader election pretty seamlessly because all tabs try to become the leader and only one actually can.
    This library (https://github.com/pubkey/broadcast-channel) from the fantastic RxDB javascript DB library uses WebLocks with a fallback to Broadcast Channel. But, WebLocks are supported on 96% of browsers, so probably safe to just use it exclusively now
  - b4ckup 6 months ago
    
    This sounds like a good use case for using a service worker. All tabs talk to the service worker and the worker is the single instance that talks to the backend and can use only one connection. Maybe there are some trade offs for using SSE in web workers, I'm not sure.
    
    Keithamus 6 months ago
    
    BroadcastChannel is a better solution for a couple of reasons. Service Workers are better at intercepting network requests and returning items from a cache, there’s some amount of additional effort to do work outside of that. The other thing is they’re a little more difficult to set up. A broadcast channel can be handled in a couple lines of code, easily debuggable as they run on the main thread, and they’re more suited to the purpose.
    
    nchmy 6 months ago
    
    Weblocks (https://developer.mozilla.org/en-US/docs/Web/API/Web_Locks_A...) are an even better way to do this than Broadcast Channel
    
    b4ckup 6 months ago
    
    I disagree. You can just postMessage to communicate with the service worker and therefore I imagine the code using broadcast channel to be actually quite similar. About debugging, service workers are easily debuggable, though not on the main thread as you already mentioned.
    
    fitsumbelay 6 months ago
    
    agreed. Workers was one of my first thoughts but I think BroadcastChannel delivers with much lower LOE
- leni536 6 months ago
  
  Supposedly websockets (the protocol) support authorization headers, but often there are no APIs for that in websocket libraries, so people just abuse the subprotocols header in the handshake.
  - apitman 6 months ago
    
    I don't think the problem is libraries. Browsers don't support this.
    
    leni536 6 months ago
    
    Sure, I didn't mean to distinguish browsers and the JS websocket API and websocket libraries in other languages.
- robocat 6 months ago
  
  Presumably you try SSE, and on failure fallback to something else like WebSockets?
  Push seems to require supporting multiple communication protocols to avoid failure modes specific to one protocol - and libraries are complex because of that.
  - mardifoufs 6 months ago
    
    But then why not just use websockets?
    
    virtue3 6 months ago
    
    From what I understand websockets are great until you have to load balance them. And then you learn why they aren’t so great.
    
    com2kid 6 months ago
    
    I've scaled websockets before, it isn't that hard.
    You need to scale up before your servers become overloaded, and basically new connections go north to the newly brought up server. It is a different mentality than scaling stateless services but it isn't super duper hard.
    
    hooli_gan 6 months ago
    
    Can you suggest some resources to learn more about Websocket scaling? Seems like an interesting topic
    
    com2kid 6 months ago
    
    Honestly I just flipped the right bits in the aws load balancer (maintain persistent connections, just the first thing you are told to do when googling aws load balancers and web sockets) and setup the instance scaler to trigger based upon "# open connections / num servers > threshold".
    Ideally it is based on the rate of incoming connections, but so long as you leave enough headroom when doing the stupid simple scaling rule you should be fine. Just ensure new instances don't take too long to start up.
    
    hamandcheese 6 months ago
    
    My understanding is the hard part about scaling WebSockets is that they are stateful and long lived connections. That is also true of SSE. Is there some other aspect of WebSockets that make them harder to scale than WebSockets?
    I guess with WebSockets, if you choose to send messages from the client to the server, then you have some additional work that you wouldn't have with SSE.
- nchmy 6 months ago
  
  FYI, the dev of that library created a new, better Event Source client
  https://github.com/rexxars/eventsource-client
  - atombender 6 months ago
    
    Yes, I know. We both work at Sanity, actually! The reason I didn't mention it was that the newer library isn't a straight polyfill; it offers a completely different interface with async support and so on.
jesprenj 6 months ago

You can easily multiplex data over one connection/event stream. You can design your app so that it only uses one eventstream for all events it needs to receive.
- raggi 6 months ago
  
  This, it works well in a service worker for example.
  - tomsmeding 6 months ago
    
    The caniuse link in the OP, under Known Issues, notes that Firefox currently does not support EventSource in a service worker. https://caniuse.com/?search=EventSource
  - nikeee 6 months ago
    
    How does this work with a service worker? I've only managed to do this via SharedWorker (which is not available on Chrome on Android).
    
    raggi 6 months ago
    
    You can just open a stream in the service worker and push events via postMessage and friends.
    Another nice thing to do is to wire up a simple filesystem monitor for all your cached assets that pushes path & timestamp events to the service worker whenever they change, then the service worker can refresh affected clients too (with only a little work this is endgame livereload if you’re not constrained by your environment)
RadiozRadioz 6 months ago

That is a very low number. I can think of many reasons why one would end up with more. Does anyone know why it is so low?
- raggi 6 months ago
  
  The number was set while Apache was dominant and common deployments would get completely tanked by a decent number of clients opening more conns than this. c10k was a thing once, these days c10m is relatively trivial
- apitman 6 months ago
  
  Historical reasons. The HTTP/1.1 spec actually recommends limited to 2 connections per domain. That said, I'm not sure why it's still so low. I would guess mostly to avoid unintended side effects of changing it.
  - gsnedders 6 months ago
    
    > The HTTP/1.1 spec actually recommends limited to 2 connections per domain.
    This is no longer true.
    From RFC 9112 § 9.4 (https://httpwg.org/specs/rfc9112.html#rfc.section.9.4):
    > Previous revisions of HTTP gave a specific number of connections as a ceiling, but this was found to be impractical for many applications. As a result, this specification does not mandate a particular maximum number of connections but, instead, encourages clients to be conservative when opening multiple connections.
    
    apitman 6 months ago
    
    If this was a MUST would it have required a bump from 1.1?
  - dontchooseanick 6 months ago
    
    Because you're supposed to use a single connection with HTTP Pipelining for all your ressources [1]
    When index.html loads 4 CSS and 5 JS : 10 ressources in HTTP 1.0 needed 10 connections, with 10 TLS negociations (unless one ressource loaded fast and you could reuse it's released connection)
    With HTTP1.1 Pipelining you open only one connection, including a single TLS nego, and ask 10 ressources.
    Why not only 1 per domain so ? IIRC it's because the 1st ressource index.html may take a lot of Time to complete and well race conditions suggest you use another one that the 'main thread' more or less. So basically 2 are sufficient.
    [1] https://en.m.wikipedia.org/wiki/HTTP_pipelining
    
    immibis 6 months ago
    
    HTTP pipelining isn't used by clients.
- giantrobot 6 months ago
  
  Because 30 years ago server processes often (enough) used inetd or served a request with a forked process. A browser hitting a server with a bunch of connections, especially over slow network links where the connection would be long lived, could swamp a server. Process launches were expensive and could use a lot of memory.
  While server capacity in every dimension has increased the low connection count for browsers has remained. But even today it's still a bit of a courtesy to not spam a server with a hundred simultaneous connections. If the server implicitly supports tons of connects with HTTP/2 support that's one thing but it's not polite to abuse HTTP/1.1 servers.
- foota 6 months ago
  
  Probably because without http/2 each would require a TCP connection, which could get expensive.
SahAssar 6 months ago

There is little reason to not use HTTP/2 these days unless you are not doing TLS. I can understand not doing HTTP/3 and QUIC, but HTTP/2?
- jiggawatts 6 months ago
  
  Corporate proxy servers often downgrade connections to HTTP 1.1 because inertia and lazy vendors.
  - SahAssar 6 months ago
    
    To do that they need to MITM and tamper with the inner protocol.
    In my experience this is quite rare. Some MITM proxies analyze the traffic, restrict which ciphers can be used, block non-dns udp (and therefore HTTP/3), but they don't usually downgrade the protocol from HTTP/2 to HTTP/1.
    
    geoffeg 6 months ago
    
    That hasn't been my experience at large corporations. They usually have a corporate proxy which only speaks HTTP 1.1, intercepts all HTTPS, and doesn't support websockets (unless you ask for an exception) and other more modern HTTP features.
    
    arccy 6 months ago
    
    "tamper" sounds much more involved than what they (their implementation) probably do: the proxy decodes the http request, potentially modifies it, and uses the decoded form to send a new request using their client, which only speaks http/1
    
    mrngm 6 months ago
    
    That is already tampering with the request. It's not simply forwarded to the requested destination, but intercepted, interpreted, and resent... with a different client than the original source.
    
    dilyevsky 6 months ago
    
    That’s exactly what they’re doing and it’s still very common in private networks
nhumrich 6 months ago

Http2 is controllable by you, since it's supposed in every browser. So, the way to fix this limitation is to use http2
- lolinder 6 months ago
  
  This was already suggested and someone pointed out that some corporate networks MITM everything without HTTP/2 support:
  https://news.ycombinator.com/item?id=42512072
- jillesvangurp 6 months ago
  
  Yes, use a proper load balancer that can do that. And use Http3 which is also supported by all relevant browsers at this point. There's no good reason to build new things on top of old things.
k__ 6 months ago

And over HTTP/2 and 3 they are efficient?
- apitman 6 months ago
  
  HTTP/2+ only uses a single transport connection (TCP or QUIC) per server, and multiplexes over that. So there's essentially no practical limit.
  - toomim 6 months ago
    
    Except that browsers add a limit of ~100 connections even with HTTP/2, for no apparently good reason.

piccirello 6 months ago

I utilized SSE when building automatic restart functionality[0] into Doppler's CLI. Our api server would send down an event whenever an application's secrets changed. The CLI would then fetch the latest secrets to inject into the application process. (I opted not to directly send the changed secrets via SSE as that would necessitate rechecking the access token that was used to establish the connection, lest we send changed secrets to a recently deauthorized client). I chose SSE over websockets because the latter required pulling in additional dependencies into our Golang application, and we truly only needed server->client communication. One issue we ran into that hasn't been discussed is HTTP timeouts. Some load balancers close an HTTP connection after a certain timeout (e.g. 1 hour) to prevent connection exhaustion. You can usually extend this timeout, but it has to be explicitly configured. We also found that our server had to send intermittent "ping" events to prevent either Cloudflare or Google Cloud Load Balancing from closing the connection, though I don't remember how frequently these were sent. Otherwise, SSE worked great for our use case.

[0] https://docs.doppler.com/docs/automatic-restart

apitman 6 months ago

Generally you're going to want to send ping events pretty regularly (I'd default to every 15-30 seconds depending on application) whether you're using SSE, WebSockets, or something else. Otherwise if the server crashes the client might not know the connection is no longer live.
- robocat 6 months ago
  
  What do you do for mobile phones: using data/radio for pings would kill the battery?
  After locking the phone, how is the ping restarted when the phone is unlocked? Or backgrounding the browser/app?
  - erinaceousjones 6 months ago
    
    The way I've implemented SSE is to make use of the fact it can also act like HTTP long-polling when the GET request is initially opened. The SSE events can be given timestamps or UUIDs and then subsequent requests can include the last received ID or the time of the last received event, and request the SSE endpoint replay events up until the current time.
    You could also add a ping with a client-requestable interval, e.g. 30 seconds (for foreground app) and 5 minutes or never (for backgrounded app), so the TCP connection is less frequently going to cause wake events when the device is idle. As client, you can close and reopen your connection when you choose, if you think the TCP connection is dead on the other side or you want to reopen it with a new ping interval.
    Tradeoff of `?lastEventId=` - your SSE serving thing needs to keep a bit of state, like having a circular buffer of up to X hours worth of events. Depending on what you're doing, that may scale badly - like if your SSE endpoint is multiple processes behind a round-robin load balancer... But that's a problem outside of whether you're choosing to use SSE, websockets or something else.
    To be honest, if you're worrying about mobile drain, the most battery efficient thing I think anyone can do is admit defeat and use one of the vendor locked-in things like firebase (GCM?) or apple's equivalent notification things: they are using protocols which are more lightweight than HTTP (last I checked they use XMPP same as whatsapp?), can punch through firewalls fairly reliably, batch notifications from many apps together so as to not wake devices too regularly, etc etc...
    Having every app keep their own individual connections open to receive live events from their own APIs sucks battery in general, regardless of SSE or websockets being used.
- sabareesh 6 months ago
  
  Yeah with cloudflare you need to do it every 30 seconds as the timeout is is 60 seconds
  - loloquwowndueo 6 months ago
    
    Then why not do it every 59 seconds :)
    
    virtue3 6 months ago
    
    You’d probably want to do it every 29 seconds in case a ping fails to send/deliver.
Xenoamorphous 6 months ago

I also used SSE 6 or so years ago, and had the same issue with out load balancer; a bit hacky but what I did was to set a timer that would send a single colon character (which is the comment delimiter IIRC) periodically to the client. Is that what you meant by “ping”?

apitman 6 months ago

> Perceived Limitations: The unidirectional nature might seem restrictive, though it's often sufficient for many use cases

For my use cases the main limitations of SSE are:

1. Text-only, so if you want to do binary you need to do something like base64

2. Browser connection limits for HTTP/1.1, ie you can only have ~6 connections per domain[0]

Connection limits aren't a problem as long as you use HTTP/2+.

Even so, I don't think I would reach for SSE these days. For less latency-sensitive and data-use sensitive applications, I would just use long polling.

For things that are more performance-sensitive, I would probably use fetch with ReadableStream body responses. On the server side I would prefix each message with a 32bit integer (or maybe a variable length int of some sort) that gives the size of the message. This is far more flexible (by allowing binary data), and has less overhead compared to SSE, which requires 7 bytes ("data:" + "\n\n") of overhead for each message.

[0]: https://stackoverflow.com/a/985704

nhumrich 6 months ago

ReadableStream appears to be SSE without any defined standards for chunk separation. In practice, how is it any different from using SSE? It appears to use the same concept.
- tomsmeding 6 months ago
  
  Presumably, ReadableStream does not auto-reconnect.
nchmy 6 months ago

You can do fetch and readable stream with SSE - here's an excellent client library for that
https://github.com/rexxars/eventsource-client

Tiberium 6 months ago

One thing I dislike regards to SSE, which is not its fault but probably a side effect of the perceived simplicity: lots of developers do not actually use proper implementations and instead just parse the data chunks with regex, or something of the sorts! This is bad because SSE, for example, supports comments (": text") in streams, which most of those hand-rolled implementations don't support.

For example, my friend used an LLM proxy that sends keepalive/queue data as SSE comments (just for debugging mainly), but it didn't work for Gemini, because someone at Google decided to parse SSE with a regex: https://github.com/google-gemini/generative-ai-js/blob/main/... (and yes, if the regex doesn't match the complete line, the library will just throw an error)

recursivedoubts 6 months ago

https://data-star.dev is a hypermedia-oriented front end library built entirely around the idea of streaming hypermedia responses via SSE.

It was developed using Go & NATS as backend technologies, but works with any SSE implementation.

Worth checking out if you want to explore SSE and what can be achieved w/it more deeply. Here is an interview with the author:

https://www.youtube.com/watch?v=HbTFlUqELVc

andersmurphy 6 months ago

+1 for recommending data-star. The combination of idiomorph (thank you), SSE and signals is fantastic for making push based and/or multiplayer hypermedia apps.
sudodevnull 6 months ago

Datastar author here, happy to answer any questions!

deaf_coder 6 months ago

The part where it says:

> SSE works seamlessly with existing HTTP infrastructure:

I'd be careful with that assumption. I have tried using SSE through some 3rd party load balancer at my work and it doesn't work that well. Because SSE is long-lived and doesn't normally close immediately, this load balancer will keep collecting and collecting bytes from the server and not forward it until server closes the connection, effectively making SSEs useless. I had to use WebSockets instead to get around this limitation with the load balancer.

jpc0 6 months ago

I had a similar issue at one point but if I remember correctly I just had to have my webserver send the header section without closing the connection.
Usually things would just get streamed through but for some reason until the full header was sent the proxy didn't forward and didn't acknowledge the connection.
Not saying that is your issue but definitely was mine.
- sudhirj 6 months ago
  
  Not entirely. If a load balancer is set to buffer say 4kb of data all the time, your SSE is stuck until you close the connection.
  I think there is a HTTP/2 flush instruction, but no load balancer is obligated to handle it and your SSE library might not be flushing anyway.
  - deaf_coder 6 months ago
    
    In my case with this load balancer, I think it's just badly written. I think it is set to hold ALL data until the server ends the connection. I have tried leaving my SSE open to send over a few megabytes worth of data and the load balancer never forwarded it at all until I commanded the server to close the connection.
    The dev who wrote that code probably didn't think too much about memory efficiency of proxying HTTP connections or case of streaming HTTP connections like SSE.
motorest 6 months ago

> SSE works seamlessly with existing HTTP infrastructure:
To stress how important it is to correct this error, even Mozilla's introductory page on server-sent events displays prominently with a big red text box that server-sent events are broken when not used over HTTP/2 due to browser's hard limit on open connections.
https://developer.mozilla.org/en-US/docs/Web/API/Server-sent...
Edit: I just saw the issue pointed out further down in the discussion
https://news.ycombinator.com/item?id=42511562
FZambia 6 months ago

Yep, and in addition to that the ephemeral ports problem will araise at some scale with long-lived connections and infrastructure balancer/reverse proxy chain. So it's still required to tune.
Igor_Wiwi 6 months ago

thanks, updated the article with your comment

hamandcheese 6 months ago

I tried implementing SSE in a web project of mine recently, and was very surprised when my website totally stopped working when I had more than 6 tabs open.

It turns out, Firefox counts SSE connections against the 6 host max connections limit, and gives absolutely no useful feedback that it's blocking the subsequent requests due to this limit (I don't remember the precise error code and message anymore, but it left me very clueless for a while). It was only when I stared at the lack of corresponding server side logs that it clicked.

I don't know if this same problem happens with websockets or not.

uncomplexity_ 6 months ago

wait let's check this
https://news.ycombinator.com/item?id=42511562
at https://developer.mozilla.org/en-US/docs/Web/API/Server-sent... it says
"Warning: When not used over HTTP/2, SSE suffers from a limitation to the maximum number of open connections, which can be especially painful when opening multiple tabs, as the limit is per browser and is set to a very low number (6). The issue has been marked as "Won't fix" in Chrome and Firefox. This limit is per browser + domain, which means that you can open 6 SSE connections across all of the tabs to www.example1.com and another 6 SSE connections to www.example2.com (per Stack Overflow). When using HTTP/2, the maximum number of simultaneous HTTP streams is negotiated between the server and the client (defaults to 100)."
so the fix is just use http/2 on server-side?
- remram 6 months ago
  
  Or a SharedWorker that creates a single SSE connection for all your tabs.
  SharedWorker is not very complicated but it's another component to add. It would be cool if this was built into SSE instead.
  - uncomplexity_ 6 months ago
    
    okay wtf this is amazing, seems usable with websockets too.
    usage of Shared Web Workers https://dev.to/ayushgp/scaling-websocket-connections-using-s...
    caniuse Shared Web Workers 45% https://caniuse.com/sharedworkers
    caniuse BroadcastChannel 96% https://caniuse.com/broadcastchannel
    
    nchmy 6 months ago
    
    Yeah, the issue with SharedWorkers is that Android Chromium doesn't support it yet. https://issues.chromium.org/issues/40290702
    But rather than Broadcast Channel, you can also use the Web Locks API (https://developer.mozilla.org/en-US/docs/Web/API/Web_Locks_A...) rather than Broadcast Channel
    This library (https://github.com/pubkey/broadcast-channel/blob/master/src/...) from the fantastic RxDB javascript DB library uses WebLocks with a fallback to Broadcast Channel. But, WebLocks are supported on 96% of browsers, so probably safe to just use it exclusively now.
  - hamandcheese 6 months ago
    
    Ultimately this is what I did. But if you need or want per-tab connection state it will get complicated in a hurry.
- ksec 6 months ago
  
  Even if they don't change the default 6 open connection. They could have at least made it per tab rather than per site. [1] [2] And I dont understand why this hasn't been done in the past 10 years.
  What am I missing?
  [1] https://bugzilla.mozilla.org/show_bug.cgi?id=906896
  [2] https://issues.chromium.org/issues/40329530
mikojan 6 months ago

That's only if not used over HTTP/2 and it says so in the docs too[0]
[0]: https://developer.mozilla.org/en-US/docs/Web/API/EventSource
- hamandcheese 6 months ago
  
  AFAIK browsers require https with http2. This is a locally running server/app which will probably never have https. Maybe there is an exception for localhost, I'm not sure.

RevEng 6 months ago

OpenAI's own documentation makes note of how difficult it is to work with SSE and to just use their library instead. My team wrote our own parser for these streaming events from an OpenAI compatible LLM server. The streaming format is awful. The double newline block separator also shows up in a bunch of our text, making parsing a nightmare. The "data:" signifier is slightly better, but when working with scientific software, it still occurs too often. Instead we've had to rely on the totally-not-reliable fact that the server returns each as a separate packet and the receiving end can be set up to return each packet in the stream.

The suggestions I've found online for how to deal with the newline issue are to fold together consecutive newlines, but this loses formatting of some documents and otherwise means there is no way to transmit data verbatim. That might be fine for HTML or other text formats where newlines are pretty much optional, but it sucks for other data types.

I'm happy to have something like SSE but the protocol needs more time to cook.

skue 6 months ago

> I'm happy to have something like SSE but the protocol needs more time to cook.
Just how well done do you like your protocols? SSE has been part of the WHATWG standard for almost 20 years.
Every protocol requires some sort of data encoding. For SSE you need to either restrict yourself to payloads that can never conflict with the message structure (e.g. an enumeration of short strings to indicate different sorts of events), or you need to encode the data.
It sounds like you are trying to send raw, unencoded data and are surprised that it sometimes conflicts with the message structure. Well of course it does! You can’t blame the protocol for that.
- RevEng 6 months ago
  
  Every other protocol I've used has a standard way to encode arbitrary data, but especially text data, usually using some kind of escape sequence. SSE does not.
  Just because it has been around for a long time does not mean it is well thought out or complete.

ramon156 6 months ago

They're underrated when they work™

Currently at work I'm having issues because - Auth between an embedded app and javascript's EventSource is not working, so I have to resort to a Microsoft package which doesn't always work. - Not every tunnel is fond of keep-alive (Cloudflare), so I had to switch to ngrok (until I found out they have a limit of 20k requests).

I know this isn't the protocol's fault, and I'm sure there's something I'm missing, but my god is it frustrating.

nchmy 6 months ago

Try this sse client https://github.com/rexxars/eventsource-client

_caw 6 months ago

> SSE works seamlessly with existing HTTP infrastructure

This is false. SSE is not supported on many proxies, and isn't even supported on some common local proxy tooling.

anshumankmr 6 months ago

SSE is not underrated. In fact it's being used by Open AI for streaming completions. It's just not always needed unlike the very obvious use cases for normal REST APIs and Websockets.

It was a pain to figure out how to get it to work in a ReactJS codebase I was working on then and from what I remember Axios didn't support it then so I had to use native fetch to get it to work.

jpc0 6 months ago

How long ago was this?
I seem to remember not having too many issue with useEffect and context on this.
Maybe the issue is you wanted to implement it in a singular react component when in reality you should be treating it like an other state library since it is something long lived that should live outside of react and pass data into react.
- anshumankmr 6 months ago
  
  Pretty much a year and a half back (I think it was March 2023). We had a real complicated set up back then, since I couldn't put our Open AI client key on the client side so I wrote an end point to to call Open AI's GPT3.5 API and then send that back to the front end to get the "typewriter" effect that they had on the frontend. It was quite broken back then cause sometimes random artifacts used to pop up in response, and some chunks came along with one another requiring me to write some really convoluted deserializing logic for it.

schmichael 6 months ago

I’ve never understood the use of SSE over ndjson. Builtin browser support for SSE might be nice, but it seems fairly easy to handle ndjson? For non-browser consumers ndjson is almost assuredly easier to handle. ndjson works over any transport from HTTP/0.9 to HTTP/3 to raw TCP or unix sockets or any reliable transport protocol.

apitman 6 months ago

Manually streaming a XHR and parsing the messages is significantly more work, and you lose the built-in browser API. But if you use a fetch ReadableStream with TLV messages I'm sold.
- nchmy 6 months ago
  
  Here's SSE with fetch and streams https://github.com/rexxars/eventsource-client

cle 6 months ago

> Automatic Reconnection

I wouldn't characterize this as "automatic", you have to do a lot of manual work to support reconnection in most cases. You need to have a meaningful "event id" that can be resumed on a new connection/host somewhere else with the Last-Event-Id header. The plumbing for this event id is the trivial part IMO. The hard part is the server-side data synchronization, which is left as an exercise for the reader.

Also, God help you if your SSE APIs have side effects. If the API call is involved in a sequence of side-effecting steps then you'll enter a world of pain by using SSE. Use regular HTTP calls or WebSockets. (Mostly b/c there's no cancellation ack, so retries are often racy.)

programmarchy 6 months ago

Great post. I discovered SSE when building a chatbot and found out it’s what OpenAI used rather than WebSockets. The batteries-included automatic reconnection is huge, and the format is surprisingly human readable.

upghost 6 months ago

Does anyone have a good trick for figuring out when the client side connection is closed? I just kill the connection on the server every N minutes and force the client to reconnect, but it's not exactly graceful.

Secondly, on iOS mobile, I've noticed that the EventSource seems to fall asleep at some point and not wake up when you switch back to the PWA. Does anyone know what's up with that?

nhumrich 6 months ago

The socket closes. Most languages bubble this back up to you with a connection closed exception. In python async world, it would be a cancelled error.
- upghost 6 months ago
  
  I haven't seen a library that does that yet, including Python. Usually you just throw messages into the void. Do you know of a specific library that does that?
  - nhumrich 6 months ago
    
    I use starlette + SSE
    
    upghost 6 months ago
    
    Thanks, I will check that out. If they properly handle client disconnects I'll pull this in.
jesprenj 6 months ago

Send a dummy event and see if you get an ACK in response. Depends on the library you're using.
- upghost 6 months ago
  
  There's no ack on a raw SSE stream, unfortunately -- unless you mean send an event and expect the client to issue an HTTP request to the server like a keepalive?
  - jauco 6 months ago
    
    There should be an ACK on the tcp packet (IIRC it’s not a lateral ACK but something like it) and the server should handle a timeout on that as the connection being “closed” which can be returned to the connection opener.
    You might want to look into timeouts or error callbacks on your connection library/framework.
    
    upghost 6 months ago
    
    Interesting, hadn't checked at the TCP level. Will need to look into that.
    
    jauco 6 months ago
    
    I remembered wrong. In most circumstances a tcp connection will be gracefully terminated by sending a FIN message. The timeout I talked about is on an ACK for a keepalive message. So after x time of not receiving a keepalive message the connection is closed. This handles cases where a connection is ungracefully dropped.
    All this is done at the kernel level, so at the application level you should be able to just verify if the connection is open by trying a read from the socket.
    
    upghost 6 months ago
    
    Thanks for clarifying, that would've sent me on a long wild goose chase. Most libraries only provide some sort of channel to send messages to. They generally do not indicate any FIN or ACK received at the TCP level.
    If anyone knows any library or framework in any language that solves this problem, I'd love to hear about it.

wlonkly 6 months ago

I love the format of this. I can tuck it away and whenever I need to think about SSE, refer back to it, ah, okay, now I'm all caught up.

Also, surprised nobody's brought up PointCast[1] yet. Dotcom bubble, or ahead of their time?

(Aside: while looking for a good reference link for Pointcast, I found an All Things Considered episode[2] about it from 1996!)

[1] https://www.ecommerce-digest.com/early-dot-com-failure-case-...

[2] https://www.npr.org/1996/02/13/1042379/pointcast

henning 6 months ago

They are handy for implementing simple ad-hoc hot reloading systems as well. E.g. you can have whatever file watcher you are using call an API when a file of interest changes that sends an event to listening clients on the frontend. You can also trigger an event after restarting the backend if you make an API change by triggering the event at boot time. Then you can just add a dev-only snippet to your base template that reloads the page or whatever. Better than nothing if your stack doesn't support it out of the box and doesn't take very much code or require adding any additional project dependencies. Not as sophisticated as React environments that will only reload a component that changed and only do a full page refresh if needed, but it still gives a nice, responsive feeling when paired with tools that recompile your backend when it changes.

lakomen 6 months ago

No they're not. They're limited to 6 clients per browser per domain on http/1.1 Which is important because nginx can't reverse proxy http/2 or higher, so you end up with very weird functionality, essentially you can't use nginx with SSE.

https://developer.mozilla.org/en-US/docs/Web/API/Server-sent...

Edit: I see someone already posted about that

yu3zhou4 6 months ago

I've had no idea they exist until I began to use APIs serving LLM outputs. They work pretty well for this purpose from my experience. An alternative to SSE is web sockets for this purpose I suppose

aniketchopade 6 months ago

I had some trouble implementing when server has to wait for another endpoint (webhook) to feed output to browser . During request processing (thread1) had to store the sse context in native Java object which will be retrieved later when webhook(thread2) is called But then with multiple service instance you wouldn't know which service had stored it so webhook had to publish something which others has to subscribe.

est 6 months ago

I built several internal tool to tail logs using SSE with Flask/FastAPI. Easy to implement and maintain.

For FastAPI if you want some hooks when client disconnects aka nginx 499 errors, follow this simple tip

https://github.com/encode/starlette/discussions/1776#discuss...

ksajadi 6 months ago

I'm curious as to how everyone deals with HTTP/2 requirements between the backend servers and the load balancer? By default, HTTP/2 requires TLS which means either no SSL termination at the load balancer or a SSL cert generated per server with a different one for the front end load balancer. This all seems very inefficient.

kcb 6 months ago

Not sure how widespread this is but AWS load balancers don't validate the backend cert in any way. So I just generate some random self signed cert and use it everywhere.
nhumrich 6 months ago

You don't need http2 on the actual backend. All limitations for SSE/http1 are browser level. Just downgrade to http1 from the LB to backend, even without SSL. As long as LB to browser is http2 you should be fine.
- ksajadi 6 months ago
  
  Isn't that going to affect the whole multiplexing / multiple connection of SSEs?
  - nhumrich 6 months ago
    
    No. That's all handled in the browser and load balancer.
  - seabrookmx 6 months ago
    
    I don't think so?
    HTTP 1.1 still supports Keep-Alive.

fitsumbelay 6 months ago

Finding use cases for SSE and reading about others doing the same brings me great joy. Very easy to set up -- you just set 2 or 3 response headers and off you go.

I have a hard time imagining the tech's limits outside of testing scenarios so some of the examples brought up here are interesting

junto 6 months ago

This is why SignalR in .NET land is so popular in that ecosystem. It automatically falls back to SSE if web sockets aren’t available. In essence giving totally transparent real time communication.

sbergjohansen 6 months ago

Previous related discussion (2022):

https://news.ycombinator.com/item?id=30403438 (100 comments)

benterix 6 months ago

The topic is interesting but the ChatGPT style of presenting information as bullet points is tiring.

Tiberium 6 months ago

Also, another day, another mostly AI-written article on HN's top page :)

emmanueloga_ 6 months ago

It’s funny how HN has a mix of people who think AGI is just around the corner, people trying to build/sell stuff that uses LLMs, and others who can’t stand LLM-generated content. Makes me wonder how much overlap there is between these groups.
- remram 6 months ago
  
  Those are not incompatible positions at all. You can think great AI is around the corner and still dislike today's not-great AI writing.
- Tiberium 6 months ago
  
  I don't have anything against LLMs, I use them daily myself, but publishing content that's largely AI-generated without a disclaimer just feels dishonest to me. Oh, and also when people don't spend at least some effort to make the style more natural, not those bullet point lists in the article that e.g. Claude loves so much.
slow_typist 6 months ago

What makes you think the article is AI-written?
- Tiberium 6 months ago
  
  I've just spent too much time with different LLMs, and for example Claude really loves such bullet point lists. The article is full of them.
  The whole structure with numbered sections really gives it away, humans don't write blog posts like that.
  - abnercoimbre 6 months ago
    
    I remember reading the post and feeling it was inorganic.
    After your comment I re-read everything and you're correct: this is a very common template for Claude type answers.
    LLM-generated comments on HN are against the guidelines. Would the same apply to blog post submissions?
    
    slow_typist 6 months ago
    
    Well if it applied, someone should train a model that tells LLM and human content apart for automated rejection.
    
    abnercoimbre 6 months ago
    
    Easy peasy.

whatever1 6 months ago

Can Django with vanilla gunicorn do this ?

mfalcao 6 months ago

Yes, I’ve done it using StreamingHttpResponse. You’ll want to use an asynchronous worker type though.

crowdyriver 6 months ago

http streaming is even more underrated.

condiment 6 months ago

So it’s websockets, only instead of the Web server needing to handle the protocol upgrade, you just piggyback on HTTP with an in-band protocol.

I’m not sure this makes sense in 2024. Pretty much every web server supports websockets at this point, and so do all of the browsers. You can easily impose the constraint on your code that communication through a websocket is mono-directional. And the capability to broadcast a message to all subscribers is going to be deceptively complex, no matter how you broadcast it.

realPubkey 6 months ago

Yes most servers support websockets. But unfortunately most proxies and firewalls do not, especially in big company networks. Suggesting my users to use SSEs for my database replication stream solved most of their problems. Also setting up a SSE endpoint is like 5 lines of code. WebSockets instead require much more and you also have to do things like pings etc to ensure that it automatically reconnects. SEEs with the JavaScript EventSource API have all you need build in:
https://rxdb.info/articles/websockets-sse-polling-webrtc-web...
- the_mitsuhiko 6 months ago
  
  SSE also works well on HTTP/3 whereas web sockets still don’t.
  - apitman 6 months ago
    
    I don't see much point in WebSockets for HTTP/3. WebTransport will cover everything you would need it for an more.
    
    the_mitsuhiko 6 months ago
    
    That might very well be but the future is not today.
    
    apitman 6 months ago
    
    But why add it to HTTP/3 at all? HTTP/1.1 hijacking is a pretty simple process. I suspect HTTP/3 would be significantly more complicated. I'm not sure that effort is worth it when WebTransport will make it obselete.
    
    the_mitsuhiko 6 months ago
    
    It was added to HTTP/2 as well and there is an RFC. (Though a lot of servers don’t support it even on HTTP/2)
    My point is mostly that SSE works well and is supported and that has A meaningful benefit today.
    
    leni536 6 months ago
    
    To have multiple independent websocket streams, without ordering requirements between streams.
- deaf_coder 6 months ago
  
  going slightly off the tangent here, does FaaS cloud providers like AWS, CloudFlare, and etc support SSEs?
  Last time I checked, they don't really support it.