Single dependency stacks

225 点作者 jeffreyrogers超过 3 年前

33 条评论

theptip超过 3 年前

I'm a big fan of this approach, having built a Django monolith with the standard Celery/RMQ, dabbled in Redis for latency-sensitive things like session caching, and never hitting scale where any of those specialized tools were actually required (despite generating substantial revenue with the company).One thing to note, if you use pq or another Postgres-as-queue approach, you should be aware of the work you'll need to do to move off it -- this pattern lets you do exactly-once processing by consuming your tasks in the same DB transaction where you process the side-effects. In general when using a separate task queue (RMQ, SQS, etc.) you need to do idempotent processing (at least once message semantics). A possible exception is if you use Kafka and use transactional event processing, but it's not serializable isolation.This is probably a reason in favor of using a Postgres task queue initially since exactly-once is way simpler to build, but just be aware that you're going to need to rethink some of your architectural foundation if/when you need to move to a higher-throughput queue implementation.

评论 #30276408 未加载

评论 #30276649 未加载

评论 #30278740 未加载

评论 #30277767 未加载

评论 #30276977 未加载

评论 #30277110 未加载

VWWHFSfQ超过 3 年前

Note that you can approximate rate-limiting in Redis with Postgres' UNLOGGED tables [0]. They're a lot faster than regular tables since they don't write to the WAL. Of course, if you restart your server or if it crashes then you lose the table contents. But for stuff like rate-limiting you probably don't care. And unless you're using some kind of persistence in Redis it happens there also.I tend to run this kind of stuff on a separate PG server so that the query velocity doesn't affect more biz-critical things.[0] <a href="https://www.postgresql.org/docs/current/sql-createtable.html#SQL-CREATETABLE-UNLOGGED" rel="nofollow">https://www.postgresql.org/docs/current/sql-createtable.html...</a>

评论 #30276568 未加载

评论 #30275669 未加载

评论 #30277803 未加载

评论 #30281643 未加载

评论 #30280337 未加载

评论 #30279508 未加载

rglover超过 3 年前

Personally staking my own future on this "less is more" approach having seen some serious horror flicks in terms of app/infrastructure stacks the past few years.What continues to surprise me: a lot of time and money has been or is being wasted on reinventing the wheel (speaking specifically about webdev here).There are a ton of great tools out there that are battle-tested (e.g., my favorite find of late as someone just digging into hand-rolled—meaning no k8s/docker/etc—infrastructure is haproxy).

评论 #30276494 未加载

评论 #30280239 未加载

评论 #30276709 未加载

chishaku超过 3 年前

What other examples are there of "single dependency stacks"?This article is really about the versatility and reliability of postgres.And I'm all in agreement.Reminiscent of:<a href="https://www.craigkerstiens.com/2017/04/30/why-postgres-five-years-later/" rel="nofollow">https://www.craigkerstiens.com/2017/04/30/why-postgres-five-...</a><a href="https://webapp.io/blog/postgres-is-the-answer/" rel="nofollow">https://webapp.io/blog/postgres-is-the-answer/</a><a href="http://rachbelaid.com/postgres-full-text-search-is-good-enough/" rel="nofollow">http://rachbelaid.com/postgres-full-text-search-is-good-enou...</a><a href="http://boringtechnology.club/" rel="nofollow">http://boringtechnology.club/</a>As much as HN could lead you astray with the hype of this and that tech, articles like the above are some of the most consistently upvoted on this website.

评论 #30277167 未加载

评论 #30283086 未加载

cjfd超过 3 年前

I think this is the right idea. The pendulum between having as many dependencies as possible and having no dependencies at all has flung way too far in the 'as many dependencies as possible' side. It is a major PITA when yet another random component breaks. Let us say that A can be done in B in, let us say, three man weeks. I would say it is worth it. The advantage is that A will never break because it is not there. Note that A also may break 3 years from now when everybody who knows anything about A has left the company.... B is now used in more places so people are more likely to have been forced to learn it so when the emulation of A break there is a better chance that people will know what to do. I see mostly advantages.

pphysch超过 3 年前

There's definitely potential to go too far into monolith territory and misinterpret how simple your architecture actually is.An example: Django backed by Postgres. I tend to view this as 1 architectural unit, i.e. Postgres is wholly embedded in Django. I am under no illusion that I have both a Django project and a PostgreSQL instance. I have a Django-backed-by-Postgres. I can have that PostgreSQL instance be a standalone interface, but that means increasing my architectural units from 1 to 2. Instead, if I want to integrate with Django's raw tables, I'm going to do it on Django's terms (via a custom HTTP API) rather than fighting the ORM over who gets to DBA the database. Bad for performance? No doubt. We'll worry about that when we get there.Yes, you can run a web app server directly out of Postgres without an additional "app layer" like Django (Crunchy has some cool tools for this). But should you?To be clear, I'm a big fan of KISS, just skeptical of false minimalism.

评论 #30279714 未加载

评论 #30277296 未加载

samwillis超过 3 年前

I am so for this, being the sole developer in my company for the last 10 years I introduced far to many “moving parts” as it grew and I’m now going through the process of simplifying it all.I love Redis but it’s next to go, currently used for user sessions and a task queue, both of which Postgres is more than capable of doing [0]. Also, as a mostly Python backend, I want to rip out a couple of Node parts, shrink that Docker image.0: <a href="https://news.ycombinator.com/item?id=21536698" rel="nofollow">https://news.ycombinator.com/item?id=21536698</a>

alilleybrinker超过 3 年前

Apparently Tailscale for a long time just used a JSON file as their data storage, and moved from that to SQLite with a hot-swappable backup with Litestream [0], and hey they've done fine with that.[0]: <a href="https://securitycryptographywhatever.buzzsprout.com/1822302/9890092-tailscale-feat-avery-pennarun-and-brad-fitzpatrick?t=0" rel="nofollow">https://securitycryptographywhatever.buzzsprout.com/1822302/...</a>

kelp超过 3 年前

I kind of love this idea.It reminds me of a redis use case we had at a former employer.We had a cluster with a high double digit number of nodes that delivered a lot of data to various external APIs.Some of those APIs required some metadata along with the payload. That metadata was large enough that it had made sense to cache it in Redis. But over time, with growth the cluster got large enough, and high volume enough that just fetching that data from Redis was saturating the 10Gbps Nic on the ElastiCache instance, creating a significant scaling bottleneck. (I don't remember if we moved up to the 25Gbps ones or not.)But we could have just as easily done a local cache (on disk or something) for this metadata on each node and avoided the cost of the ElastiCache and all the scaling and operational issues. It would have also avoided the network round trips to Redis, and the whole thing probably would have just performed better.

M0r13n超过 3 年前

This is why I love Ansible. As a DevOps enigneer I do not design or implement complex systems or programs. But I am responsible for the reliability of our systems and infrastructure. And Ansible is just pleasant to use for the same reasons stated by the author:- a single packaged without any additional dependencies - no client side software - pure SSH - simple playbooks written in only YAMLFocusing on simplicity and maintainabilty has helped me deliver reliable systems.

KwisaksHaderach超过 3 年前

Many can even get away with less: sqlite.One less process to worry about.

np_tedious超过 3 年前

I have nothing to disagree with here, but it's worth noting that his company Crunchy Data are themselves a postgres provider. So they, more then most, have the chops and incentive to do a great deal in postgres alone.<a href="https://www.crunchydata.com/" rel="nofollow">https://www.crunchydata.com/</a>

asdfman123超过 3 年前

This article makes me embarrassed. Not that they wrote the article, but that it had to be written.A single web application with a single database is how everything should be built unless necessity demands more complexity.It's the way people do things in the "uncool" enterprise world every single day.

rlawson超过 3 年前

There are so many benefits of keeping things as simple as possible.<pre><code> - easier troubleshooting - easier to maintain documentation - quicker onboarding of new devs - easier to migrate to new hosting if needed - quicker to add features (or decide not to add)</code></pre>

hsn915超过 3 年前

Zero dependency is better.Your database does not need to be a separate process. It can just be a library embedded in your application.If your application is distributed across several physical servers, then your database can be as well.

评论 #30281887 未加载

wwweston超过 3 年前

> normally I’d push all rate limiting to Redis. Here, we rate limit in memory and assume roughly uniform distribution between server nodes.Dumb question: what do they mean by rate limiting in memory vs via Redis? Does that mean keeping track of request origins+counts using those storage mechanisms, or something else?

评论 #30276430 未加载

评论 #30276074 未加载

tulstrup超过 3 年前

This idea is super cool.I am not sure it necessarily has to be just one single dependency, but keeping the number of dependencies as low as possible makes a lot of sense to me. At least the overhead of introducing any given new dependency should be taken into serious consideration and held against the concrete benefits that will be gained from it.I wrote a blog post on a very similar subject, essentially all of the same arguments, but targeted more towards the dependencies and abstractions found within a given systems code structure and application architecture.If you are interested, you can read it here: <a href="https://betterprogramming.pub/avoiding-premature-software-abstractions-8ba2e990930a" rel="nofollow">https://betterprogramming.pub/avoiding-premature-software-ab...</a>

agentultra超过 3 年前

I’m a big fan of getting rid of the MVC frameworks too. Get down with PostgREST or Postgraphile on your control plane. Use Postgres to manage security, views, migrations, the whole nine yards.On your data plane.. yeah queues on Postgres work too. Subscribe/notify are decent. Or go with using a streaming replication client to send out events to a queue and decouple downstream stuff like your analysis datastores, notification systems, policy monitoring, what have you.Postgres can handle a lot. It’s an awesome piece of tech.

0xbadcafebee超过 3 年前

> 1 Okay fine, S3 too, but that’s a different animal.I think people forget that AWS S3 isn't immutable. Unlike an EBS volume, it is impossible to "snapshot" S3 the way you can a database. There are arbitrary global limitations outside the scope of your control, and a dozen different problems with trying to restore or move or version all the things about buckets that aren't objects (although the objects too can be problematic depending on a series of factors).If you want real simplicity/repeatability/reliability, but have to use S3, host your own internal S3 service. This way you can completely snapshot both the metadata and block devices used by your S3 service, making it actually immutable. Plus you can do things like re-use a bucket name or move it to any region without worrying about global conflicts. (All of that is hard/expensive to do, however, so you should just use AWS S3 and be very careful not to use it in a way that is unreliable)

评论 #30276554 未加载

评论 #30281471 未加载

评论 #30276353 未加载

rkhacker超过 3 年前

I am sure there is momentary thrill of achieving minimalism but alas the world is not so simple anymore. I would refer the OP and the community here to the paper from the creator of PostgreSQL: <a href="http://cs.brown.edu/~ugur/fits_all.pdf" rel="nofollow">http://cs.brown.edu/~ugur/fits_all.pdf</a>

评论 #30277237 未加载

评论 #30276661 未加载

bcrosby95超过 3 年前

I always start with just MySQL and introduce things as needed - not as guessed. These days I don't work on anything with enough traffic that needs more than that.An RDBMS is a lot more than just SQL these days, and they offer a lot of good enough solutions to a wide variety of problems.

评论 #30277111 未加载

klysm超过 3 年前

I wholeheartedly agree with the author that holding all your state within one system has massive benefits. As soon as you have to make your state cross process boundaries yourself, you are going to have a much more difficult time and are exposing yourself to a much larger problem space.> If we were to need search, we’d use Postgres’ full text search instead of ElasticSearch.I think this is significantly easier said than done, I don’t really consider those two pieces of tech equivalent. Postgres FTS is pretty good, but I don’t think it really can do a lot of the stuff ES can do well.

fizx超过 3 年前

This is great, but you might want to have multiple postgreses for the different workloads. DB postgres != rate-limit PG != search PG. It's pretty hard to optimize one DB for every workload.

评论 #30278224 未加载

评论 #30278078 未加载

baggy_trough超过 3 年前

The next level up of this approach is running everything on one box.

AtNightWeCode超过 3 年前

Redis is overused in my opinion. For many requests it does not beat a database for the same amount of money. There can be other reasons for using a cache though. I often hear that people claim that the cache “protects” the database. From my experience it is more common that once the database has problems it spills over to the cache. If then for instance a circuit breaker opens to the cache the database will be smacked senseless.

评论 #30278194 未加载

ianstormtaylor超过 3 年前

I love this concept.For anyone who’s interested in doing this, but doesn’t want to manage the infrastructure you should check out Supabase. (I’m not affiliated, just a fan.)

jonstewart超过 3 年前

I love Postgres and mostly agree, but there’s one caveat: Postgres’s full text indexing is better than most non-Lucene options, but it’s still a toy compared to anything using Lucene (eg ElasticSearch). If you need to support indexed search of documents, especially multi-language corpora, you need to use the right tool for the job and Postgres isn’t it.

chucke超过 3 年前

operationally, makes sense. but the inevitable moment (if you survive) you need to migrate to smth else depending on a different queue system, it'll be a pain to retrofit the code relying on db-level transactions and locks.

评论 #30279322 未加载

评论 #30277907 未加载

评论 #30276303 未加载

评论 #30276770 未加载

LAC-Tech超过 3 年前

This is one thing that really appeals to me about postgres - it may not be the best at everything, but it can do it all. Love seeing people actually load testing it before adding another stateful service.

mberning超过 3 年前

I was a bit disappointed. I though they were going to implement their entire system using stored procedures. That would be “single dependency”. As it stands it is “all my app tier dependencies and postgres.

eezing超过 3 年前

I was worried about how long the initial indexing would take for a recent full text search implementation in Postgres.Took less than a second on a few hundred thousand rows.Naive and simple is good enough for now.

pnathan超过 3 年前

my take on this looks similar, but I'll have more going on:1. kubernetes. 2. postgres. 3. application.where the kubernetes bit is used for the more integration test side of things.a lot of machinery can be employed that gets in the way of "wtf just happened".

deathanatos超过 3 年前

I'm not sure what qualifies as "stateful", but> Fewer dependencies to fail and take down the service.No logging? No metrics? No monitoring? (& yes, you'd think those shouldn't take down the stack if they went offline. And I'd agree. And yet, I've witnessed that failure mode multiple times. In one, a call to Sentry was synchronous & a hard-fail, so when Sentry went down, that service 500'd. In another, syslog couldn't push logs out to the logging service, as that was very down, having been inadvertently deleted by someone who ran "terraform apply", didn't read the plan, & then said "make it so"; syslog then responded to the logging service being down by logging that error to a local file. Repeatedly. As fast as it possibly could. Fill the disk. Disk is full. Service fails.)I've also seen our alerting provider have an outage during an outage we're having & thus not sending pages for our outage, causing me to ponder and wonder how I'd just rolled a 1 on the SRE d20 and what god did I anger? Also who watches the watchmen?> A common pitfall is to introduce something like ElasticSearch, only to realize a few months later that no one knows how to run it.Yeah I've seen that exact pit fallen into.No DNS? Global Cloudflare outage == fun.No certificates?I've seen certs fail so many different way. Of course not getting renewed, that's your table stakes "welcome to certs!" failure mode. Certs get renewed but an allegedly Semver compatible upgrade changed the defaults, and required extensions don't get included leading to the client rejecting the cert. I've seen a service which watches certs to make sure they don't expire (see the outage earlier in this paragraph!) have an outage (which, b/c it's monitoring, wasn't customer visible) because a tool issued a malformed cert (…by… default…) that the monitor failed to parse (as it was malformed). Oh, and then the LE cross-signing expiration took out an Azure service that wasn't ready for it, a service from a third-party of ours that wasn't ready for it, and our CI system b/c several tools were out of date including an up to date system on Debian that was theoretically "supported"… but still shipped an ancient crypto library riddled with bugs in its path validation.> Okay fine, S3 too, but that’s a different animal.Is it? I've seen that have outages too, & bring down a service with it. (There really wasn't a choice there; S3 was the service's backing store, & without it, the service was truly screwed.)But of course, all this is to say I violently agree with the article's core point: think carefully about each dependency, as they have a very real production cost.(I've recently been considering changing my title to SRE because I have done very little in the way of SWE recently…)

评论 #30282889 未加载