Shipping Multi-Tenant SaaS Using Postgres Row-Level Security

254 pointsby capikialmost 3 years ago

22 comments

mkurzalmost 3 years ago

Be aware when using RLS with views: By default the RLS policy will be executed with the permissions of the owner of the view instead with the permissions of the user executing the current query. This way it can easily happen that the RLS policy will be bypassed because the owner of the view is a admin account or the same account that owns the underlying table (see the the gotchas section of the original post).However, upcoming PostgreSQL 15 adds support for security invoker views: <a href="https://github.com/postgres/postgres/commit/7faa5fc84bf46ea6c543993cffb8be64dff60d25" rel="nofollow">https://github.com/postgres/postgres/commit/7faa5fc84bf46ea6...</a> That means you can then define the security_invoker attribute when creating a view and this "... causes the underlying base relations to be checked against the privileges of the user of the view rather than the view owner" (see <a href="https://www.postgresql.org/docs/15/sql-createview.html" rel="nofollow">https://www.postgresql.org/docs/15/sql-createview.html</a>) PG15 beta 1 release notes: <a href="https://www.postgresql.org/about/news/postgresql-15-beta-1-released-2453/" rel="nofollow">https://www.postgresql.org/about/news/postgresql-15-beta-1-r...</a>

评论 #32245314 未加载

评论 #32246224 未加载

bearjawsalmost 3 years ago

This is such a killer feature in PG, my new job uses it and it makes audits of our tenancy model dead simple.Coming from a SaaS company that used MySQL, we would get asked by some customers how we guarantee we segmented their data, and it always ended at the app layer. One customer (A fortune 10 company) asked if we could switch to SQL Server to get this feature...Our largest customers ask how we do database multi-tenant and we point to our SDLC + PG docs and they go 'K'.

评论 #32242789 未加载

评论 #32242777 未加载

评论 #32245160 未加载

评论 #32247247 未加载

simonwalmost 3 years ago

I don't fully understand the performance implications here.Say I was using this for a blog engine, and I wanted to run this SQL query:<pre><code> select * from entries; </code></pre> But I actually only want to get back entries that my current user is allowed to view - where author_id = 57 for example.Would PostgreSQL automatically turn the above query into the equivalent of this:<pre><code> select * from entries where author_id = 57; </code></pre> And hence run quickly (assuming there's an index on that author_id column)?Or would it need to run an additional SQL query check for every single row returned by my query to check row permissions, adding up to a lot of extra overhead?

评论 #32243159 未加载

lmeyerovalmost 3 years ago

We were looking at RLS, various ABAC integrated frameworks (casbin, ..), and zanzibar clones late last year --* RLS is super appealing. Long-term, the architecture just makes so much more sense than bringing in additional maintenance/security/perf/etc burdens. So over time, I expect it to hollow out how much the others need to do, reducing them just to developer experience & tools (policy analysis, db log auditing, ...). Short-term, I'd only use it for simple internal projects because cross-tenant sharing is so useful in so many domains (esp if growing a business), and for now, RLS seems full of perf/expressivity/etc. footguns. So I wouldn't use for a SaaS unless something severely distinct tenant like payroll, and even then, I'd have a lot of operational questions before jumping in.* For the needed flexibility and app layer controls, we took the middle of casbin, though others tools emerging to. Unlike the zanzibar style tools that bring another DB + runtime + ..., casbin's system of record is our existing system of record. Using it is more like a regular library call than growing the dumpster fire that is most distributed systems. Database backups, maintenance, migrations, etc are business as usual, no need to introduce more PITAs here, and especially not a vendor-in-the-middle with proprietary API protocols that we're stuck with ~forever as a dependency.* A separate managed service might make zanzibar-style OK in some cases. One aspect is ensuring the use case won't suffer the view problem. From there, it just comes down to governance & risk. Auth0 being bought by Okta means we kind of know what it'll look like for awhile, and big cloud providers have growing identity services, which may be fine for folks. Startup-of-the-month owning parts of your control plane is scarier to me: if they get hacked, go out of business, get acquired by EvilCorp or raise $100M in VC and jack up prices, etc.There's a lot of innovation to do here. A super-RLS postgres startup is on my list of easily growable ideas :)On a related note: We're doing a bunch of analytics work on how to look at internal+customer auth logs -- viz, anomaly detection, and supervised behavioral AI -- so if folks are into things like looking into account take overs & privilege escalations / access abuse / fraud in their own logs, would love to chat!

jzelinskiealmost 3 years ago

As the developer of an external authorization system (full disclosure)[0], I feel obligated to chime in the critiques of external authorization systems in this article. I don't think they're far off base, as we do recommend RLS for use cases like what the article covers, but anyways, here's my two cents:1+2: Cost + Unnecessary complexity: this argument can be used against anything that doesn't fit the given use case. There's no silver bullet for any choice of solution. You should only adopt the solution that makes the most sense for you and vendors should be candid about when they wouldn't recommend adopting their solution -- it'd be bad for both the users and reputation of the solution.3: External dependencies: That depends on the toolchain. Integration testing against SpiceDB is easier than Postgres, IMO [1]. SpiceDB integration tests can run fully parallelized and can also model check your schema so that you're certain there are no flaws in your design. In practice, I haven't seen folks write tests to assert that their assumptions about RLS are maintained over time. The last place you want invariants to drift is authorization code.4: Multi-tenancy is core to our product: I'm not sure I'm steel-manning this point, but I'll do my best. Most companies do not employ authorization experts and solutions worth their salt should support modeling multi-tenant use cases in a safe way. SpiceDB has a schema language with idioms and recommendations to implement functionality like multi-tenancy, but still leaves it in the hands of developers to construct the abstraction that matches their domain[2].[0]: <a href="https://github.com/authzed/spicedb" rel="nofollow">https://github.com/authzed/spicedb</a>[1]: <a href="https://github.com/authzed/examples/tree/main/integration-testing" rel="nofollow">https://github.com/authzed/examples/tree/main/integration-te...</a>[2]: <a href="https://docs.authzed.com/guides/schema" rel="nofollow">https://docs.authzed.com/guides/schema</a>

评论 #32244299 未加载

shaicolemanalmost 3 years ago

We're currently using the schema-per-tenant, and it's working very well for us:* No extra operational overhead, it's just one database* Allows to delete a single schema, useful for GDPR compliance* Allows to easily backup/restore a single schema* Easier to view and reason about the data from an admin point of view* An issue in a single tenant doesn't affect other tenants* Downtime for maintenance is shorter (e.g. database migration, non-concurrent REINDEX, VACUUM FULL, etc.)* Less chance of deadlocks, locking for updates, etc.* Allows easier testing and development by subsetting tenants data* Smaller indexes, more efficient joins, faster table scans, more optimal query plans, etc. With row level security, every index needs to be a compound index* Easy path to sharding per tenant if needed. Just move some schemas to a different DB* Allows to have shared data and per-tenant data on the same database. That doesn't work with the tenant-per-database approachThere are a few cons, but they are pretty minor compared to the alternative approaches:* A bit more code to deal in the tenancy, migrations, etc. We opted to write our own code rather than use an existing solution* A bit more hassle when dealing with PostgreSQL extensions . It's best to install extensions into a separate extensions schema* Possible caching bugs so you need to namespace the cache, and clear the query cache when switching tenant* The security guarantees of per tenant solution aren't perfect, so you need to ensure you have no SQL injection vulnerabilities

评论 #32244793 未加载

评论 #32243839 未加载

andy_pppalmost 3 years ago

I find adding loads of stuff to Postgres exciting and fun, but I want all of my logic in the code in GitHub, rather that floating around in my global data store. Has anyone thought about a data layer that allows you to define this stuff programmatically rather than in SQL but then it configures your data layer to work like this. Not necessarily an ORM but more a business logic layer that compiles everything down to use features like this. Or maybe even a data layer that is a set of programmatic building blocks that works as described?

评论 #32245733 未加载

评论 #32245694 未加载

评论 #32267242 未加载

评论 #32247369 未加载

uhoh-itsmaciekalmost 3 years ago

>Another issue we caught during testing was that some requests were being authorized with a previous request’s user id.This is the terrifying part about RLS to me: having to rely on managing the user id as part of the database connection session seems like an easy way to shoot yourself in the foot (especially when combined with connection pooling). Adding WHERE clauses everywhere isn't great, but at least it's explicit.That said, I've never used RLS, and I am pretty curious: it does seem like a great solution other than that one gotcha.

评论 #32245765 未加载

sgarmanalmost 3 years ago

Am I right in my understanding that EVERY request that comes in to their api creates a new connection to the database? What about reusing connections with connection pools or one level up using pgbouncer or thing. Can you actually use RLS while reusing connections?

评论 #32243682 未加载

评论 #32243587 未加载

评论 #32243516 未加载

ishanralmost 3 years ago

I use RLS quite heavily for my app Sudopad (<a href="https://sudopad.com" rel="nofollow">https://sudopad.com</a>) and it has been working quite well so far.One gotcha specific to Supabase (where I run the backend) is because there is no anonymous login in Supabase, turning on RLS and using database functions marked as security definers are the way to go. Otherwise there is no easy way of stopping a 'select * from x' since some rows might not have a user_id if they are anonymous and I still want people to access the row if they know a specific primary key uuid.

mglalmost 3 years ago

Row-level security is always a tricky and hard to enforce assumption as this not how we relational databases really.Much bigger fan of the approach described here:Scalability, Allocation, and Processing of Data for Multitenancy<a href="https://stratoflow.com/data-scalability-allocation-processing-multitenancy/" rel="nofollow">https://stratoflow.com/data-scalability-allocation-processin...</a>

fswdalmost 3 years ago

I use this for a startup in a re-write of their solution. It simplifies my queries and mutations, and security concerns. It also drammatically reduces the complexity of my code. There's also ROLES (Guest/Public user, Authenticated, Admin) and combinding the roles with Row Level Security.I like it so much I don't want to go back!

ei8thsalmost 3 years ago

I needed this two years ago, i was looking at this but couldn't figure out how to do it with a existing db connection pool to reuse connections. I might be migrating to this soon so that things will be more isolated from the tenants.

a-dubalmost 3 years ago

this is cool. next up, row level encryption with private information retrieval methods for enabling queries and searches homomorphically (on data encrypted by the client that the service provider never has a key for).

评论 #32247079 未加载

xsrealityalmost 3 years ago

Curious how advanced authorization (like ABAC) can be implemented with RLS. For example returning resources that are accessible to the team I belong to within the tenant.

spacemanmattalmost 3 years ago

If I were leaning into RLS today I would do it through PostgREST

评论 #32244848 未加载

andrewstuartalmost 3 years ago

I once implemented RLS/Postgres for Django.It worked pretty well.The basic mechanism was to intercept all outbound SQL queries and wrap them in postgres environment variables that set up the RLS.

santa_boyalmost 3 years ago

This is awesome. Are there any similar features that can be implemented with Mariadb? One of my favorite products is integrated with Mariadb.

kache_almost 3 years ago

Context aware data access is really cool. And hard :)

jtwebmanalmost 3 years ago

What about stop using ORM abstractions as an option then it is much harder to forget needed filters?

nbevansalmost 3 years ago

Using RLS to implement multi-tenancy is a terrible idea. Just deploy a database per tenant. It's not hard. Why overcomplicate it?

评论 #32243262 未加载

评论 #32243396 未加载

评论 #32245174 未加载

评论 #32245682 未加载

评论 #32244165 未加载

paxysalmost 3 years ago

Is having to write "SELECT [...] WHERE user_id=<123>" really considered a security hole? Isn't that how like every service in existence operates? Coming up with complicated auth systems and patterns just because you are scared you will accidentally skip that WHERE clause seems bizarre to me.

评论 #32243747 未加载

评论 #32245007 未加载

评论 #32243927 未加载

评论 #32243705 未加载

评论 #32243774 未加载

评论 #32244179 未加载

评论 #32243700 未加载