Avoiding the soft delete anti-pattern

61 pointsby culturedsystemsabout 1 year ago

18 comments

wruzaabout 1 year ago

It’s a great article exploring the idea, but the premise and arguments leading to it are somewhat weak, imo. First, views aren’t “fragile”. I may be wrong here, but it feels like TA tries to squeeze that along with some abstract-ORM issues.Second, “anti-pattern” is a very technical rating of this phenomenon. Business logic and its databases may contain data that may, may not, or can never be viewed as deletable, at the same time (and in the same table). Soft deletion isn’t a blanket rule. For example, you hard-delete generated records, cause they are a cached projection of higher level data. You may hard-delete drafts like you trash your notes, but you never hard-delete a signed contract/declaration.

评论 #40327433 未加载

评论 #40363004 未加载

评论 #40337266 未加载

评论 #40339221 未加载

kthejoker2about 1 year ago

The main problem I have is the article takes a performance/devlopment lens to soft deletes, and only pays lip service to the objectives you're trading off performance for with soft deletes ... namely data retention / disaster recovery / audit requirements.* availability / recovery - soft deletes provide the best RPO/RTO in archival / lifecycle planning* auditability / compliance - much easier to achieve with 1 system than 2 or 3 systems* security - see aboveYou certainly can achieve these objectives with CDC / snapshotting / warehousing / archival practices, but the soft delete pattern has its place at the application layer in spite of performance which is only begrudgingly acknowledged in the article.

评论 #40336786 未加载

评论 #40330916 未加载

Reefersleepabout 1 year ago

How about a separate, schema-wise identical "deleted_x" table that you "move" deleted entities to? Can't get much more explicit than that, and still enables whatever joins you'd like on historical deleted data.

评论 #40330960 未加载

评论 #40337854 未加载

评论 #40327564 未加载

评论 #40327160 未加载

评论 #40336790 未加载

000ooo000about 1 year ago

So we call approaches 'anti-patterns' now if they aren't universally suitable?

评论 #40330969 未加载

评论 #40337146 未加载

评论 #40327546 未加载

评论 #40329010 未加载

EdwardDiegoabout 1 year ago

Sigh, as always in tech, the answer to "is soft delete appropriate" is - "it depends".Do you want to support reversible deletion in the business logic sense? Soft delete is a trivial way to do this.Do you want to support business logic deletion in a normalised schema while retaining other records that relate to that entity for auditing requirements? Probably worth looking into soft delete first.Of course at large entity counts, soft delete can impact performance, but that's usually a rather large entity count, and then you can start considering approaches like a delete log or denormalisation.Afraid of throwing away data you worry you might need later but don't have an existing use case for right now? There are better ways to data hoard, and you should regularly analyse how often that hoarded data is actually accessed before your data lake turns into a data swamp.

Scubabear68about 1 year ago

I am surprised the article didn’t mention the obvious fix soft deletes potential performance issues - have a job run regularly that archives soft deleted data older than X units of time.This allows for undoing a soft delete and gets rid of soft deleted rows eventually.

评论 #40337174 未加载

nivertechabout 1 year ago

“Soft-Delete pattern (deleted_at column) or any other pattern adding $event_at column to a DB table, contains an ad hoc, informally-specified, bug-ridden, slow implementation of half of Event Sourcing.”— Greenspun's tenth rule of programming

评论 #40330975 未加载

mikkomabout 1 year ago

Soft deletion is not an anti-pattern. In real software you need to have possibility to delete items but they still need to be exist in historical items because of analytics, historical data integrity etc.Soft delete is the only way to make this possible without horrible kludges.

spyke112about 1 year ago

No one ever got fired for implementing soft delete.

kylehotchkissabout 1 year ago

I remember how easy it used to be to drop an entire firestore collection with one click. Yes, when deleting your production data is one click away (the delete button was right next to the filter button!) it’s very natural to be afraid. Thankfully Google has improved a lot of these interfaces with a deletion confirmation prompt but can you see where the fear originates?

GMoromisatoabout 1 year ago

Isn't soft-delete just a variant of having a lifecycle? The article tries to distinguish it by saying that the lifecycle pattern is implemented at the app-layer instead of the database layer, but isn't their criticism of soft-delete that the app-layer has to deal with it?Maybe a better recommendation is to give guidelines for implementing soft-delete?

评论 #40337013 未加载

评论 #40337223 未加载

banish-m4about 1 year ago

The choices are:A. Move deleted data to (an)other table(s): users, deleted_usersB. Read from a scope, view, or materialized view but update a raw table: deleted bool or deleted_at datetimeC. Sprinkle conditionals everywhere live data is desired: deleted bool or deleted_at datetimeThere is no one "the way" for all use-cases.

AdrianB1about 1 year ago

For some reasons I was involved a lot with databases in the past 5 years (more than usual) and I don't remember to meet soft-delete implemented anywhere, but lifecycle is used almost everywhere. I think that soft delete may be an indicator of bad design.

mirekrusinabout 1 year ago

Just add on delete trigger to store whole row as json in generic archive table and move on.

评论 #40337424 未加载

magicalhippoabout 1 year ago

We make a B2B application that's installed on-prem for a lot of customers.We do hard deletes on most things, mainly due to legacy reasons, and almost every week we get a request to restore data that a user deleted but later realized they needed.And quite often the user realizes this after a week or more, at which point the only option is for the user to ask their IT to restore the DB from backup so we can extract the data from there.So adding soft deletes to the tables the users commonly do mistaken deletes from is something we're planning on doing.I don't see the alternatives given in the article would work for us. For example few of our customers even have a data warehouse. Our current DB doesn't support temporal tables, though we are migrating to MSSQL which does, so that might be an option soon. Though unclear how well it works with 3-4 levels of child tables which would also need to be temporal, especially since we need to do hard deletes due to GDPR etc and we have customers who work 24/7 so regular downtime is not tolerated. And users will have active locks against these key tables, not sure how that'll work out with the schema changes needed for hard deletes.

评论 #40329843 未加载

minimemeabout 1 year ago

But what about data overwrites? This is basically the same as deleting data, since information will be destroyed. Using soft delete is a somewhat naive solution if there is no mechanism for restoring overwritten data.

评论 #40336932 未加载

elliottkemberabout 1 year ago

The lengths people will go to, to avoid using an ORM!

cynicalsecurityabout 1 year ago

The article is mostly nonsense.