Deleting an S3 Bucket Costs Money

290 pointsby fideloperover 3 years ago

26 comments

CSDudeover 3 years ago

You can use lifecycle policies to delete it for free, but its best to confirm it via support. Not saying this is the great way, maybe its intentionally hidden, but at least there is a way.<a href="https://stackoverflow.com/questions/59170391/s3-lifecycle-expiration-do-object-expiry-deletes-cost-money-for-sia-objects" rel="nofollow">https://stackoverflow.com/questions/59170391/s3-lifecycle-ex...</a>

评论 #28918232 未加载

评论 #28917932 未加载

评论 #28917959 未加载

评论 #28928598 未加载

评论 #28918240 未加载

heliumover 3 years ago

Not only is a problem that deleting a bucket costs money, but if you have a big bucket with many deeply nested files, it can take a really long time to clean it up using the AWS command line.I ran into this with a bucket full of EMR log files a few years ago and had to figure out some pretty crazy command line hackiness, plus running on a EC2 machine with lots of cores to figure it out. This a write-up I did if anyone else ever runs into this issue.<a href="https://gist.github.com/michael-erasmus/6a5acddcb56548874ffe780e19b7701d" rel="nofollow">https://gist.github.com/michael-erasmus/6a5acddcb56548874ffe...</a>

评论 #28920082 未加载

评论 #28920603 未加载

评论 #28919714 未加载

评论 #28920661 未加载

JackCover 3 years ago

Per-object costs can be tricky with S3 -- it's easy to mentally round costs less than 1/10th of a penny to zero, and then look up a few years later and realize you have hundreds of millions of things and can't afford to do anything with them.When this bit us on a project I made a tool to solve our particular problem, which tars files, writes csv indexes, and can fetch individual files from the tars if need be.[1] Running on millions of files was janky enough that I also ended up scripting an orchestrator to repeatedly attempt each step of the pipeline.[2] Not tested on data other than ours but could be a useful starting point.[1] <a href="https://github.com/harvard-lil/s3mothball" rel="nofollow">https://github.com/harvard-lil/s3mothball</a> [2] <a href="https://github.com/harvard-lil/mothball_pipeline" rel="nofollow">https://github.com/harvard-lil/mothball_pipeline</a>

WORMS_EAT_WORMSover 3 years ago

And deleting your AWS account will keep billing you [1] if you don’t delete all resources first.AWS is designed to extract dollars from big enterprise contracts.Also interesting from the article, this poor soul on StackOverflow was trying to figure out how to delete a bucket that would cost him $20,000 [2]. Can’t delete, can’t close.[1] <a href="https://www.reddit.com/r/aws/comments/j5nh4w/ive_deleted_my_account_but_amazon_keeps_billing/" rel="nofollow">https://www.reddit.com/r/aws/comments/j5nh4w/ive_deleted_my_...</a>[2] <a href="https://stackoverflow.com/questions/54255990/cheapest-way-to-delete-2-billion-objects-from-s3-ia" rel="nofollow">https://stackoverflow.com/questions/54255990/cheapest-way-to...</a>

评论 #28917945 未加载

评论 #28917733 未加载

评论 #28917612 未加载

评论 #28917712 未加载

评论 #28922219 未加载

评论 #28918374 未加载

pkorzeniewskiover 3 years ago

Pricing of AWS services makes me uneasy in general, just take the S3 as an example - you go to the pricing page and you have several tabs with dozens of entries which makes calculating how much exactly will you pay difficult. I might be simple minded but I prefer a clearly defined plans with predetermined limits - you know exactly what it costs you each month and what you get and if you need more, just switch to a higher plan, no risk of nasty (and often expensive) surprises like mentioned in the article.

评论 #28918236 未加载

评论 #28918999 未加载

评论 #28920271 未加载

评论 #28919972 未加载

rad_gruchalskiover 3 years ago

Yup. And uploading / downloading large objects from S3 incurs tons of requests because S3 client does parallel chunking with a small number of other control requests. That client works on the same premise as SFTP client.It’s amazing how often it retries.Example from go sdk: <a href="https://github.com/aws/aws-sdk-go/blob/main/service/s3/s3manager/download.go#L303" rel="nofollow">https://github.com/aws/aws-sdk-go/blob/main/service/s3/s3man...</a>.

userbinatorover 3 years ago

The first thing everyone who tries using cloud services should learn: everything costs money. Even the service that tells you how much it costs: <a href="https://aws.amazon.com/aws-cost-management/pricing/" rel="nofollow">https://aws.amazon.com/aws-cost-management/pricing/</a>

评论 #28918088 未加载

Decabytesover 3 years ago

This post finally got my ass in gear to cancel an account that I thought I had closed but was still charging me a few dollars a month.I spinned up an AWS instance to practice, and once I was done I thought I closed everything down.Turns out I had just stopped my micro instances, and I didn't terminate them. I also hadn't released the my IP address. There was also a snapshot of the tiny db I had created still floating around. The documentation was a little confusing, so after I went through it I spent half an hour chatting with a support rep to make sure everything was completely good. After next month my last bill should go through and I should be free and clear. Unfortunately I have to wait for next months bill to go through as I can't just pay it all now.This was mostly my fault for letting it go on for so long, but I hate how if you don't do some very specific steps you can still be charged. And I think if an account is closed, it should absolutely terminate all services that are still running on that account, and then send you the final bill.

评论 #28919277 未加载

评论 #28919099 未加载

devnull3over 3 years ago

How much of this is a problem in practice?I think in practice, S3 data is often indexed using other DBs e.g DynamoDB, Postgres, MySQL etc. Can't this index be used to enumerate all S3 URLs? I am off-course simplifying this a lot.

评论 #28918046 未加载

评论 #28917537 未加载

peanut_wormover 3 years ago

Stories like this make me extremely hesitant to try AWS. I was about to try S3 for a static site I was working on this weekends but I think I am gonna stick with netlify or digital ocean instead after reading this.

评论 #28918261 未加载

评论 #28918538 未加载

评论 #28924997 未加载

donatjover 3 years ago

> .5¢ per 1000 items LISTed seems insanely expensive considering how cheaply you can transfer terabytes of data with S3.Correction: I misread - .5¢ per 1,000,000 items LISTed<pre><code> .5¢ per 1000 LIST operations LIST operations max out at 1000 items </code></pre> Still a little pricey, but way less so than I'd imagined.Do they make a lot of money off of charging for basic operations? It seems like you could make the whole pricing structure a lot more friendly by only charging for bandwidth use. I guess when you're as dominant as S3, you don't need to care about friendly pricing structures.Charging for basic operations like that is weird, it's akin to a service charging people per number of clicks on a website.

评论 #28918733 未加载

评论 #28918798 未加载

评论 #28918773 未加载

评论 #28918808 未加载

NabiDevover 3 years ago

> In 2021, anyone who comes across this question may benefit to know that AWS console now provides an empty button.source : <a href="https://stackoverflow.com/a/67834172" rel="nofollow">https://stackoverflow.com/a/67834172</a>

评论 #28918498 未加载

评论 #28918517 未加载

vdmover 3 years ago

> you can also get an export of all objects in a bucket using S3 Inventory and run the output through AWS Batch in order to delete those objects"S3 Batch Operations" sends S3 requests based on a csv file, which can but does not have to be from S3 Inventory. But S3 Batch Operations supports only a subset of APIs and this does not include DeleteObject(s). [0]An AWS Batch job could run a container which sends DeleteObjects requests but only when triggered by a job queue which seems redundant here.If I can't use an expiration lifecycle policy because I need a selection of objects not matching a prefix or object tags, I would run something with `s5cmd rm` [1]. Alternatively roll your own golang which parses the CSV and sends many DeleteObjects requests in parallel goroutines.0. <a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/batch-ops-operations.html" rel="nofollow">https://docs.aws.amazon.com/AmazonS3/latest/userguide/batch-...</a>1. <a href="https://github.com/peak/s5cmd#delete-multiple-s3-objects" rel="nofollow">https://github.com/peak/s5cmd#delete-multiple-s3-objects</a>

jugg1esover 3 years ago

They have an example of some person almost paying $20k on transition fees. In my early days of AWS, I racked up $90k on S3 transition fees. Thankfully, AWS forgave it.

评论 #28918082 未加载

评论 #28917676 未加载

mweberxyzover 3 years ago

Would the S3 inventory help here? That would allow you to get the list of all files (albeit on a delay similar to the lifecycle rule approach), which you could process offline to generate the DELETEs.<a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/storage-inventory.html" rel="nofollow">https://docs.aws.amazon.com/AmazonS3/latest/userguide/storag...</a>

评论 #28917705 未加载

评论 #28917526 未加载

malkaover 3 years ago

DELETE is free LIST is notI guess one could spam DELETE calls while bruteforcing filenames to make it free.

评论 #28917953 未加载

Eikonover 3 years ago

Ok pretty obvious, but if you don't know what you are storing inside your bucket, how are you accessing your objects in the first place ?If your use-case is storing random things you don't know the path of, maybe it's the wrong product to use.

评论 #28917529 未加载

评论 #28917552 未加载

评论 #28921105 未加载

Someoneover 3 years ago

> AWS is "eventually consistent" within most services, and S3 is no exceptionNowadays, it (¿almost?) is. <a href="https://aws.amazon.com/s3/consistency/" rel="nofollow">https://aws.amazon.com/s3/consistency/</a>:“After a successful write of a new object, or an overwrite or delete of an existing object, any subsequent read request immediately receives the latest version of the object”I think that says that deletes are immediately visible, too, but they phrase it weirdly, as, after a delete, there is no latest version of the object.Also, I don’t think buckets are objects in this sense, so the caveat in the article stands.

评论 #28917966 未加载

评论 #28918178 未加载

Hamukoover 3 years ago

>The wait is often hours until AWS released a bucket name (since bucket names are globally unique, not just within your account).I think last time I did this, the wait time was pretty much exactly 60 minutes.

annoyingnoobover 3 years ago

This kind of thing is the best argument for your own bare metal hardware.

评论 #28918301 未加载

shortlivedover 3 years ago

Anyone have suggestions for S3 alternatives for storing many files sized 50-500mb each? They are mostly long audio files and there is an external index as well.

edwnjover 3 years ago

Its silly that they wont just let you delete the whole bucket but this actually pretty cheap tho.Based on some quick maths, deleting a million files would only cost you like $5.P.S. Again its silly they do this and I'm probably greatly underestimating how these costs can add up for mid to large orgs.

评论 #28917609 未加载

评论 #28917988 未加载

vrnchndkover 3 years ago

> Deleting a bucket won't let you re-create that bucket immediately.This is partially incorrect. I can recreate it immediately in the same account, but in different account, I need to wait for ~1 hour

xwdvover 3 years ago

Another reason to switch to R2.

评论 #28917569 未加载

jonplackettover 3 years ago

If s3 was any service the general public use, it would be banned for sure.

StratusBenover 3 years ago

Most things on AWS cost money and AWS makes pricing incredibly complex and opaque...where the monthly bill is usually the first way people find out about these things. While it is likely no consolation, S3 is by far one of the most complex AWS products pricing-wise with different object storage types each with their own rates, request costs with different rates for GET/PUT/POST/etc which this post mentions, and transit/egress fees.I work on <a href="https://www.vantage.sh/" rel="nofollow">https://www.vantage.sh/</a> which helps teams get visibility on their cloud costs which may be helpful to folks here as well on this topic.

评论 #28917924 未加载