How Kubernetes and Kafka Will Get You Fired

52 pointsby brakmicabout 2 years ago

14 comments

Not that I want to advocate for Kubernetes, but this is just a stepping stone for another blog post in a few years "How managed services in AWS got us fired!" detailing how when AWS changed their pricing/strategies the companies had no way out of their design choices.The "vendor agnostic" approach was the right call to make at that point in time. Sure, every business is unique and some cases fit better for a hands-off (lets pay for the convenience of AWS taking care of managed offerings) approach. But, it is a fallacy to think there is no cost to pay for that decision.The cost of operating your vendor agnostic infrastructure is replaced by your team now needing to learn the intricacies of AWS such as IAM, AWS's way of networking, backups etc. Those operational needs dont just go away, they just become "easier" and more defined as AWS's way of doing it.As a consultant, one must know where to draw the line and recommend the appropriate route.

评论 #35637951 未加载

withinboredomabout 2 years ago

For fun, I run a bare metal k8s cluster with my blog and other projects running on it. My last three nights have been fighting bugs. Bugs with volumes not attaching, nginx magically configuring itself incorrectly, and a whole bunch of other crap. This just magically started happening, but crap like this seems to happen at least once a month. It’s to the point where I spend at least one night a week babysitting the cluster.I don’t have to pay someone else to handle this, but if did, I would get rid of k8s in a heartbeat. I’ve seen a devops team of only a few people manage tens of thousands of traditional servers, but I doubt such a small team could handle a k8s cluster of the same size.I’m considering moving back to traditional architecture for my blog and other projects. K8s has been fun, but there’s too much magic everywhere.

评论 #35637203 未加载

评论 #35637329 未加载

评论 #35645540 未加载

评论 #35637408 未加载

评论 #35710145 未加载

评论 #35637584 未加载

评论 #35648628 未加载

评论 #35637521 未加载

评论 #35637398 未加载

thrashhabout 2 years ago

I can confirm that maintaining a Kubernetes cluster is a full time job. Due to its design, there are a lot of moving parts even for the most minimum deployments.Low key I hate touching Google-created projects. On paper technically sound but in practice a guaranteed usability disaster.

评论 #35637544 未加载

评论 #35638352 未加载

cocoland2about 2 years ago

Touches a chord this post.Systems like Kubernetes, Kafka are inherently complicated. My previous company got baremetal from AWS and installed k8s cluster on them. No offense to who architected it, we had multi country infra and made sense to take care of cost advantages on lower cloud costs using alternate providersWe got a lot of critical infra running on them and then slowly there was tech-debt that would start accumulating. Clusters have to get updated , older DNS versions in k8s are slow, networking (Older Weave versions was bursting through the seams when the traffic exploded with many applications onboarded). SRE teams get overwhelmed, constant requests for adding PVC (Kafka & C* was on k8s) took a toll. Sanity prevailed in the end, there was decision to move to hosted PaaS infra, though I no longer work there, I just reminisced what we were going through.Though a "cloud-independent" solution will save pennies, it will definitely drown dollars in personnel costs and the uptime/SLAHistory repeats itself, because we don't learn from our mistakes (us or others)

dikeiabout 2 years ago

I came expecting a war story about running Kafka on K8S. Instead, I find an advertisement for AWS server-less.

评论 #35638111 未加载

mt42orabout 2 years ago

Very funny article, we are spending 2 people full time (on 4) trying to building on AWS services. This is really a mess and cost a lot of human resources.

评论 #35636998 未加载

bradwoodabout 2 years ago

Having run k8s and Kafka in a previous job (I left before I got the sack) this article rings completely true.New shop: lambda and eventbridge = life is good.

评论 #35636961 未加载

holografixabout 2 years ago

An explanation on what was actually being so difficult about managing Kafka and K8s would have been helpful.Why didn’t the customer use EKS?

评论 #35637560 未加载

metacatdudabout 2 years ago

I am happy to see people are talking about this.Everytime I am trying to point out systems are more and more complicated and do a call for simplicity I am pushed away.I heard you are not taken seriously if you don't use a well established cloud provider or similar things.Truth be told, there are not many projects you do or will work on which need this kind of things.We though cloud will help us with a lot of things but to what cost and by cost I mean stress, data protection, money etc.

SilverBirchabout 2 years ago

This is a good example of people making questionable practical decisions based on good principles. Yes, in principle it'd be good if you were agnostic of your cloud provider. But there's a few issues with it. Firstly, you're going to work so much harder doing what you want to do on AWS whilst avoiding doing the things AWS wants you to do. They're not dumb! You don't want to be locked in? They really want you locked in and they're going to work very hard to make that happen. Secondly, the likelihood of you ever actually making the decision to leave is extremely low, so you're paying all these costs for what is at best a theoretically risk. And finally, even if you do everything perfectly and never depend on anything uniquely amazon.... leaving is still going to suck! It's still going to be a huge amount of work to migrate away!

theoldloveabout 2 years ago

Quite the advertisement for AWS. I’d like more details on what the migration to native AWS services looked like

评论 #35636996 未加载

qeternityabout 2 years ago

There seem to be two camps: those who think k8s is a godsend, and those who think it's the devil incarnate. We fall into the former.We run Rancher across a couple of bare metal clusters and it's been mostly an amazing experience (ca. 3 years). The only issues we had were with Rancher specific bugs, but those have been resolved and for the most part our infra is pretty autonomous. We do all HA at the application layer, so local NVME as opposed to network storage. This means Patroni, Redis Sentinel/Cluster, etc. But it broadly just works. Maybe we're not big enough to bump into issues, but I couldn't imagine migrating to the labyrinth of vendor lockin masquerading as cloud services.What am I missing? Why do we have such a wildly different experience to others?

morelispabout 2 years ago

I don't particularly like Kubernetes at all, and while I like Kafka it's definitely overkill for the kind of system discussed in the article. But I gotta say, 87%? What the hell? I had >98% uptime with the first Kafka tooling I ever built, and that was 3 nodes, shared machines with the ZKs, producers/consumers split across three cities, and processing 10x the traffic they're talking about; maintained by just me on medium-range Hetzner boxes.It feels like there's something deeper hiding here, more along the lines of "our developers really don't / can't care about how the software is operating in production."

kyugkyugyuogabout 2 years ago

i hate k8s!!!!!!!!!!!