Datadog has turned into an ever loving piece of shit. I am sick of their sales team grabbing us by the ankles and "Accidentally" charging for services we don't use. Now, this morning they changed something with their AWS integration that is causing 10X the API calls against our accounts (and thus, 10X guardduty costs on our end analyzing those API requests).<p>I have had enough of their bullshit and want to find a new vendor. What is everyone using for cloud scale logging and monitoring these days that isn't one of Splunk (cost prohibitive), Sumo-Logic (Also assholes), or Datadog?<p>It would be nice to just self host a service that stored logs in S3 using iceberg format. However, I don't think such a tool exists.
We self-host on our own open-source product, but it helps to have a streaming data platform as a product. Not too difficult to write a connector-sink to S3. <a href="https://www.fluvio.io/" rel="nofollow">https://www.fluvio.io/</a><p>The caveat is that this is a general purpose data streaming product, so it isn't that opinionated or customized for logging. We still use logging custom frontends for browsing (e.g. grafana), but for core logging/monitoring/billing and critical alerts, it's just Fluvio.<p>Been looking at the <a href="https://coroot.com" rel="nofollow">https://coroot.com</a> ebpf collectors to see if we can replace our sidecar sources recently.
Datadog is a monstrosity, they do so many things, most of which you don't know or don't care about, so which ones do you need?<p>We ended up going for HyperDX, just because I really favour fast log searches and simple alerts straight to Slack, but many of these Clickhouse-powered observability suites like Signoz, BetterStack, etc will do the job. Just depends how much you want to pay.<p>Also, I hope you're on OTEL already, since it makes changing providers the most easy task ever.
I like SigNoz, it isn't as good as Datadog yet, but I can see it growing to be just as good, with a very small bill in comparison to DD, and a more sustainable revenue model.<p>They need to focus a lot on APM. Most people pay Datadog because of APM. The rest can be done quite well by using ELK stack or any other tools.
Datadog’s antics are a headache, but I'm stuck with Splunk due to corporate compliance and security demands. We keep the costs in check by funneling our logs through Cribl. In my experience, most companies end up juggling Splunk, Wazuh, and some custom SecOps setups—no silver bullet, just a series of compromises that keep the auditors happy.
I totally understand your frustration with Datadog's issues. How about the combination of Prometheus + Grafana? Prometheus is for monitoring data collection and Grafana is for visual display. They are open - source and free, allowing for flexible customization. Do you have any specific requirements for monitoring metrics and visualization?
I've tried highlight.io (I'm not affiliated) which I found quite nice, there is an open-source version, although I tried to spin it and it was complex to get it working, but I'm using their cloud version for now
Check out SigNoz, <a href="https://github.com/signoz/signoz">https://github.com/signoz/signoz</a><p>Has metrics, logs and traces in a single app and built natively on OpenTelemetry<p>Disclaimer : I am a maintainer