Show HN: Open-source APM with support for tracing, metrics, and logs

112 pointsby vmihailencoover 2 years ago

Uptrace is an all-in-one tool that supports distributed tracing, metrics, and logs. It uses OpenTelelemetry observability framework to collect data and ClickHouse database to store it.You can ingest data using OpenTelemetry Protocol (OTLP), Vector Logs, and Zipkin API. You can also use OpenTelemetry Collector to collect Prometheus metrics or receive data from Jaeger, X Ray, Apache, PostgreSQL, MySQL and many more.The latest Uptrace release introduces support for OpenTelemetry Metrics which includes:- User interface to build table-based and grid-based dashboards.- Pre-built dashboard templates for Golang, Redis, PostgreSQL, MySQL, and host metrics.- Metrics monitoring aka alerting rules inspired by Prometheus.- Notifications via email/Slack/PagerDuty using AlertManager integration.There are 2 quick ways to try Uptrace:- Using the Docker container - <a href="https://github.com/uptrace/uptrace/tree/master/example/docker" rel="nofollow">https://github.com/uptrace/uptrace/tree/master/example/docke...</a>- Using the public demo - <a href="https://app.uptrace.dev/play" rel="nofollow">https://app.uptrace.dev/play</a>I will be happy to answer your questions in the comments.

14 comments

derN3rdover 2 years ago

Nice to see so many new projects in the area of APM in the last few months.We recently tried Signoz and Grafana Tempo and while I can't say something about uptrace yet (will definitely try it out) I want to list some pros and cons about them.Grafana TempoPros:- Easy and smooth integration into our existing Grafana instance, no additional frontend needed- No new storage engine needed (No additional Clickhouse, Postgres, etc) as it saves its data to S3- Supports OTLPCons:- Search is limited by param size and unique params (as its baked to be indexed)- Ingestion is not in real time, but configurable (time to finish span)Signoz:Pros:- Supports OTLP- Integrates Logs and Metrics within the same service (for Grafana you need Loki then)- Supports real time queryingCons:- Uses new storage engines (or extends the software stack) with adding ClickHouse- Adds an additional frontend (might not be relevant for everyone)- Doesn't provide SSO yet, so you need to manage users differentlyInteresting to see, that UpTrace also chose ClickHouse (btw I love ClickHouse!)Some questions:- Can I easily disable certain features? (e.g. alerting)- Is there support for SSO for self-hosted installation?- Are there any recommendations for scaling (e.g. benchmarks) on how many spans/s are supported on what hardware?Thanks in advance!

评论 #32734211 未加载

评论 #32748809 未加载

评论 #32736346 未加载

vmihailencoover 2 years ago

You can ingest data using OpenTelemetry Protocol (OTLP), Vector Logs, and Zipkin API. You can also use OpenTelemetry Collector to collect Prometheus metrics or receive data from Jaeger, X Ray, Apache, PostgreSQL, MySQL and many more.The latest Uptrace release introduces support for OpenTelemetry Metrics which includes:- User interface to build table-based and grid-based dashboards.- Pre-built dashboard templates for Golang, Redis, PostgreSQL, MySQL, and host metrics.- Metrics monitoring aka alerting rules inspired by Prometheus.- Notifications via email/Slack/PagerDuty using AlertManager integration.There are 2 quick ways to try Uptrace:- Using the Docker container - <a href="https://github.com/uptrace/uptrace/tree/master/example/docker" rel="nofollow">https://github.com/uptrace/uptrace/tree/master/example/docke...</a>- Using the public demo - <a href="https://app.uptrace.dev/play" rel="nofollow">https://app.uptrace.dev/play</a>I will be happy to answer your questions in the comments.

评论 #32733945 未加载

bovermyerover 2 years ago

I see lots of new tracing options these days, and that seems to have taken over the "APM" term.I still have yet to see new profiling options. When I think of APM, I think of CPU profiling and automatic instrumentation of black box systems, not request tracing. I should be able to see which function calls are slow/problematic, without having to add code to the application.

评论 #32736454 未加载

评论 #32744648 未加载

jonasdevopsover 2 years ago

Before you try, please make sure you are comfortable with their license - <a href="https://github.com/uptrace/uptrace/blob/master/LICENSE" rel="nofollow">https://github.com/uptrace/uptrace/blob/master/LICENSE</a> (Business Source License 1.1), which as License says "The Business Source License (this document, or the “License”) is not an Open Source license"

评论 #32735372 未加载

评论 #32734590 未加载

KronisLVover 2 years ago

This seems like a pretty cool project!Currently using Apache Skywalking myself, because it's reasonably simple to get up and running, as well as integrate with some of the more popular stacks: <a href="https://skywalking.apache.org/" rel="nofollow">https://skywalking.apache.org/</a>I do wonder how ClickHouse (which Uptrace uses) would compare with something like ElasticSearch (which is used by Skywalking and some others) and how badly/well an attempt to use something like MariaDB/MySQL/PostgreSQL for a similar workload would actually go.I mean, something like Matomo Analytics already uses a traditional RDBMS for storing its data, albeit it might be an order of magnitude or two off from the typical APM solution.

评论 #32735766 未加载

tylergetsayover 2 years ago

I think the log interface should be optimized for keyboard navigation and larger screens. On my 4k monitor it only takes up 1/2 the width and only shows 10 lines at a time, id expect closer to ~100

评论 #32736300 未加载

tmd83over 2 years ago

I wonder if anyone can answer some question on distributed tracing for me.The difference between old days of APM vs. tracing as I understand is two things.1. Originally APM was single process and it was language aware, usually do sampling stacktrace to find where times are being taken and some very well know place to instrument for exact timing say response time or query time.Tracers are more working by instrumenting methods of framework/servers/runtime at well known point and getting the timing. In man ways it's a lot more coarse as it might know of a hot loop that I have in my code. But it can trace very well with exact timing at framework boundary like web, cache, db etc.2. The APM were primarily single process and couldn't really show a different service/process which doesn't work in a micro-service/distributed world.The way I understand it is that Tracers would allow me to narrow down to the service/component very easily. Whether I can find out why that component is slow might not be as easy (not sure what granularity tracing happens inside a component).I wonder if this understanding of mine is correct.The second thing I am really unsure about is sampling and overhead. What's the usual overhead of a single tracing (I know it's variable) but generally are they more expensive at a single request level. Also do they usually sample and is there a good/recommend way to sample this. I forgot exactly who but (probably NewRelic) was saying they collect all traces (like every request?) and discard if they are not anomalous (to save on storage). But does that mean taking a trace is very cheap? And is that end of the request sampling decision something that's common or that's a totally unique capability some have.

评论 #32735683 未加载

PeterZaitsevover 2 years ago

False Advertising!BSL Licensed is not Open Source. To Be fair Utrace restrictions are relatively light but it is still Source Available project not Open Source

nik736over 2 years ago

Nice! Exactly what I've been looking for, will give it a try for sure. Sentry eats a lot of resources so I was looking for an alternative.

评论 #32734335 未加载

edf13over 2 years ago

Looks nice... I'm a bit out of touch in this space but my last solution for similar would be Datadog. How does this compare?

评论 #32734333 未加载

xyzzy_plughover 2 years ago

I've been out of the loop for a while but...> OpenTelemetry Protocol (OTLP)> OTLP> OLTPI'm going back to bed.

xferover 2 years ago

Anyways to export dashboard for public viewing, maybe even static image? It looks like all drawing is done client side at present.

评论 #32737889 未加载

ram_rarover 2 years ago

can you elaborate more on why clickhouse for backend? And what challenges if any are you facing with clickhouse?

wdbover 2 years ago

How does it compare to Opstrace? (www.opstrace.com)