TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Homelab Monitoring Setup with Grafana

155 点作者 conor_f将近 2 年前

17 条评论

BrandoElFollito将近 2 年前
I self host for years about 30 services, out of these 3 are vital (bitwarden, home assistant and pihole).<p>I work in IT, I am a geek so I tried a few monitoring systems and wrote two myself.<p>Then I realized that I have self-sustaining, 24&#x2F;7 monitoring agents: wife and children.<p>I gave up trying to have the right stack and just wait for them to yell.<p>Seriously: it works great and it made me wonder WHY I am trying to monitor. Turns out this is more for the fun, discovery of tools than a real need at home.
评论 #36231363 未加载
评论 #36231659 未加载
评论 #36232144 未加载
sjsdaiuasgdia将近 2 年前
This confirms to me what I suspected when I was trying to determine whether to host my own Grafana stack or use the Grafana Cloud free tier - that I&#x27;d end up spending a ton of time fiddling with a constellation of services I didn&#x27;t actually care about that I could spend on the projects and services I do care about.<p>I&#x27;ve not found it too hard to stay within the limits of the free tier. The 10 dashboards limit is the main one that actually constrains me, but I just put more stuff on each dashboard and live with the scrolling. The free retention is not great but it&#x27;s good enough for my purposes.
评论 #36227178 未加载
评论 #36231449 未加载
评论 #36227011 未加载
评论 #36227067 未加载
评论 #36226549 未加载
bovermyer将近 2 年前
I&#x27;m in the process of building out a Grafana stack (Prometheus, Loki, Tempo, Mimir, Grafana) for my day job right now.<p>...and also for one of my side projects, OSRBeyond.<p>It&#x27;s easy to get overwhelmed by all the moving pieces, but it&#x27;s also a lot of _fun_ to set up.
评论 #36227598 未加载
评论 #36227622 未加载
评论 #36227080 未加载
adql将近 2 年前
I&#x27;ve found VictoriaMetrics all-in-one binary to be perfect size for home at the very least for metrics gathering.<p>Supports Prometheus querying and few other formats for ingesting so any knowledge bout &quot;how to get data into prometheus&quot; applies pretty much 1:1 + their own vmagent is pretty advanced. Not related to company in any way, just a happy user.<p><a href="https:&#x2F;&#x2F;victoriametrics.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;victoriametrics.com&#x2F;</a>
评论 #36227343 未加载
评论 #36231850 未加载
conor_f将近 2 年前
Hey everyone, this is a post I&#x27;ve been working on the past few months about setting up my own monitoring stack with Grafana for my home server.<p>I&#x27;d love your feedback on how this process could be easier for me, some resources on learning the Grafana query languages, and general comments.<p>Thanks for taking the time to read + engage!
评论 #36227345 未加载
tacker2000将近 2 年前
I have been using Zabbix to monitor my servers for the last years, since I wanted something simple and this Grafana&#x2F;Prometheus stack always scared me because, as the OP says, of the amount of “moving parts”.<p>Zabbix has been quite solid and has lots of templates for different servers (linux, windows, etc), triggers and can also monitor docker containers (although i never tried that).<p>The only thing Zabbix cant do well is log file monitoring, so I am considering something like an ELK stack as an addition.
评论 #36228449 未加载
评论 #36234682 未加载
评论 #36229530 未加载
评论 #36232412 未加载
shrx将近 2 年前
Mildly related: can anyone recommend a time series database that supports easy aggregation by week (with the ability to configure the start of the week) and month? I&#x27;m looking for something to switch from InfluxDB which I&#x27;m currently using. The linked article is using Prometheus which also doesn&#x27;t appear to support this functionality.
评论 #36233257 未加载
评论 #36232119 未加载
majkinetor将近 2 年前
Is there anything easier for logs? Basically glorified ripgrep?
评论 #36240989 未加载
评论 #36237300 未加载
whalesalad将近 2 年前
check out netdata if y&#x27;all haven&#x27;t already - incredible software
评论 #36241112 未加载
codetrotter将近 2 年前
I recently set up packet loss monitoring on a Raspberry Pi, using Prometheus for logging and graphing.<p><a href="https:&#x2F;&#x2F;video.nstr.no&#x2F;w&#x2F;hjTH3Vggn2fvpTrQitMmVP" rel="nofollow">https:&#x2F;&#x2F;video.nstr.no&#x2F;w&#x2F;hjTH3Vggn2fvpTrQitMmVP</a><p>I would like to set up Grafana and more monitoring as well, on some of my other machines. But for now this is what I have :D
czzzzz将近 2 年前
Shameless plug for AppScope (<a href="https:&#x2F;&#x2F;github.com&#x2F;criblio&#x2F;appscope">https:&#x2F;&#x2F;github.com&#x2F;criblio&#x2F;appscope</a>) which is designed for exactly this. Capturing observability data from processes in your environment without code modification, and shipping the data off to tools like grafana for monitoring.
评论 #36231893 未加载
hardwaresofton将近 2 年前
Has anyone had lots of trouble configuring Grafana via YAML from the documentation? A lot of it is kind of hard to follow.<p>I&#x27;ve found that the ability to (pre)configure Grafana <i>without</i> clicking around in it is pretty difficult.
评论 #36233371 未加载
guybedo将近 2 年前
shameless plug for uptimeFunk (<a href="https:&#x2F;&#x2F;uptimefunk.com" rel="nofollow">https:&#x2F;&#x2F;uptimefunk.com</a>) that i soft launched a some time ago. I wanted some uptime monitoring with nice ui and a few advanced features that i didn&#x27;t find anywhere: - monitoring mongo db&#x2F;replicaset status<p>- monitoring sql databases with basic sql queries<p>- monitoring host cpu, ram and disk usage<p>- monitoring docker containers<p>- and being able to monitor all of this through ssh tunnels because not all my services are on the internet
评论 #36237616 未加载
shashasha2将近 2 年前
We&#x27;ve been using nagios and munin for years, this stack is rock solid. We added recently ELK. This feels overkill, heavyweight and fragile.
评论 #36227910 未加载
评论 #36228612 未加载
artisin将近 2 年前
I went down the Grafana rabbit hole, and without a doubt, it&#x27;s a fantastic tool. It can handle just about any kind of data you throw at it, and when it comes to visualizing time series data, it&#x27;s second to none. That said, it&#x27;s a slog to set up and configure, but once finished, I had a beautiful dashboard for my home media server, and life was good. Unfortunately, a few months later, I was forced to upgrade and lacked the time to reconfigure Grafana. So, as a stopgap, I installed Netdata... fast-forward two years, and today I still haven&#x27;t reconfigured Grafana, nor do I plan to.<p>For my use case, a home media server, Netdata turned out to be way simpler to set up, and, most importantly, way less of a hassle&#x2F;dink-around. It&#x27;s a basic plug-and-play operation with auto-discovery. While the dashboard isn&#x27;t nearly as beautiful or configurable, it gets the job done and provides everything I pretty much need or want. It offers a quick overview, historical metrics (over a year of data) to analyze trends or spot potential issues, and push&#x2F;email notifications if something goes awry.<p>If you decide to go down this route, there are two major items:<p>1. You&#x27;ll need to configure the dbengine[1] database to save and store historical metric data. However, I found the dbengine configuration documentation to be a bit confusing, so I&#x27;ll spare you the trouble - just use this Jupyter Notebook[2]. If needed, adjust the input, run it, scroll down, and you&#x27;ll see a summary of the number of days, the maximum dbengine size, and the yaml config, which you can copy, paste, and voila.<p>2. If you&#x27;re hoarding data, you&#x27;ll probably want to set up smartmontools&#x2F;smartd[3] in a separate Docker container for better disk monitoring metrics. However, I think you can enable hddtemp[4] with Netdata through the config if you don&#x27;t want or need the extra hassle. You can have Netdata to query this smartd container, but with a handful of disks, it ends up timing out frequently, so I found it&#x27;s best to simply set up smartd&#x2F;smartd.conf to log out the smartd data independently. Then all you need to do is tell Netdata where to find the smartd_log[5], and Netdata handles the rest.<p>Boom, home media server metrics with historical data, done. It still takes a bit of time to set up, but way less than Grafana. Anywho, hopefully, this saves you from wasting as much time as I did. And if you&#x27;re looking for a smartd reference, shoot me a reply, and I&#x27;ll tidy up and share my Docker config&#x2F;scripts and notes.<p>[1] <a href="https:&#x2F;&#x2F;learn.netdata.cloud&#x2F;docs&#x2F;typical-netdata-agent-configurations&#x2F;optimizing-metrics-database&#x2F;database-modes-for-parent-child-setups#choose-your-database-mode" rel="nofollow">https:&#x2F;&#x2F;learn.netdata.cloud&#x2F;docs&#x2F;typical-netdata-agent-confi...</a> [2] <a href="https:&#x2F;&#x2F;colab.research.google.com&#x2F;github&#x2F;andrewm4894&#x2F;netdata-storage-calculator&#x2F;blob&#x2F;main&#x2F;calculator.ipynb#scrollTo=XNOCVoIMBR8xe" rel="nofollow">https:&#x2F;&#x2F;colab.research.google.com&#x2F;github&#x2F;andrewm4894&#x2F;netdata...</a> [3] <a href="https:&#x2F;&#x2F;www.smartmontools.org&#x2F;wiki" rel="nofollow">https:&#x2F;&#x2F;www.smartmontools.org&#x2F;wiki</a> [4] <a href="https:&#x2F;&#x2F;github.com&#x2F;vitlav&#x2F;hddtemp">https:&#x2F;&#x2F;github.com&#x2F;vitlav&#x2F;hddtemp</a> [5] <a href="https:&#x2F;&#x2F;learn.netdata.cloud&#x2F;docs&#x2F;data-collection&#x2F;storage,-mount-points-and-filesystems&#x2F;hardware-storage&#x2F;s.m.a.r.t.-attributes#configuration" rel="nofollow">https:&#x2F;&#x2F;learn.netdata.cloud&#x2F;docs&#x2F;data-collection&#x2F;storage,-mo...</a>
评论 #36233635 未加载
revskill将近 2 年前
Just push to github and people will contribute the rest for you. Easy!
Demmme将近 2 年前
With 40 containers I would go kubernetes and with Kube stack you basically have this up and running in 5 minutes.<p>Aligning metric endpoints for fine-tuning.<p>Add tracing to it in a few more clicks