TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Monitoring historical server resource usage on linux?

11 点作者 apollo超过 15 年前
I want to be able to find performance bottlenecks after usage spikes, so I would like a tool which monitors and logs cpu, memory, disk io over time on each of my servers. These are ubuntu servers.<p>If you have any other suggestions for tools I can use to help find bottlenecks, I'd appreciate that too; I'm new to this.

12 条评论

ezmobius超过 15 年前
Collectd, I swear by it, running it on thousands of hosts without issues.
评论 #859605 未加载
评论 #859680 未加载
评论 #859615 未加载
jbellis超过 15 年前
There's a ton of projects, none of which is truly easy to deploy, all of which kinda suck, and all of which store data as RRDs (which is the worst possible solution except for all the others, as the saying goes).<p>This is an area that could use some new blood, but apparently the existing solutions work barely well enough.
lg超过 15 年前
You probably know about sar, but a lot of people may not; it can get advanced with all the options, but for simple cpu spot checks:<p><pre><code> sar -u X Y </code></pre> prints Y lines of utilization info, one every X seconds.
rv77ax超过 15 年前
<a href="http://www.solarisinternals.com/wiki/index.php/Dim_STAT" rel="nofollow">http://www.solarisinternals.com/wiki/index.php/Dim_STAT</a><p>&#60;quote&#62; <i>All STAT data are collected from standard Solaris or Linux programs (vmstat, iostat, etc.) or some special (like psSTAT for users/processes activity) and saved in MySQL database. Collected data are accessed via Web interface and can be presented in several manner (interactive or static graphs, text, HTML tables).</i><p><i>dim_STAT can be used for On-Line monitoring one or several hosts on the same time. As well, data may be easily post loaded from output files of stat commands and analyzed in the same manner. At any time collecting from new stat commands may be added to the tool (via Add-On interface) and enlarge your view on application workload, RDBMS, your personal STAT program, etc.</i> &#60;/quote&#62;
neondiet超过 15 年前
Where I'm working at the moment we use Zabbix to monitor system availability and collect performance and trend data for 184 servers [that generate 8701 monitored items of data]. The Zabbix folk do all their development on Ubuntu and it's their preferred platform; but it works just as well on Redhat, CentOS, etc and has client agents for a wide variety of platforms.
jws超过 15 年前
I keep munin, <a href="http://munin.projects.linpro.no/" rel="nofollow">http://munin.projects.linpro.no/</a> installed on all my servers. When things begin to look unwell, it lets you look back to see what resource, temperature, voltage, or activity might have changed.<p>It is alleged to be easy to add your own data points to it as well.
评论 #859988 未加载
评论 #859698 未加载
carl_超过 15 年前
I use zabbix over ~&#60;1000 physical and virtual hosts for historical resource usage and alerting.<p>Cacti for network traffic monitoring (and alerting using thold plugin).<p>Smokeping for network latency and availability monitoring and alerting.
duskwuff超过 15 年前
Ganglia is pretty good, although it's oriented more towards detecting problems than diagnosing them post-mortem.
josephruscio超过 15 年前
Are you looking only at free solutions? Or is proprietary software an option?
评论 #859610 未加载
ApolloRising超过 15 年前
You can try Munin, MRTG, or CACTI
brianr超过 15 年前
Cacti.
maelstrom超过 15 年前
rrdtool