TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: How does your team organize/manage their runbooks?

1 点作者 bullcitydev将近 4 年前
I&#x27;m a member of a 15 person team who are responsible for several production services. We try to write runbooks for any incident that occurs and tie them to our alerts (DataDog), but I&#x27;ve noticed a couple of problems with how we manage them:<p>1. Runbooks become out of date. This may be because no one on the team feels like they &#x27;own&#x27; our runbooks (even though I feel like they should be everyones responsibility to keep up with).<p>2. We keep our runbooks in a GitHub repository for our team, but they are nested under several subfolders depending on the service they reference. This makes them hard to find during an incident (especially if you are woken up at 3am). GitHub search is not the best for this usecase it seems.<p>I&#x27;m wondering how other teams manage their runbooks and keep them up to date and easily discoverable? I know there are tools out there like Rundeck (PagerDuty) and what was VictorOps (now Splunk Oncall or something), but these seem to focus on Runbook Automation which is not what we want. I don&#x27;t want to fully automate our runbooks, but simply make them easily discoverable when we need them and some how encourage keeping them up to date.<p>Any ideas&#x2F;feedback would be greatly appreciated!

暂无评论

暂无评论