TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Versioning and Pipeline for ML Research

1 点作者 mfazalul超过 6 年前
Being an applied ML PhD student, I often find myself having multiple versions of files and scripts towards the end of each research cycle. When I&#x27;m at the point of writing a paper for submission, it gets difficult to back-track and find the earlier results or have difficulty running the earlier experiments as files might have changed. I&#x27;ve been trying to find a way to solve this problem, but the closest I could get to a solution was amie.ai. Due to the product still being in early release, it isn&#x27;t quite what I&#x27;m looking for. The use of Jupyter Notebook does not help much either. I wrote up the requirements as to what might work to solve this problem (https:&#x2F;&#x2F;github.com&#x2F;fazalul92&#x2F;ProtoPypelines), but wanted to make sure if there is anything currently available that can solve it.<p>To be more specific, at the very least the following features would be helpful: 1) A tree structure to organize the different changes or &quot;flows&quot; within the code. 2) Being able to execute from a cell&#x2F;line of code such that all the preceding lines of code will execute. (only the nodes leading up to that cell&#x2F;line) 3) Versioning of files so that new child-nodes grow from the point at which the change was made.<p>So my question is to those who do rapid prototyping and&#x2F;or ML research (or anyone else who might have found themselves in similar situations), What do you do to keep track of your earlier work?

暂无评论

暂无评论