TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Local fine tuning for Mistral and SDXL, GPU mem/latency optimization

36 点作者 lewq超过 1 年前
100% bootstrapped new startup. It lets you fine tune Mistral-7B and SDXL. In particular, for the LLM fine tuning we implemented a dataprep pipeline that turns websites&#x2F;pdfs&#x2F;doc files into question-answer pairs for training the small LLM using an big LLM.<p>It includes a GPU scheduler that can do finegrained GPU memory scheduling (Kubernetes can only do whole-GPU, we do it per-GB of GPU memory to pack both inference and fine tuning jobs into the same fleet) to fit model instances into GPU memory to optimally trade off user facing latency with GPU memory utilization<p>It&#x27;s a pretty simple stack of control plane and a fat container that runs anywhere you can get hold of a GPU (e.g. runpod).<p>Architecture: <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;architecture" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;architecture</a><p>Demo walkthrough showing runner dashboard: <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;overview" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;overview</a><p>Run it yourself: <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;controlplane" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;controlplane</a><p>Discord: <a href="https:&#x2F;&#x2F;discord.gg&#x2F;VJftd844GE" rel="nofollow noreferrer">https:&#x2F;&#x2F;discord.gg&#x2F;VJftd844GE</a><p>Please roast me!

2 条评论

leblancfg超过 1 年前
This is huge. Luke I think you have a winner here, this is great. Can&#x27;t wait to try it over the holidays.<p>If I can be cheeky, be sure to repost over the coming days at different hours – you&#x27;re likely to spawn more traffic that way =)
评论 #38732513 未加载
lewq超过 1 年前
Some resources:<p>Demo: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Ym4nPSzfer0" rel="nofollow noreferrer">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Ym4nPSzfer0</a><p>About Helix<p>Helix is a generative AI platform that you can run on our cloud or deploy in your own data center or cloud account. It provides an easy-to-use interface to using open source AI that&#x27;s accessible to everyone.<p>Under the hood, it uses the best open source models and includes a GPU scheduler that can fit model instances into GPU memory to optimally trade off user facing latency with GPU memory utilization.<p>If you think this is cool, please vote for us on <a href="https:&#x2F;&#x2F;www.producthunt.com&#x2F;posts&#x2F;helix-5" rel="nofollow noreferrer">https:&#x2F;&#x2F;www.producthunt.com&#x2F;posts&#x2F;helix-5</a> today.<p>Docs: <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;overview" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;overview</a><p>Architecture: <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;architecture" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;architecture</a><p>Things to try with LLM fine-tuning using Helix:<p>- <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;papers" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;papers</a><p>- <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;engaging-content" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;engaging-content</a><p>- <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;insights-data" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;insights-data</a><p>- <a href="https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;website-content" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.helix.ml&#x2F;docs&#x2F;website-content</a><p>Sample sessions for SDXL:<p>- <a href="https:&#x2F;&#x2F;app.tryhelix.ai&#x2F;session&#x2F;e1b50789-a209-46c8-aa60-4d097af1aa8b" rel="nofollow noreferrer">https:&#x2F;&#x2F;app.tryhelix.ai&#x2F;session&#x2F;e1b50789-a209-46c8-aa60-4d09...</a><p>- <a href="https:&#x2F;&#x2F;app.tryhelix.ai&#x2F;session&#x2F;cc6004cd-111b-48ae-9a8c-d651f3ed45c8" rel="nofollow noreferrer">https:&#x2F;&#x2F;app.tryhelix.ai&#x2F;session&#x2F;cc6004cd-111b-48ae-9a8c-d651...</a><p>- <a href="https:&#x2F;&#x2F;app.tryhelix.ai&#x2F;session&#x2F;d50db369-4ffa-4a49-88dd-1cff05fee947" rel="nofollow noreferrer">https:&#x2F;&#x2F;app.tryhelix.ai&#x2F;session&#x2F;d50db369-4ffa-4a49-88dd-1cff...</a>