TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Open Source AI Playground for Prompt Engineers

5 点作者 theanirudh4 个月前
Hi HN,<p>We&#x27;re a small team building AI tutors out of India, and as you might guess, this means we spend a ton of time writing, testing, and refining prompts for LLMs. When we started out, we were using the OpenAI playground but things became tedious when we wanted to compare responses from different models. We tried a bunch of other playgrounds but found them lacking in some features so we built our own.<p>Quick Links:<p>Github: <a href="https:&#x2F;&#x2F;github.com&#x2F;supernova-app&#x2F;ai-playground">https:&#x2F;&#x2F;github.com&#x2F;supernova-app&#x2F;ai-playground</a><p>Hosted demo: <a href="http:&#x2F;&#x2F;playground.getsupernova.ai" rel="nofollow">http:&#x2F;&#x2F;playground.getsupernova.ai</a><p>Demo video: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=I01_t75FT-c" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=I01_t75FT-c</a><p>TLDR:<p>Main features are:<p>- Monaco editor for writing prompts.<p>- Variable support in prompts {{}}.<p>- Syntax highlighting for tags like XML.<p>- Generate multiple completions with same model.<p>- Chat with multiple models simultaneously.<p>- Save prompt and conversations as JSON.<p>- Easy to self host.<p>Key Features:<p>1. Monaco Editor for Writing Prompts<p>When we were working on long, detailed prompts, writing them in plain text felt clunky and error-prone. Small issues—like missing a tag or having weird formatting—could break things.<p>So, we integrated the Monaco editor (used in VS Code). It gives us:<p>- Line numbers (so we don&#x27;t get lost in long prompts).<p>- White space detection to catch formatting issues early.<p>- Syntax highlighting for tags like XML.<p>- Code folding to collapse parts of a prompt we&#x27;re not actively working on.<p>These might sound like small things, but they&#x27;ve been a huge help when we&#x27;re dealing with large, complex prompts that need constant tweaking.<p>2. Variable Support for Dynamic Prompts<p>You can define placeholders in your prompt using double curly braces ({{ }}) and fill them in via a friendly UI.<p>3. Testing for Consistency Across Completions<p>One of the hardest parts of building AI tutors has been ensuring reliable outputs. Even when a prompt seems fine, it can fail unexpectedly—or worse, it works sometimes but not always.<p>To address this, we made it easy to generate multiple completions from the same model at once. This lets us quickly see:<p>- If the prompt is consistently producing good results.<p>- Where the AI might misinterpret our intent.<p>For example, we&#x27;d often run 5–6 completions to see if the AI consistently understood our instructions, rather than getting lucky once or twice.<p>4. Comparing Models Side by Side<p>The main reason we built the playground in the first place. You can set up API keys for multiple providers and see how different models handle the same task. This helped us:<p>- Optimize prompts for specific models.<p>- Choose the best model for a particular use case.<p>5. Saving Conversations as Test Cases<p>Another pain point was testing how a prompt or conversation would evolve over time. Sometimes, we&#x27;d go back and forth with the AI to simulate real user interactions, but we had no way to save that conversation for future reference.<p>Now, with the playground, we can save these conversations as test cases. Here&#x27;s how it works: 1. We create a conversation (or simulate a long interaction). 2. At any point, we can save it as a JSON file. 3. The JSON includes the full conversation, the system prompt, and any variables we used.<p>We then use this JSON file and use it in our code for running test cases or run evals.<p>6. Simple Self-Hosting<p>Finally, we wanted to make sure the playground was easy for others to set up. The only dependencies are:<p>- A Postgres database.<p>- API keys for the AI providers you want to use.<p>It supports Google login, so you can setup OAuth and can restrict access to only your domain.<p>The app is open source and we are running a hosted version of it here: <a href="http:&#x2F;&#x2F;playground.getsupernova.ai" rel="nofollow">http:&#x2F;&#x2F;playground.getsupernova.ai</a>.<p>You can check out the repo here: <a href="https:&#x2F;&#x2F;github.com&#x2F;supernova-app&#x2F;ai-playground">https:&#x2F;&#x2F;github.com&#x2F;supernova-app&#x2F;ai-playground</a>. It&#x27;s easy to self-host, and we&#x27;re actively working on new features.<p>If you give it a try, let us know what you think! Feedback, feature ideas, and contributions are all welcome.

暂无评论

暂无评论