TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Open Source, Splitscreen Prompt Engineering

2 pointsby BenGuzover 1 year ago
Hey HN!<p>In my experiments with prompt engineering, I ran into a problem: most prompts I&#x27;m designing can&#x27;t be quantitatively tested because they don&#x27;t have a right&#x2F;wrong answer (eg. providing essay feedback, deciphering corporate-speak in meeting minutes). That means I can&#x27;t run evals, super powerful tools like ChainForge[1] are too high-overhead, and running one prompt at a time in ChatGPT... sucks.<p>I built Prompt Octopus[2] to evaluate as many prompts as I want, side by side, and it&#x27;s sped up my workflow dramatically. You can plug in an API key online or self-host (I added python + node.js boilerplates in the repo). Click the Octopus icon in the top right to change your model type, see your history, and change the number of prompt-response boxes you&#x27;re working with. I&#x27;m open sourcing it here and want your feedback, both on the UX and the self-hosting experience!<p>This week I hope to add diff checking, batch API calls to speed things up, and options to add more LLMs.<p>[1] <a href="https:&#x2F;&#x2F;chainforge.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;chainforge.ai&#x2F;</a> [2] <a href="https:&#x2F;&#x2F;promptoctopus.com" rel="nofollow">https:&#x2F;&#x2F;promptoctopus.com</a>

no comments

no comments