TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: High-performance GenAI engine now open source

22 点作者 fryz大约 1 个月前
Hey HN<p>After one too many customer firedrills regarding hallucinating or insecure AI models, we built a system to catch these issues before they reached production. The Arthur Engine has been running in Fortune 100 to AI Native Start-Ups over the past two years, putting security controls around more than 10 billion tokens in production every month. We&#x27;re now opening up this service to developers, enabling you to leverage enterprise-grade solutions to provide guardrails and evals as a service, all for free.<p>Get it on Github (<a href="https:&#x2F;&#x2F;github.com&#x2F;arthur-ai&#x2F;arthur-engine">https:&#x2F;&#x2F;github.com&#x2F;arthur-ai&#x2F;arthur-engine</a>) to start evaluating your models today<p>Highlights of Arthur&#x27;s Engine include:<p>* Built for speed and scale: It performs well with p90 latencies of sub-second well over 100+ RPS<p>* Made for full lifecycle support: Ideal for pre-production validation, real-time guardrails, and post-production monitoring.<p>* Ease of use: It is designed to be easy for anyone to run and deploy whether you&#x27;re working on it locally during development, or you&#x27;re deploying it within a horizontally-scaling architecture for large-scale workloads.<p>* Unification of generative and traditional AI: The Arthur AI Engine can be used to evaluate a diverse range of models from LLMs and Agentic AI systems to binary classifiers, regression models, recommender systems, forecasting models, and more.<p>* Content-specific guardrail and detection features: Ranging from toxicity and hallucination detection to sensitive data (like PII, keyword&#x2F;regex and custom rules) and prompt injection.<p>* Customizability: Plug in your own models or integrate with other model or guardrail providers with ease, and tailor the system to match your specific needs.<p>Having been first-hand witnesses to the lack of adequate AI monitoring tools and the general under delivery of Gen AI systems in production, we believe that such a capability shouldn&#x27;t be exclusive to big-budget organizations. Our mission is to make AI better, for everyone, and we believe by opening up this tool we can help more people get to that goal.<p>Check out our GitHub repo for examples and directions on how to use the Arthur AI Engine for various purposes such as validation during development, real-time guardrails or performance troubleshooting using enriched logging data. (<a href="https:&#x2F;&#x2F;github.com&#x2F;arthur-ai&#x2F;engine-examples">https:&#x2F;&#x2F;github.com&#x2F;arthur-ai&#x2F;engine-examples</a>)<p>We can’t wait to see what you build<p>— Zach and Team Arthur

10 条评论

serguei大约 1 个月前
We&#x27;ve been ramping up our gen ai usage for the last ~month at Upsolve and it&#x27;s becoming a huge pain. There are already a million solutions for observability out there, but I like that this one is open source and can detect hallucinations<p>Thanks for open sourcing and sharing, excited to try this out!!
评论 #43785998 未加载
kacperek0大约 1 个月前
Cool, I&#x27;m running few GenAI automations, but they&#x27;re rather unsupervisored. So I&#x27;m gonna try it and check how they&#x27;re doing.
Lupita___大约 1 个月前
Thanks for sharing! This looks perfect for teams getting started with monitoring for all model types -- excited to try it out!
iabouhashish大约 1 个月前
Very excited to be trying this out! The examples look very useful and excited to tie it up with other open source solutions
pierniki大约 1 个月前
Yoo! Hopefully no more &quot;oops our AI just leaked the system prompt&quot; moments thanks to these guardrails!
madeleinelane大约 1 个月前
Love this. More transparency + better tooling is exactly what AI needs right now. Excited to give it a try.
Gabriel_h大约 1 个月前
Interesting, AI needs much better guardrails and monitoring!
jdbtech大约 1 个月前
Looks great! How does the system detect hallucinations?
评论 #43785973 未加载
评论 #43784575 未加载
vparekh1995大约 1 个月前
Excited to get hands on with this. I&#x27;ve had too many sleepless nights trying to figure out how to track when my agents were hallucinating.
cipherchain111大约 1 个月前
Very cool!