TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Building AI Products–Part I: Back-End Architecture

162 点作者 rafaelferreira5 个月前

11 条评论

xrd5 个月前
This article is written by an engineer, first and foremost.<p>Many of the APIs or LLM extensions provided by AI companies are written by ML engineers that do not have Phil&#x27;s decades of experience in distributed systems, databases and networking. That is evident after reading this article; the first time I&#x27;ve seen a coherent discussion of the tools and tradeoffs when building agentic systems.<p>I&#x27;ve struggled to actually build something useful with the &quot;agentic&quot; systems and tools out there (and I&#x27;ve tried a lot). Deep down I&#x27;ve felt intimidated by the dozens of new terms the docs use, and after reflection, those tech marketing pieces give the vibe that they are written primarily by AI and told to be colorful and not clear and precise. These solutions from billion dollar valued companies must to present &quot;brand new&quot; ideas to justify their valuations. We should know better: everything builds on the shoulders of decades of research and discovery. If you see something flying high in the clouds (and not standing on the shoulders of giants), it is sure to fall back to earth soon.<p>A great read. I&#x27;m very excited about Outropy.
评论 #42575217 未加载
评论 #42522113 未加载
svilen_dobrev5 个月前
&gt; durable workflows<p>This is what long-running-transactions of the past became.. and slowly cover all their ground (initially Cadence by Uber, then Temporal). Zillions of little flows that can go through their FSMs at any speed, (milli)seconds-or-days-or-months-or-whenever.<p>i wonder though, how much some further developments like Cloudflare&#x27;s durable objects, or similar recently announced Rivet actions [1] would simplify (or, complicate) matters, esp. in this &quot;agentic&quot; case ?<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42472519">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42472519</a>
nikolayasdf1235 个月前
&gt; Agents are not Microservices<p>&gt; Agents naturally align with OOP principles: they maintain encapsulated state (their memory), expose methods (their tools and decision-making capabilities via inference pipelines), and communicate through message passing<p>it does sound like a service (memory=db,methods+messages=api). it is just the level of isolation&#x2F;deployment you need<p>UPD: also, how come your services share database layer (?), maybe problems in scaling are not due to Agents at all? do you have scaling issues even without agents? would not be surprised! classic rule form Amazon 2002 API mandate by Bezos &quot;no shared db between services. all communication happens over exposed interfaces and over network&quot;.
评论 #42575625 未加载
评论 #42522030 未加载
theptip5 个月前
Great breakdown of the &quot;architectural decision log&quot; for the evolution of this system.<p>&gt; This model broke down when we added backpressure and resilience patterns to our agents. We faced new challenges: what happens when the third of five LLM calls fails during an agent’s decision process? Should we retry everything? Save partial results and retry just the failed call? When do we give up and error out?”<p>&gt; We first looked at ETL tools like Apache Airflow. While great for data engineering, Airflow’s focus on stateless, scheduled tasks wasn’t a good fit for our agents’ stateful, event-driven operations.<p>&gt; I’d heard great things about Temporal from my previous teams at DigitalOcean. It’s built for long-running, stateful workflows, offering the durability and resilience we needed out of the box.<p>I would also have reached for workflow engines here. But I wonder if Actor frameworks might actually be the sweet spot; something like Erlang&#x27;s distributed actor model could be a good fit. I&#x27;m not familiar with a good distributed Actor framework for Python but there&#x27;s of course Elixir, Actix, Akka in other stacks.<p>Coming from the other direction, I&#x27;m not surprised that Airflow isn&#x27;t fit for this purpose, but I wonder if one of the newer generation of ETL engines like Dagster would work? Maybe the workflow here just involves too many pipelines (one per customer per Agent, I suppose), and too many Sensor events (each Slack message would get materialized, not sure if that&#x27;s excessive). Could be a fairly substantial overhaul to the architecture vs. Temporal, but I&#x27;d be interested to know if anyone has experimented with this option for AI workflows.
karmasimida5 个月前
I don&#x27;t see AI system too special in terms of back-end engineering, except maybe for agentic system, things are inherently stateful.<p>But considering how limited RPM&#x2F;TPM with regards mainstream LLMs, states saving&#x2F;loading is hardly the bottleneck I feel.
评论 #42575658 未加载
upghost5 个月前
Color me impressed. These guys get it right because they treat LLMs like what they are -- tools with a specific use, not anthropomorphized pets. (although I did groan a bit at the &quot;AI Chief of Staff&quot; moniker).<p>It&#x27;s extremely refreshing to hear an actual engineering conversation around LLMs that doesn&#x27;t sound like it came out of the pages of an undergraduate alchemy notebook.
AYBABTME5 个月前
I came to the same conclusion about Temporal for these types of things. Interactive stuff that touches 1 DB? Do it in the API. Needs to coordinate &gt;1 thing? Temporal.<p>Orchestrating a bunch of LLM calls is a perfect fit for Temporal.
评论 #42575635 未加载
iandanforth5 个月前
Thanks for the excellent article. It&#x27;s hard to find these step by step architecture evolution retrospectives. A great reference for other startups going though a similar journey!
jarbus5 个月前
Great article, really enjoyed how they described what they initially tried, where it struggled, and why their current solution works better.
xwowsersx5 个月前
Well written. It&#x27;s a rare pleasure to hear a discussion about LLMs grounded in real engineering, free from the fanciful notions often found in all the other spam out there.
asah5 个月前
crazy idea: could quantum entangled communication help soften CAP ? (e.g. by allowing limited communication between partitions)