TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Beyond Text: On-Demand UI Generation for Better Conversational Experiences

77 点作者 fka7 天前

12 条评论

sheo7 天前
I think that the example in the article is not a good usecase for this technology. It would be better, cheaper and less error prone to have prebuilt forms that LLM can call like tools, at least for things like changing shipping address<p>Shipping forms usually need verification of addresses, sometimes they even include a map<p>Especially if on the other end data that would be inputted in this form, would be stored in the traditional DB<p>Much better usecase would be use it in something, that is dynamic by nature. For example, advanced prompt generator for image generation models (sliders for size of objects in a scene; dropdown menus with variants of backgrounds or style, instead of usual lists)
评论 #44009287 未加载
jFriedensreich7 天前
I was working on exactly this in gpt 3 days and still believe ad hoc generation of super specifc and contextual relevant UIs will solve a lot of problems and friction that purely textual or speech based conversational interfaces pose especially if the UI elements like sliders provide some form of live feedback of their effect and are possible to scroll back to or pin and make changes anytime.
评论 #44006257 未加载
评论 #44009045 未加载
ActionHank7 天前
I really believe this is the future.<p>Conversations are error prone and noisy.<p>UI distills down the mode of interaction into something defined and well understood by both parties.<p>Humans have been able to speak to each other for a long time, but we fill out forms for anything formal.
评论 #44008426 未加载
评论 #44006013 未加载
评论 #44005809 未加载
wddlz7 天前
Related to this: Here is some recently published research we did at Microsoft Research on generating UX for prompt refinements based on the user prompt and other context (case study: <a href="https:&#x2F;&#x2F;www.iandrosos.me&#x2F;promptly.html" rel="nofollow">https:&#x2F;&#x2F;www.iandrosos.me&#x2F;promptly.html</a>, paper link also in intro).<p>We found it lowered barriers to providing context to AI, improved user perception of control over AI, and provided users guidance for steering AI interactions.
joshstrange7 天前
Related, it’s crazy to me that OpenAI hasn’t already done something like this for Deep Research.<p>After your initial question, it always follows up asking some clarifying questions, but it’s completely up to the user to format their responses and I always wonder if people are sloppy if the LLM gets confused. It would make much more sense for OpenAI to break out each question and have a dedicated answer box. That way the user’s response can be consistent and there’s less of a chance they make a mistake or forget to answer a question.
评论 #44006374 未加载
评论 #44004520 未加载
jmull7 天前
This seems much worse than the typical pre-AI mechanism of navigating to and clicking on a &quot;Change Delivery Address&quot; button.<p>I don&#x27;t know why you wouldn&#x27;t develop whatever forms you wanted to support upfront and make them available to the agent (and hopefully provide old-fashioned search). You can still use AI to develop and maintain the forms. Since the output can be used as many times as you want, you can probably use more expensive&#x2F;capable models to develop the forms rather than cheaper&#x2F;faster but less capable models that you&#x27;re probably limited to for customer service.
exe347 天前
I was hoping to do this over IRC but never got around to implementing it. I hate the idea of implementing a whole website&#x2F;chat system, when they already exist. I&#x27;d like to use it for my (currently in-existent) home automation communication.
评论 #44004409 未加载
banga7 天前
Semantic clarity of written prose is hard, but this approach seems like making it easier for the machines rather than the other way around.
maxcan7 天前
Video isn&#x27;t loading.
评论 #44004291 未加载
aatd867 天前
that&#x27;s not a very innovative idea or even better UX. I think that the future wil have to do with voice commands and mcps will be the backend, exposing capabilities.
评论 #44005653 未加载
评论 #44006450 未加载
utku13377 天前
looks very useful
casey27 天前
If it could have been done it would have by now
评论 #44004413 未加载
评论 #44004733 未加载