TechEcho

12 comments

behnamohalmost 2 years ago

the more it goes, the more I realize that the true power of LLMs is not in unstructured text that they can generate, but in structured output. but there are two approaches to achieve this:1. LMQL/guidance/JSONformer/OP's post2. finetuning the model to understand function calls and their (potentially) JSON schemas.there was a comment here about OpenAI's approach (finetuning a model to understand function call) which raised a good point: since finetuning is often forgetful (previous knowledge learnt by the model gets forgotten a little bit), it's not clear if OpenAI's approach has made GPT-4 less capable than it was before. Not to mention that you're still dealing with a statistical process (LLM), not a locked-in algorithm that generates the desired schema 100% the time.Which brings me to the other approach: steering the LLM's output __as it is generating tokens__, which is what LMQL does. This results in less token usage (you don't send function schema as part of your prompt/message to OpenAI) and 100% accuracy because token probabilities are modified (e.g., 0% chance of any character except ":" after a double quotation mark).

评论 #36753611 未加载

评论 #36754911 未加载

评论 #36754509 未加载

评论 #36754313 未加载

评论 #36755037 未加载

foundry27almost 2 years ago

I think the references to this being a “tool” that you can “self-host if you want” are a little disingenuous, especially after seeing that the linked GitHub project doesn’t mention the fact that it’s just a thin wrapper client making requests to a remote server until you’re halfway through the README. The product might be great, but introducing it to the community in this way doesn’t foster much trust in your company from the perspective of a potential customer.The only reference I can find to this being a self-hosted model is a blurb in the GitHub README saying “If you'd like to self-host this in your own cloud, email us”. Sure, I can email my OpenAI/Microsoft rep and self-host GPT-4 in my own cloud for enough money too, but that doesn’t change the fact that the primary business model is SaaS. Just be up-front about this fact in community posts, rather than obfuscating it. Your website does a great job with that.

评论 #36763376 未加载

jensneusealmost 2 years ago

We're using a similar approach with OpenAI. The user can define a schema using zod and call a prompt. We're then using OpenAI functions behind the scenes to parse the answer into the shape the user wants. Add JSON schema validation on top and we can be sure that the response conforms to our Schema. Some more details and examples can be found in this blog post: <a href="https://wundergraph.com/blog/beyond_functions_seamlessly_build_ai_enhanced_apis_with_openai" rel="nofollow noreferrer">https://wundergraph.com/blog/beyond_functions_seamlessly_bui...</a>

sunshadowalmost 2 years ago

What are the benefits over <a href="https://github.com/microsoft/guidance/">https://github.com/microsoft/guidance/</a> ?

评论 #36752420 未加载

评论 #36753203 未加载

roseway4almost 2 years ago

Looking at the playground, it appears the few shot examples in the prompt and CFG are duplicative. What is the relationship between the two?When you say in another comment that using OpenAI functions to output JSON is a waste of tokens, how are you generating the JSON output? And why do your prompts then include few shot examples of JSON objects?

评论 #36753254 未加载

Janymosalmost 2 years ago

What's the difference between self hosting this and manually running <a href="https://github.com/r2d4/parserllm">https://github.com/r2d4/parserllm</a> ?

minzialmost 2 years ago

Can someone explain what they use structured output for? I’m just curious what kinds of use cases people have found for it.

评论 #36758076 未加载

easygenesalmost 2 years ago

You mention self hosting the model. Do you have the model weights up on HuggingFace?

评论 #36753634 未加载

darkteflonalmost 2 years ago

Could you contextualise this against OpenAI’s native functions?

评论 #36752991 未加载

marbanalmost 2 years ago

Can easily be done with OAI's new function calling.

评论 #36763259 未加载

dkjaudyeqooealmost 2 years ago

Don't forget to put a license on your repository.

评论 #36754259 未加载

beefnugsalmost 2 years ago

oh its 100% predictable alright, predictable to be garbage : the default example chooses height as the wrong "number" whatever that might assume to be, then if you try to change it to define height as perhaps "height in total inches" it still gets it wrong

评论 #36752333 未加载

12 comments

behnamohalmost 2 years ago

评论 #36753611 未加载

评论 #36754911 未加载

评论 #36754509 未加载

评论 #36754313 未加载

评论 #36755037 未加载

foundry27almost 2 years ago

评论 #36763376 未加载

jensneusealmost 2 years ago

sunshadowalmost 2 years ago

What are the benefits over <a href="https://github.com/microsoft/guidance/">https://github.com/microsoft/guidance/</a> ?

评论 #36752420 未加载

评论 #36753203 未加载

roseway4almost 2 years ago

评论 #36753254 未加载

Janymosalmost 2 years ago

What's the difference between self hosting this and manually running <a href="https://github.com/r2d4/parserllm">https://github.com/r2d4/parserllm</a> ?

minzialmost 2 years ago

Can someone explain what they use structured output for? I’m just curious what kinds of use cases people have found for it.

评论 #36758076 未加载

easygenesalmost 2 years ago

You mention self hosting the model. Do you have the model weights up on HuggingFace?

Show HN: Structured output from LLMs without reprompting

12 comments

Show HN: Structured output from LLMs without reprompting

12 comments