TechEcho

17 comments

gdbabout 2 years ago

(I work at OpenAI.)This document is a preview of the underlying format consumed by ChatGPT models. As an API user, today you use our higher-level API (<a href="https://platform.openai.com/docs/guides/chat" rel="nofollow">https://platform.openai.com/docs/guides/chat</a>). We'll be opening up direct access to this format in the future, and want to give people visibility into what's going on under the hood in the meanwhile!

评论 #34989493 未加载

评论 #34989984 未加载

评论 #34994587 未加载

评论 #34989539 未加载

评论 #34991720 未加载

评论 #34994477 未加载

评论 #34989813 未加载

评论 #34995151 未加载

explaininjsabout 2 years ago

Is it just me or is this the least intuitive format imaginable? The type def is something like:<pre><code> type Message = string type Speaker = 'system' | 'user' | 'assistant' | 'system name=example_user' | 'system name=example_assistant' type CML = ('\n' | '${Speaker}\n${Message}' | {token: '<im_start>'|'<im_end>'})[] </code></pre> I'd expect something more like...<pre><code> type Message = string type Speaker = 'system' | 'user' | 'assistant' | 'example_user' | 'example_assistant' type CML = {message: Message, speaker: Speaker}[]</code></pre>

评论 #34989199 未加载

creatonezabout 2 years ago

Why do you think Bing Chat is going in a slightly different direction and not using this format exactly?Their prompt - <a href="https://old.reddit.com/r/bing/comments/11bd91j/release_of_the_whole_initial_prompt_of_bing_chat/" rel="nofollow">https://old.reddit.com/r/bing/comments/11bd91j/release_of_th...</a>Wouldn't it be better to unify around ChatML, for the sake of all future training data being consistent? I thought it was strange that they used <|im_start|> but not the rest of the ChatML syntax.(There is always the possibility that they are using it, but the AI has hallucinated a slightly different syntax when repeating it)

评论 #34994769 未加载

评论 #34990050 未加载

评论 #34990347 未加载

transitivebsabout 2 years ago

Here is a leaked google doc with a lot more details: <a href="https://docs.google.com/document/d/1mYBAIilR8IcIfzvIfrsayAU_XJJ-w5Oi6zYY53g0LFs/edit?usp=sharing" rel="nofollow">https://docs.google.com/document/d/1mYBAIilR8IcIfzvIfrsayAU_...</a>This was copied from OpenAI alpha documentation.

staticautomaticabout 2 years ago

In what way is this a “list of dicts”? It’s an implicitly structured list of dict, str pairs. And it is WEIRD.

评论 #34989774 未加载

评论 #34994155 未加载

interleaveabout 2 years ago

I just rewired our project from <|im_start|><|im_end|> to use the { "role" : "user", "content" : "Hi!" } format.Naming-wise, the JSON format is not ChatML, right? Does it have a name yet?(This is what I'm working off of: <a href="https://platform.openai.com/docs/api-reference/chat/create" rel="nofollow">https://platform.openai.com/docs/api-reference/chat/create</a>)

cancelselfabout 2 years ago

ChatML documents consists of a sequence of messages. Each message contains a header and contents. The current version (ChatML v0) can be represented with a JSON format.

sterlindabout 2 years ago

huh neat. a couple weeks back I had a conversation with ChatGPT where I asked it if it had special control tokens it could emit, and after a lot of coaxing I got it to tell me about <|im_end|>. I wasn't sure if it hallucinated it but I guess not?

mirekrusinabout 2 years ago

Do you have to prefix every user query with a long list of examples to create context (and be billed for it extra for every query)?Or there is a way to create context with some prelude and then use it for subsequent queries?Let's say I want to create quick help for SQL, where the prelude would be schema snapshot and some examples.Do I need to flood every user query with this long prefix of sql schema snapshot with examples?I don't want one user conversatio to interfere with other user query.

评论 #34992275 未加载

culiaoabout 2 years ago

What is the best way for a response to come back simply as an integer? So asking, what is the avg. cost of a bagel in new york? I've been able to do it by asking for no text - but wondering if there is an alternative i'm missing.

评论 #34995082 未加载

ariveroabout 2 years ago

Amusingly when I asked bing about this format hallucinated that there was also extra tokens such as <|mood|> to set style and tome. Take note @gdb :-D

mark_and_sweepabout 2 years ago

I wonder why they are not using XML.

barefegabout 2 years ago

What is the plan to solve injections using this lower level representation?

bacchusracineabout 2 years ago

Can I use ChatGPT to rewrite everything to this format?

CMLababout 2 years ago

It is worth discussing whether the domain specific language design for ChartML is necessary for users.

raldiabout 2 years ago

What does "im" stand for?

评论 #34989265 未加载

评论 #34989193 未加载

评论 #34989210 未加载

born-jreabout 2 years ago

> This gives an opportunity to mitigate and eventually solve injectionsNo more jailbreaks!!!!

评论 #34993400 未加载

17 comments

gdbabout 2 years ago

评论 #34989493 未加载

评论 #34989984 未加载

评论 #34994587 未加载

评论 #34989539 未加载

评论 #34991720 未加载

评论 #34994477 未加载

评论 #34989813 未加载

评论 #34995151 未加载

explaininjsabout 2 years ago

评论 #34989199 未加载

creatonezabout 2 years ago

评论 #34994769 未加载

评论 #34990050 未加载

评论 #34990347 未加载

transitivebsabout 2 years ago

staticautomaticabout 2 years ago

In what way is this a “list of dicts”? It’s an implicitly structured list of dict, str pairs. And it is WEIRD.

评论 #34989774 未加载

评论 #34994155 未加载

interleaveabout 2 years ago

cancelselfabout 2 years ago

ChatML documents consists of a sequence of messages. Each message contains a header and contents. The current version (ChatML v0) can be represented with a JSON format.

sterlindabout 2 years ago

mirekrusinabout 2 years ago

评论 #34992275 未加载

culiaoabout 2 years ago

评论 #34995082 未加载

ariveroabout 2 years ago

Amusingly when I asked bing about this format hallucinated that there was also extra tokens such as <|mood|> to set style and tome. Take note @gdb :-D

mark_and_sweepabout 2 years ago

I wonder why they are not using XML.

barefegabout 2 years ago

What is the plan to solve injections using this lower level representation?

bacchusracineabout 2 years ago

Can I use ChatGPT to rewrite everything to this format?

CMLababout 2 years ago

It is worth discussing whether the domain specific language design for ChartML is necessary for users.

raldiabout 2 years ago

What does "im" stand for?

评论 #34989265 未加载

评论 #34989193 未加载

评论 #34989210 未加载

born-jreabout 2 years ago

> This gives an opportunity to mitigate and eventually solve injectionsNo more jailbreaks!!!!

评论 #34993400 未加载

ChatML: ChatGPT API expects a structured format, called Chat Markup Language

17 comments

ChatML: ChatGPT API expects a structured format, called Chat Markup Language

17 comments