科技回声

11 条评论

minimaxir超过 1 年前

An xAI engineer replied:> The issue here is that the web is full of ChatGPT outputs, so we accidentally picked up some of them when we trained Grok on a large amount of web data. This was a huge surprise to us when we first noticed it. For what it’s worth, the issue is very rare and now that we’re aware of it we’ll make sure that future versions of Grok don’t have this problem. Don’t worry, no OpenAI code was used to make Grok.<a href="https://twitter.com/ibab_ml/status/1733558576982155274" rel="nofollow noreferrer">https://twitter.com/ibab_ml/status/1733558576982155274</a>

评论 #38586722 未加载

评论 #38585352 未加载

评论 #38586338 未加载

评论 #38599670 未加载

评论 #38585288 未加载

andy_xor_andrew超过 1 年前

It's far, far more likely that Grok was just trained on data that includes text generated by GPT.This is super common in the open-source/local AI area. Many models are trained on output from GPT. The better models will filter out anything that mentions GPT or OpenAI. Seems like Grok is not one of the better models.

评论 #38585587 未加载

评论 #38586812 未加载

评论 #38586650 未加载

评论 #38586738 未加载

sschueller超过 1 年前

I believe it is. I've had people send me screenshots where it refers to itself as chatGPT.

评论 #38585779 未加载

ronsor超过 1 年前

This isn't really that newsworthy. Grok was probably trained on ChatGPT logs, just like the litany of other chat-oriented LLMs that are also open source. They still should've filtered out the OpenAI canned responses (there are datasets with that filter).

评论 #38585376 未加载

评论 #38585307 未加载

berkes超过 1 年前

What's grok? (I dropped off Twitter/X the moment the inmates started running that asylum again. It appears to have to do with X?)

评论 #38585139 未加载

评论 #38585324 未加载

评论 #38586933 未加载

评论 #38585133 未加载

LorenDB超过 1 年前

I had self hosted Llama 2 call itself Bard the other day, so I'd take this with a grain of salt - or maybe even the whole shaker.

alsodumb超过 1 年前

A simple explanation is that Grok probably used GPT3.5 or GPT4 API to generate synthetic data, most likely for RLHF.

hn_throwaway_99超过 1 年前

I feel like comments defending Twitter (I'm still calling it that until they change their domain) are half missing the point. True, it's not necessarily an "OpenAI API wrapper", but the fact that it was trained on ChatGPT's logs basically means it's still an "OpenAI wrapper" of sorts, and it's going to be inferior in nearly every way (I always thought a huge problem with LLMs going forward was the risk that they would be "contaminated" with non-human training data).I can't think of any reason anybody would use Grok over ChatGPT besides political tribe signalling.

评论 #38585556 未加载

评论 #38585308 未加载

评论 #38585222 未加载

评论 #38585910 未加载

simonw超过 1 年前

Grok is a new model. Its training data has seen enough examples of OpenAI generated content that it occasionally spits out text like this that makes it look like it's by OpenAI.This is common issue across all sorts of other alternative models too. It's not particularly surprising.

评论 #38585221 未加载

qarl超过 1 年前

I wonder why they decided not to filter-out references to OpenAI and ChatGPT.

评论 #38585810 未加载

gafage超过 1 年前

It makes no sense to me that one would train a chatbot on chatgpt conversations and not filter strings that literally say "openai" and "chatgpt". Extreme incompetence.

评论 #38585331 未加载

评论 #38585235 未加载

11 条评论

minimaxir超过 1 年前

评论 #38586722 未加载

评论 #38585352 未加载

评论 #38586338 未加载

评论 #38599670 未加载

评论 #38585288 未加载

andy_xor_andrew超过 1 年前

评论 #38585587 未加载

评论 #38586812 未加载

评论 #38586650 未加载

评论 #38586738 未加载

sschueller超过 1 年前

I believe it is. I've had people send me screenshots where it refers to itself as chatGPT.

评论 #38585779 未加载

ronsor超过 1 年前

评论 #38585376 未加载

评论 #38585307 未加载

berkes超过 1 年前

What's grok? (I dropped off Twitter/X the moment the inmates started running that asylum again. It appears to have to do with X?)

评论 #38585139 未加载

评论 #38585324 未加载

评论 #38586933 未加载

评论 #38585133 未加载

LorenDB超过 1 年前

I had self hosted Llama 2 call itself Bard the other day, so I'd take this with a grain of salt - or maybe even the whole shaker.

alsodumb超过 1 年前

A simple explanation is that Grok probably used GPT3.5 or GPT4 API to generate synthetic data, most likely for RLHF.

hn_throwaway_99超过 1 年前

评论 #38585556 未加载

评论 #38585308 未加载

评论 #38585222 未加载

评论 #38585910 未加载

simonw超过 1 年前

评论 #38585221 未加载

qarl超过 1 年前

I wonder why they decided not to filter-out references to OpenAI and ChatGPT.

评论 #38585810 未加载

gafage超过 1 年前

It makes no sense to me that one would train a chatbot on chatgpt conversations and not filter strings that literally say "openai" and "chatgpt". Extreme incompetence.

评论 #38585331 未加载

评论 #38585235 未加载

Is Grok Basically Just an OpenAI Wrapper?

11 条评论

Is Grok Basically Just an OpenAI Wrapper?

11 条评论