TechEcho

7 comments

galaxyLogicabout 2 years ago

> LLMs with coding abilities could be employed to create sophisticated malware with unprecedented ease.If that is possible then shouldn't it also be possible to ask the AI to find and code remediation to the vulnerabilities it found?So AI could be used to find all possible code-vulnerabilities and then how to neutralize those? This would advance software security in general.In other words AI could be used like a microscope discovering tiny defects in our software which are not visible to the naked eye. Like a microscope that detects viruses and thus allows us to guard against them. Like a COVID-test.

评论 #35805128 未加载

评论 #35804635 未加载

评论 #35805126 未加载

shakesabout 2 years ago

Does anyone know what happens if you do transfer learning in addition to scaling? It feels like people used to use transfer learning in lieu of scaling and I haven't wrapped my head around how they work together.

评论 #35804280 未加载

chaxorabout 2 years ago

One important but that is often left out is that ChatGPT is not the first model to come out using RLHF to train LLMs.As is typical in the AI field, Deepmind was key in the development of the process. Deepmind 's Sparrow came out just before ChatGPT (regarding language modeling with RLHF), and much of the RLHF work was explored in their robotics/agent exploration work just prior to application in language.OpenAI was integral in PPO, but it's important to know and understand it wasn't ChatGPT or OpenAI that is solely leading these advancements.

runnerupabout 2 years ago

I found this to be a particularly lucid writeup of the past 5 years of advancement in LLMs. I sent it to some undergrads to read.

paulrchdsabout 2 years ago

I have been meaning to get a better overview of LLMs, this was a useful article.

sharemywinabout 2 years ago

"From Giant Stochastic Parrots to Preference-Tuned Models"I found this sub-title quite interesting

1024coreabout 2 years ago

Word to the wise: the RLHF part comes 80% of the way down.

7 comments

galaxyLogicabout 2 years ago

评论 #35805128 未加载

评论 #35804635 未加载

评论 #35805126 未加载

shakesabout 2 years ago

评论 #35804280 未加载

chaxorabout 2 years ago

runnerupabout 2 years ago

I found this to be a particularly lucid writeup of the past 5 years of advancement in LLMs. I sent it to some undergrads to read.

paulrchdsabout 2 years ago

I have been meaning to get a better overview of LLMs, this was a useful article.

sharemywinabout 2 years ago

"From Giant Stochastic Parrots to Preference-Tuned Models"I found this sub-title quite interesting

1024coreabout 2 years ago

Word to the wise: the RLHF part comes 80% of the way down.

The Full Story of Large Language Models and RLHF

7 comments

The Full Story of Large Language Models and RLHF

7 comments