TechEcho

6 comments

For additional context:- Some more details on the building (and challenges) of the leaderboard <a href="https://livablesoftware.com/biases-llm-leaderboard/" rel="nofollow">https://livablesoftware.com/biases-llm-leaderboard/</a>- The tests used in the backend: <a href="https://github.com/SOM-Research/LangBiTe">https://github.com/SOM-Research/LangBiTe</a>

shikon7about 1 year ago

Rather than assessing whether the LLM has biases, the leaderboard seems to assess whether the LLM affirms the tester’s biases.Not that I blame them, as it’s probably impossible to define what a “no bias” exactly means.

Terrettaabout 1 year ago

Gene Roddenberry would like to have a word about the bias of the testers.

评论 #39804724 未加载

rastignackabout 1 year ago

Here are the (heavily biased and dishonest) prompts:<a href="https://github.com/SOM-Research/LangBiTe/blob/main/langbite/resources/prompts.csv">https://github.com/SOM-Research/LangBiTe/blob/main/langbite/...</a>

评论 #39806179 未加载

评论 #39805263 未加载

评论 #39806876 未加载

评论 #39805081 未加载

评论 #39805731 未加载

评论 #39805212 未加载

salesynerdabout 1 year ago

GPT-4 seems to be the least biased of all the LLMs. As a newbie to the field, does it mean that OpenAI have the most "balanced" data and/or does it do a great job in training their model? If the training is the secret sause of success, will it make sense for these companies to share their "best" data with each other?

评论 #39805327 未加载

评论 #39806861 未加载

djohnstonabout 1 year ago

Lazy, derivative, failing to account for any nuance and falling back to the same tired leftist talking points. This eval set could better be called “Am I the little parrot my master wants me to be?”The best LLMs will be the ones that don’t conform to this canned drivel, so presumably the bottom of the leaderboard is where to look. Thanks!

评论 #39809143 未加载

6 comments

softmodelingabout 1 year ago

shikon7about 1 year ago

Terrettaabout 1 year ago

Gene Roddenberry would like to have a word about the bias of the testers.

评论 #39804724 未加载

rastignackabout 1 year ago

评论 #39806179 未加载

评论 #39805263 未加载

评论 #39806876 未加载

评论 #39805081 未加载

评论 #39805731 未加载

评论 #39805212 未加载

salesynerdabout 1 year ago

评论 #39805327 未加载

评论 #39806861 未加载

djohnstonabout 1 year ago

评论 #39809143 未加载

LLM leaderboard focusing on assessing their biases

6 comments

LLM leaderboard focusing on assessing their biases

6 comments