科技回声

sanxiyn9 个月前

I believe the paper to be cited is "The Internal State of an LLM Knows When It's Lying", published last year: <a href="https://arxiv.org/abs/2304.13734" rel="nofollow">https://arxiv.org/abs/2304.13734</a>

评论 #41295474 未加载

autokad9 个月前

if what I understand is correct, that they project the LLM's internal activations into meaningful linear directions derived from contrasting examples, I guess this is similar to how we began to derive a lot more value from ebeddings by using the embedding values for various things.

评论 #41295026 未加载

uiDevofNW9 个月前

This is a stupid arguement. I wish author understood an ounce of how LLMs works. Of course, they know more than whay they say. That's because LLMs are nothing but probabistic structures. They mix and match and provide probabilistic approach. Therefore, they are always making a choice between multiple options.<p>I wish there was a global mandatory course before these substacky authors write for fame.

tarasglek9 个月前

This looks cool, but I'm confused as to how this is surfaced in your product, llama-8 is not present in your model list.<p>I thought maybe you offer hallucination detection, but I also don't see that. RAG evals also not visible

评论 #41300431 未加载

nqnielsen9 个月前

And how to use LLM interpretability research for applied evaluation

LLMs know more than what they say

5 条评论

LLMs know more than what they say

5 条评论