14 点作者 shayanjm超过 1 年前

We explored a novel method to gauge the significance of tokens in prompts given to large language models, without needing direct model access. Essentially, we just did an ablation study on the prompt using cosine similarity of the embeddings as the measure. We got surprisingly promising results when comparing this really simple approach to integrated gradients. Curious to hear thoughts from the community!

3 条评论

worstestes超过 1 年前

Very interesting research!<p>Given that you're using cosine similarity of text embeddings to approximate the influence of individual tokens in a prompt, how does this approach fare in capturing higher-order interactions between tokens, something that Integrated Gradients (allegedly) is designed to account for? Are there specific scenarios where the cosine similarity method might fall short in capturing the nuances that Integrated Gradients can reveal?

评论 #37471874 未加载

spdustin超过 1 年前

What do you consider to be an “average length” prompt? How about a “long” prompt? You mention those in the text, and I’m curious of the token-length thresholds you’re seeing before performance degrades, and if that varies more when higher-importance tokens are distributed across the length versus clustered at the beginning.

评论 #37474401 未加载

naguas超过 1 年前

Super cool! Tried it on my prompt which tried compressing information using emojis, but those were given a low importance score. Switched the emojis out for plain text, which is given a higher importance score, and I'm seeing better results.

Show HN: A surprisingly effective way to predict token importance in LLM prompts

3 条评论

Show HN: A surprisingly effective way to predict token importance in LLM prompts

3 条评论