科技回声

vinay427大约 1 年前

For those interested in playing with or doing research using model internals, the TransformerLens [1] project appears to be the leading open-source tooling in this area. It allows for loading dozens of different models, adding hooks, displaying activations in a format compatible with CircuitsVis, and other (mechanistic) interpretability work.<p>[1] <a href="https://github.com/neelnanda-io/TransformerLens">https://github.com/neelnanda-io/TransformerLens</a><p>[2] <a href="https://github.com/alan-cooney/CircuitsVis">https://github.com/alan-cooney/CircuitsVis</a>

knlb2022大约 1 年前

I gave a small talk on how to really push using hooks for logging intermediate values (including capturing gradients from torch & fx scripted modules) that may be useful: <a href="https://static.sched.com/hosted_files/pytorch2023/40/Intermediate%20Logging%20_%20PyTorch%20Conference%202023.pdf" rel="nofollow">https://static.sched.com/hosted_files/pytorch2023/40/Interme...</a>

jph00大约 1 年前

FYI, for anyone interesting in creating and using hooks to better understand what's happening in your model, I created a free lesson covering that:<p><a href="https://course.fast.ai/Lessons/lesson17.html" rel="nofollow">https://course.fast.ai/Lessons/lesson17.html</a>

jey大约 1 年前

(2020)

Intermediate Activations – the forward hook (2020)

4 条评论

Intermediate Activations – the forward hook (2020)

4 条评论