8 pointsby ekzhang7 months ago

1 comment

metasj7 months ago

At 5 cents per neuron with 4o-mini, for pretty satisfying descriptions.<p>"we fine-tune Llama-3.1-8B-Instruct to directly predict per-token activations ... [this] allows us to use smaller models, and the task of directly predicting the output (integer from 0-10) gets rid of the extra tokens, making the prompt much shorter."

Scaling Automatic Neuron Description (Describing Every Neuron in Llama 3)

1 comment

Scaling Automatic Neuron Description (Describing Every Neuron in Llama 3)

1 comment