6 点作者 aubanel4 个月前

1 comment

The multi-step agent API they've produced feels so much like reinforcement learning. Neat!<p><pre><code> memory = [user_defined_task] while llm_should_continue(memory): # this loop is the multi-step part action = llm_get_next_action(memory) # this is the tool-calling part observations = execute_action(action) memory += [action, observations]</code></pre>

Hugging Face releases a barebones library for agents

1 comment

Hugging Face releases a barebones library for agents

1 comment