智能体 (Agents)

Smolagents 是一个实验性的 API，随时可能更改。由于 API 或底层模型容易发生变化，智能体返回的结果可能会有所不同。(Smolagents is an experimental API which is subject to change at any time. Results returned by the agents can vary as the APIs or underlying models are prone to change.)

要了解有关智能体和工具的更多信息，请务必阅读入门指南。此页面包含底层类的 API 文档。(To learn more about agents and tools make sure to read the introductory guide. This page contains the API docs for the underlying classes.)

智能体 (Agents)

我们的智能体继承自 MultiStepAgent，这意味着它们可以分多个步骤执行操作，每个步骤都包含一个思考，然后是一个工具调用和执行。在本概念指南中阅读更多内容。(Our agents inherit from MultiStepAgent, which means they can act in multiple steps, each step consisting of one thought, then one tool call and execution. Read more in this conceptual guide.)

我们提供两种类型的智能体，基于主要的 Agent 类。(We provide two types of agents, based on the main Agent class.)

CodeAgent 是默认智能体，它用 Python 代码编写其工具调用。(CodeAgent is the default agent, it writes its tool calls in Python code.)
ToolCallingAgent 以 JSON 格式编写其工具调用。(ToolCallingAgent writes its tool calls in JSON.)

两者都需要在初始化时提供参数 model 和工具列表 tools。(Both require arguments model and list of tools tools at initialization.)

智能体类别 (Classes of agents)

class smolagents.MultiStepAgent

< 源代码 (source) >

( tools: typing.List[smolagents.tools.Tool] model: typing.Callable[[typing.List[typing.Dict[str, str]]], smolagents.models.ChatMessage] prompt_templates: typing.Optional[smolagents.agents.PromptTemplates] = None max_steps: int = 20 add_base_tools: bool = False verbosity_level: LogLevel = <LogLevel.INFO: 1> grammar: typing.Optional[typing.Dict[str, str]] = None managed_agents: typing.Optional[typing.List] = None step_callbacks: typing.Optional[typing.List[typing.Callable]] = None planning_interval: typing.Optional[int] = None name: typing.Optional[str] = None description: typing.Optional[str] = None provide_run_summary: bool = False final_answer_checks: typing.Optional[typing.List[typing.Callable]] = None )

参数 (Parameters)

tools (list[Tool]) — 智能体可以使用的工具 (Tool)。
model (Callable[[list[dict[str, str]]], ChatMessage]) — 将生成智能体操作的模型 (Model that will generate the agent’s actions.)。
prompt_templates (PromptTemplates, 可选 (optional)) — Prompt 模板 (Prompt templates)。
max_steps (int, 默认 20) — 智能体解决任务可以采取的最大步骤数 (Maximum number of steps the agent can take to solve the task.)。
tool_parser (Callable, 可选 (optional)) — 用于解析来自 LLM 输出的工具调用的函数 (Function used to parse the tool calls from the LLM output.)。
add_base_tools (bool, 默认 False) — 是否将基本工具添加到智能体的工具中 (Whether to add the base tools to the agent’s tools.)。
verbosity_level (LogLevel, 默认 LogLevel.INFO) — 智能体日志的详细程度级别 (Level of verbosity of the agent’s logs.)。
grammar (dict[str, str], 可选 (optional)) — 用于解析 LLM 输出的语法 (Grammar used to parse the LLM output.)。
managed_agents (list, 可选 (optional)) — 智能体可以调用的受管智能体 (Managed agents that the agent can call.)。
step_callbacks (list[Callable], 可选 (optional)) — 将在每个步骤调用的回调函数 (Callbacks that will be called at each step.)。
planning_interval (int, 可选 (optional)) — 智能体运行规划步骤的间隔 (Interval at which the agent will run a planning step.)。
name (str, 可选 (optional)) — 仅当为受管智能体时才需要 - 此智能体可以被调用的名称 (Necessary for a managed agent only - the name by which this agent can be called.)。
description (str, 可选 (optional)) — 仅当为受管智能体时才需要 - 此智能体的描述 (Necessary for a managed agent only - the description of this agent.)。
provide_run_summary (bool, 可选 (optional)) — 当作为受管智能体调用时是否提供运行摘要 (Whether to provide a run summary when called as a managed agent.)。
final_answer_checks (list, 可选 (optional)) — 在返回最终答案以检查有效性之前要运行的 Callable 列表 (List of Callables to run before returning a final answer for checking validity.)。

智能体类，它使用 ReAct 框架逐步解决给定的任务：在未达到目标之前，智能体将执行一个操作循环（由 LLM 给出）和观察（从环境中获得）。(Agent class that solves the given task step by step, using the ReAct framework: While the objective is not reached, the agent will perform a cycle of action (given by the LLM) and observation (obtained from the environment).)

extract_action

< 源代码 (source) >

( model_output: str split_token: str )

参数 (Parameters)

model_output (str) — LLM 的输出 (Output of the LLM)
split_token (str) — 操作的分隔符 (Separator for the action)。应与系统提示中的示例匹配 (Should match the example in the system prompt.)。

从 LLM 输出中解析操作 (Parse action from the LLM output)

from_folder

< 源代码 (source) >

( folder: typing.Union[str, pathlib.Path] **kwargs )

参数 (Parameters)

folder (str 或 Path) — 保存智能体的文件夹 (The folder where the agent is saved.)。
**kwargs — 将传递给智能体 init 的其他关键字参数 (Additional keyword arguments that will be passed to the agent’s init.)。

从本地文件夹加载智能体。(Loads an agent from a local folder.)

from_hub

< 源代码 (source) >

( repo_id: str token: typing.Optional[str] = None trust_remote_code: bool = False **kwargs )

参数 (Parameters)

repo_id (str) — Hub 上仓库的名称，您的工具定义于此仓库中。
token (str, 可选) — 用于在 hf.co 上标识您的令牌。如果未设置，将使用运行 huggingface-cli login 时生成的令牌（存储在 ~/.huggingface 中）。
trust_remote_code(bool, 可选，默认为 False) — 此标志表示您理解运行远程代码的风险，并且您信任此工具。如果不将其设置为 True，则从 Hub 加载工具将失败。
kwargs (附加关键字参数，可选) — 将拆分为两部分的附加关键字参数：所有与 Hub 相关的参数（例如 cache_dir、revision、subfolder）将在下载代理文件时使用，其他参数将传递给其 init 方法。

加载 Hub 上定义的代理。

从 Hub 加载工具意味着您将下载该工具并在本地执行它。始终在运行时加载工具之前检查您要下载的工具，就像您在使用 pip/npm/apt 安装软件包时一样。

smolagents

智能体 (Agents)