介绍三家新的无服务器推理提供商：Hyperbolic、Nebius AI Studio 和 Novita 🔥

发布于 2025 年 2 月 18 日

在 GitHub 上更新

100

我们激动地宣布，Hugging Face Hub 新增了三家杰出的无服务器推理提供商：Hyperbolic、Nebius AI Studio 和 Novita。这些提供商加入了我们不断壮大的生态系统，扩展了直接在 Hub 模型页面上进行无服务器推理的广度和能力。它们也无缝集成到我们的客户端 SDK (包括 JS 和 Python) 中，让您可以非常轻松地通过您喜欢的提供商使用各种各样的模型。

这些合作伙伴加入了我们现有的提供商行列，其中包括 Together AI、Sambanova、Replicate、fal 和 Fireworks.ai。

新的合作伙伴支持了一大批新模型：DeepSeek-R1、Flux.1 以及许多其他模型。在下方可以找到它们支持的所有模型

我们非常期待看到您将使用这些新的提供商创造出什么！

工作原理

在网站 UI 中

在您的用户帐户设置中，您可以

为您已注册的提供商设置您自己的 API 密钥。如果未设置自定义密钥，您的请求将通过 HF 进行路由。
按偏好顺序排列提供商。这适用于模型页面中的小组件和代码片段。

如上所述，调用推理 API 有两种模式

自定义密钥（调用直接发送到推理服务提供商，使用您自己的相应提供商的 API 密钥）
通过 HF 路由（在这种情况下，您不需要提供商的令牌，费用将直接计入您的 HF 帐户，而不是提供商的帐户）

模型页面会展示第三方推理服务提供商（与当前模型兼容的，并按用户偏好排序）

通过客户端 SDK

通过 Python，使用 huggingface_hub

以下示例展示了如何使用 Hyperbolic 作为推理提供商来运行 DeepSeek-R1。您可以使用 Hugging Face 令牌通过 Hugging Face 自动路由，或者如果您有自己的 Hyperbolic API 密钥，也可以使用它。

从源代码安装 huggingface_hub (参见说明)。官方支持将很快在 v0.29.0 版本中发布。

from huggingface_hub import InferenceClient

client = InferenceClient(
    provider="hyperbolic",
    api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)

messages = [
    {
        "role": "user",
        "content": "What is the capital of France?"
    }
]

completion = client.chat.completions.create(
    model="deepseek-ai/DeepSeek-R1", 
    messages=messages, 
    max_tokens=500
)

print(completion.choices[0].message)

以下是如何使用在 Nebius AI Studio 上运行的 FLUX.1-dev 从文本提示生成图像

from huggingface_hub import InferenceClient

client = InferenceClient(
    provider="nebius",
    api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)

# output is a PIL.Image object
image = client.text_to_image(
    "Bob Marley in the style of a painting by Johannes Vermeer",
    model="black-forest-labs/FLUX.1-schnell"
)

要切换到不同的提供商，您只需更改提供商名称，其他所有内容保持不变

from huggingface_hub import InferenceClient

client = InferenceClient(
-	provider="nebius",
+   provider="hyperbolic",
    api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)

通过 JS，使用 @huggingface/inference

import { HfInference } from "@huggingface/inference";

const client = new HfInference("xxxxxxxxxxxxxxxxxxxxxxxx");

const chatCompletion = await client.chatCompletion({
    model: "deepseek-ai/DeepSeek-R1",
    messages: [
        {
            role: "user",
            content: "What is the capital of France?"
        }
    ],
    provider: "novita",
    max_tokens: 500
});

console.log(chatCompletion.choices[0].message);