Llama

通过 Mastra 的模型路由访问 7 个 Llama 模型。身份验证会自动使用 LLAMA_API_KEY 环境变量处理。

🌐 Access 7 Llama models through Mastra's model router. Authentication is handled automatically using the LLAMA_API_KEY environment variable.

在Llama 文档中了解更多信息。

🌐 Learn more in the Llama documentation.

.env
LLAMA_API_KEY=your-api-key

src/mastra/agents/my-agent.ts
import { Agent } from "@mastra/core/agent";

const agent = new Agent({
  id: "my-agent",
  name: "My Agent",
  instructions: "You are a helpful assistant",
  model: "llama/cerebras-llama-4-maverick-17b-128e-instruct"
});

// Generate a response
const response = await agent.generate("Hello!");

// Stream a response
const stream = await agent.stream("Tell me a story");
for await (const chunk of stream) {
  console.log(chunk);
}

info

Mastra 使用与 OpenAI 兼容的 /chat/completions 端点。某些特定提供商的功能可能无法使用。详见 Llama 文档。

🌐 Mastra uses the OpenAI-compatible /chat/completions endpoint. Some provider-specific features may not be available. Check the Llama documentation for details.

模型
Direct link to 模型

🌐 Models

7 available models
Model	Context	Input $/1M	Output $/1M
`llama/cerebras-llama-4-maverick-17b-128e-instruct`	128K	—	—
`llama/cerebras-llama-4-scout-17b-16e-instruct`	128K	—	—
`llama/groq-llama-4-maverick-17b-128e-instruct`	128K	—	—
`llama/llama-3.3-70b-instruct`	128K	—	—
`llama/llama-3.3-8b-instruct`	128K	—	—
`llama/llama-4-maverick-17b-128e-instruct-fp8`	128K	—	—
`llama/llama-4-scout-17b-16e-instruct-fp8`	128K	—	—

高级配置
Direct link to 高级配置

🌐 Advanced Configuration

自定义头
Direct link to 自定义头

🌐 Custom Headers

src/mastra/agents/my-agent.ts
const agent = new Agent({
  id: "custom-agent",
  name: "custom-agent",
  model: {
    url: "https://api.llama.com/compat/v1/",
    id: "llama/cerebras-llama-4-maverick-17b-128e-instruct",
    apiKey: process.env.LLAMA_API_KEY,
    headers: {
      "X-Custom-Header": "value"
    }
  }
});

动态模型选择
Direct link to 动态模型选择

🌐 Dynamic Model Selection

src/mastra/agents/my-agent.ts
const agent = new Agent({
  id: "dynamic-agent",
  name: "Dynamic Agent",
  model: ({ requestContext }) => {
    const useAdvanced = requestContext.task === "complex";
    return useAdvanced
      ? "llama/llama-4-scout-17b-16e-instruct-fp8"
      : "llama/cerebras-llama-4-maverick-17b-128e-instruct";
  }
});

模型Direct link to 模型

高级配置Direct link to 高级配置

自定义头Direct link to 自定义头

动态模型选择Direct link to 动态模型选择

模型
Direct link to 模型

高级配置
Direct link to 高级配置

自定义头
Direct link to 自定义头

动态模型选择
Direct link to 动态模型选择