AI agents in Pipelines
Buildkite supports AI agents in two complementary ways: you can use agents to build and maintain your pipelines, and you can run agents inside your CI pipeline steps.
Build with agents
Give your AI coding agent the context and tools it needs to work with Buildkite effectively.
Skills
Buildkite skills encode expert knowledge about Buildkite. These include YAML configuration patterns, migration strategies, CLI usage, and API patterns, all formatted for AI coding agents like Claude Code, Cursor, and GitHub Copilot.
Install them to avoid re-explaining Buildkite conventions in every session. Available skills cover:
- Pipeline configuration and step types
- Migrating from other CI platforms
- Running preflight builds against local changes
- Agent runtime commands
- The Buildkite CLI and REST/GraphQL APIs
See Getting started with coding agents for the full skill list and installation instructions.
MCP server
The Buildkite MCP server uses the Model Context Protocol (MCP) to connect your AI agent to the Buildkite REST API in real time. Your agent can inspect build state, read logs, trigger runs, and iterate on pipeline configuration using live data.
Docs as context
Every Buildkite docs page is available in Markdown format—append .md to any URL (for example, /docs/pipelines/getting-started.md). Per-section llms.txt files are available for loading entire topic areas into your agent's context at once.
See Getting started with coding agents for the full list of llms.txt URLs.
Use agents in CI
Run AI agents as steps in your pipelines to automate analysis, summarize failures, and connect AI capabilities to your build workflows.
Agentic steps with model providers
Model providers connect LLMs directly into pipeline steps, giving agents access to build logs, artifacts, security policies, and real-time pipeline data. This is Buildkite's native approach to running agentic steps.
Agent steps authenticate using the existing $BUILDKITE_AGENT_ACCESS_TOKEN—no separate API key is required when using a Buildkite hosted token (available on Pro and Enterprise plans). Teams that manage their own credentials can use Bring Your Own Token instead.
steps:
- label: ":anthropic: Analyze test failures"
command: |
curl "$BUILDKITE_AGENT_ENDPOINT/ai/anthropic" \
-H "Authorization: Bearer $BUILDKITE_AGENT_ACCESS_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"model": "claude-opus-4-5",
"messages": [{"role": "user", "content": "Analyze these test failures and suggest fixes..."}]
}'
Only Anthropic models are currently supported via model providers. See Model providers for the full configuration reference, including usage tracking in Organization Settings > Usage > Model Providers.
Plugins
For quick AI integration without custom scripting, Buildkite plugins can add failure analysis and log summarization to any pipeline step:
| Plugin | LLM provider | Description | |||
|---|---|---|---|---|---|
| Plugin | claude-summarize | LLM provider | Anthropic Claude | Description | Analyzes build failures, identifies root causes, and posts suggested fixes as build annotations |
| Plugin | bedrock-summarize | LLM provider | AWS Bedrock | Description | Same failure analysis pattern using AWS Bedrock LLMs; supports injecting project context via agent_file
|
| Plugin | chatgpt-analyzer | LLM provider | OpenAI | Description | Build log analysis and summarization using OpenAI models |
All three plugins support trigger: on-failure to run only when a step fails, analysis_level: step or build to scope the analysis, and custom_prompt to add project-specific context.
Example using the bedrock-summarize plugin:
steps:
- label: "🧪 Run tests"
command: "bundle exec rspec"
plugins:
- buildkite/bedrock-summarize#v1.0.0:
trigger: on-failure
analysis_level: step
model: "anthropic.claude-3-5-sonnet-20241022-v2:0"
custom_prompt: "This is a Ruby on Rails app. Focus on database and authentication errors."
The claude-summarize plugin is no longer actively maintained. For Anthropic model integration, Buildkite recommends using model providers instead.