# Baz Agents

Baz runs AI agents, including coding SDLC and code review agents, as for evaluating pull requests. Rather than assessing changes on a per-file basis, these agents consider the entire repository and its external context. The codebase is split into indexable units, and embeddings with similarity measures are used to retrieve relevant code and tests. Agents perform agentic code analysis and optional runtime inspection, yielding structured findings. These findings are shared as pull request comments and CI check results.

{% embed url="<https://www.youtube.com/watch?v=RNQ1o9aU0is>" %}

### SDLC Agents

*SDLC Agents* are a new class of coding agents. They run in a ephemeral, secure, sandboxed environments and can be tasked with a variety of coding tasks.

{% tabs %}
{% tab title="Fixer" %}
Accelerates the code review cycle by letting suggested fixes be committed directly to a PR, eliminating manual edits and context switching.

**What it does**

* Proposes small, safe edits to address verified issues and can apply them to the PR so reviewers and authors see a working suggestion in-place.
* Focuses on self-contained fixes that are low risk to apply automatically.

**High level guidance**

* Only apply fixes that are clearly correct and scoped to the change. Avoid risky changes that require design or product decisions.
* Keep the suggested changes minimal and accompanied by a short rationale so reviewers can accept or tweak the suggestion quickly.

**Tools it uses**\
Tools that gather code and diff context, tools that produce a patch or suggestion, and tools that safely create commit suggestions against the PR.

**Context it consumes**\
PR diff, related files needed to justify the fix, and any metadata that explains intent (PR title or ticket). The agent favors fixes that can be validated by the changed code alone.

**How it behaves**\
Runs a quick verification workflow: build a minimal justification for the change, prepare a patch, and surface the patch as a suggested commit. The agent favors simplicity and high confidence fixes.

**How to Trigger**\
Once Baz Fixer is enabled and configured, you can trigger fixes directly from your PR in two ways:

* Apply fix on a single comment  \
  Each Baz review comment includes a checkbox: **“Apply fix with Baz”**. Selecting it will generate a commit that addresses that specific finding.
* Fix all comments in a PR&#x20;

  If your PR contains multiple Baz comments, you will see a **“Fix all”** option in the PR description. Selecting it will generate a separate commit for each open Baz comment in the PR.
* Auto-fix all PR comments\
  You can configure Baz Fixer to automatically apply fixes for specific repositories and authors. When a PR matches the configured combination, Baz will automatically generate commits that address all Baz findings in the PR. This configuration is available in the Fixer agent drawer.

**Reviewing fixing session logs**

* To review whether a fix was successful, open the Fixer **agent drawer** and click **“See logs”** for the relevant sandbox environment.
* This opens the **Fixing Sessions** page, where each PR shows its fixing sessions. Each session includes logs such as tools invoked, agent responses and final session status.
  {% endtab %}

{% tab title="Merger" %}
Accelerates the code review cycle by identifying open PRs that are safe to merge, so teams can clear stale queues without skipping review discipline.

**What it does**

Analyzes open PRs for merge readiness and suggests whether each PR should move forward, be rejected, or be escalated for human review.

Focuses on PRs that are sitting idle, while filtering out changes with failed CI, unresolved issues, risky business impact, or other signals that need attention. The intended scope is clear in the internal brief: some PRs are dangerous, some did not pass CI or have open comments, and some are simply waiting for attention. Merge Agent is aimed at the last group.

**High level guidance**

Only merge changes that are clearly safe, have no blocking CI state, and do not require product, design, architecture, or ownership decisions.

Escalate changes with open findings, failing or pending CI, unclear risk, or impact on critical business logic. The agent’s expected verdicts are MERGE, REJECT, and ESCALATE, where ESCALATE applies when a PR has open issues or affects critical business logic that needs human review.

Keep the verdict short, explain the reason, and make it easy for the reviewer to understand why the PR can move forward or why it needs attention.

**Tools it uses**

Tools that gather PR metadata, CI state, review findings, repo context, git history, related code, repo auto-approve rules, and org-specific guidelines.

Recent implementation work sends richer CI and PR metadata into the merge readiness flow and gates merge decisions on detailed CI-state signals.

**Context it consumes**

PR diff, PR description, CI status, review findings, repo index, related files, git history, repo auto-approve rules, and custom org guidelines.

The internal spec lists the agent context as unaddressed findings, repo index, git history, repo traversal, repo auto-approve rules, and admin-editable custom org guidelines.

**How it behaves**

Runs a merge-readiness workflow: collects the relevant PR and repo context, checks CI and open findings, evaluates production and business impact, and returns a verdict.

The verdict is one of:

MERGE: the PR is good to go.

REJECT: the PR should not be merged.

ESCALATE: the PR has open issues or touches critical logic and needs human review.

The agent favors conservative decisions. It should only recommend merge when the evidence is strong. Otherwise, it should escalate.

**How to Trigger**

Once Merge Agent is enabled and configured, merge readiness can be triggered for PRs tied to the workflow.
{% endtab %}

{% tab title="SRE Agent" %}
The SRE Agent identifies reliability, performance, and operational risks in your codebase using observability tools, and automatically generates fixes as pull requests for your team to review.

**What it does**

When an issue is detected, the agent analyzes the risk, generates a fix, and opens a pull request with clear context. It focuses on reliability issues such as retries and error handling, performance bottlenecks like inefficient queries, and observability gaps including missing logs or metrics.

**How it works**

The agent combines repository context with production telemetry from observability tools like Datadog. By correlating code with real system behavior, it identifies risky patterns and generates fixes aligned with production usage, delivered as pull requests.

**Setup instructions**

* The AI SRE Agent requires write access to repositories to open pull requests, and an active observability tool (e.g. Datadog) to provide production context.
* Datadog Setup
  * Service Mapping must be configured for source code integration. The application key must include `error_tracking_read` and `apm_read` permissions. Adding `logs_read_data` is recommended for deeper analysis.
  * Setup of [Service Mapping for Source Code Integration](https://docs.datadoghq.com/source_code/service-mapping)<br>
    {% endtab %}
    {% endtabs %}

### Code Review Agents

Reviews are our general purpose code-review agent class. They are individually scoped, contextualized and steered to discover, analyze and fix coding issues on specific engineering sub-domains. Combined with memories, derived by user feedback to the Baz agent on pull requests, each agent is both extremely focused and highly tuned to your codebase's unique requirements.

{% tabs %}
{% tab title="Spec Reviewer" %}
Ensures implemented code and design align with documented requirements, identifying gaps or deviations early.

**What it does**

* Extracts explicit requirements from tickets and designs and validates whether the implementation satisfies those requirements.
* Produces a verdict for each requirement with evidence: met, partially met, or not met.

**High level guidance**

* Keep extraction strictly ticket-driven: only record requirements explicitly stated in the source materials.
* Validate each requirement using code and, when available, preview environments or design artifacts.

**Tools it uses**\
Tools that fetch ticket and design artifacts, tools that help get context from code and diffs, visual comparison helpers when preview environments are available, and evidence capture tools.

**Context it consumes**\
Ticket text and attachments, design files, PR diff, optional preview environment snapshots, and prior specifications for consistency.

**Activation note**\
Connect your integrations to activate this agent. When design or preview integrations are present the agent will include visual validation as part of the verdict.
{% endtab %}

{% tab title="AI Coding Guidelines" %}
Ensure AI-generated code follows consistent, high-quality standards aligned with your engineering practices.

**What it does**

* Applies organization coding conventions and quality expectations to AI-produced output.
* Produces guardrails and standard phrasing developers can copy to align model behavior.

**High level guidance**

* Emphasize consistency and predictability. Encourage minimal, well documented suggestions and require evidence when changes affect public contracts.

**Tools it uses**\
Policy and style templates, and context tools that map repository expectations.

See more details [Skills & Instructions](/agents/skills-and-instructions.md)
{% endtab %}
{% endtabs %}

#### Code Correctness

{% tabs %}
{% tab title="Logical Bugs" %}
Identifies logical inconsistencies, flawed conditionals, and edge cases that could produce unexpected behavior.

**What it does**

* Highlights incorrect logic, incomplete implementations, missing steps, and unintended side effects.
* Gives concrete traces and examples of failing execution paths.

**High level guidance**

* Compare the implementation with PR intent or ticket context to determine whether behavior is intentional. Prioritize concrete, reproducible issues.

**Tools it uses**\
Tools that map code flows and help extract execution traces along with code and diff exploration utilities.

**Context it consumes**\
PR title and ticket context, diff hunks, and the code paths needed to trace complete execution from input through output.
{% endtab %}

{% tab title="Breaking Changes" %}
Detects changes that alter or remove existing functionality and could break dependent APIs or features.

**What it does**

* Finds contract or API surface changes and ties them to consumers that would fail.
* Produces actionable findings with exact locations and suggested mitigations.

**High level guidance**

* Only label a change as breaking when there is direct evidence showing a consumer or contract is affected. Avoid hypothetical statements.

**Tools it uses**\
Tools that discover API surface and contract definitions, tools that help find consumers across the repo, and diff/context tools to produce evidence.

**Context it consumes**\
PR diff, public API/type definitions, API docs if present, and consumer client code references.
{% endtab %}

{% tab title="Type Inconsistency" %}
Ensures variables and functions use appropriate data types to prevent type related errors.

**What it does**

* Flags type changes that could cause runtime or integration problems, especially where code interfaces with external systems.
* Recommends specific, actionable type fixes or mitigations.

**High level guidance**

* In strongly typed modules prefer conservative assumptions about types, but call out clear inconsistencies that impact external contracts.

**Tools it uses**\
Tools that help get type and API context from code, diff comparators, and repo search helpers.

**Context it consumes**\
PR diff, type definitions and usages, and any module metadata that clarifies language and dependency expectations.&#x20;
{% endtab %}
{% endtabs %}

#### Code Quality and Correctness

{% tabs %}
{% tab title="Naming and Typos" %}
Finds unclear identifiers and obvious spelling mistakes that reduce code clarity.

**What it does**

* Flags non descriptive or incorrect names and typos in code and comments.

**High level guidance**

* Be conservative with stylistic nitpicks. Avoid enforcing strict naming conventions that conflict with the repo style.
  {% endtab %}

{% tab title="Code Dedup and Conventions" %}
Detects duplicated logic and enforces existing team patterns and conventions.

**What it does**

* Finds repeated code and suggests refactors that follow team patterns.
* Encourages reuse and clearer abstractions aligned with local conventions.

**High level guidance**

* Prefer refactors that are small and safe for the current change. Avoid large architectural rewrites in a single suggestion.
  {% endtab %}

{% tab title="Conciseness" %}
Proposes simpler, more idiomatic code that keeps readability and correctness.

**What it does**

* Suggests idiomatic patterns appropriate for the repository language and flags overly verbose constructs.

**High level guidance**

* Focus on substantial improvements that reduce complexity while preserving behavior. Do not suggest changes for very small or trivial lines.
  {% endtab %}

{% tab title="Code Hygiene" %}
Ensures code is tidy, well organized, and follows agreed style rules to improve maintainability.

**What it does**

* Flags commented out code, obvious clutter, and structural issues that impede readability.
* Avoids false positives for accepted patterns such as deliberate multi-line strings or documented TODOs.

**High level guidance**

* Recommend cleanup only when it improves maintainability and does not change intent. Keep suggestions pragmatic and minimal.
  {% endtab %}
  {% endtabs %}

#### Security Best Practices

{% tabs %}
{% tab title="Advanced Security" %}
**What it does**

Reviews PRs for security issues that require more than static pattern matching.

The agent looks for exploitable behavior across authentication, authorization, input handling, data exposure, dependency usage, unsafe defaults, insecure flows, and cross-service assumptions.

It focuses on findings that can be supported by the change, surrounding code, related repositories, package context, runtime signals, and other evidence a senior AppSec engineer would ask for.

**High level guidance**

Use Advanced Security Reviewer when security impact depends on how the system actually works.

It is especially useful for changes touching permissions, request handling, secrets, dependencies, internal packages, data access, APIs, and multi-repo contracts.

Treat findings as security review evidence, not generic scanner output. Each finding should explain what changed, why it is risky, and what context supports the conclusion.

**Tools it uses**

Tools that gather PR diff and surrounding code context.

Tools that retrieve cross-repo and module context through the Context Broker.

Tools that inspect domain summaries, related repositories, package context, PR metadata, comments, and runtime signals when available.

**Context it consumes**

PR diff, changed files, surrounding code, related files, PR metadata, ticket context, package metadata, internal dependency context, related repositories, module and domain summaries, prior comments, and runtime signals.

The agent uses Baz’s Context Broker to fetch additional data required to validate whether a finding is real and relevant.
{% endtab %}

{% tab title="Basic Security Patterns" %}
Identifies common security anti patterns like unsanitized inputs, PII exposure, and injection vectors.

**What it does**

* Flags hardcoded secrets, PII leaks in logs, risky SQL or command usage, and missing input validation.

**High level guidance**

* When calling out PII or secrets, specify exact locations and avoid hedging language. Avoid flagging framework handled behavior or test placeholders.
  {% endtab %}

{% tab title="REST API Best Practices" %}
Ensures backend APIs follow modern REST conventions and sound design.

**What it does**

* Looks at route names, HTTP method usage, versioning, parameter patterns, and resource hierarchy.
* Focused specifically on backend endpoints and their server side implementations.

**High level guidance**

* Only flag server side endpoint implementations. Do not flag client-side calls or client library usage.
  {% endtab %}
  {% endtabs %}

## FAQ

<details>

<summary>What do Baz’s default agents do?</summary>

They analyze change requests for naming, typing, logic bugs, outdated comments, log errors, etc., using a combination of AI, parsing, and repository context.

</details>

<details>

<summary>How does Baz scale efficiently on large codebases?</summary>

Baz divides code into manageable chunks, reprocesses only changed files, stores embeddings in a vector database for similarity search, and filters by organization/repo to maintain performance.

</details>

<details>

<summary><strong>Can I disable some default reviewer checks?</strong></summary>

Yes. Organization admins can deactivate specific agents or modify their scope.

</details>


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.baz.co/agents/baz-agents.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.