Skip to content

Agents & models

Six model roles exist, each backed by a different system prompt, tool set, or runtime lane. The construction model is the simplest way to understand the split: one role drafts the blueprint, one keeps the job coherent, one performs trade work, one inspects, one enforces deterministic checks, and one enriches project context.

RoleResponsibilityTypical model tier
specSurveys missing context and drafts the task blueprint in exploring.High — reasoning-heavy.
coordinatorKeeps the project/domain plan coherent and makes promotion/rejection decisions.High.
workerPerforms bounded trade work against the accepted blueprint.Mid-to-high — coding-specific.
reviewerInspects completed work against the blueprint + rubric.Mid.
gateCheckerRuns deterministic checks and custom gates.Low — mostly shell/parse.
contextIndexerSummarizes code purpose, contracts, canonical abstractions, and read-next guidance for the Corpus Map.Mid — code-aware, structured, fast enough for repeated indexing.

Assigning models

yaml
models:
  spec: deepseek-ai/DeepSeek-V4-Flash
  coordinator: deepseek-ai/DeepSeek-V4-Flash
  worker: Qwen/Qwen3.5-35B-A3B
  reviewer: deepseek-ai/DeepSeek-V4-Flash
  gateChecker: deepseek-ai/DeepSeek-V4-Flash
  contextIndexer: zai-org/GLM-4.6

Model IDs resolve against Guildhall's bundled model catalog.

Guildhall's public open-model recommendations live in Open model recommendations. Those notes are based on saved development replay prompts and treated as a starting point, not a permanent ranking.

For OpenAI-compatible providers, Guildhall also sends model API hints when they make the run cheaper or more reliable. Stable agent turns get a prompt-cache identity when one is available. JSON-only helper calls can request strict response_format schemas. Reasoning controls are selected from the agent role and model id instead of being sprinkled through prompts. Guildhall deliberately does not set paid service_tier priority unless a future configuration adds an explicit opt-in.

Cognitive profiles

Each role has a CognitiveProfile declaring what it needs from a model — context window, tool use, reasoning. The coordinator uses these to validate model assignments at boot.

Permission modes

Agents run under a permission mode:

  • plan — no side effects; agents can read but not write.
  • default — standard allow-list; destructive operations prompt.
  • full_auto — unrestricted; use only in sandboxed worktrees.

remediation_autonomy and worktree_isolation interact with permission mode to decide how aggressively agents may self-recover.

Persistence

Agent conversations and checkpoints are snapshotted under ~/.guildhall/data/projects/<project-hash>/ per project. Shared project state stays in ./.guildhall/, but raw conversation history is local so it does not balloon commits.

Released under the FLL-1.2 License.