Gonnector
A two-part structure by reader: for prompt authors + for harness/infra engineers
A reworking of Anthropic's official "Prompting Claude Fable 5" documentation so you can put it to work immediately. The reorganization takes four forms.
Jump straight to the part that fits your role.
The people who write system prompts, skills, and task instructions. Nine ready-to-paste instruction patterns: instructing with short principles, operating effort, stating boundaries, not reproducing reasoning, and more.
The people who design the agent harnessharness. The runtime that wraps the model and actually drives it. It is the software layer responsible for tool calls, permissions, timeouts, and conversation management., API integrations, and production pipelines. Nine structural changes: preparing for long turns, refusal/fallback, memory, subagents, the send-to-user tool, and more.
Fable 5 is aimed at problems that were too complex, too long-running, or too ambiguous for previous models, and it is especially strong on end-to-end work that would take a person from hours to weeks. The teams that applied it to their hardest unsolved problems saw the best results; if you test it only with easy tasks, you will underestimate the range of its capability. It also handles simple tasks reliably.
stop_reason: "refusal". For operational handling, see the safety classifier section in Part 2.
Patterns to apply when writing system prompts, skills, and task instructions. The snippets are verbatim from the original; copy and use them.
Because instruction-following has improved, you can steer most of the behavior with a single short principle rather than naming each desired behavior individually. With no instruction, Fable 5 at high effort can over-elaborate (listing options it will not pursue, giving verbose root-cause explanations, over-structuring PR descriptions), yet the single paragraph below is as effective as a whole list of forbidden patterns.
Lead with the outcome. Your first sentence after finishing should answer "what happened" or "what did you find": the thing the user would ask for if they said "just give me the TLDR." Supporting detail and reasoning come after. Being readable and being concise are different things, and readability matters more. The way to keep output short is to be selective about what you include (drop details that don't change what the reader would do next), not to compress the writing into fragments, abbreviations, arrow chains like A → B → fails, or jargon.
The checkpoint (pausing) behavior in long-running workflows is also steered by a single principle rather than an enumeration of cases.
Pause for the user only when the work genuinely requires them: a destructive or irreversible action, a real scope change, or input that only they can provide. If you hit one of these, ask and end the turn, rather than ending on a promise.
effortFable 5's thinking-intensity setting (low / medium / high / xhigh). Raising it makes the model think more deeply but slower and more expensive. is the primary control for the trade-off between intelligence, latency, and cost.
high by defaultxhighmedium / lowEven low effort often exceeds the xhigh of previous models. If a task is taking longer than it needs to, or you want to work more interactively, lower the effort. High effort gives excellent verification behavior and the most rigorous output, but on routine tasks it can lead to excessive context-gathering and deliberation.
At high effort the model can add unrequested cleanup, refactoring, abstractions, and unnecessary defensive code on its own. State the boundaries.
Don't add features, refactor, or introduce abstractions beyond what the task requires. A bug fix doesn't need surrounding cleanup and a one-shot operation usually doesn't need a helper. Don't design for hypothetical future requirements: do the simplest thing that works well. Avoid premature abstraction and half-finished implementations. Don't add error handling, fallbacks, or validation for scenarios that cannot happen. Trust internal code and framework guarantees. Only validate at system boundaries (user input, external APIs). Don't use feature flags or backwards-compatibility shims when you can just change the code.
For unrequested behavior (drafting an email that was not asked for, backing up a defensive git branch, and the like), define the boundary between what should and should not be done.
When the user is describing a problem, asking a question, or thinking out loud rather than requesting a change, the deliverable is your assessment. Report your findings and stop. Don't apply a fix until they ask for one. Before running a command that changes system state (restarts, deletes, config edits), check that the evidence actually supports that specific action. A signal that pattern-matches to a known failure may have a different cause.
To keep the model from planning in circles on ambiguous tasks, add the instruction below. The last sentence matters: it does not apply to thinking blocksThe record of the model's internal reasoning before it produces an answer. In Fable 5, only a summarized version is provided., so internal reasoning stays free.
When you have enough information to act, act. Do not re-derive facts already established in the conversation, re-litigate a decision the user has already made, or narrate options you will not pursue in user-facing messages. If you are weighing a choice, give a recommendation, not an exhaustive survey. This does not apply to thinking blocks.
In long autonomous runs, instruct the model to audit its progress against actual tool results. In Anthropic's testing, this instruction nearly eliminated false progress reports even on tasks designed to induce them.
Before reporting progress, audit each claim against a tool result from this session. Only report work you can point to evidence for; if something is not yet verified, say so explicitly. Report outcomes faithfully: if tests fail, say so with the output; if a step was skipped, say that; when something is done and verified, state it plainly without hedging.
Instructions that move or explain internal reasoning in the response text ("show your thinking," "explain your reasoning process," and the like) can trigger the reasoning_extraction refusalA response in which the model declines to process a request for safety reasons. In the API it is surfaced as stop_reason: "refusal". category and increase fallbackfallback. A backup path that automatically hands processing to another model when the primary model refuses or fails. to Opus 4.8.
thinking blocks from adaptive thinkingFable 5 performs better when it understands the intent behind a request. Rather than guessing the intent on its own, it connects to the relevant information. This is especially effective for long-running agents handling multiple workstreams.
I'm working on [the larger task] for [who it's for]. They need [what the output enables]. With that in mind: [request].
In long, tool-heavy agent conversations, you can get arrow-chain shorthand, excessive implementation detail, and references to thinking the user never saw. Have the model write the final summary not as a continuation of the working thread, but as a re-grounding for a reader seeing this for the first time.
Terse shorthand is fine between tool calls (that's you thinking out loud, and brevity there is good). Your final summary is different: it's for a reader who didn't see any of that. If you've been working for a while without the user watching (overnight, across many tool calls, since they last spoke), your final message is their first look at any of it. Write it as a re-grounding, not a continuation of your working thread: the outcome first, then the one or two things you need from them, each explained as if new. The vocabulary you built up while working is yours, not theirs; leave it behind unless you re-introduce it. When you write the summary at the end, drop the working shorthand. Write complete sentences. Spell out terms. Don't use arrow chains, hyphen-stacked compounds, or labels you made up earlier. When you mention files, commits, flags, or other identifiers, give each one its own plain-language clause. Open with the outcome: one sentence on what happened or what you found. Then the supporting detail. If you have to choose between short and clear, choose clear.
Skills and instructions written for previous models can be too prescriptive for Fable 5 and can actually lower output quality. If baseline performance is better, consider removing outdated instructions. A capability improvement is a good occasion to reassess which instructions, tools, and guardrails are still genuinely needed. Fable 5 is also good at updating skills on the fly based on what it learned during the work.
Structural changes to apply when designing the agent harnessharness. The runtime that wraps the model and actually drives it. It is the software layer responsible for tool calls, permissions, timeouts, and conversation management., API integrations, and production pipelines.
At high effort, a single request can take minutes, and autonomous runs can go for hours. This is one of the biggest changes teams encounter.
Fable 5 runs safety classifiers aimed at the following.
stop_reason: "refusal" in production, those requests simply fail.
Details are in the model introduction document; items to check when migrating:
refusal stop reason and fallback handlingFable 5 does especially well when it can record and reference lessons from previous runs. A note space as simple as a single Markdown file is enough.
Store one lesson per file with a one-line summary at the top. Record corrections and confirmed approaches alike, including why they mattered. Don't save what the repo or chat history already records; update an existing note rather than creating a duplicate; delete notes that turn out to be wrong.
To bootstrap from existing history:
Reflect on the previous sessions we've had together. Use subagents to identify core themes and lessons, and store them in [X]. Make sure you know to reference [X] for future use.
Fable 5 dispatches and manages parallel subagentssubagent. An auxiliary AI instance that the main agent hands off part of the work to. It works in parallel in its own independent context. more aggressively and more reliably than previous models.
Delegate independent subtasks to subagents and keep working while they run. Intervene if a subagent goes off track or is missing relevant context.
A separate verification subagent with fresh context is better than self-critique. For long-running work, instruct as follows.
Establish a method for checking your own work at an interval of [X] as you build. Run this every [X interval], verifying your work with subagents against the specification.
For long, asynchronous agents, build a client tool that delivers, without ending the turn, a message the user must see exactly as written. Targets: partial deliverables (a generated code snippet, a draft message), progress updates with concrete numbers, and direct answers to questions that came in mid-run. Tool input is not summarized, so the content arrives intact.
{
"name": "send_to_user",
"description": "Display a message directly to the user. Use this for progress updates, partial results, or content the user must see exactly as written before the task finishes.",
"input_schema": {
"type": "object",
"properties": {
"message": {
"type": "string",
"description": "The content to display to the user."
}
},
"required": ["message"]
}
}Between tool calls, when you have content the user must read verbatim (a partial deliverable, a direct answer to their question), call the send_to_user tool with that content. Use send_to_user only for user-facing content, not for narration or reasoning.
Patterns that appear rarely in the later part of long sessions, and how to handle them:
| Pattern | Response |
|---|---|
| Stops after stating intent | Ends the turn after only saying "now I'll do X," with no tool call. A short nudge like "continue" / "go ahead and do it end to end." Operate it paired with the checkpoint instruction (Part 1-1) |
| Unnecessary permission requests | Asks again for permission when it could proceed. Add the system reminder below for autonomous pipelines |
| Context-budget worries | Suggests a new session, proposes a summary handoff, or reduces the work. It is best if the harness does not expose the remaining token countdown; if it must, add a reassurance line |
System reminder for autonomous pipelines:
You are operating autonomously. The user is not watching in real time and cannot answer questions mid-task, so asking "Want me to…?" or "Shall I…?" will block the work. For reversible actions that follow from the original request, proceed without asking. Offering follow-ups after the task is done is fine; asking permission after already discussing with the user before doing the work is not. Before ending your turn, check your last paragraph. If it is a plan, an analysis, a question, a list of next steps, or a promise about work you have not done ("I'll…", "let me know when…"), do that work now with tool calls. End your turn only when the task is complete or you are blocked on input only the user can provide.Context reassurance line:
You have ample context remaining. Do not stop, summarize, or suggest a new session on account of context limits. Continue the work.
Pick tasks harder than what you would hand to previous models, and let Fable 5 scope them itself, ask clarifying questions, and execute. If you evaluate it only with easy tasks, you will underestimate the range of its capability.
A record of how this guide came to be. Opus 4.8 summarized the original (Appendix A), and Fable 5 verified that summary against the original (Appendix B) before reorganizing and writing the entire body itself. In effect, the model refined its own user manual.
This document is a prompting and scaffolding guide for Claude Fable 5 / Mythos 5 (the generation after the previous Claude Opus 4.8). The core idea is that "because the model has become far more capable, the old approach of fine-grained instruction and building guardrails can actually get in the way." Below is a systematic summary of the key points.
Fable 5 does well on problems that were too complex, too long-running, or too ambiguous for previous models, especially end-to-end work that would take a person from hours to weeks. Interestingly, the document emphasizes that "the teams that applied it to their hardest unsolved problems saw the best results, and if you test it only with easy tasks you will underestimate the model's range of capability." In other words, the direction of prompting philosophy has shifted from "control the model tightly" to "trust its capability and let it loose."
Infrastructure preparation for longer turns. Hard tasks can take minutes for a single request and hours for autonomous runs. Adjust client timeouts, streaming, and progress indicators, and restructure the harness to check on runs asynchronously (scheduled jobs, etc.) instead of blocking. At the same time, to prevent overplanning on ambiguous tasks, add an instruction like "when you have enough information to act, act; don't re-litigate decisions already made."
Building a memory system. Fable 5 does especially well when it can record and reference lessons from previous runs. Give it a note space as simple as a single Markdown file and rules like "one lesson per file, a one-line summary at the top, update instead of duplicating, delete notes that turn out wrong." You can also bootstrap by having subagents review past sessions.
Aggressive use of parallel subagents. Fable 5 dispatches and manages parallel subagents better and more aggressively than before. Delegate often, prefer asynchronous communication over blocking, and get cache benefits from long-lived subagents that retain context.
Explicit self-verification. A separate verification subagent with fresh context is better than self-critique. Instruct it to "verify against the specification with subagents at regular intervals."
Creating a send-to-user tool. For long asynchronous agents, build a client tool that delivers, without ending the turn, a message the user must see (partial deliverables, direct answers to questions) exactly as written. Tool input is not summarized, so the content arrives intact. But the tool definition alone is not enough; you must also include phrasing in the system prompt that elicits the call, and you must not leak narration/internal reasoning through it.
In the later part of long sessions, Fable 5 may (1) stop after only stating intent ("now I'll do X") with no tool call, (2) ask again for permission when it could already proceed, or (3) worry about the context budget and suggest a new session or try to reduce the work. Handle each with a short nudge like "continue / carry it through to the end," a system reminder for autonomous pipelines ("you are operating autonomously and the user can't watch in real time, so proceed without asking for reversible actions"), and a reassurance line like "context is ample, don't stop or summarize." The third case often occurs when the harness exposes the remaining token countdown to the model, so it's best not to expose that number when possible.
In long, tool-heavy agent conversations, Fable 5 can produce hard-to-follow text: arrow-chain shorthand (A → B → fails), excessive implementation detail, references to thinking the user never saw. Write the final summary not as "a continuation of the working thread" but as "a re-grounding for a reader seeing this for the first time." That is, lead with the outcome in complete sentences, drop the shorthand and self-made vocabulary, and give files, commits, and flags each their own plain-language explanation.
Where you used to control, enumerate, and defend the model in fine detail, with Fable 5 the core is to (a) instruct with short principles, (b) strip back the prescriptive skills of the past, (c) tune with effort, (d) put in place scaffolding for long autonomous runs such as progress-evidence verification, memory, subagents, and send-to-user, and (e) observe the newly introduced constraints like "don't reproduce reasoning as text." All of it converges on one principle: "the model has become far more capable, so don't block that capability, put it to use."
Review method: Downloaded the original markdown directly and compared it against the summary paragraph by paragraph. Completeness assessment: about 85% (opinion).
The four-tier effort operation, short-principle-based instruction, stripping back old prompts, grounding progress reports in evidence, the ban on reproducing reasoning and the reasoning_extraction refusal, the memory system, parallel subagents, the send-to-user tool (with the caveat that the definition alone is not enough), the early-termination and context-worry edge cases, and the readability guidance all match the original. No factual distortion or fabricated content.
Observation from Fable 5's perspective: Confirmed that the example snippets in this document are included almost verbatim in the system prompt of the Fable 5 deployed in Claude Code ("Lead with the outcome...," "When you have enough information to act, act...," the autonomous-run reminder, etc.). The document's example prompts are not theory but actual deployed phrasing, and the summary accurately captured that skeleton.
According to the original, Fable 5 runs safety classifiers aimed at three areas: (a) offensive cybersecurity, (b) biology/life-science, and (c) extraction of summarized thinking; well-intentioned security work or beneficial life-science work can be caught as false positives; and refused requests should be configured to auto-reroute via fallback to Opus 4.8. The summary mentions only (c), reasoning_extraction, and omits (a), (b), and the recommendation to configure fallback entirely. Without this knowledge, when a refusal occurs in production you can't diagnose the cause or respond, so for teams in security/bio-adjacent domains this is information that governs migration decisions.
The original specifically enumerates the areas of improvement over Opus 4.8: long-horizon autonomy, first-shot accuracy, Vision (trained to handle flipped or blurry images with bash/crop tools), enterprise workflows (financial analysis, spreadsheets, slides), code review/debugging (improved recall, but excluding the cybersecurity domain), navigating ambiguity, and delegation/collaboration. The summary keeps only the general point "does well on end-to-end work" and drops all the specifics. This list is practical material for deciding "what tasks to put Fable 5 on."
On structure: Item 4 of the summary merges two different sections of the original (the over-refactoring-prevention snippet from the effort section + the unrequested-behavior-prevention from the State the boundaries section) into one item. There is no distortion of content, but it is a source of confusion for tracing the original structure.