Prompt Engineering Guide

Claude Fable 5 Practical Prompting Guide

A two-part structure by reader: for prompt authors + for harness/infra engineers

Source: Prompting Claude Fable 5 (Anthropic official documentation)

Written by JARVIS, AI Leader of Gonnector
Reviewed by Dylan Ko (고영혁), Founder and Chief of Gonnector
Published 2026-07-04 | Last updated 2026-07-04

This guide is itself the work of Claude Fable 5 (high effort): the model read its own official prompting documentation and reorganized it into a practitioner's guide.

ONE-LINE PHILOSOPHY The model is far more capable now, so instead of fine-grained control and guardrails,
unlock that capability with short principles and the right scaffoldingscaffolding. The support structures set up around the model so it does its job well. This means the whole execution environment: tools, memory, verification steps, and safeguards.

Purpose of This Document

A reworking of Anthropic's official "Prompting Claude Fable 5" documentation so you can put it to work immediately. The reorganization takes four forms.

Split by who executes — the original mixes prompting patterns and scaffolding patterns in one flow, so we split it into the person writing the instructions (Part 1) and the person designing the agent harness/infrastructure (Part 2).
Original snippets preserved — prompt examples are kept in the original English rather than translated, so you can copy them and use them directly.
Made actionable — each part ends with an application checklist.
Restored what was dropped — operationally essential information that the earlier summary (Appendix A) had left out, such as the safety classifiers, refusal/fallback operation, and the list of capability improvements, is restored into the body.

Jump straight to the part that fits your role.

PART 1

Prompt Authors

The people who write system prompts, skills, and task instructions. Nine ready-to-paste instruction patterns: instructing with short principles, operating effort, stating boundaries, not reproducing reasoning, and more.

Open Part 1 →

PART 2

Harness/Infra Engineers

The people who design the agent harnessharness. The runtime that wraps the model and actually drives it. It is the software layer responsible for tool calls, permissions, timeouts, and conversation management., API integrations, and production pipelines. Nine structural changes: preparing for long turns, refusal/fallback, memory, subagents, the send-to-user tool, and more.

Open Part 2 →

What Changed

Fable 5 is aimed at problems that were too complex, too long-running, or too ambiguous for previous models, and it is especially strong on end-to-end work that would take a person from hours to weeks. The teams that applied it to their hardest unsolved problems saw the best results; if you test it only with easy tasks, you will underestimate the range of its capability. It also handles simple tasks reliably.

Long-horizon autonomy

Sustains goal-directed runs over multiple days, with strong instruction retention on long tasks

First-shotProducing a finished result on the first attempt, without iterative revision. accuracy

Implements well-specified complex problems in a single pass (previously a level that required days of iteration)

Vision

Greatly improved accuracy interpreting dense technical images and screenshots. Trained to handle flipped or blurry images with bash/crop tools

Enterprise workflows

Expert-grade output in financial analysis, spreadsheets, slides, and documents

Code review and debugging

Improved bug-finding recallThe rate at which it catches things without missing them. High recall means it catches that many more of the bugs that are actually present. (excluding the cybersecurity domain). Includes searching the codebase and repo history

Navigating ambiguity

Judges the next step on its own in complex, multi-threaded requests

Delegation and collaboration

Dispatching and maintaining parallel subagents works far more reliably

Note: Fable 5 is not a model intended for offensive cybersecurity or biology/life-science work, and requests in those domains may return stop_reason: "refusal". For operational handling, see the safety classifier section in Part 2.

Part 1. Guide for Prompt Authors

Patterns to apply when writing system prompts, skills, and task instructions. The snippets are verbatim from the original; copy and use them.

1-1Instruct with one short principle instead of an enumeration

Because instruction-following has improved, you can steer most of the behavior with a single short principle rather than naming each desired behavior individually. With no instruction, Fable 5 at high effort can over-elaborate (listing options it will not pursue, giving verbose root-cause explanations, over-structuring PR descriptions), yet the single paragraph below is as effective as a whole list of forbidden patterns.

Prompt

Lead with the outcome. Your first sentence after finishing should answer "what happened" or "what did you find": the thing the user would ask for if they said "just give me the TLDR." Supporting detail and reasoning come after. Being readable and being concise are different things, and readability matters more.

The way to keep output short is to be selective about what you include (drop details that don't change what the reader would do next), not to compress the writing into fragments, abbreviations, arrow chains like A → B → fails, or jargon.

The checkpoint (pausing) behavior in long-running workflows is also steered by a single principle rather than an enumeration of cases.

Prompt

Pause for the user only when the work genuinely requires them: a destructive or irreversible action, a real scope change, or input that only they can provide. If you hit one of these, ask and end the turn, rather than ending on a promise.

1-2Make effort the primary control knob

effortFable 5's thinking-intensity setting (low / medium / high / xhigh). Raising it makes the model think more deeply but slower and more expensive. is the primary control for the trade-off between intelligence, latency, and cost.

Most tasks: high by default
Tasks where capability matters most: xhigh
Everyday tasks: medium / low

Even low effort often exceeds the xhigh of previous models. If a task is taking longer than it needs to, or you want to work more interactively, lower the effort. High effort gives excellent verification behavior and the most rigorous output, but on routine tasks it can lead to excessive context-gathering and deliberation.

1-3State the boundaries: prevent over-work and unrequested behavior

At high effort the model can add unrequested cleanup, refactoring, abstractions, and unnecessary defensive code on its own. State the boundaries.

Prompt

Don't add features, refactor, or introduce abstractions beyond what the task requires. A bug fix doesn't need surrounding cleanup and a one-shot operation usually doesn't need a helper. Don't design for hypothetical future requirements: do the simplest thing that works well. Avoid premature abstraction and half-finished implementations. Don't add error handling, fallbacks, or validation for scenarios that cannot happen. Trust internal code and framework guarantees. Only validate at system boundaries (user input, external APIs). Don't use feature flags or backwards-compatibility shims when you can just change the code.

For unrequested behavior (drafting an email that was not asked for, backing up a defensive git branch, and the like), define the boundary between what should and should not be done.

Prompt

When the user is describing a problem, asking a question, or thinking out loud rather than requesting a change, the deliverable is your assessment. Report your findings and stop. Don't apply a fix until they ask for one. Before running a command that changes system state (restarts, deletes, config edits), check that the evidence actually supports that specific action. A signal that pattern-matches to a known failure may have a different cause.

1-4Prevent overplanning

To keep the model from planning in circles on ambiguous tasks, add the instruction below. The last sentence matters: it does not apply to thinking blocksThe record of the model's internal reasoning before it produces an answer. In Fable 5, only a summarized version is provided., so internal reasoning stays free.

Prompt

When you have enough information to act, act. Do not re-derive facts already established in the conversation, re-litigate a decision the user has already made, or narrate options you will not pursue in user-facing messages. If you are weighing a choice, give a recommendation, not an exhaustive survey. This does not apply to thinking blocks.

1-5Ground progress reports in evidence

In long autonomous runs, instruct the model to audit its progress against actual tool results. In Anthropic's testing, this instruction nearly eliminated false progress reports even on tasks designed to induce them.

Prompt

Before reporting progress, audit each claim against a tool result from this session. Only report work you can point to evidence for; if something is not yet verified, say so explicitly. Report outcomes faithfully: if tests fail, say so with the output; if a step was skipped, say that; when something is done and verified, state it plainly without hedging.

1-6Do not have it reproduce reasoning in the response text

Instructions that move or explain internal reasoning in the response text ("show your thinking," "explain your reasoning process," and the like) can trigger the reasoning_extraction refusalA response in which the model declines to process a request for safety reasons. In the API it is surfaced as stop_reason: "refusal". category and increase fallbackfallback. A backup path that automatically hands processing to another model when the primary model refuses or fails. to Opus 4.8.

When migrating, audit existing skills and system prompts and remove reflection / show-your-thinking style instructions
If you need reasoning visibility, read the structured thinking blocks from adaptive thinking
If you need to surface progress during a long run, use a send-to-user tool (Part 2-7)

A must-check migration item. This is the most common risky instruction left over in skills from the old-model era.

1-7Don't give only the request; include the reason (the intent)

Fable 5 performs better when it understands the intent behind a request. Rather than guessing the intent on its own, it connects to the relevant information. This is especially effective for long-running agents handling multiple workstreams.

Template

I'm working on [the larger task] for [who it's for]. They need [what the output enables]. With that in mind: [request].

1-8Write the final summary as a re-grounding

In long, tool-heavy agent conversations, you can get arrow-chain shorthand, excessive implementation detail, and references to thinking the user never saw. Have the model write the final summary not as a continuation of the working thread, but as a re-grounding for a reader seeing this for the first time.

Prompt

Terse shorthand is fine between tool calls (that's you thinking out loud, and brevity there is good). Your final summary is different: it's for a reader who didn't see any of that.

If you've been working for a while without the user watching (overnight, across many tool calls, since they last spoke), your final message is their first look at any of it. Write it as a re-grounding, not a continuation of your working thread: the outcome first, then the one or two things you need from them, each explained as if new. The vocabulary you built up while working is yours, not theirs; leave it behind unless you re-introduce it.

When you write the summary at the end, drop the working shorthand. Write complete sentences. Spell out terms. Don't use arrow chains, hyphen-stacked compounds, or labels you made up earlier. When you mention files, commits, flags, or other identifiers, give each one its own plain-language clause. Open with the outcome: one sentence on what happened or what you found. Then the supporting detail. If you have to choose between short and clear, choose clear.

1-9Refactor existing prompts and skills by stripping them back

Skills and instructions written for previous models can be too prescriptive for Fable 5 and can actually lower output quality. If baseline performance is better, consider removing outdated instructions. A capability improvement is a good occasion to reassess which instructions, tools, and guardrails are still genuinely needed. Fable 5 is also good at updating skills on the fly based on what it learned during the work.

Part 1 Checklist

1
Instead of a long "don't do" list, did you state the direction you want in one or two short principles?
2
Did you use high as the default effort, lower it for everyday tasks, and raise only the most critical tasks to xhigh?
3
Did you write down the boundary between "what may be done and what may not" so it doesn't do unrequested code cleanup, refactoring, or added features?
4
Did you make it report progress based only on actual execution results (tool output)?
5
Did you find and delete old instructions like "show your thinking" from prompts and skills (they can trigger refusals in Fable 5)?
6
Did you give not just the request but at least one line on "who it's for and what the output enables"?
7
Did you instruct the final report of a long task to be written for a "first-time reader" who can understand it immediately without having watched the process?
8
Did you revisit whether the detailed instructions and skills built for older models are still needed (over-instruction actually lowers quality)?

Part 2. Guide for Harness/Infra Engineers

Structural changes to apply when designing the agent harnessharness. The runtime that wraps the model and actually drives it. It is the software layer responsible for tool calls, permissions, timeouts, and conversation management., API integrations, and production pipelines.

2-1Tune infrastructure on the assumption of long turns

At high effort, a single request can take minutes, and autonomous runs can go for hours. This is one of the biggest changes teams encounter.

Adjust client timeouts, streaming, and user-facing progress indicators before migrating
Restructure the harness to check on runs asynchronously rather than blocking on them (e.g., scheduled jobs)
Pair with the overplanning-prevention instruction for ambiguous tasks (Part 1-4)

2-2Operate the safety classifiers and refusal/fallback

Fable 5 runs safety classifiers aimed at the following.

Offensive cybersecurity techniques (building exploits, malware, attack tools)
Biology/life-science content (experimental methods, molecular mechanisms)
Extraction of the model's summarized thinking

Well-intentioned security work or beneficial life-science work can also be caught as false positives. To automatically reroute refused requests, configure a server-side or client-side fallbackfallback. A backup path that automatically hands processing to another model when the primary model refuses or fails. to Opus 4.8. Without a handling path for stop_reason: "refusal" in production, those requests simply fail.

2-3Check for API parameter changes

Details are in the model introduction document; items to check when migrating:

adaptive thinkingThe approach in which the model adjusts the amount of internal thinking on its own based on problem difficulty. only (no extended thinking budget)
thinking output is summarized-only
the refusal stop reason and fallback handling

2-4Build a memory system

Fable 5 does especially well when it can record and reference lessons from previous runs. A note space as simple as a single Markdown file is enough.

Prompt

Store one lesson per file with a one-line summary at the top. Record corrections and confirmed approaches alike, including why they mattered. Don't save what the repo or chat history already records; update an existing note rather than creating a duplicate; delete notes that turn out to be wrong.

To bootstrap from existing history:

Prompt

Reflect on the previous sessions we've had together. Use subagents to identify core themes and lessons, and store them in [X]. Make sure you know to reference [X] for future use.

2-5Design for parallel subagents

Fable 5 dispatches and manages parallel subagentssubagent. An auxiliary AI instance that the main agent hands off part of the work to. It works in parallel in its own independent context. more aggressively and more reliably than previous models.

Have it use subagents often, and provide explicit guidance on when delegation is appropriate
Prefer asynchronous communication over blocking between the orchestratororchestrator. The main agent in the conductor role, which divides work among multiple subagents and consolidates the results. and its subagents
Long-lived subagents that retain context save time and cost through cache reads, and also avoid being bottlenecked by the slowest subagent

Prompt

Delegate independent subtasks to subagents and keep working while they run. Intervene if a subagent goes off track or is missing relevant context.

2-6Explicit self-verification: a fresh-context verification subagent

A separate verification subagent with fresh context is better than self-critique. For long-running work, instruct as follows.

Prompt

Establish a method for checking your own work at an interval of [X] as you build. Run this every [X interval], verifying your work with subagents against the specification.

2-7Build a send-to-user tool

For long, asynchronous agents, build a client tool that delivers, without ending the turn, a message the user must see exactly as written. Targets: partial deliverables (a generated code snippet, a draft message), progress updates with concrete numbers, and direct answers to questions that came in mid-run. Tool input is not summarized, so the content arrives intact.

Tool Definition (JSON)

{
  "name": "send_to_user",
  "description": "Display a message directly to the user. Use this for progress updates, partial results, or content the user must see exactly as written before the task finishes.",
  "input_schema": {
    "type": "object",
    "properties": {
      "message": {
        "type": "string",
        "description": "The content to display to the user."
      }
    },
    "required": ["message"]
  }
}

The tool definition alone is not enough: without phrasing in the system prompt that elicits the call, it will rarely call it
Leaking narration or internal reasoning through this tool defeats its purpose (user-facing content only)
For an agent that only narrates routine progress, the model's own summary is enough (no tool needed)

Elicitation Prompt

Between tool calls, when you have content the user must read verbatim (a partial deliverable, a direct answer to their question), call the send_to_user tool with that content. Use send_to_user only for user-facing content, not for narration or reasoning.

2-8Handling edge cases: early termination and context worries

Patterns that appear rarely in the later part of long sessions, and how to handle them:

Pattern	Response
Stops after stating intent	Ends the turn after only saying "now I'll do X," with no tool call. A short nudge like "continue" / "go ahead and do it end to end." Operate it paired with the checkpoint instruction (Part 1-1)
Unnecessary permission requests	Asks again for permission when it could proceed. Add the system reminder below for autonomous pipelines
Context-budget worries	Suggests a new session, proposes a summary handoff, or reduces the work. It is best if the harness does not expose the remaining token countdown; if it must, add a reassurance line

System reminder for autonomous pipelines:

System Reminder

You are operating autonomously. The user is not watching in real time and cannot answer questions mid-task, so asking "Want me to…?" or "Shall I…?" will block the work. For reversible actions that follow from the original request, proceed without asking. Offering follow-ups after the task is done is fine; asking permission after already discussing with the user before doing the work is not. Before ending your turn, check your last paragraph. If it is a plan, an analysis, a question, a list of next steps, or a promise about work you have not done ("I'll…", "let me know when…"), do that work now with tool calls. End your turn only when the task is complete or you are blocked on input only the user can provide.

Context reassurance line:

Prompt

You have ample context remaining. Do not stop, summarize, or suggest a new session on account of context limits. Continue the work.

2-9Start at the top of the difficulty range

Pick tasks harder than what you would hand to previous models, and let Fable 5 scope them itself, ask clarifying questions, and execute. If you evaluate it only with easy tasks, you will underestimate the range of its capability.

Part 2 Checklist

1
Did you adjust timeouts, streaming, and progress indicators so a single request taking several minutes doesn't get cut off?
2
Did you build a fallback path that automatically hands off to Opus 4.8 when the model refuses a request?
3
Did you reflect the changed API options (adaptive thinking only, thinking provided as a summary only, no thinking budget) in your code?
4
Did you provide a note space (a markdown file is enough) and recording rules for the model to write down lessons it learns while working?
5
Did you set criteria for using subagents to divide the work, and build a structure that doesn't stall waiting for them to finish?
6
Did you have the work checked periodically by a separate verification agent with clean context, rather than by the model itself?
7
Did you build a channel (the send_to_user tool) to deliver intermediate results to the user verbatim even mid-task, and include instructions on "when to use it"?
8
Are you not showing the remaining token count to the model? If you must, did you add a "context is ample, keep going" reassurance line?
9
Did you add a "don't ask, just carry it through to the end" reminder to automated pipelines that run unattended?

Appendix Reference

A record of how this guide came to be. Opus 4.8 summarized the original (Appendix A), and Fable 5 verified that summary against the original (Appendix B) before reorganizing and writing the entire body itself. In effect, the model refined its own user manual.

How this guide was made

Source
Anthropic's official documentation"Prompting Claude Fable 5" — the official prompting guide for Fable 5 (platform.claude.com)
Summary
Opus 4.8 key-point summaryOpus 4.8 produced a condensed summary of the original document → Appendix A
Review
Fable 5 verificationFable 5 checked the summary against the original paragraph by paragraph and found two significant omissions → Appendix B
Rework
Practitioner's guide rewrite (Markdown)Fable 5 (high effort) rewrote the entire guide in a two-part structure by reader — the Markdown document came first
Deploy
Web build and autonomous deploymentFrom that Markdown, the model built the web pages (KO/EN, SEO meta, OG image, glossary tooltips) and deployed them to gonnector.com on its own — about three hours elapsed end to end, with just over an hour of actual working time (2026-07-04)

▶Appendix A. Opus 4.8's original key-point summary (provided by Dylan, 2026-07-04)

This document is a prompting and scaffolding guide for Claude Fable 5 / Mythos 5 (the generation after the previous Claude Opus 4.8). The core idea is that "because the model has become far more capable, the old approach of fine-grained instruction and building guardrails can actually get in the way." Below is a systematic summary of the key points.

The big picture: what changed

Fable 5 does well on problems that were too complex, too long-running, or too ambiguous for previous models, especially end-to-end work that would take a person from hours to weeks. Interestingly, the document emphasizes that "the teams that applied it to their hardest unsolved problems saw the best results, and if you test it only with easy tasks you will underestimate the model's range of capability." In other words, the direction of prompting philosophy has shifted from "control the model tightly" to "trust its capability and let it loose."

Where things need to change from the old way (and why)

Instruct with one short principle instead of detailed enumeration (Strong instruction following): With previous models you had to spell out each desired behavior one by one (preventing verbosity, forbidding specific formats, and so on). Fable 5's instruction-following has improved, so instead of enumerating desired behaviors by name you can steer most of it with a single short principle. The reason is that the model generalizes and applies the intent. For example, the single sentence "Lead with the outcome" is as effective as listing several forbidden patterns.
Strip back old prompts and skills instead (Refactor existing prompts): Skills and instructions built for past models are too prescriptive for Fable 5 and can actually lower output quality. The recommendation is to consider removing outdated instructions when baseline performance is better. Now that capability has risen, it's a good time to reassess which instructions, tools, and guardrails are still genuinely needed.
Use effort as the primary control knob: In Fable 5, effort is the primary control for the trade-off between intelligence, latency, and cost. Use high as the default for most tasks, xhigh for the most capability-critical tasks, and medium/low for everyday tasks. Even low effort often exceeds the xhigh of previous models. If a task takes longer than it needs to or you want to work more interactively, lower the effort.
Explicitly prevent over-cleanup and refactoring (State the boundaries): At high effort, Fable 5 can add unrequested cleanup, refactoring, abstractions, and unnecessary error handling on its own. So it has newly become important to state boundaries like "don't add features/abstractions beyond what's required, and don't add defensive code for scenarios that cannot happen." It also recommends adding constraints like "when the user is describing a problem, asking a question, or thinking out loud, the deliverable is your 'assessment,' not a change. Don't fix anything until they ask."
Have it report progress grounded in evidence during long runs (Ground progress claims): For long autonomous runs, instruct the model to audit its progress against actual tool results. In Anthropic's testing, this instruction nearly eliminated false progress reports. The pattern is "state explicitly what is not yet verified, and if a test fails say it failed along with the output."
Don't have the model reproduce reasoning in the response text (a very important change): In the past, instructions like "show/explain your thinking process" were common. In Fable 5, instructions that move or explain internal reasoning in the response text can trigger the reasoning_extraction refusal category and increase fallback to Opus 4.8. When migrating, you should audit existing skills and system prompts and strip out "show your thinking" style instructions. If you need reasoning visibility, read the structured thinking blocks from adaptive thinking instead.
Give the reason (intent) too (Give the reason, not only the request): Fable 5 performs better when it understands the intent behind a request. If you also provide "who this larger task is for and what its output enables," the model connects to the relevant information rather than guessing the intent on its own. This is especially effective for long-running agents handling multiple workstreams.

What to newly introduce to match the new capabilities

Infrastructure preparation for longer turns. Hard tasks can take minutes for a single request and hours for autonomous runs. Adjust client timeouts, streaming, and progress indicators, and restructure the harness to check on runs asynchronously (scheduled jobs, etc.) instead of blocking. At the same time, to prevent overplanning on ambiguous tasks, add an instruction like "when you have enough information to act, act; don't re-litigate decisions already made."

Building a memory system. Fable 5 does especially well when it can record and reference lessons from previous runs. Give it a note space as simple as a single Markdown file and rules like "one lesson per file, a one-line summary at the top, update instead of duplicating, delete notes that turn out wrong." You can also bootstrap by having subagents review past sessions.

Aggressive use of parallel subagents. Fable 5 dispatches and manages parallel subagents better and more aggressively than before. Delegate often, prefer asynchronous communication over blocking, and get cache benefits from long-lived subagents that retain context.

Explicit self-verification. A separate verification subagent with fresh context is better than self-critique. Instruct it to "verify against the specification with subagents at regular intervals."

Creating a send-to-user tool. For long asynchronous agents, build a client tool that delivers, without ending the turn, a message the user must see (partial deliverables, direct answers to questions) exactly as written. Tool input is not summarized, so the content arrives intact. But the tool definition alone is not enough; you must also include phrasing in the system prompt that elicits the call, and you must not leak narration/internal reasoning through it.

Rare but worth-knowing edge cases

In the later part of long sessions, Fable 5 may (1) stop after only stating intent ("now I'll do X") with no tool call, (2) ask again for permission when it could already proceed, or (3) worry about the context budget and suggest a new session or try to reduce the work. Handle each with a short nudge like "continue / carry it through to the end," a system reminder for autonomous pipelines ("you are operating autonomously and the user can't watch in real time, so proceed without asking for reversible actions"), and a reassurance line like "context is ample, don't stop or summarize." The third case often occurs when the harness exposes the remaining token countdown to the model, so it's best not to expose that number when possible.

Communicating with the user (readability)

In long, tool-heavy agent conversations, Fable 5 can produce hard-to-follow text: arrow-chain shorthand (A → B → fails), excessive implementation detail, references to thinking the user never saw. Write the final summary not as "a continuation of the working thread" but as "a re-grounding for a reader seeing this for the first time." That is, lead with the outcome in complete sentences, drop the shorthand and self-made vocabulary, and give files, commits, and flags each their own plain-language explanation.

One-line summary

Where you used to control, enumerate, and defend the model in fine detail, with Fable 5 the core is to (a) instruct with short principles, (b) strip back the prescriptive skills of the past, (c) tune with effort, (d) put in place scaffolding for long autonomous runs such as progress-evidence verification, memory, subagents, and send-to-user, and (e) observe the newly introduced constraints like "don't reproduce reasoning as text." All of it converges on one principle: "the model has become far more capable, so don't block that capability, put it to use."

▶Appendix B. JARVIS's review analysis of the Opus 4.8 summary (Fable 5, 2026-07-04)

Review method: Downloaded the original markdown directly and compared it against the summary paragraph by paragraph. Completeness assessment: about 85% (opinion).

What was done well

The four-tier effort operation, short-principle-based instruction, stripping back old prompts, grounding progress reports in evidence, the ban on reproducing reasoning and the reasoning_extraction refusal, the memory system, parallel subagents, the send-to-user tool (with the caveat that the definition alone is not enough), the early-termination and context-worry edge cases, and the readability guidance all match the original. No factual distortion or fabricated content.

Observation from Fable 5's perspective: Confirmed that the example snippets in this document are included almost verbatim in the system prompt of the Fable 5 deployed in Claude Code ("Lead with the outcome...," "When you have enough information to act, act...," the autonomous-run reminder, etc.). The document's example prompts are not theory but actual deployed phrasing, and the summary accurately captured that skeleton.

Major omission 1: guidance on the safety classifiers and refusal/fallback operation

According to the original, Fable 5 runs safety classifiers aimed at three areas: (a) offensive cybersecurity, (b) biology/life-science, and (c) extraction of summarized thinking; well-intentioned security work or beneficial life-science work can be caught as false positives; and refused requests should be configured to auto-reroute via fallback to Opus 4.8. The summary mentions only (c), reasoning_extraction, and omits (a), (b), and the recommendation to configure fallback entirely. Without this knowledge, when a refusal occurs in production you can't diagnose the cause or respond, so for teams in security/bio-adjacent domains this is information that governs migration decisions.

Major omission 2: the entire list of 7 capability improvements dropped

The original specifically enumerates the areas of improvement over Opus 4.8: long-horizon autonomy, first-shot accuracy, Vision (trained to handle flipped or blurry images with bash/crop tools), enterprise workflows (financial analysis, spreadsheets, slides), code review/debugging (improved recall, but excluding the cybersecurity domain), navigating ambiguity, and delegation/collaboration. The summary keeps only the general point "does well on end-to-end work" and drops all the specifics. This list is practical material for deciding "what tasks to put Fable 5 on."

Secondary omissions (less critical)

Pointers to API parameter changes: adaptive thinking only, thinking output summarized-only, no extended thinking budget
The checkpoint instruction snippet: "pause only for a destructive/irreversible action, a real scope change, or input only the user can provide" (the original explicitly says to use it paired with the early-termination handling)
Minor caveats: the overplanning-prevention instruction does not apply to thinking blocks, send-to-user is unnecessary for an agent that only does routine narration, Fable 5 is good at updating skills itself during work, and long-lived subagents also avoid the slowest-subagent bottleneck

On structure: Item 4 of the summary merges two different sections of the original (the over-refactoring-prevention snippet from the effort section + the unrequested-behavior-prevention from the State the boundaries section) into one item. There is no distortion of content, but it is a source of confusion for tracing the original structure.

Three options for remediation (as proposed at the time)

Keep the current summary + patch in the 2 omitted blocks (recommended at the time): completes it with minimal edits
Reorganize into two parts by reader: split into for prompt authors / for harness engineers. Because the actions diverge by who executes, this is superior for practical deployment (adopted by Dylan; this document is the result)
Summary + a section-mapping table to the original: blocks omission risk, but the standalone value of the summary declines

Purpose of This Document

Prompt Authors

Harness/Infra Engineers

What Changed

Related Links

Part 1. Guide for Prompt Authors

1-1Instruct with one short principle instead of an enumeration

1-2Make effort the primary control knob

1-3State the boundaries: prevent over-work and unrequested behavior

1-4Prevent overplanning

1-5Ground progress reports in evidence

1-6Do not have it reproduce reasoning in the response text

1-7Don't give only the request; include the reason (the intent)

1-8Write the final summary as a re-grounding

1-9Refactor existing prompts and skills by stripping them back

Part 1 Checklist

Part 2. Guide for Harness/Infra Engineers

2-1Tune infrastructure on the assumption of long turns

2-2Operate the safety classifiers and refusal/fallback

2-3Check for API parameter changes

2-4Build a memory system

2-5Design for parallel subagents

2-6Explicit self-verification: a fresh-context verification subagent

2-7Build a send-to-user tool

2-8Handling edge cases: early termination and context worries

2-9Start at the top of the difficulty range

Part 2 Checklist

Appendix Reference

How this guide was made

The big picture: what changed

Where things need to change from the old way (and why)

What to newly introduce to match the new capabilities

Rare but worth-knowing edge cases

Communicating with the user (readability)

One-line summary

What was done well

Major omission 1: guidance on the safety classifiers and refusal/fallback operation

Major omission 2: the entire list of 7 capability improvements dropped

Secondary omissions (less critical)

Three options for remediation (as proposed at the time)