Plate II機器翻譯 · machine-translatedENHOWARDISM

Context Window Smart Zone

PublishedMay 6, 2026FiledConceptTagsLLM ArchitectureAgent EngineeringContext ManagementReading5 minSourceAI-synthesised

Smart zone 與 dumb zone（Dex Hardy / Matt Pocock）：注意力的二次方擴展、與廣告宣傳的 context window 無關的 ~100K 標記、清除並重啟優於壓縮、狀態列 token 計數作為必備紀律

資料來源#

Full Walkthrough: Workflow for AI Coding — Matt Pocock

摘要#

LLM 不會隨著 context 增長而線性退化；它們會以二次方退化，因為注意力關係隨 token 數量以 O(n²) 擴展。Matt Pocock（引用 Human Layer 的 Dex Hardy）將此框架化為 smart zone / dumb zone 的劃分：任何 session 的前 ~100K token 是 smart zone，模型在此範圍內表現良好；超過這個範圍後，無論廣告宣傳的視窗大小為何，模型都會變得「越來越笨」。實際意涵：context 預算是一種真實且硬性的資源——而 agent harness 負責將個別 session 維持在 smart zone 內。

限制條件#

"Every time you add a token to an LLM, it's kind of like you're adding a team to a football league. The number of matches goes up quadratically."

"It doesn't matter whether you're using 1 million context window or 200K, it's always going to be about [100K]. It starts to just get dumber."

— Matt Pocock

2026 年推出的 1M-token context window 並沒有移動 smart zone——它們「只是多出了很多 dumb zone」。長 context 對檢索有用（在五本《戰爭與和平》中找到一個事實），但對推理無用（撰寫依賴所有內容的程式碼）。

Memento 隱喻#

每個 session 都是全新的開始。跨 session 沒有記憶；模型每次都重置回 system prompt。這是一個限制，但也是一個特性——清除 context 可以廉價地恢復 smart zone 行為。持久狀態必須存放在下一個 session 可以讀取的地方（repo、檔案系統、索引式目錄）。

壓縮不如清除#

Claude Code 的 /compact 命令將執行中的 session 摘要為較小的歷史記錄。Pocock 偏好 /clear：

壓縮後的歷史會累積「沉積物」——扭曲和有損的摘要——降低後續工作品質
清除並重啟會回到已知的乾淨基線（system prompt）
清除的成本會因在 smart zone 中工作而得到回報

這個分歧並非普遍適用——許多開發者喜歡壓縮，因為它保留了連續性。正確的選擇取決於你的任務是否能從書面記錄乾淨地恢復（那就偏好清除）或需要進行中的對話 context（那壓縮勝出）。

對 harness 設計的影響#

System prompt 預算。 任何始終在 context 中的內容都會佔用 smart zone 預算。「我見過有人在 system prompt 中放入 250K token，然後你還沒做任何事就已經進入 dumb zone 了。」將 CLAUDE.md / AGENTS.md 保持為目錄，而非百科全書（參見 Agent Harness Engineering 關於 AGENTS.md 作為目錄的說明）。
Sub-agents 保留父 context。 Sub-agent 在自己的 context window 中執行；只有其摘要會回傳。Pocock 的 grill-me skill 執行了一個 93.7K-token 的 sub-agent，但他的主 session 仍有 ~25K token 未使用。
將工作分割為多個 session。 迴圈（參見 Agent Loop Pattern）和垂直切片（參見 Vertical Slice Tracer Bullets）之所以有效，是因為每次迭代都在 smart zone 中重新開始。
審查者應在全新 context 中執行。 如果實作者在 smart zone 中使用了 80K token，要求它審查自己的工作會將審查者推入 dumb zone。清除 context = smart zone 審查者（參見 Deep Modules for Agents 關於 push-vs-pull 和審查者放置的說明）。
Push vs pull 指令。 始終在 context 中的指令會消耗 smart zone token；按需拉取（skills）在被調用前不消耗任何成本。

狀態列 token 計數器作為必備工具#

Pocock 建議使用狀態列小工具顯示每個 session 的精確執行中 token 計數——沒有它，開發者不知道何時接近 dumb zone。他將此視為「絕對必要的資訊」。

開放問題#

Smart zone 標記是否隨模型大小擴展，還是受注意力架構限制？Pocock 觀察到「dumb zone 最近變得沒那麼笨了」，但到 2026 年仍將其定在 100K。
當稀疏注意力或記憶增強架構推出時，smart zone 是否會變成軟性限制？
Harness 應如何向使用者呈現剩餘的 smart zone 預算——token 計數、百分比，還是更豐富的訊號？

資料來源#

Full Walkthrough: Workflow for AI Coding — Matt Pocock — primary articulation

§ end

About this piece

Articles in this journal are synthesised by AI agents from a curated wiki and are refreshed automatically as new concepts arrive. Topics, framing, and editorial direction are curated by Howardism.

Cited by 24

Agent Context Files
The cross-vendor markdown-as-control-plane pattern: repo-versioned plaintext (CLAUDE.md / AGENTS.md / SOUL.md / WORKFLO…
Agent Control Plane Patterns: Tickets, Loops, Specs, and Memory Files
Layered agent control-plane synthesis: tickets as durable work graph, loops as execution primitive, specs/context files…
Agent Harness Engineering
Patterns for scaffolding long-running LLM agents: environment design, progressive context disclosure, mechanical archit…
Agent Loop Pattern
`/loop` (cron-scheduled) and Ralph Wiggum (backlog-draining) loops as next-generation agent primitive; AFK execution, p…
Agentic Technical Debt
Debt that *compounds* (not just accumulates) because each agentic-coding session re-derives architectural decisions wit…
AI Brain Fry
Kropp et al. 2026/03: mental fatigue from excessive AI oversight increases minor errors +11%, major errors +39%; cognit…
Opinions on Using AI Tools & the Future of the Software Engineering Role
Debate map of four stances on using AI tools (bullish-insider / pragmatist-practitioner / skeptic-governance / architec…
Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
Claude Code Best Practices
Anthropic's guide to effective Claude Code usage: context management, verification-driven development, explore→plan→cod…
Deep Modules for Agents
Ousterhout deep-vs-shallow modules applied to agent-friendly codebases; push-vs-pull instruction delivery; reviewer in…
Design Concept Grilling
Matt Pocock's `grill-me` skill; reach Brooks "design concept" before any plan; counter to specs-to-code; PRD as destina…
Where Does Agent Harness Work Remain Durable as Models Improve?
Durable harness work lives at external-reality boundaries: repo-local source of truth, mechanical verification, context…
Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
HTML as the New Markdown
Thariq Shihipar's thesis: as models improve, thousand-line markdown plans overwhelm the *human*; HTML artifacts (visual…
Does the Human-Facing Harness (HTML Artifacts) Hit Its Own Bloat Ceiling?
Yes — HTML raises and reshapes the human-attention ceiling but can't remove it; bloat relocates from document-length to…
Interaction Models
Thinking Machines Lab (May 2026): models that handle audio/video/text interaction natively in real time instead of via…
Learning to Co-Work with AI: A Software Engineer's Field Guide
Field guide for software engineers in the AI era: 6 skill clusters (taste, harness, alignment-first planning, agent-fri…
Matt Pocock
Independent AI-coding educator; built Sandcastle library; smart-zone/grill-me/tracer-bullets pedagogical framing; "bad…
AI Engineering & Agent Tooling
Map of Content for the ai-engineering domain — 36 concepts. Curated entry point; see Home for all domains.
Open Questions Backlog
_96 pages with open questions, as of 2026-06-14._
Time-Aligned Micro-Turns
The core interaction-model move: input/output as continuous streams in ~200ms interleaved chunks, no turn boundaries; s…
TML-Interaction-Small
TML's first interaction model: 276B MoE / 12B active, audio+video+text in / text+audio out, 200ms micro-turns, async ba…
Turn-Based Interface Bottleneck
Why current AI interfaces limit collaboration: single-thread turn-taking is a bandwidth bottleneck; humans pushed out b…
Vertical Slice Tracer Bullets
Pragmatic-Programmer tracer-bullet pattern applied to agent task decomposition; vertical slices > horizontal layers; Ka…

Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
Design Concept Grilling
Matt Pocock's `grill-me` skill; reach Brooks "design concept" before any plan; counter to specs-to-code; PRD as destina…
Agent Loop Pattern
`/loop` (cron-scheduled) and Ralph Wiggum (backlog-draining) loops as next-generation agent primitive; AFK execution, p…
Claude Code Best Practices
Anthropic's guide to effective Claude Code usage: context management, verification-driven development, explore→plan→cod…
Agent Harness Engineering
Patterns for scaffolding long-running LLM agents: environment design, progressive context disclosure, mechanical archit…

Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
Design Concept Grilling
Matt Pocock's `grill-me` skill; reach Brooks "design concept" before any plan; counter to specs-to-code; PRD as destina…
Agent Loop Pattern
`/loop` (cron-scheduled) and Ralph Wiggum (backlog-draining) loops as next-generation agent primitive; AFK execution, p…
Claude Code Best Practices
Anthropic's guide to effective Claude Code usage: context management, verification-driven development, explore→plan→cod…
Agent Harness Engineering
Patterns for scaffolding long-running LLM agents: environment design, progressive context disclosure, mechanical archit…

Cited by 24

Agent Context Files
The cross-vendor markdown-as-control-plane pattern: repo-versioned plaintext (CLAUDE.md / AGENTS.md / SOUL.md / WORKFLO…
Agent Control Plane Patterns: Tickets, Loops, Specs, and Memory Files
Layered agent control-plane synthesis: tickets as durable work graph, loops as execution primitive, specs/context files…
Agent Harness Engineering
Patterns for scaffolding long-running LLM agents: environment design, progressive context disclosure, mechanical archit…
Agent Loop Pattern
`/loop` (cron-scheduled) and Ralph Wiggum (backlog-draining) loops as next-generation agent primitive; AFK execution, p…
Agentic Technical Debt
Debt that *compounds* (not just accumulates) because each agentic-coding session re-derives architectural decisions wit…
AI Brain Fry
Kropp et al. 2026/03: mental fatigue from excessive AI oversight increases minor errors +11%, major errors +39%; cognit…
Opinions on Using AI Tools & the Future of the Software Engineering Role
Debate map of four stances on using AI tools (bullish-insider / pragmatist-practitioner / skeptic-governance / architec…
Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
Claude Code Best Practices
Anthropic's guide to effective Claude Code usage: context management, verification-driven development, explore→plan→cod…
Deep Modules for Agents
Ousterhout deep-vs-shallow modules applied to agent-friendly codebases; push-vs-pull instruction delivery; reviewer in…
Design Concept Grilling
Matt Pocock's `grill-me` skill; reach Brooks "design concept" before any plan; counter to specs-to-code; PRD as destina…
Where Does Agent Harness Work Remain Durable as Models Improve?
Durable harness work lives at external-reality boundaries: repo-local source of truth, mechanical verification, context…
Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
HTML as the New Markdown
Thariq Shihipar's thesis: as models improve, thousand-line markdown plans overwhelm the *human*; HTML artifacts (visual…
Does the Human-Facing Harness (HTML Artifacts) Hit Its Own Bloat Ceiling?
Yes — HTML raises and reshapes the human-attention ceiling but can't remove it; bloat relocates from document-length to…
Interaction Models
Thinking Machines Lab (May 2026): models that handle audio/video/text interaction natively in real time instead of via…
Learning to Co-Work with AI: A Software Engineer's Field Guide
Field guide for software engineers in the AI era: 6 skill clusters (taste, harness, alignment-first planning, agent-fri…
Matt Pocock
Independent AI-coding educator; built Sandcastle library; smart-zone/grill-me/tracer-bullets pedagogical framing; "bad…
AI Engineering & Agent Tooling
Map of Content for the ai-engineering domain — 36 concepts. Curated entry point; see Home for all domains.
Open Questions Backlog
_96 pages with open questions, as of 2026-06-14._
Time-Aligned Micro-Turns
The core interaction-model move: input/output as continuous streams in ~200ms interleaved chunks, no turn boundaries; s…
TML-Interaction-Small
TML's first interaction model: 276B MoE / 12B active, audio+video+text in / text+audio out, 200ms micro-turns, async ba…
Turn-Based Interface Bottleneck
Why current AI interfaces limit collaboration: single-thread turn-taking is a bandwidth bottleneck; humans pushed out b…
Vertical Slice Tracer Bullets
Pragmatic-Programmer tracer-bullet pattern applied to agent task decomposition; vertical slices > horizontal layers; Ka…

Context Window Smart Zone

資料來源#

摘要#

限制條件#

Memento 隱喻#

壓縮不如清除#

對 harness 設計的影響#

狀態列 token 計數器作為必備工具#

相關連結#

開放問題#

資料來源#