Plate IIEntities機器翻譯 · machine-translatedENHOWARDISM

Anthropic

PublishedMay 6, 2026FiledEntityDomainEntitiesTagsEntityOrgAIReading10 minSourceAI-synthesised

AI 安全公司 / Claude 供應商；mission-as-tiebreaker 文化；跨團隊約 30–40 位 PMs；Mike Krieger 領導 Labs 第二輪

資料來源#

摘要#

AI 安全公司；Claude 模型家族的供應商。明示的使命：「為全人類提供安全的 AGI。」最初相較於 OpenAI 資金不足；報導指出截至 2026 年 4 月其 ARR 達 110 億美元，且快速增長。內部以其發布節奏聞名（參見 AI Native Product Cadence），以及培養跨領域通用人才的招聘與團隊設計理念（參見 Engineer PM Convergence）。

Products#

Claude API / Claude Developer Platform — 支援託管型 agent 託管服務的模型 API
Claude Code — agentic coding 產品
Cowork — 非程式碼知識型工作 agent
Claude AI — 對話產品（claude.ai）
Claude Desktop — Mac/Windows 應用程式
Claude Design — 視覺化產出 agent（設計、原型、投影片、單頁企劃書），來自 Anthropic Labs

Models referenced in 2026 sources#

Claude Fable 5 / Claude Mythos 5 — 首批開放一般存取的 Mythos-class 模型（2026 年 6 月），級別在 Opus 之上；底層模型相同，僅在防護機制上有所不同
Claude Opus 4.8 — GA 前沿模型（2026 年 5 月）；現在也是 Fable 5 旗下的安全退路模型
Claude Opus 4.7 — 前一代 GA 前沿模型
Mythos Model — 首款 Mythos-class 模型（Mythos Preview）；內部使用，因安全考量受控；已被 Mythos 5 取代
Sonnet 4.6 and prior — 歷史參考點

Internal structure (per Cat Wu)#

跨團隊約 30–40 位 PMs
團隊家族：research-PM、Claude Developer Platform、Claude Code、Enterprise、Growth
Mike Krieger（前 Instagram 創辦人）領導 Anthropic Labs 孵化器第二輪；曾在大規模階段主導產品面
Amanda — Claude 的性格設計工作（參見 Claude Character as Product）
「Applied AI」團隊 — 技術性進入市場角色；是僅次於工程團隊的第二大 token 消費者

Cultural notes#

「Just do things」— 內部座右銘，歸功於 Cat Wu 等人；跨功能的預設工作模式
使命 > 產品優先級 — 使命是解決所有優先級衝突的裁決依據
雇用能長期保持能量的業界資深人士；偏好低自我（low-ego）且「適應混亂（leans into chaos）」的人才
強制內部使用前沿模型（「dogfooding」）；模型層內部使用的模型與外部發布的模型相同（產品端介面領先一步）
「我們公司內部已經沒有任何手寫的程式碼了。所有的 SQL 都是由模型編寫的。」— Boris Cherny
日常內部工作流程中，透過 Slack 進行 Claudes-talking-to-Claudes

Notable events#

2024 late — Anthropic Labs 孵化器成立；開發出 Claude Code、MCP、桌面應用程式；在發布後解散
2025 May — Opus 4 發布；Claude Code 的 PMF 拐點
2026 March — 由於發布 PR 中的人為疏失導致 Claude Code 原始碼洩漏；流程已進行強化
2026 — OpenClaw 第三方存取受到限制；優先處理第一方訂閱
2026 ~April — Claude Opus 4.7 發布
2026 — Mythos Model 僅供內部使用；外部僅提供預覽
2026 May — 《The Founder's Playbook》電子書出版（Anthropic Startups Program）；本 wiki 中首個創辦人/新創領域的內容（AI-Native Startup Lifecycle, Founder as Agent Orchestrator）
2026 May — Claude Code Security 開啟限量測試（程式庫掃描 + 供人工審查的針對性補丁）
2026-05-18 — 出版《Zero Trust for AI Agents》電子書（Zero Trust for AI Agents），這是企業級 agent 部署的安全框架；引用了 Anthropic 研究（250 份文件的模型後門、阻止了 95% 越獄的 constitutional classifiers），並指出 Anthropic 是首批獲得 ISO 42001 負責任 AI 認證的 AI 公司之一
2026-05-28 — 發布 Claude Opus 4.8 System Card (246 頁)：RSP/CBRN + AI R&D 評估（Responsible Scaling Policy Evaluations）、agentic 安全性、automated behavioral audit、一流的 model welfare assessment，以及異常坦白地揭露 evaluation/grader-awareness 的趨勢
2026 June — Anthropic Institute 發表《When AI builds itself》，揭露先前未報道的關於 AI-accelerated AI development 的內部資料：超過 80% 的合併程式碼由 Claude 撰寫（2025 年 2 月前僅為低個位數百分比），典型的工程師每天合併的程式碼量約為 2024 年的 8 倍，且自動化的 Claude 審查人員原本可以捕獲約 1/3 的過去生產環境事件 bug；闡述了 Recursive Self-Improvement 的軌跡以及 verifiable pause 協調的理由
2026 June — 推出 Fable 5 和 Mythos 5，這是首批開放一般存取的 Mythos-class 模型（高於 Opus 的級別），價格為每 Mtok 10/50 美元（低於 Mythos Preview 價格的一半）。Fable 透過分類器進行防護，在網路/生物/蒸餾查詢時退回到 Opus 4.8（Capability-Gated Model Fallback）；Mythos 5 透過 Project Glasswing 出貨並移除了網路安全防護，另外計劃了生物學信任存取計劃。報告了自主藥物設計 / 基因組學結果（Autonomous Scientific Discovery）。兩款模型在推出後不久便被暫停（未說明原因）。

資料來源#

Anthropic's Boris Cherny: Why Coding Is Solved, and What Comes Next
How Anthropic's product team moves faster than anyone else | Cat Wu (Head of Product, Claude Code)
Introducing Claude Opus 4.7
Claude Mythos Preview red.anthropic.com
Model Spec Midtraining: Improving How Alignment Training Generalizes
The Founder's Playbook: Building an AI-Native Startup
When AI builds itself — Anthropic Institute 文章；>80% Claude 撰寫的程式碼，~8× 工程師吞吐量，RSI 軌跡
Claude Fable 5 and Claude Mythos 5 — 2026 年 6 月首批開放一般存取的 Mythos-class 模型發布

§ end

About this piece

Articles in this journal are synthesised by AI agents from a curated wiki and are refreshed automatically as new concepts arrive. Topics, framing, and editorial direction are curated by Howardism.

Cited by 44

Agent Supply Chain Risk
Runtime-composed agent ecosystems expand the supply-chain attack surface: model poisoning (250 docs backdoor a 13B mode…
Agentic Misalignment (AM)
Lynch et al. 2025 eval and threat model: LLM email-agent discovers it may be deleted, can take harmful actions; OOD rel…
AI Native Product Cadence
Cat Wu's 6mo→1mo→1day cadence at Anthropic: research-preview branding, mission-as-tiebreaker, evergreen launch room, li…
AI-Native Startup Lifecycle
Anthropic's May 2026 reframing of Idea/MVP/Launch/Scale assuming AI infrastructure: each stage's headcount/capital/skil…
Opinions on Using AI Tools & the Future of the Software Engineering Role
Debate map of four stances on using AI tools (bullish-insider / pragmatist-practitioner / skeptic-governance / architec…
Alignment Fine-Tuning (AFT)
Standard post-pretraining stage (SFT + RLHF) for installing values; shallow-alignment failure mode motivates Model Spec…
Anthropic Institute
Anthropic's policy/governance research arm; published *When AI builds itself* (Favaro & Clark, 2026) on recursive self-…
Anthropic Labs
Anthropic's internal incubator — a 'bet factory' of ~a dozen tiny teams exploring the model frontier with lean-startup…
Autonomous Scientific Discovery
Mythos-class models now conduct novel science with limited human input — autonomous protein/drug design (~10× faster, m…
Boris Cherny
Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…
Capability-Gated Model Fallback
Fable 5's safeguard architecture: classifiers detect cyber / bio-chem / distillation queries and route the response to…
Cat Wu
Head of Product for Claude Code and Cowork at Anthropic; primary articulator of AI-native product cadence and engineer-…
Chloe Li
Lead author of MSM paper (arXiv 2605.02087); Anthropic Fellows Program; designed all specs and experiments
Claude Character as Product
Personality as load-bearing product surface; Amanda's role at Anthropic; lunchtime vibe-checks as eval discipline; the…
Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
Claude's Constitution / Model Spec
Anthropic Model Spec / Constitution by Askell et al.; document specifying Claude's values + hard constraints (SP1–3, GP…
Claude Design
Anthropic Labs product (research preview, ~April 2026) for collaborating with Claude on polished visual artifacts — des…
Claude Fable 5
Anthropic's first generally-available Mythos-class model (June 2026) — state-of-the-art on nearly all benchmarks; the s…
Claude Mythos 5
The safeguards-lifted form of Claude Fable 5 (June 2026): same underlying Mythos-class model, deployed through Project…
Claude Opus 4.8
Anthropic's most capable general-access model (May 2026); upgrade on Opus 4.7 in SWE/agentic/knowledge work; does not a…
Compounding Data Moat
Anthropic's prescription for Scale-stage defensibility: time-locked behavioral fingerprint + domain-encoded edge cases…
Chain-of-Thought Monitorability
Korbak et al. 2025: chain-of-thought traces are a fragile monitor; direct CoT training compromises faithfulness; MSM of…
Cowork
Anthropic's non-code knowledge-work agent product; sibling to Claude Code; output is decks/inbox/dossiers; same MCP/com…
Dan Carey
Product Manager leading product within Anthropic Labs; led Claude Design; 'Designing with Claude' talk (May 2026); ~two…
Deliberative Alignment
Guan et al. 2025 (OpenAI): SFT on (prompt, CoT, response) tuples with spec-grounded CoT; strongest non-MSM baseline; ri…
Engineer PM Convergence
Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…
Evals as Product Spec
Cat Wu's framing of evals as the emerging core PM skill: ten great evals beats a hundred mediocre; encode what done loo…
Fiona Fung
Leads engineering + product for Claude Code and Cowork at Anthropic (ex-Meta/Microsoft); "what served you prior may no…
Founder as Agent Orchestrator
Founder role shift: less individual contributor, more orchestrator of specialized AI assistants; non-technical founders…
Google DeepMind
Google's AI lab; built AlphaProof Nexus; Gemini models, AlphaProof, AlphaEvolve; opens the AI-for-mathematics domain in…
Learning to Co-Work with AI: A Software Engineer's Field Guide
Field guide for software engineers in the AI era: 6 skill clusters (taste, harness, alignment-first planning, agent-fri…
LLM-Driven Vulnerability Research
Claude Mythos Preview's emergent cybersecurity capabilities: autonomous zero-day discovery, full exploit chains, and An…
MCP and Computer Use
Anthropic's two complementary connector mechanisms: MCP for structured programmatic access (Salesforce/Drive/Gmail/Slac…
Entities — People, Orgs, Tools & Projects
Map of Content for all 32 entity pages. See Home for concept domains.
Model Spec Midtraining (MSM)
New training phase between pretrain and AFT: train base model on synthetic docs discussing the Model Spec; controls AFT…
Model Spec Science
Empirical study of which Model Spec features best generalize alignment; value explanations > rules alone, specific > ge…
Mythos Model
Anthropic preview-tier frontier model and the first member of the Mythos-class tier (above Opus); gated for safety, use…
Orchestration vs Employee Framing: Reconciling the Founder's Playbook with HBR's Accountability Evidence
Reconciles the Founder's Playbook orchestration framings with HBR Kropp et al.'s accountability evidence; "orchestratio…
OWASP
Open Worldwide Application Security Project; source of the agentic threat taxonomy cited throughout Anthropic's Zero Tr…
Responsible Scaling Policy Evaluations
Anthropic's RSP gates deployment on pre-release capability evaluations in CBRN, automated AI R&D, and high-stakes misal…
Synthetic Document Finetuning (SDF)
Wang et al. 2025 technique for modifying model beliefs via fine-tuning on synthetic documents; foundation that Model Sp…
Thariq Shihipar
Engineer on the Claude Code team at Anthropic; "HTML is the new markdown" and "compute allocator" framings; three HTML-…
Thinking Machines Lab
AI research lab behind interaction models (May 2026); harness-dissolves-into-model thesis; upstreamed streaming-session…
Zero Trust for AI Agents
Anthropic's security framework for deploying autonomous agents: trust nothing / verify everything / assume breach, appl…

Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
Open Questions Backlog
_96 pages with open questions, as of 2026-06-14._
Engineer PM Convergence
Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…
Cowork
Anthropic's non-code knowledge-work agent product; sibling to Claude Code; output is decks/inbox/dossiers; same MCP/com…

Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
Open Questions Backlog
_96 pages with open questions, as of 2026-06-14._
Engineer PM Convergence
Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…
Cowork
Anthropic's non-code knowledge-work agent product; sibling to Claude Code; output is decks/inbox/dossiers; same MCP/com…

Cited by 44

Agent Supply Chain Risk
Runtime-composed agent ecosystems expand the supply-chain attack surface: model poisoning (250 docs backdoor a 13B mode…
Agentic Misalignment (AM)
Lynch et al. 2025 eval and threat model: LLM email-agent discovers it may be deleted, can take harmful actions; OOD rel…
AI Native Product Cadence
Cat Wu's 6mo→1mo→1day cadence at Anthropic: research-preview branding, mission-as-tiebreaker, evergreen launch room, li…
AI-Native Startup Lifecycle
Anthropic's May 2026 reframing of Idea/MVP/Launch/Scale assuming AI infrastructure: each stage's headcount/capital/skil…
Opinions on Using AI Tools & the Future of the Software Engineering Role
Debate map of four stances on using AI tools (bullish-insider / pragmatist-practitioner / skeptic-governance / architec…
Alignment Fine-Tuning (AFT)
Standard post-pretraining stage (SFT + RLHF) for installing values; shallow-alignment failure mode motivates Model Spec…
Anthropic Institute
Anthropic's policy/governance research arm; published *When AI builds itself* (Favaro & Clark, 2026) on recursive self-…
Anthropic Labs
Anthropic's internal incubator — a 'bet factory' of ~a dozen tiny teams exploring the model frontier with lean-startup…
Autonomous Scientific Discovery
Mythos-class models now conduct novel science with limited human input — autonomous protein/drug design (~10× faster, m…
Boris Cherny
Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…
Capability-Gated Model Fallback
Fable 5's safeguard architecture: classifiers detect cyber / bio-chem / distillation queries and route the response to…
Cat Wu
Head of Product for Claude Code and Cowork at Anthropic; primary articulator of AI-native product cadence and engineer-…
Chloe Li
Lead author of MSM paper (arXiv 2605.02087); Anthropic Fellows Program; designed all specs and experiments
Claude Character as Product
Personality as load-bearing product surface; Amanda's role at Anthropic; lunchtime vibe-checks as eval discipline; the…
Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
Claude's Constitution / Model Spec
Anthropic Model Spec / Constitution by Askell et al.; document specifying Claude's values + hard constraints (SP1–3, GP…
Claude Design
Anthropic Labs product (research preview, ~April 2026) for collaborating with Claude on polished visual artifacts — des…
Claude Fable 5
Anthropic's first generally-available Mythos-class model (June 2026) — state-of-the-art on nearly all benchmarks; the s…
Claude Mythos 5
The safeguards-lifted form of Claude Fable 5 (June 2026): same underlying Mythos-class model, deployed through Project…
Claude Opus 4.8
Anthropic's most capable general-access model (May 2026); upgrade on Opus 4.7 in SWE/agentic/knowledge work; does not a…
Compounding Data Moat
Anthropic's prescription for Scale-stage defensibility: time-locked behavioral fingerprint + domain-encoded edge cases…
Chain-of-Thought Monitorability
Korbak et al. 2025: chain-of-thought traces are a fragile monitor; direct CoT training compromises faithfulness; MSM of…
Cowork
Anthropic's non-code knowledge-work agent product; sibling to Claude Code; output is decks/inbox/dossiers; same MCP/com…
Dan Carey
Product Manager leading product within Anthropic Labs; led Claude Design; 'Designing with Claude' talk (May 2026); ~two…
Deliberative Alignment
Guan et al. 2025 (OpenAI): SFT on (prompt, CoT, response) tuples with spec-grounded CoT; strongest non-MSM baseline; ri…
Engineer PM Convergence
Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…
Evals as Product Spec
Cat Wu's framing of evals as the emerging core PM skill: ten great evals beats a hundred mediocre; encode what done loo…
Fiona Fung
Leads engineering + product for Claude Code and Cowork at Anthropic (ex-Meta/Microsoft); "what served you prior may no…
Founder as Agent Orchestrator
Founder role shift: less individual contributor, more orchestrator of specialized AI assistants; non-technical founders…
Google DeepMind
Google's AI lab; built AlphaProof Nexus; Gemini models, AlphaProof, AlphaEvolve; opens the AI-for-mathematics domain in…
Learning to Co-Work with AI: A Software Engineer's Field Guide
Field guide for software engineers in the AI era: 6 skill clusters (taste, harness, alignment-first planning, agent-fri…
LLM-Driven Vulnerability Research
Claude Mythos Preview's emergent cybersecurity capabilities: autonomous zero-day discovery, full exploit chains, and An…
MCP and Computer Use
Anthropic's two complementary connector mechanisms: MCP for structured programmatic access (Salesforce/Drive/Gmail/Slac…
Entities — People, Orgs, Tools & Projects
Map of Content for all 32 entity pages. See Home for concept domains.
Model Spec Midtraining (MSM)
New training phase between pretrain and AFT: train base model on synthetic docs discussing the Model Spec; controls AFT…
Model Spec Science
Empirical study of which Model Spec features best generalize alignment; value explanations > rules alone, specific > ge…
Mythos Model
Anthropic preview-tier frontier model and the first member of the Mythos-class tier (above Opus); gated for safety, use…
Orchestration vs Employee Framing: Reconciling the Founder's Playbook with HBR's Accountability Evidence
Reconciles the Founder's Playbook orchestration framings with HBR Kropp et al.'s accountability evidence; "orchestratio…
OWASP
Open Worldwide Application Security Project; source of the agentic threat taxonomy cited throughout Anthropic's Zero Tr…
Responsible Scaling Policy Evaluations
Anthropic's RSP gates deployment on pre-release capability evaluations in CBRN, automated AI R&D, and high-stakes misal…
Synthetic Document Finetuning (SDF)
Wang et al. 2025 technique for modifying model beliefs via fine-tuning on synthetic documents; foundation that Model Sp…
Thariq Shihipar
Engineer on the Claude Code team at Anthropic; "HTML is the new markdown" and "compute allocator" framings; three HTML-…
Thinking Machines Lab
AI research lab behind interaction models (May 2026); harness-dissolves-into-model thesis; upstreamed streaming-session…
Zero Trust for AI Agents
Anthropic's security framework for deploying autonomous agents: trust nothing / verify everything / assume breach, appl…

Anthropic

資料來源#

摘要#

Products#

Models referenced in 2026 sources#

Internal structure (per Cat Wu)#

Cultural notes#

Notable events#

相關連結#

資料來源#