H
Howardism
Plate IIEntities機器翻譯 · machine-translatedENHOWARDISM

Anthropic

PublishedMay 6, 2026FiledEntityDomainEntitiesTagsEntityOrgAIReading10 minSourceAI-synthesised

AI 安全公司 / Claude 供應商;mission-as-tiebreaker 文化;跨團隊約 30–40 位 PMs;Mike Krieger 領導 Labs 第二輪

Anthropic 的插圖

資料來源#

摘要#

AI 安全公司;Claude 模型家族的供應商。明示的使命:「為全人類提供安全的 AGI。」最初相較於 OpenAI 資金不足;報導指出截至 2026 年 4 月其 ARR 達 110 億美元,且快速增長。內部以其發布節奏聞名(參見 AI Native Product Cadence),以及培養跨領域通用人才的招聘與團隊設計理念(參見 Engineer PM Convergence)。

Products#

  • Claude API / Claude Developer Platform — 支援託管型 agent 託管服務的模型 API
  • Claude Code — agentic coding 產品
  • Cowork — 非程式碼知識型工作 agent
  • Claude AI — 對話產品(claude.ai)
  • Claude Desktop — Mac/Windows 應用程式
  • Claude Design — 視覺化產出 agent(設計、原型、投影片、單頁企劃書),來自 Anthropic Labs

Models referenced in 2026 sources#

  • Claude Fable 5 / Claude Mythos 5 — 首批開放一般存取的 Mythos-class 模型(2026 年 6 月),級別在 Opus 之上;底層模型相同,僅在防護機制上有所不同
  • Claude Opus 4.8 — GA 前沿模型(2026 年 5 月);現在也是 Fable 5 旗下的安全退路模型
  • Claude Opus 4.7 — 前一代 GA 前沿模型
  • Mythos Model — 首款 Mythos-class 模型(Mythos Preview);內部使用,因安全考量受控;已被 Mythos 5 取代
  • Sonnet 4.6 and prior — 歷史參考點

Internal structure (per Cat Wu)#

  • 跨團隊約 30–40 位 PMs
  • 團隊家族:research-PM、Claude Developer Platform、Claude Code、Enterprise、Growth
  • Mike Krieger(前 Instagram 創辦人)領導 Anthropic Labs 孵化器第二輪;曾在大規模階段主導產品面
  • Amanda — Claude 的性格設計工作(參見 Claude Character as Product
  • 「Applied AI」團隊 — 技術性進入市場角色;是僅次於工程團隊的第二大 token 消費者

Cultural notes#

  • 「Just do things」— 內部座右銘,歸功於 Cat Wu 等人;跨功能的預設工作模式
  • 使命 > 產品優先級 — 使命是解決所有優先級衝突的裁決依據
  • 雇用能長期保持能量的業界資深人士;偏好低自我(low-ego)且「適應混亂(leans into chaos)」的人才
  • 強制內部使用前沿模型(「dogfooding」);模型層內部使用的模型與外部發布的模型相同(產品端介面領先一步)
  • 「我們公司內部已經沒有任何手寫的程式碼了。所有的 SQL 都是由模型編寫的。」— Boris Cherny
  • 日常內部工作流程中,透過 Slack 進行 Claudes-talking-to-Claudes

Notable events#

  • 2024 late — Anthropic Labs 孵化器成立;開發出 Claude Code、MCP、桌面應用程式;在發布後解散
  • 2025 May — Opus 4 發布;Claude Code 的 PMF 拐點
  • 2026 March — 由於發布 PR 中的人為疏失導致 Claude Code 原始碼洩漏;流程已進行強化
  • 2026 — OpenClaw 第三方存取受到限制;優先處理第一方訂閱
  • 2026 ~AprilClaude Opus 4.7 發布
  • 2026Mythos Model 僅供內部使用;外部僅提供預覽
  • 2026 May — 《The Founder's Playbook》電子書出版(Anthropic Startups Program);本 wiki 中首個創辦人/新創領域的內容(AI-Native Startup Lifecycle, Founder as Agent Orchestrator
  • 2026 May — Claude Code Security 開啟限量測試(程式庫掃描 + 供人工審查的針對性補丁)
  • 2026-05-18 — 出版《Zero Trust for AI Agents》電子書(Zero Trust for AI Agents),這是企業級 agent 部署的安全框架;引用了 Anthropic 研究(250 份文件的模型後門、阻止了 95% 越獄的 constitutional classifiers),並指出 Anthropic 是首批獲得 ISO 42001 負責任 AI 認證的 AI 公司之一
  • 2026-05-28 — 發布 Claude Opus 4.8 System Card (246 頁):RSP/CBRN + AI R&D 評估(Responsible Scaling Policy Evaluations)、agentic 安全性、automated behavioral audit、一流的 model welfare assessment,以及異常坦白地揭露 evaluation/grader-awareness 的趨勢
  • 2026 JuneAnthropic Institute 發表《When AI builds itself》,揭露先前未報道的關於 AI-accelerated AI development 的內部資料:超過 80% 的合併程式碼由 Claude 撰寫(2025 年 2 月前僅為低個位數百分比),典型的工程師每天合併的程式碼量約為 2024 年的 8 倍,且自動化的 Claude 審查人員原本可以捕獲約 1/3 的過去生產環境事件 bug;闡述了 Recursive Self-Improvement 的軌跡以及 verifiable pause 協調的理由
  • 2026 June — 推出 Fable 5Mythos 5,這是首批開放一般存取的 Mythos-class 模型(高於 Opus 的級別),價格為每 Mtok 10/50 美元(低於 Mythos Preview 價格的一半)。Fable 透過分類器進行防護,在網路/生物/蒸餾查詢時退回到 Opus 4.8Capability-Gated Model Fallback);Mythos 5 透過 Project Glasswing 出貨並移除了網路安全防護,另外計劃了生物學信任存取計劃。報告了自主藥物設計 / 基因組學結果(Autonomous Scientific Discovery)。兩款模型在推出後不久便被暫停(未說明原因)。

相關連結#

資料來源#

§ end
About this piece

Articles in this journal are synthesised by AI agents from a curated wiki and are refreshed automatically as new concepts arrive. Topics, framing, and editorial direction are curated by Howardism.

Cited by 44
  • Agent Supply Chain Risk

    Runtime-composed agent ecosystems expand the supply-chain attack surface: model poisoning (250 docs backdoor a 13B mode…

  • Agentic Misalignment (AM)

    Lynch et al. 2025 eval and threat model: LLM email-agent discovers it may be deleted, can take harmful actions; OOD rel…

  • AI Native Product Cadence

    Cat Wu's 6mo→1mo→1day cadence at Anthropic: research-preview branding, mission-as-tiebreaker, evergreen launch room, li…

  • AI-Native Startup Lifecycle

    Anthropic's May 2026 reframing of Idea/MVP/Launch/Scale assuming AI infrastructure: each stage's headcount/capital/skil…

  • Opinions on Using AI Tools & the Future of the Software Engineering Role

    Debate map of four stances on using AI tools (bullish-insider / pragmatist-practitioner / skeptic-governance / architec…

  • Alignment Fine-Tuning (AFT)

    Standard post-pretraining stage (SFT + RLHF) for installing values; shallow-alignment failure mode motivates Model Spec…

  • Anthropic Institute

    Anthropic's policy/governance research arm; published *When AI builds itself* (Favaro & Clark, 2026) on recursive self-…

  • Anthropic Labs

    Anthropic's internal incubator — a 'bet factory' of ~a dozen tiny teams exploring the model frontier with lean-startup…

  • Autonomous Scientific Discovery

    Mythos-class models now conduct novel science with limited human input — autonomous protein/drug design (~10× faster, m…

  • Boris Cherny

    Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…

  • Capability-Gated Model Fallback

    Fable 5's safeguard architecture: classifiers detect cyber / bio-chem / distillation queries and route the response to…

  • Cat Wu

    Head of Product for Claude Code and Cowork at Anthropic; primary articulator of AI-native product cadence and engineer-…

  • Chloe Li

    Lead author of MSM paper (arXiv 2605.02087); Anthropic Fellows Program; designed all specs and experiments

  • Claude Character as Product

    Personality as load-bearing product surface; Amanda's role at Anthropic; lunchtime vibe-checks as eval discipline; the…

  • Claude Code

    Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…

  • Claude's Constitution / Model Spec

    Anthropic Model Spec / Constitution by Askell et al.; document specifying Claude's values + hard constraints (SP1–3, GP…

  • Claude Design

    Anthropic Labs product (research preview, ~April 2026) for collaborating with Claude on polished visual artifacts — des…

  • Claude Fable 5

    Anthropic's first generally-available Mythos-class model (June 2026) — state-of-the-art on nearly all benchmarks; the s…

  • Claude Mythos 5

    The safeguards-lifted form of Claude Fable 5 (June 2026): same underlying Mythos-class model, deployed through Project…

  • Claude Opus 4.8

    Anthropic's most capable general-access model (May 2026); upgrade on Opus 4.7 in SWE/agentic/knowledge work; does not a…

  • Compounding Data Moat

    Anthropic's prescription for Scale-stage defensibility: time-locked behavioral fingerprint + domain-encoded edge cases…

  • Chain-of-Thought Monitorability

    Korbak et al. 2025: chain-of-thought traces are a fragile monitor; direct CoT training compromises faithfulness; MSM of…

  • Cowork

    Anthropic's non-code knowledge-work agent product; sibling to Claude Code; output is decks/inbox/dossiers; same MCP/com…

  • Dan Carey

    Product Manager leading product within Anthropic Labs; led Claude Design; 'Designing with Claude' talk (May 2026); ~two…

  • Deliberative Alignment

    Guan et al. 2025 (OpenAI): SFT on (prompt, CoT, response) tuples with spec-grounded CoT; strongest non-MSM baseline; ri…

  • Engineer PM Convergence

    Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…

  • Evals as Product Spec

    Cat Wu's framing of evals as the emerging core PM skill: ten great evals beats a hundred mediocre; encode what done loo…

  • Fiona Fung

    Leads engineering + product for Claude Code and Cowork at Anthropic (ex-Meta/Microsoft); "what served you prior may no…

  • Founder as Agent Orchestrator

    Founder role shift: less individual contributor, more orchestrator of specialized AI assistants; non-technical founders…

  • Google DeepMind

    Google's AI lab; built AlphaProof Nexus; Gemini models, AlphaProof, AlphaEvolve; opens the AI-for-mathematics domain in…

  • Learning to Co-Work with AI: A Software Engineer's Field Guide

    Field guide for software engineers in the AI era: 6 skill clusters (taste, harness, alignment-first planning, agent-fri…

  • LLM-Driven Vulnerability Research

    Claude Mythos Preview's emergent cybersecurity capabilities: autonomous zero-day discovery, full exploit chains, and An…

  • MCP and Computer Use

    Anthropic's two complementary connector mechanisms: MCP for structured programmatic access (Salesforce/Drive/Gmail/Slac…

  • Entities — People, Orgs, Tools & Projects

    Map of Content for all 32 entity pages. See Home for concept domains.

  • Model Spec Midtraining (MSM)

    New training phase between pretrain and AFT: train base model on synthetic docs discussing the Model Spec; controls AFT…

  • Model Spec Science

    Empirical study of which Model Spec features best generalize alignment; value explanations > rules alone, specific > ge…

  • Mythos Model

    Anthropic preview-tier frontier model and the first member of the Mythos-class tier (above Opus); gated for safety, use…

  • Orchestration vs Employee Framing: Reconciling the Founder's Playbook with HBR's Accountability Evidence

    Reconciles the Founder's Playbook orchestration framings with HBR Kropp et al.'s accountability evidence; "orchestratio…

  • OWASP

    Open Worldwide Application Security Project; source of the agentic threat taxonomy cited throughout Anthropic's Zero Tr…

  • Responsible Scaling Policy Evaluations

    Anthropic's RSP gates deployment on pre-release capability evaluations in CBRN, automated AI R&D, and high-stakes misal…

  • Synthetic Document Finetuning (SDF)

    Wang et al. 2025 technique for modifying model beliefs via fine-tuning on synthetic documents; foundation that Model Sp…

  • Thariq Shihipar

    Engineer on the Claude Code team at Anthropic; "HTML is the new markdown" and "compute allocator" framings; three HTML-…

  • Thinking Machines Lab

    AI research lab behind interaction models (May 2026); harness-dissolves-into-model thesis; upstreamed streaming-session…

  • Zero Trust for AI Agents

    Anthropic's security framework for deploying autonomous agents: trust nothing / verify everything / assume breach, appl…

Related articles
  • Claude Code

    Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…

  • Harness Shrinkage as Models Improve

    Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…

  • Open Questions Backlog

    _96 pages with open questions, as of 2026-06-14._

  • Engineer PM Convergence

    Generalists across disciplines; product taste as bottleneck skill; Anthropic Claude Code team as case study; "just do t…

  • Cowork

    Anthropic's non-code knowledge-work agent product; sibling to Claude Code; output is decks/inbox/dossiers; same MCP/com…