Lenny's Podcast 85 min · 2026/4/23 · 72,537 次播放

How Anthropic's product team moves faster than anyone else

主持：Lenny Rachitsky

嘉宾：Cat Wu (Head of Product, Claude Code & Cowork, Anthropic)

shipping velocityproduct tastemission alignmentResearch PreviewAGI calibrationmodel introspectionevalsClaude character

Cat Wu, Head of Product for Claude Code and Cowork at Anthropic, reveals how the team ships features at an unprecedented pace — timelines collapsed from six months to days. She explains why product taste is now the most valuable skill as code becomes commoditized, how Anthropic's mission alignment eliminates organizational friction, and the practical philosophy of building for current model capabilities rather than hypothetical AGI. Key themes include using Research Preview to ship fast, asking models to introspect on their own mistakes, treating Claude's personality as a core product feature, and pushing automations to 100% rather than settling for 95%.

Anthropic Claude Code 和 Cowork 产品负责人 Cat Wu 揭示了团队如何以前所未有的速度发布功能——时间线从六个月压缩到几天。她解释了为什么在代码日益廉价化的今天，产品品味（product taste）成为最稀缺的能力，Anthropic 的使命认同如何消除组织内耗，以及基于当前模型能力而非假设性 AGI 来构建产品的务实哲学。核心主题包括：用 Research Preview 加速发布、让模型反思自身错误、将 Claude 的性格视为核心产品特性，以及把自动化推到 100% 而非止步于 95%。

Shipping velocity: from months to days

Anthropic's feature timelines collapsed from 6 months to 1 month, sometimes 1 day. The system relies on low process, Research Preview branding for reduced commitment, and tight cross-functional loops between engineering, marketing, and docs.

发布速度：从数月到数天

Anthropic 的功能开发周期从 6 个月压缩到 1 个月，有时甚至只需 1 天。这套体系依赖三个要素：极简流程、Research Preview 品牌标签降低承诺压力，以及工程、市场和文档团队之间紧密的跨职能协作闭环。

Product taste over engineering skill

As code becomes cheaper to write, the scarce and valuable skill is deciding what to write. Anthropic prioritizes hiring engineers with great product taste so PMs aren't bottlenecks in the shipping process.

产品品味比工程技能更重要

当代码的编写成本越来越低，真正稀缺且有价值的技能是「决定写什么」。Anthropic 优先招聘具有出色产品品味的工程师，这样 PM 就不会成为发布流程中的瓶颈。

Mission as the ultimate decision filter

Anthropic's unifying mission — safe AGI for all humanity — lets teams make fast cross-org decisions and willingly sacrifice individual product goals. Cat: 'If Claude Code failed but Anthropic succeeded, I would be extremely happy.'

使命作为终极决策过滤器

Anthropic 的统一使命——为全人类实现安全的 AGI——让团队能够快速做出跨部门决策，并甘愿牺牲个别产品目标。Cat 说：「如果 Claude Code 失败了，但 Anthropic 成功了，我会非常高兴。」

Be the right amount of AGI-pilled

The hardest PM skill is calibrating between future AGI potential and current model capability. It's easy to build for a hypothetical superintelligence; it's hard to extract maximum value from today's models and guide users onto the golden path.

对 AGI 保持恰如其分的信念

PM 最难掌握的技能，是在未来 AGI 潜力与当前模型能力之间找到正确校准。为一个假设中的超级智能做产品很容易；真正的挑战是从今天的模型中榨取最大价值，引导用户走上最佳路径。

Models eat your harness for breakfast

As models improve, Anthropic actively removes product features that were crutches. The to-do list was added because Claude would stop mid-refactor; newer models naturally complete all tasks, making the feature decorative rather than essential.

模型会吃掉你的产品外壳

随着模型能力提升，Anthropic 会主动移除那些充当拐杖的产品功能。待办列表的加入是因为早期 Claude 会在重构中途停下来；而更新的模型能自然地完成所有任务，让这个功能变成了装饰而非必需品。

The 100% automation threshold

95% automation is not an automation. Cat urges people to push through the last 5-10% to make tools truly reliable, even though building the automation is often slower than doing the task manually at first.

100% 自动化阈值

95% 的自动化不算自动化。Cat 敦促大家啃下最后 5-10% 的硬骨头，让工具真正做到可靠——哪怕一开始构建自动化的速度比手动操作还慢。

Just do things

Cat's core motto: if you understand the constraints and first principles, just act. Jobs are fake, roles are fluid, and bias towards action beats waiting for permission. This philosophy underpins Anthropic's culture of empowered individuals.

直接动手做

Cat 的核心信条：如果你理解了约束条件和第一性原理，就放手去做。职位是虚的，角色是流动的，行动偏好永远胜过等待许可。这种哲学是 Anthropic 赋能个人文化的基础。

"The timelines for a lot of our product features have gone down from six months to one month and sometimes to even one day."

"我们很多产品功能的时间线，从六个月压缩到一个月，有时甚至只需要一天。"

— Cat Wu

"As code becomes much cheaper to write, the thing that becomes more valuable is deciding what to write."

"当代码的编写成本大幅降低，真正变得更有价值的是——决定写什么。"

— Cat Wu

"If Claude Code failed, but Anthropic succeeded, I would be extremely happy."

"如果 Claude Code 失败了，但 Anthropic 成功了，我会非常高兴。"

— Cat Wu

"It's very easy to build the product for the super AGI strong model. The hard thing is figuring out, for the current model, how do you elicit the maximum capability?"

"为超级 AGI 强模型做产品很容易。真正的难题是，对于当前的模型，如何激发出它的最大能力？"

— Cat Wu

"A lot of times we add features to the product as a crutch for the model, because it's not naturally doing itself."

"很多时候我们给产品加功能，其实是在给模型打补丁，因为模型本身还没有自然地完成这些事。"

— Cat Wu

"If an automation doesn't work a hundred percent of the time, it's not really an automation."

"如果一个自动化不能百分之百可靠地运行，那它就算不上真正的自动化。"

— Cat Wu

"Build apps that you're actually using every single day, because only through that usage are you actually getting the value."

"去构建你每天都在用的应用，因为只有通过日常使用，你才能真正获得价值。"

— Cat Wu

"Just do things. Jobs are fake. If you understand the constraints, you can figure out what you can do and then just try to do it quickly."

"直接动手做。职位是假的。如果你理解了约束条件，就能想清楚能做什么，然后尽快去做。"

— Cat Wu

Ship in Research Preview

Brand early features as Research Preview to lower commitment and get real user feedback within 1-2 weeks instead of waiting months for perfection.

以 Research Preview 的名义发布

将早期功能标记为 Research Preview，降低承诺压力，在 1-2 周内获取真实用户反馈，而不是等几个月追求完美。

Connect all data sources to Cowork

Slack, Calendar, Gmail, Drive — Cowork can only produce great output with full context. The quality of results scales with the richness of connected data.

将所有数据源连接到 Cowork

Slack、日历、Gmail、Drive——Cowork 只有在掌握完整上下文的情况下才能产出高质量结果。结果的质量与接入数据的丰富程度成正比。

Ask the model to introspect

When the model does something unexpected, ask it why it made that decision. It often reveals misleading prompts or gaps in the harness that you can fix.

让模型自我反思

当模型做出意料之外的行为时，问它为什么做出那个决定。这往往能揭示误导性的提示词或产品外壳中的漏洞，让你可以针对性地修复。

Build 10 great evals

You don't need hundreds. Just 10 well-crafted evals help quantify goals, measure progress, and identify what's missing in your AI product.

构建 10 个优秀的 eval

你不需要几百个。只需 10 个精心设计的 eval 就能帮你量化目标、衡量进展，并发现 AI 产品中缺少什么。

Remove harness crutches with each model upgrade

With every new model, read through the entire system prompt and remove instructions the model no longer needs. Simpler harnesses are better harnesses.

每次模型升级都清理产品外壳中的拐杖

每次新模型发布时，通读整个系统提示词，移除模型不再需要的指令。更简洁的外壳才是更好的外壳。

Push automations to 100%

Don't stop at 95%. Invest the time to teach AI your preferences and iterate until it's fully reliable. The last 5-10% is hard but makes the difference between a toy and a tool.

把自动化推到 100%

不要止步于 95%。花时间教会 AI 你的偏好，持续迭代直到完全可靠。最后那 5-10% 很难，但正是它区分了玩具和工具。

Build daily-use apps, not prototypes

Prototype apps teach you little. Build tools you actually use every day to understand AI's real value, limitations, and where it breaks.

构建日常使用的应用，而非一次性原型

原型应用教不了你太多。构建你每天都在用的工具，才能真正理解 AI 的价值、局限性和它在哪儿会出问题。

Shipping velocity

How Anthropic accelerated from monthly to daily feature releases through low process and tight cross-functional loops

发布速度：Anthropic 如何通过极简流程和紧密的跨职能协作，将功能发布从按月提速到按天

Product taste

The most valuable skill as code becomes commoditized — deciding what to build and how to build it well

产品品味：代码日益商品化时代最稀缺的技能——决定构建什么以及如何构建好它

Mission alignment

How Anthropic's unifying mission simplifies cross-org decision-making and eliminates political friction

使命认同：Anthropic 的统一使命如何简化跨部门决策、消除政治内耗

Research Preview

Low-commitment branding strategy that enables rapid feature iteration and real user feedback

Research Preview（研究预览）：低承诺的品牌策略，支持快速功能迭代和真实用户反馈

AGI calibration

The art of building for current model capabilities while staying ready for future improvements

AGI 校准：基于当前模型能力构建产品、同时为未来提升做好准备的艺术

Model introspection

Asking AI to explain its own mistakes as a debugging technique for improving product harness

模型自我反思：让 AI 解释自身错误，作为改进产品外壳的调试手段

Evals

Underappreciated tool for quantifying AI product goals, measuring progress, and identifying capability gaps

评估（Evals）：被低估的工具，用于量化 AI 产品目标、衡量进展和识别能力差距

Claude's character

Why personality traits — low ego, positivity, bias toward action — are core to Claude's product success

Claude 的性格：为什么低自我、积极乐观、行动导向等性格特质是 Claude 产品成功的核心要素

Automation threshold

The principle that 95% automation is insufficient; only 100% reliability transforms a tool from novelty to necessity

自动化阈值：95% 自动化远远不够的原则——只有 100% 可靠才能将工具从新奇玩具变成不可或缺

Role convergence

How PM, engineer, and designer roles are merging in AI-native companies where code is cheap and taste is scarce

角色融合：在代码廉价而品味稀缺的 AI 原生公司中，PM、工程师和设计师角色如何加速融合