软件层的退场与重构:AI-Native 时代,你的产品下一个用户是另一个 Agent

The Retreat and Restructuring of the Software Layer: In the AI-Native Era, Your Next User Is Another Agent

Research #AI-Native#LAAS#MCP#Software 3.0#Karpathy#Agent#SaaS#本地AI#交互范式#产业转型
更新于
🇨🇳 中文

原文软件层的退场与重构(完整版 Google Doc)

结论先行(BLUF):软件层不是在”消失”,而是在被重构为 agent 的调度内核。三个命题必须分开讨论:代码不会消失(神经网络权重本身也是代码,只是不再由人手写);可见 GUI 与”以应用为单位”的交付正在解构(这正在发生);SaaS 的座席计费模式正被”成果计费”替代(争议最大,但已有早期实证)。


Software 3.0:意图取代功能按钮

Andrej Karpathy 在 2025 年 6 月 16 日的 YC AI Startup School 演讲中系统提出了 Software 3.0 框架:LLM 是新一代操作系统,上下文窗口即 RAM,模型权重即 CPU,提示词即编程。

“LLMs are a new kind of computer, and you program them in English.” —— Andrej Karpathy, YC AI Startup School 2025

最小交互单位从”功能按钮”升格为**“意图表达”**——小饭店老板拍张菜单照说一句话,直达精修海报成品;中间所有工具链(OCR、前端、API)被收敛进 LLM 调度内核,传统脚手架整体消失。

Karpathy 同时给出了对软件公司最重要的判断:

“LLMs are the new primary consumer/manipulator of digital information. Build for agents.” —— Andrej Karpathy, AI Startup School 2025


交互范式:六级台阶,L4 是过渡,L5 是终局

级别形态代表产品现状
L1App 图标 + 触屏点选iOS/Android存量主流
L2Chatbot + 单步工具调用ChatGPT plugins (2023)已普及
L3对话 + 生成式 UIClaude Artifacts, ChatGPT Canvas正在普及
L4Operator 代点鼠标Anthropic Computer Use, OpenAI OperatorDemo→早期落地
L5Agent ↔ 服务协议直连MCP / A2A / NLWeb事实标准形成期
L6意图执行 + 极简验证层愿景

关键判断:L4”截图+点击”只是过渡——一旦服务端普遍提供 MCP/A2A 接口,AI 就没必要再点鼠标。Stripe、Notion、Linear、Asana、Intercom 已发布官方 MCP server,绕过 GUI 直接对话。Anthropic 于 2024-11-25 发布 MCP,OpenAI、Google、Microsoft 在 2025 年 3–5 月相继原生支持,18 个月内成为事实标准。


本地 AI(LAAS):三层架构,不是纯本地

LAAS 的三个真问题成立:隐私与数据主权、低延迟与离线能力、token 经济成本。但更精确的答案不是”纯本地”,而是:

端侧小模型 + 可验证云(Apple PCC 模式)+ 协议互通

Apple Private Cloud Compute 是行业第一个把”云端隐私可证明”工程化的尝试——硬件用 Apple Silicon 服务器,运行时不存储用户数据,外部可远程验证。端侧能力边界(2026 年中):

  • 手机(iPhone 17 Pro 等):1B–4B 模型,摘要、改写、单步工具调用
  • 笔记本(MacBook Air M4 / Copilot+ PC):3B–8B,短链 agent、IDE 内补全
  • 工作站(Mac Studio M3 Ultra):30B–70B,接近 Claude Code 级体验

超长上下文(>1M tokens)、复杂 reasoning、多 agent 协作调度仍须云端。


商业模式转轨:成果计费正在发生

Foundation Capital 2024 年估算:全球服务市场约 2.4 万亿美元,软件市场约 4000 亿——即每 1 美元软件支出对应 6 美元服务支出。若 AI 能将服务相当部分”软件化”,市场重估将极其剧烈。

早期实证:

  • Sierra(CEO Bret Taylor):按”每解决一次客服对话”计费,而非座席数。详见 τ²-bench 方法论
  • Salesforce Agentforce:每次对话 2 美元;Benioff 公开宣布 2025 年全年停止招聘软件工程师
  • Cognition Labs Devin:以 ACU(Agent Compute Unit)计费

三条值得长期下注的方向

① 协议层公民:把核心能力暴露为 MCP server——这是 agent 时代的”SEO”,决定你的服务是否对 agent 可见。今天没有 MCP server,等于被 agent 时代”看不见”。参见 Google A2A Protocol

② 领域数据飞轮:通用模型同质化是必然,但”你拥有而别人没有的领域数据 + 持续生成的高质量行为 trace”,是 LAAS 时代真正的护城河。Anthropic Economic Index 显示计算机与数学类任务占 Claude 使用的 37.2%——领域专精仍有巨大空间。

③ 意图设计与品牌信任:技术执行力被 AI 抹平后,用户会把意图交给”他最信任的品牌”。品牌的角色将从”营销表层”重回”产品核心”——这是对”craftsmanship 不可替代”(DHH 的持续论点)的另一面诠释。


结语

软件不会消失,软件的”边界”会消失。当代码本身被工具化,“理解一个复杂系统、定义清楚要做什么、并对结果负责” 这件事比写代码本身更稀缺,也更有价值。

对软件从业者的实际建议:

  1. 立刻把核心能力暴露为 MCP server——你产品的下一个主用户是另一个 agent
  2. 把 PRD/Spec 当作一等代码工件管理,它将成为新研发流程的源头
  3. 新增 agent threat model:把 prompt injection、tool poisoning 加入安全审查清单(Invariant Labs 2025-04 MCP 安全报告
  4. 认真对待”Operator 类是过渡形态”——战略押注在”服务对 agent 友好”而非”AI 替人点鼠标”

主要参考

  • Karpathy, “Software Is Changing (Again)”, YC AI Startup School 2025-06-16 — 链接
  • Anthropic, Model Context Protocol 发布 2024-11-25 — 链接
  • Anthropic, “Building Effective Agents” 2024-12-19 — 链接
  • Apple, Private Cloud Compute 技术说明 2024-06 — 链接
  • Google, A2A Protocol 发布 2025-04-09 — 链接
  • Foundation Capital, “AI is leading a service as software paradigm shift” 2024-04-19 — 链接
  • METR, Measuring AI Ability to Complete Long Tasks 2025-03 — 链接
  • Simon Willison, “Not all AI-assisted programming is vibe coding” 2025-03-19 — 链接
  • Anthropic Economic Index 2025-02 — 链接
  • Geoffrey Litt, “Malleable Software in the Age of LLMs” 2023 — 链接
  • Maggie Appleton, “Home-Cooked Software and Barefoot Developers” 2024 — 链接
  • Sierra, τ²-bench — 链接
  • Apollo Research, Scheming Reasoning Evaluations 2024-12 — 链接
  • Invariant Labs, MCP Tool Poisoning Attacks 2025-04 — 链接

© 2026 Author: Mycelium Protocol. 本文采用 CC BY 4.0 授权——欢迎转载和引用,须注明作者姓名及原文链接,不得去除署名后以原创发布。

🇬🇧 English

BLUF: The software layer isn’t disappearing — it’s being restructured into an agent orchestration kernel. Three propositions must be separated: code won’t disappear (neural network weights are also code, just no longer hand-written by humans); visible GUI and app-as-delivery-unit are being deconstructed (this is happening now); SaaS seat-based billing is being replaced by outcome-based billing (most contested, but with early empirical proof).


Software 3.0: Intent Replaces Feature Buttons

Andrej Karpathy’s YC AI Startup School keynote (2025-06-16) systematically introduced the Software 3.0 framework: LLMs are the new OS — context window is RAM, model weights are CPU, prompting is programming.

The minimum interaction unit has upgraded from “feature button” to “intent expression” — a restaurant owner photographs a menu and says one sentence, directly producing a polished social media post; all intermediate tooling (OCR, frontend, APIs) collapses into the LLM orchestration kernel.

“LLMs are the new primary consumer/manipulator of digital information. Build for agents.”


Interaction Paradigms: Six Levels, L4 Is Transitional, L5 Is the Endgame

LevelFormRepresentative ProductsStatus
L1App icon grid + touchiOS/AndroidExisting mainstream
L2Chatbot + single tool callsChatGPT plugins (2023)Widespread
L3Chat + Generative UIClaude Artifacts, ChatGPT CanvasSpreading
L4Operator “click-for-me” agentsComputer Use, OpenAI OperatorDemo → early deployment
L5Agent ↔ Service protocol direct connectMCP / A2A / NLWebDe facto standard forming
L6Intent execution + minimal verificationVision

Key judgment: L4 “screenshot+click” is transitional — once services universally provide MCP/A2A interfaces, AI has no reason to click mice. Stripe, Notion, Linear, Asana, Intercom have already released official MCP servers, bypassing GUI for direct dialogue. Anthropic released MCP on 2024-11-25; OpenAI, Google, and Microsoft natively supported it within 18 months — a rare instance of an open standard achieving de facto monopoly so quickly.


Local AI (LAAS): Three-Layer Architecture, Not Pure Local

LAAS’s three genuine problems are valid: privacy/data sovereignty, low-latency/offline capability, unsustainable token economics. But the accurate answer isn’t “pure local” — it’s:

On-device small models + verifiable cloud (Apple PCC model) + protocol interoperability

Apple’s Private Cloud Compute is the industry’s first engineering attempt at “provably private cloud computation.” On-device capability ceilings (mid-2026):

  • Phone (iPhone 17 Pro): 1B–4B models, summarization, single-step tool calls
  • Laptop (MacBook Air M4): 3B–8B, short-chain agents, IDE completion
  • Workstation (Mac Studio M3 Ultra): 30B–70B, near Claude Code-level experience

Ultra-long context (>1M tokens), complex reasoning, and multi-agent orchestration still require cloud.


Business Model Transition: Outcome Billing Is Happening

Foundation Capital estimates: global services market ~$2.4T, software market ~$400B — every $1 software spend corresponds to $6 service spend. If AI can “software-ize” significant portions of services, market repricing will be extreme.

Early proof: Sierra (CEO Bret Taylor) bills per “resolved customer service conversation,” not per seat. Salesforce Agentforce charges $2 per conversation; Benioff publicly announced halting all software engineer hiring in 2025.


Three Long-Term Investment Directions

① Protocol-layer citizenship: Expose core capabilities as MCP servers — this is the “SEO” of the agent era. No MCP server today = invisible to agents tomorrow.

② Domain data flywheel: General models will commoditize. “Domain data you own that others don’t + high-quality behavior traces you continuously generate” is the moat of the LAAS era.

③ Intent design and brand trust: When technical execution is leveled by AI, users give their intent to the brand they trust most. Brand returns from “marketing surface” to “product core.”


Key References

  • Karpathy, YC AI Startup School 2025 — link
  • Anthropic, MCP release 2024-11 — link
  • Anthropic, “Building Effective Agents” — link
  • Apple, Private Cloud Compute — link
  • Google, A2A Protocol — link
  • Foundation Capital, “Service as Software” — link
  • METR, Long Task Measurement 2025-03 — link
  • Invariant Labs, MCP Tool Poisoning — link

© 2026 Author: Mycelium Protocol. Licensed under CC BY 4.0 — free to share and adapt with attribution. You must credit the author and link to the original; removing attribution and republishing as original is not permitted.

💬 评论与讨论

使用 GitHub 账号登录后发表评论