X List 每小时精选 | 2026-05-23 00:00

生成时间：2026-05-23 00:00 (Asia/Shanghai)
筛选范围：最新 50 条推文
精选推文数：12 篇高质量内容

精选推文 2: Greg Brockman - 模型不再是产品

作者: Greg Brockman (@gdb) ✅
发布时间: 2026-05-22 11:51 (北京时间)
互动数据: ❤️ 4,912 | 🔄 296 | 💬 517 | 👁️ 426,350

推文原文:

the model alone is no longer the product

核心要点:
OpenAI 联创 Greg Brockman 的简短推文引发了 AI 行业范式转变的思考：当模型能力趋于同质化，真正的产品价值将转向应用层、用户体验和垂直场景的深度整合。

灵感启发:

基础模型正在变成"水电煤"一样的通用基础设施
差异化竞争将聚焦在编排层、Agent 框架、用户交互
这解释了为什么 OpenAI 在大力推 ChatGPT 和 API 生态

可实践建议:
如果你正在做 AI 应用，不要把核心竞争力押在模型调用上，而是思考如何构建独特的数据飞轮、用户工作流或垂直场景理解。

社交媒体文案:

即刻版:
OpenAI联创一句话点破行业真相：“模型本身不再是产品” 🤯 当GPT-4、Claude、Gemini差距越来越小，真正的战场在应用层、在用户体验、在垂直场景的深度整合。做AI应用的创业者们，该重新思考护城河了 💡 #OpenAI #AI产品 #创业思考

参考链接: https://x.com/gdb/status/2057670776803996110

Twitter/X版:
“The model alone is no longer the product” - Greg Brockman 这句话道出了AI行业的范式转移。基础模型正在商品化，真正的价值在应用层 🔮 #OpenAI #AI #ProductStrategy

参考链接: https://x.com/gdb/status/2057670776803996110

精选推文 3: llama.cpp WebGPU 后端发布

作者: Georgi Gerganov (@ggerganov) ✅
发布时间: 2026-05-22 11:42 (北京时间)
互动数据: ❤️ 367 | 🔄 69 | 💬 9 | 👁️ 32,765

推文原文:

Highlighting the new WebGPU backend in llama.cpp/ggml

The work to bring full-fledged WebGPU support in llama.cpp started about an year and a half ago. It has been lead by @reeselevine and team at USCS.

For more information, checkout the interactive blog and paper in the quoted post. Here are 2 excerpts from the paper, summarizing the implemented software architecture.

核心要点:
llama.cpp 正式支持 WebGPU 后端，历时一年半开发完成。这意味着可以在浏览器中直接运行本地大模型，利用 GPU 加速，数据完全不出本地。

灵感启发:

浏览器端 LLM 推理将开启新的应用形态
隐私保护+端侧推理的组合对 B2B 场景极具吸引力
WebGPU 的普及将缩小 Web 与 Native 的性能差距

可实践建议:
关注 llama.cpp WebGPU 的进展，对于需要数据隐私保护的场景（如医疗、金融），浏览器端本地推理将成为可行方案。

社交媒体文案:

即刻版:
重大突破！llama.cpp 正式支持 WebGPU 🚀 这意味着你可以在浏览器里直接跑本地大模型，GPU加速且数据不出本机！历时一年半开发，由UCSC团队主导。浏览器端AI推理的时代要来了 🔥 #llamacpp #WebGPU #边缘计算 #AI

参考链接: https://x.com/ggerganov/status/2057668450076520811

Twitter/X版:
llama.cpp now supports WebGPU! Run local LLMs in your browser with GPU acceleration. No data leaves your computer 🌐🤖 #WebGPU #llamacpp #EdgeAI

参考链接: https://x.com/ggerganov/status/2057668450076520811

精选推文 4: AI 补贴时代结束？

作者: Daniel Jeffries (@Dan_Jeffries1) ✅
发布时间: 2026-05-22 13:54 (北京时间)
互动数据: ❤️ 51 | 🔄 11 | 💬 8 | 👁️ 14,334

推文原文:

Closed and gated surveillance economy SaaS systems and models will get replaced by open source in two tiers:

The models themselves

The everything-ultra-app harness

If an American open source champion does not emerge, the world will run on whoever produces the best open models and open software that is strong, stable, affordable, customizable, expandable and adaptable to both individual and business needs.

We don’t need another WinTel dynasty in the agent era, or an everything is “free” so we can monitor you, censor you and spy on you era.

As AI becomes our interface to the world it needs to be open and people powered.

核心要点:
转发评论指出：Microsoft 因 token 计费成本过高取消了内部 Claude Code 许可证，Uber CTO 警告 AI 预算在4个月内耗尽。AI 补贴时代正在结束，开源模型和软件将成为替代方案。

灵感启发:

Token-based 定价让企业开始正视 AI 的真实成本
开源生态可能因成本压力而加速发展
闭源 SaaS 的"免费"模式面临可持续性挑战

可实践建议:
企业应开始评估开源模型的 TCO（总拥有成本），并建立成本监控机制，避免预算失控。

社交媒体文案:

即刻版:
AI补贴时代要结束了？Microsoft都嫌Claude Code太贵取消内部许可，Uber的AI预算4个月烧光 😱 Token计费让企业开始正视AI的真实成本。Daniel Jeffries说得对：我们需要开源的AI生态，而不是另一个监控资本主义时代的WinTel王朝 🔓 #AI成本 #开源 #科技趋势

参考链接: https://x.com/Dan_Jeffries1/status/2057701696298471566

Twitter/X版:
The AI subsidy era is ending. Microsoft canceled Claude Code licenses due to token costs. Time to seriously evaluate open-source alternatives 🔓💰 #AI #OpenSource #CostOptimization

参考链接: https://x.com/Dan_Jeffries1/status/2057701696298471566

精选推文 5: AGI 是否已经存在？

作者: nic carter (@nic_carter) ✅
发布时间: 2026-05-22 21:14 (北京时间)
互动数据: ❤️ 286 | 🔄 27 | 💬 76 | 👁️ 18,132

推文原文:

The “it’s not AGI because machine intelligence is jagged” is dumb cope.

It’s obviously AGI. If you had a friend who had a 130 IQ, could write production code flawlessly, could write academic papers of a high research caliber, pass any exam in any field with flying colors, create a sophisticate LBO model, draw technical diagrams perfectly, compose poetry in any language, and could find solutions to significant unsolved mathematical problems, you would call that person a world historical genius…

核心要点:
一篇长文论证 AGI 已经存在：如果以人类标准来看，能写代码、发论文、解数学题的 AI 无疑具备通用智能。“锯齿状智能"的批评是双重标准。

灵感启发:

AGI 的定义之争本质是哲学问题：什么是"通用”？
人类对 AI 的标准往往高于对人类的标准
智能 vs 意识/自主性需要区分

可实践建议:
与其纠结 AGI 定义，不如关注 AI 在具体任务上的实际能力边界，以及如何安全有效地利用这些能力。

社交媒体文案:

即刻版:
AGI已经存在了吗？nic carter这篇长文说得挺透彻 🤔 如果一个朋友智商130、能写代码发论文解数学题，我们肯定叫他天才。但对AI我们就要求"不能犯错"、“要有创造力”。这双重标准是不是有点问题？不管怎样，AI的能力边界在快速扩展，这比争论定义更重要 🧠 #AGI #AI思考

参考链接: https://x.com/nic_carter/status/2057812428528685142

Twitter/X版:
“AGI obviously exists and has existed for a while” - provocative take on why we apply double standards to AI vs human intelligence. Worth a read 🤔 #AGI #AI

参考链接: https://x.com/nic_carter/status/2057812428528685142

精选推文 6: Vector Policy Optimization (VPO) - RL 新突破

作者: Ryan Bahlous-Boldi (@RyanBoldi) / MIT NLP (@nlp_mit) ✅
发布时间: 2026-05-22 23:33 (北京时间)
互动数据: ❤️ 43 | 🔄 12 | 💬 4 | 👁️ 1,849

推文原文:

Your RL post-training may be sabotaging your LLM’s test-time scaling!

Conventional RL pretends that you can collapse all reward signals upfront into a single scalar reward.
We introduce Vector Policy Optimization (VPO), which natively maximizes vector-valued rewards, boosting test time search performance, even on the original scalar.

核心要点:
MIT 提出 Vector Policy Optimization (VPO)，突破传统 RL 将多维度奖励压缩为单一标量的局限，直接优化向量值奖励，显著提升测试时的搜索性能。

灵感启发:

人类决策本就涉及多目标权衡，单一奖励函数过于简化
向量奖励可能让模型学会更 nuanced 的推理策略
这对 test-time scaling 有重要意义

可实践建议:
关注 VPO 论文和代码发布，如果你在训练 RLHF 或 post-training 模型，考虑多目标优化方法。

社交媒体文案:

即刻版:
MIT新研究VPO有点意思 🧠 传统RL把多维奖励压成单一数字，但真实世界的偏好本来就是多维的！Vector Policy Optimization直接优化向量奖励，让LLM在测试时的搜索能力更强。这可能是post-training的新方向 🔬 #强化学习 #LLM #AI研究

参考链接: https://x.com/RyanBoldi/status/2057847412819906658

Twitter/X版:
Vector Policy Optimization (VPO) from MIT: stop collapsing multi-dimensional rewards into a scalar. Native vector-valued rewards boost test-time search performance 📊 #RL #LLM #MachineLearning

参考链接: https://x.com/RyanBoldi/status/2057847412819906658

精选推文 7: DeepSeek 降价背后的真相

作者: Teortaxes (@teortaxesTex) ✅
发布时间: 2026-05-22 21:35 (北京时间)
互动数据: ❤️ 111 | 🔄 2 | 💬 12 | 👁️ 7,834

推文原文:

So… how long did it even work at “original price”? 3 days? 2?
“promotional discount ends with the new Official Full Price equal to the discount price”. Only at DeepSeek.

核心要点:
DeepSeek V4-Pro 的"原价"只维持了几天，促销折扣结束后，新的"官方全价"直接等于折扣价。这是典型的中国式互联网打法：先高价锚定，再永久低价占领市场。

灵感启发:

定价策略也是产品策略的一部分
国产模型在成本控制和工程优化上确实领先
价格战将加速行业洗牌

可实践建议:
对于依赖大模型 API 的产品，建议多模型备份策略，不要把鸡蛋放在一个篮子里。

社交媒体文案:

即刻版:
DeepSeek这手玩得妙啊 😏 “原价"只卖了3天，促销结束后的"新官方全价"直接等于折扣价。输入3元/百万token，这定价策略教科书级别。国产大模型不仅在技术上追赶，商业打法也越来越成熟 💰 #DeepSeek #商业策略 #AI

参考链接: https://x.com/teortaxesTex/status/2057817507386306744

Twitter/X版:
DeepSeek’s pricing strategy: “original price” lasted 3 days, then the “new official full price” = the discounted price. Clever market positioning 💡 #DeepSeek #PricingStrategy

参考链接: https://x.com/teortaxesTex/status/2057817507386306744

精选推文 8: Yohei 发布首篇论文 - “The Log is the Agent”

作者: Yohei (@yoheinakajima) ✅
发布时间: 2026-05-22 21:43 (北京时间)
互动数据: ❤️ 6 | 🔄 1 | 💬 0 | 👁️ 855

推文原文:

@gdb @jonathanrlarkin just dropped a relevant paper :)

引用推文:

babyagi has ~200 citations, but 0 papers… i just published my first paper on arXiv 😆

“The Log is the Agent: Event-Sourced Reactive Graphs for Auditable, Forkable Agentic Systems”

the case for agents that coordinate through persistent replayable state — no conversation loops, no workflows, no A2A — with auditability, forking, and causal lineage built in.

核心要点:
BabyAGI 作者 Yohei 发布首篇论文，提出"日志即 Agent"架构：通过持久化、可重放的状态（Event-Sourced）实现 Agent 协调，无需对话循环、工作流或 A2A 协议，内置可审计、分叉和因果追溯能力。

灵感启发:

Agent 架构正在从"对话式"向"事件溯源式"演进
可审计性和因果追溯对生产环境至关重要
这可能是 Agent 系统架构的新范式

可实践建议:
如果你正在设计 Agent 系统，推荐阅读这篇论文，考虑事件溯源架构对可维护性和可扩展性的长期价值。

社交媒体文案:

即刻版:
BabyAGI作者Yohei发论文了！🎉 “The Log is the Agent” - 提出用事件溯源架构替代对话循环和工作流，Agent通过持久化状态协调，内置可审计和因果追溯。这可能是Agent系统架构的新方向 📜 #BabyAGI #Agent架构 #论文

参考链接: https://x.com/yoheinakajima/status/2057819568488501542

Twitter/X版:
BabyAGI’s Yohei published his first paper: “The Log is the Agent” - Event-sourced reactive graphs for auditable, forkable agentic systems. A new architectural paradigm 📝 #Agent #Architecture #Research

参考链接: https://x.com/yoheinakajima/status/2057819568488501542

精选推文 9: AI 与宗教信仰的关联研究

作者: Tim Soret (@timsoret) ✅
发布时间: 2026-05-22 22:45 (北京时间)
互动数据: ❤️ 8 | 🔄 0 | 💬 2 | 👁️ 835

推文原文:

Love this. It’s one thing I got quite wrong in my early extrapolations. Already adjusted in The Last Night btw. There is a location & several characters designed to explore this topic.

引用推文:

In one of the largest religion surveys ever conducted, perhaps surprisingly, people who are actively religious are the ones who have the highest confidence in AI right now

核心要点:
一项大规模宗教调查显示，积极参与宗教活动的人对 AI 的信心最高。这可能与宗教群体对"非人类智能"的接受度更高有关。

灵感启发:

AI 接受度与宗教/文化背景可能存在相关性
技术扩散需要考虑文化因素
游戏《The Last Night》正在探索这一主题

可实践建议:
如果你在做 AI 产品的用户研究，可以考虑宗教/文化背景作为细分维度。

社交媒体文案:

即刻版:
有趣的研究发现 🧐 大规模调查显示，积极参与宗教活动的人对AI的信心最高！这可能与宗教背景让人更容易接受"非人类智能"有关。Tim Soret的游戏《The Last Night》也在探索这个主题 🎮 #AI研究 #宗教 #文化

参考链接: https://x.com/timsoret/status/2057835129419047075

Twitter/X版:
Surprising finding: people who are actively religious have the highest confidence in AI. Cultural acceptance of non-human intelligence? 🤔 #AI #Research #Culture

参考链接: https://x.com/timsoret/status/2057835129419047075

精选推文 10: Bryan Johnson 的 N=2 实验

作者: Tim Soret (@timsoret) ✅
发布时间: 2026-05-22 22:02 (北京时间)
互动数据: ❤️ 19 | 🔄 0 | 💬 1 | 👁️ 1,171

推文原文:

The fact that Bryan & Kate have to do this on their own, that no government or company in the world has invested just a few millions / year to optimize & study the health of a few volunteers in order to advance humanity is crazy.

It’s such an obvious low-hanging fruit.

核心要点:
评论 Bryan Johnson 启动的"女性版 Blueprint"项目：Kate Tolo 将成为史上被测量最全面的女性，投入200万美元/年，测试女性专属的长寿方案。令人惊讶的是，没有政府或大公司愿意投资这类研究。

灵感启发:

N=1/N=2 医学研究正在填补传统 RCT 的空白
女性健康研究长期被忽视（FDA曾禁止女性参与临床试验16年）
个人数据驱动的健康管理可能是未来趋势

可实践建议:
关注 Bryan Johnson 和 Kate Tolo 的实验数据发布，这可能是预防医学和长寿研究的宝贵资源。

社交媒体文案:

即刻版:
Bryan Johnson的"女性版Blueprint"启动了！Kate Tolo将成为史上被测量最全面的女性 📊 投入200万美元/年，测试女性专属长寿方案。讽刺的是，没有政府或大公司愿意投资这类研究，而这明明是人类进步的"低垂果实” 🍎 #长寿科技 #健康 #N1医学

参考链接: https://x.com/timsoret/status/2057824398187319522

Twitter/X版:
Bryan Johnson’s “female Blueprint” - Kate Tolo will be the most measured woman in history. $2M/year investment in female-specific longevity research. Surprising no gov/corp funds this 🔬 #Longevity #HealthTech

参考链接: https://x.com/timsoret/status/2057824398187319522

精选推文 11: Claude 的幽默感超越人类

作者: tokenbender (@tokenbender) ✅
发布时间: 2026-05-22 23:34 (北京时间)
互动数据: ❤️ 11 | 🔄 0 | 💬 1 | 👁️ 355

推文原文:

I find it hilarious that Claude has a better sense of humor than many humans now.
It is tough being a human in the world where you’re ai-mogged on art, humor and maths.

核心要点:
作者感叹 Claude 的幽默感已经超过很多人类，在 AI 被艺术、幽默和数学领域"碾压"的时代，做人类越来越难了。

灵感启发:

AI 在创造性任务上的表现正在超越预期
“AI 不会幽默"的论断可能需要重新审视
人类独特价值的重新定义

可实践建议:
尝试用 Claude 进行创意写作或头脑风暴，你可能会惊喜于其幽默感和创造力。

社交媒体文案:

即刻版:
Claude的幽默感已经比很多人类强了 😂 当AI在艺术、幽默、数学全面超越人类，我们的独特价值在哪里？也许该重新定义"人类优势"了 🤔 #Claude #AI幽默 #创意AI

参考链接: https://x.com/tokenbender/status/2057847597730300172

Twitter/X版:
“Claude has a better sense of humor than many humans now” - AI is mogging us on art, humor, and maths. Time to redefine human uniqueness? 🤖😂 #Claude #AI #Creativity

参考链接: https://x.com/tokenbender/status/2057847597730300172

精选推文 12: Modern LLMs 百位数乘法无需工具

作者: Teortaxes (@teortaxesTex) ✅
发布时间: 2026-05-22 22:12 (北京时间)
互动数据: ❤️ 60 | 🔄 0 | 💬 4 | 👁️ 3,631

推文原文:

Modern LLMs can do multiplication of 100-digit numbers without tools. So much for “embers of autoregression”. Just scale the COT bro

核心要点:
现代 LLM 可以在没有工具的情况下完成 100 位数的乘法运算，这推翻了"自回归模型只能做短程推理"的论断。只要扩大 CoT（思维链）规模，模型就能处理复杂计算。

灵感启发:

CoT scaling 可能是解锁 LLM 推理能力的关键
“自回归局限性"的论断需要重新审视
推理时计算（test-time compute）的重要性

可实践建议:
在需要精确计算的场景，给模型足够的推理时间和空间（long CoT），可能比外挂工具更有效。

社交媒体文案:

即刻版:
LLM现在能心算100位数乘法了！🧮 Teortaxes这句"Just scale the COT bro"说得好 - 只要给够思维链长度，自回归模型也能做复杂推理。推理时计算（test-time compute）可能是下一个突破点 💡 #LLM #推理能力 #CoT

参考链接: https://x.com/teortaxesTex/status/2057826903721951273

Twitter/X版:
Modern LLMs can multiply 100-digit numbers without tools. “Just scale the COT bro” - test-time compute might be the key to unlocking reasoning 🧮 #LLM #Reasoning #ChainOfThought

参考链接: https://x.com/teortaxesTex/status/2057826903721951273

汇总统计

指标	数值
总推文数	50
精选推文数	12
平均点赞数	312
平均转发数	36
平均评论数	56

主题分布:

AI 模型与定价策略: 4 篇
技术突破与开源: 3 篇
AI 哲学与思考: 3 篇
健康科技与长寿: 1 篇
其他: 1 篇

本报告由 X List 每小时精选自动生成
数据来源: X List (ID: 1597115448146898944)
生成时间: 2026-05-23 00:00 (Asia/Shanghai)