X List V2 精选 · 2025-05-31 00:00

每3小时精选高质量推文 | 共筛选 100 条，精选 12 篇

精选推文 1

作者: @GaryMarcus (Gary Marcus, MIT PhD and NYU Professor Emeritus)
发布时间: 2025-05-30 11:47 (北京时间)
互动数据: 1,216 likes · 244 retweets · 254 replies

推文原文

I honestly think Elon’s best days are behind him:

BYD is crushing Tesla in EVs.
Waymo is crushing Tesla in AVs.
Anthropic, Openai, and Google are crushing Xai on AI.
The SpaceX S-1 is so ridiculous the Danish pension fund blacklisted it.
He is not the man he was.

核心要点

Gary Marcus 对 Elon Musk 的商业帝国进行了系统性批判，指出 Tesla 在 EV 领域被 BYD 超越、在自动驾驶领域被 Waymo 领先，xAI 也落后于 Anthropic/OpenAI/Google，SpaceX 的 IPO 文件更是被丹麦养老基金拉黑。

灵感启发

思维模型: 竞争优势的动态性 - 即使是行业领军者，也可能在多线作战中失去焦点。这提醒我们关注技术公司的"护城河"是否在持续加深，还是被多方围攻而稀释。

可实践建议

定期审视你关注的技术公司/产品的竞争格局变化，不要只关注单一指标（如股价），要看市场份额、技术创新、用户口碑等多维度的真实竞争力。

社交媒体文案

🟠 即刻版:
Elon Musk 的商业帝国正在多线失守？🤔

BYD 在电动车领域碾压 Tesla，Waymo 在自动驾驶赛道领先，xAI 也追不上 OpenAI/Claude 的步伐…Gary Marcus 这波分析挺狠的 💀

技术竞争没有永远的王者，只有持续的进化 🔄

https://x.com/GaryMarcus/status/2060568761909092844

🔵 Twitter/X版:
Gary Marcus 对 Elon Musk 商业帝国的系统性批判：Tesla 被 BYD 超越、Waymo 领先自动驾驶、xAI 落后、SpaceX IPO 被拉黑。

技术竞争没有永远的王者 📉

https://x.com/GaryMarcus/status/2060568761909092844

精选推文 2

作者: @omarsar0 (elvis)
发布时间: 2025-05-30 23:52 (北京时间)
互动数据: 30 likes · 5 retweets · 4 replies

推文原文

Increasingly, HTML Artifacts are becoming a core part of how I work with AI agents.

Long-horizon agent sessions need a better way to surface insights about what work it has done.

This may not be obvious right now, but as you start to let your agent work on dynamic workflows, large codebases, long-running loops (e.g., using /goal), and deep research tasks, you need a good way to present results. Chat window is not it.

You also don’t want to just trust everything the agents do. Artifacts help provide an important verification layer, which in turn enables important decision-making.

I like HTML artifacts because I can just ask the agent to produce as many of them (and in whatever form) as I need to verify the work and make sense out of everything. I even built a nice tab system for my artifacts. They are great for continual learning and research.

I use HTML artifacts for logging, tracking experiments, brainstorming, managing my inbox, code reviews, agent session management, deep research, writing, reading, and so much more.

I believe @karpathy wrote about this somewhere: As we move on to more advanced applications of AI agents and outputs get more complex, we will start to find the need for even more advanced forms of interactions with AI, including interactive neural videos/simulations.

核心要点

elvis 分享了使用 HTML Artifacts 与 AI Agent 协作的深度实践经验，强调长时程 Agent 会话需要更好的结果呈现方式，Artifacts 提供了重要的验证层和决策支持。

灵感启发

思维模型: 人机协作的界面进化 - 从简单的对话式交互到结构化、可视化的 Artifact 交互。这暗示着 AI 交互范式的转变：从"问-答"到"委托-验证-迭代"。

可实践建议

尝试让 AI Agent 生成 HTML 格式的报告/仪表板来展示工作成果，而不是仅仅依赖对话窗口。可以包含：任务进度、关键发现、待确认决策点等结构化信息。

社交媒体文案

🟠 即刻版:
AI Agent 工作流的下一个进化点？🤖

elvis 分享了一个超实用的模式：用 HTML Artifacts 作为 Agent 的输出载体，而不是依赖聊天窗口 💡

日志追踪、实验记录、代码审查、深度研究…都可以可视化呈现。这给了人类一个"验证层"，避免盲目信任 AI 🛡️

@karpathy 也说过：AI 输出越复杂，我们越需要更高级的交互形式 🎯

https://x.com/omarsar0/status/2060751120587497720

🔵 Twitter/X版:
HTML Artifacts 正在成为 AI Agent 协作的核心组件。

长时程 Agent 会话需要更好的结果呈现方式，Artifacts 提供了验证层和决策支持。

从对话式交互到结构化可视化，这是 AI 协作范式的进化 🔄

https://x.com/omarsar0/status/2060751120587497720

精选推文 3

作者: @ClementDelangue (clem 🤗)
发布时间: 2025-05-30 23:43 (北京时间)
互动数据: 46 likes · 13 retweets · 7 replies

推文原文

AI safety can’t happen behind closed doors! Super cool to see that the @AISecurityInst is releasing its evals, datasets, and models in the open on @huggingface, so researchers everywhere can scrutinize, reproduce, and build on them.

核心要点

HuggingFace CEO 强调 AI 安全不能闭门造车，AISecurityInst 在 HuggingFace 上开源了其评估标准、数据集和模型，让全球研究者可以审查、复现和扩展。

灵感启发

思维模型: 安全的开源悖论 - 传统认为安全应该保密，但 AI 安全的特殊性在于需要集体智慧来发现和修复问题。开源安全评估反而能加速安全标准的建立。

可实践建议

如果你在做 AI 相关的安全研究，考虑开源你的评估方法和数据集。安全不是隐藏问题，而是让问题被发现和解决。

社交媒体文案

🟠 即刻版:
AI 安全不能闭门造车！🔓

HuggingFace CEO 分享好消息：AISecurityInst 把评估标准、数据集、模型都开源了 🎉

全球研究者可以一起审查、复现、改进。这才是解决 AI 安全问题的正确姿势 - 集众人之力 💪

https://x.com/ClementDelangue/status/2060749008641970465

🔵 Twitter/X版:
AI safety can’t happen behind closed doors.

AISecurityInst 在 HuggingFace 开源了 evals、datasets 和 models，让全球研究者可以 scrutinize、reproduce 和 build on them。

Open approach to safety 🛡️

https://x.com/ClementDelangue/status/2060749008641970465

精选推文 4

作者: @emollick (Ethan Mollick)
发布时间: 2025-05-30 22:55 (北京时间)
互动数据: 121 likes · 4 retweets · 18 replies

推文原文

I think Epoch does a great job benchmarking, but I continue to believe that open weights models are much more fragile, especially out-of-distribution, than their benchmarks indicate. Vibe-wise, I don’t think they were only 3 months behind last year or only 4 months behind today.

核心要点

Ethan Mollick 对 Epoch AI 的基准测试提出质疑，认为开源模型在实际应用中的鲁棒性（特别是分布外场景）比基准显示的要差，实际差距可能大于 3-4 个月。

灵感启发

思维模型: 基准测试 vs 真实世界 - 基准测试往往是在受控环境下进行的，而真实应用场景更加复杂和混乱。这提醒我们关注"纸面性能"与"实际表现"之间的差距。

可实践建议

在选择 AI 模型时，不要只看 benchmark 分数，要在自己的实际数据和使用场景下进行测试。特别是开源模型，要注意其在边界情况下的表现。

社交媒体文案

🟠 即刻版:
开源模型真的只落后 3-4 个月吗？🤔

Ethan Mollick 提出一个关键观点：基准测试 ≠ 真实表现

开源模型在分布外场景下可能更脆弱。Vibe 上感觉差距比数字显示的更大 📊

选模型别只看 benchmark，要在自己的场景实测 💡

https://x.com/emollick/status/2060736941453189622

🔵 Twitter/X版:
Open weights models are more fragile than benchmarks indicate, especially out-of-distribution.

The gap between open and proprietary models might be larger than 3-4 months in real-world usage.

Benchmarks ≠ Real performance 📊

https://x.com/emollick/status/2060736941453189622

精选推文 5

作者: @NVIDIAAI (NVIDIA AI)
发布时间: 2025-05-30 06:00 (北京时间)
互动数据: 491 likes · 61 retweets · 25 replies

推文原文

Hours of video, now searchable by your agent.

We just released a new set of agent skills and modular architecture for the Metropolis Blueprint for Video Search and Summarization, eliminating the need for manual configuration of multiple microservices.

Load the skills into a compatible coding agent and it deploys the stack, turning hours of footage into searchable, actionable intelligence through a chat interface. Ask in plain language and get back clips, summaries, and answers.

核心要点

NVIDIA 发布了新的 Agent Skills 和模块化架构，让开发者可以通过自然语言对话界面搜索和总结数小时的视频内容，无需手动配置多个微服务。

灵感启发

思维模型: 视频内容的 Agent 化 - 视频数据正在从"被动存储"转向"主动可查询的知识库"。这意味着未来的视频内容管理将完全改变，从文件夹浏览变成对话式检索。

可实践建议

如果你有大量的视频内容需要管理，关注这类视频搜索 Agent 技术。未来视频库将像数据库一样可被查询，而不再是线性浏览。

社交媒体文案

🟠 即刻版:
NVIDIA 新发布：视频也能被 Agent 搜索了！🎥🤖

几小时的视频素材，现在可以用自然语言对话来搜索、总结、提取关键片段 💡

不用手动配置微服务，直接加载 Agent Skills 就能部署

视频内容管理要变天了 🌊

https://x.com/NVIDIAAI/status/2060481312511623513

🔵 Twitter/X版:
NVIDIA releases new agent skills for video search and summarization.

Hours of footage → searchable via chat interface. Natural language queries return clips, summaries, and answers.

No manual microservice configuration needed 🎥🤖

https://x.com/NVIDIAAI/status/2060481312511623513

精选推文 6

作者: @gdb (Greg Brockman, President & Co-Founder @OpenAI)
发布时间: 2025-05-30 05:19 (北京时间)
互动数据: 716 likes · 37 retweets · 48 replies

推文原文

codex for managing the codex UI:

引用推文: “If you ever get tired of managing your Codex threads, just let Codex manage itself! Codex can now create threads, search them, organize them, pin the important ones, and spin up worktrees for parallel tasks.”

核心要点

OpenAI 展示了 Codex 的自我管理能力：Codex 现在可以创建、搜索、组织线程，置顶重要对话，并为并行任务创建工作区。这是 Agent 自我管理的早期形态。

灵感启发

思维模型: 递归式 Agent 能力 - Agent 开始管理 Agent，这是向更高级自主系统演进的关键一步。当工具能够管理自身状态时，人类从"操作者"转变为"监督者"。

可实践建议

尝试让 AI Agent 帮你管理工作流程，比如自动整理聊天记录、归档已完成任务、提醒待办事项。把重复性的管理工作委托给 Agent。

社交媒体文案

🟠 即刻版:
Codex 开始自己管理自己了！🤯

OpenAI 展示了一个有趣的功能：Codex 可以创建、搜索、组织线程，还能置顶重要对话、为并行任务创建工作区

这是 Agent 自我管理的早期形态 - 工具开始自己管理自己的状态 🔄

人类从"操作者"变成"监督者" 👀

https://x.com/gdb/status/2060486309886443787

🔵 Twitter/X版:
Codex managing Codex: create threads, search, organize, pin important ones, spin up worktrees for parallel tasks.

Early form of agent self-management 🤖🔄

https://x.com/gdb/status/2060486309886443787

精选推文 7

作者: @bilawalsidhu (Bilawal Sidhu)
发布时间: 2025-05-30 08:28 (北京时间)
互动数据: 596 likes · 68 retweets · 20 replies

推文原文

i like big splats and i cannot lie.

you can now compress, tile and stream city scale 3d gaussian splats – glTF has an official 3DGS extension now too.

this is what the future of google earth looks like. no more broccoli trees. no more melted powerlines. immaculate ground level detail that actually holds up.

核心要点

3D Gaussian Splatting 技术取得突破：现在可以压缩、切片和流式传输城市规模的 3DGS，glTF 也加入了官方 3DGS 扩展。这意味着 Google Earth 级别的精细 3D 地图即将成为现实。

灵感启发

思维模型: 空间计算的临界点 - 当 3D 内容可以高效压缩和流式传输时，空间计算应用将迎来爆发。从"西兰花树"到"精确细节"，这是从 demo 到 production 的关键跨越。

可实践建议

如果你有 AR/VR 或地图相关的项目，开始关注 3D Gaussian Splatting 技术。这可能是下一代空间应用的基础设施。

社交媒体文案

🟠 即刻版:
3D 地图要变天了！🗺️✨

Bilawal Sidhu 分享：城市规模的 3D Gaussian Splatting 现在可以压缩、切片、流式传输了！glTF 也加入了官方扩展

Google Earth 的未来：告别"西兰花树"，迎来精确到地面的细节 🌳➡️🏙️

空间计算的基础设施正在成熟 🚀

https://x.com/bilawalsidhu/status/2060518632547877359

🔵 Twitter/X版:
City-scale 3D Gaussian Splatting is here.

Compress, tile, stream. glTF now has official 3DGS extension.

The future of Google Earth: no more broccoli trees, no more melted powerlines. Immaculate ground-level detail 🗺️✨

https://x.com/bilawalsidhu/status/2060518632547877359

精选推文 8

作者: @ajambrosino (Andrew Ambrosino, Codex @OpenAI)
发布时间: 2025-05-30 06:20 (北京时间)
互动数据: 253 likes · 12 retweets · 22 replies

推文原文

and more!:

new subagent identicons

search previous chat content, not just titles

codex can now better control the app itself

核心要点

Codex 应用更新：新增子代理标识图标、支持搜索聊天内容（不仅是标题）、Codex 可以更好地控制应用本身。这些改进指向更强的 Agent 自主能力。

灵感启发

思维模型: Agent 应用的控制权转移 - 从"人类操作工具"到"Agent 操作应用"。搜索聊天内容意味着 Agent 有了"记忆检索"能力，这是长期任务执行的基础。

可实践建议

在使用 AI 工具时，注意利用其记忆和搜索功能。让 Agent 能够回顾之前的对话内容，可以显著提升长时程任务的连贯性。

社交媒体文案

🟠 即刻版:
Codex 又更新了！🆕

子代理有了专属标识
可以搜索聊天内容（不只是标题）
能更好地控制应用本身

Agent 的记忆检索能力越来越强，长时程任务执行会更连贯 🧠🔗

https://x.com/ajambrosino/status/2060486346796576898

🔵 Twitter/X版:
Codex updates:

Subagent identicons
Search chat content, not just titles
Better app self-control

Agent memory retrieval improving 🧠🔗

https://x.com/ajambrosino/status/2060486346796576898

精选推文 9

作者: @kyleichan (Kyle Chan, Research Fellow @BrookingsInst)
发布时间: 2025-05-30 15:23 (北京时间)
互动数据: 118 likes · 39 retweets · 6 replies

推文原文

The Asian AI boom is masking other problems

引用推文: “If you exclude semiconductors and AI servers, Taiwanese exports have actually fallen by 40% since 2022. In South Korea, non-AI exports have stagnated and Japan’s industry is in decline.”

核心要点

Kyle Chan 指出亚洲 AI 繁荣正在掩盖其他结构性问题：如果不算半导体和 AI 服务器，台湾出口自 2022 年以来实际下降了 40%，韩国非 AI 出口停滞，日本产业衰退。

灵感启发

思维模型: 结构性失衡的隐藏 - 当一个行业（AI/半导体）高速增长时，容易让人忽视其他产业的衰退。这是典型的"结构性转型期"特征。

可实践建议

关注宏观经济数据时要细分看：整体增长可能掩盖结构性问题。对于投资或职业选择，要考虑"AI 繁荣"之外的产业机会和风险。

社交媒体文案

🟠 即刻版:
AI 繁荣正在掩盖什么？🤔

Kyle Chan 的数据很扎心：不算半导体和 AI 服务器，台湾出口自 2022 年跌了 40%，韩国非 AI 出口停滞，日本产业衰退…

结构性转型期的典型特征：一个行业狂飙，其他行业失血 🩸

投资/择业要看到全貌 👀

https://x.com/kyleichan/status/2060622996549603453

🔵 Twitter/X版:
The Asian AI boom is masking other problems.

Excluding semiconductors and AI servers:

Taiwan exports: -40% since 2022
South Korea non-AI exports: stagnant
Japan industry: in decline

Structural transition pains 📉

https://x.com/kyleichan/status/2060622996549603453

精选推文 10

作者: @omarsar0 (elvis)
发布时间: 2025-05-30 11:12 (北京时间)
互动数据: 92 likes · 13 retweets · 18 replies

推文原文

In a few months, people will start to realize how fundamentally important MCP for agents is.

It’s not even about connecting tools. There are many ways to do that.

It’s about the types of abstraction it already enables. My new self-improving system, enabled through agent-to-agent interaction, is all powered by MCPs.

This was not an accident. I ran my entire orchestrator through a self-improving loop with clear criteria/goal, and it came up with all kinds of interesting ways (mostly powered by MCP tools) on how to enable complex interactions, versioning, eval workflows, communications, tools, etc.

Something new could always emerge, but I think the protocol itself will be crucial and necessary for all the advancements ahead.

MCP is the future. And I am glad a lot of it is built in the open.

核心要点

elvis 强调 MCP (Model Context Protocol) 对 Agent 生态的根本性重要性。他认为 MCP 的价值不仅是连接工具，而是它启用的抽象层 - Agent 之间的复杂交互、版本控制、评估工作流等。

灵感启发

思维模型: 协议层创新 - 就像 HTTP 协议催生了互联网，MCP 可能成为 Agent 时代的"HTTP"。协议的价值在于标准化接口，让不同系统能够无缝协作。

可实践建议

如果你正在构建 Agent 系统，开始学习和使用 MCP。这可能是未来 Agent 生态的基础设施标准。

社交媒体文案

🟠 即刻版:
MCP 可能是 Agent 时代的 HTTP 🌐

elvis 分享了他用 MCP 构建自改进系统的经验：Agent 之间的复杂交互、版本控制、评估工作流…

MCP 不只是连接工具，而是提供了标准化的抽象层 🛠️

协议层创新往往催生生态爆发 💥

https://x.com/omarsar0/status/2060559932533989719

🔵 Twitter/X版:
MCP is the future of agent infrastructure.

Not just about connecting tools - it’s about the abstractions it enables: agent-to-agent interaction, versioning, eval workflows.

Protocol layer innovation 🌐

https://x.com/omarsar0/status/2060559932533989719

精选推文 11

作者: @fchollet (François Chollet, Creator of Keras and ARC-AGI)
发布时间: 2025-05-30 20:12 (北京时间)
互动数据: 213 likes · 23 retweets · 53 replies

推文原文

The end will begin when humanity turns away from humanity

核心要点

François Chollet 的这句简短但深刻的推文，暗示了 AI 发展的终极风险：当人类开始放弃人性、放弃人类特有的价值和能力时，“终结"就开始了。

灵感启发

思维模型: 技术异化的临界点 - 技术的终极风险不是机器取代人类，而是人类主动放弃自己的独特性。保持"人性"可能是对抗技术失控的最后防线。

可实践建议

在追求 AI 效率的同时，保留那些” inefficient but human"的活动：面对面的深度交流、手工创作、无目的的思考。这些"低效"活动可能是保持人性的关键。

社交媒体文案

🟠 即刻版:
“终结的开始，是人类背弃人性之时” 💭

Keras 创始人 François Chollet 的这句话值得深思

AI 的终极风险可能不是机器取代人类，而是我们主动放弃自己的独特性 🎭

在追求效率的同时，保留那些"低效但有人性"的活动吧 ✨

https://x.com/fchollet/status/2060695962600009969

🔵 Twitter/X版:
“The end will begin when humanity turns away from humanity”

The ultimate risk of AI may not be replacement, but humans voluntarily abandoning what makes us human.

Preserve the inefficient but human 💭

https://x.com/fchollet/status/2060695962600009969

精选推文 12

作者: @c_valenzuelab (Cristóbal Valenzuela, Co-Founder & Co-CEO @runwayml)
发布时间: 2025-05-30 19:50 (北京时间)
互动数据: 70 likes · 5 retweets · 15 replies

推文原文

These judgments are a solid example of cultural essentialism. iow, the belief that there is a fixed essence of “real cinema” or “true art”

Every generation internalizes the standards of its artistic community and then experiences those standards as self evidently correct rather than socially learned.

Basically:

A group of people/artists develops certain conventions, eg: “good films have three act structures”,“cinema should be shot on film”, “art should be representational”, “animation should only be done with certain software”

These conventions prove useful in some contexts

Over time, people forget that they were choices made by particular humans in particular historical circumstances

Opinions are not facts. Don’t mistake accumulated traditions and conventions for laws of nature

核心要点

Runway CEO 对"文化本质主义"的批判：每一代人都把艺术社区的标准内化为"理所当然"，却忘记这些只是特定历史情境下人类的选择。观点不是事实，不要把传统当成自然法则。

灵感启发

思维模型: 传统的去神秘化 - 许多我们认为"本该如此"的规则，其实只是历史的偶然产物。AI 生成内容引发的争议，本质是新旧创作范式的冲突。

可实践建议

面对新技术（如 AI 创作）时，审视自己的抵触是否来自"文化本质主义"。问自己：这个标准是基于实际效果，还是仅仅因为"向来如此"？

社交媒体文案

🟠 即刻版:
“观点不是事实，不要把传统当成自然法则” 🎨

Runway CEO 这波对"文化本质主义"的批判很到位

每一代人都把艺术标准当成"理所当然"，却忘记这只是特定历史的选择

AI 创作引发的争议，本质是新旧范式的碰撞 💥

保持开放，审视自己的预设 🧠

https://x.com/c_valenzuelab/status/2060690322552991749

🔵 Twitter/X版:
“Opinions are not facts. Don’t mistake accumulated traditions and conventions for laws of nature.”

Cultural essentialism: every generation internalizes artistic standards as “self-evident” while forgetting they were historically contingent choices.

AI art debates = paradigm clash 💥

https://x.com/c_valenzuelab/status/2060690322552991749

📈 主题分布

主题	数量
AI Agent / 工具	5 篇
AI 安全/伦理	3 篇
技术趋势/产业	2 篇
空间计算/3D	1 篇
创作/艺术	1 篇

🔗 数据来源

X List ID: 1578456227805564928
筛选标准: 点赞>10 或转发>5 或评论>5 或内容深度>100字
生成时间: 2025-05-31 00:30 CST

本报告由 AI Agent 自动生成，如有遗漏或错误，请以原文为准。