(CVPR 2026 一作)WebChain:为什么 GUI Agent 其实还没有真正学会“用浏览器”
很多系统看起来会操作网页,但并不是通过 GUI 真正在使用浏览器。WebChain 想解决的,是这个能力断层背后的数据问题。
Technical articles, research notes, and thoughts.
很多系统看起来会操作网页,但并不是通过 GUI 真正在使用浏览器。WebChain 想解决的,是这个能力断层背后的数据问题。
WebFactory 想解决的核心问题是:语言模型已经理解网页,但为什么仍然很难在真实 GUI 环境中稳定完成任务。
We will open source the largest real human trajectory dataset for web agents.
Agents are replacing traditional interaction paradigms, transitioning from passive response tools to active decision-execution systems. This raises new questions and directions for GUI agent technology itself and how we understand future human-computer interaction.
We need to strip away the semantic bubble of capital markets and return to the original proposition defined by reinforcement learning: how to achieve true Markov Decision Processes in uncertain environments.
The current AI wave is defined by 'content generation,' but the real paradigm shift is happening beyond 'content.' GUI Agent is not a new form of content output, but a Side Effect manufacturing system. It marks AI's transition from probabilistically 'describing the world' to deterministically 'changing the world.'