Research & Projects
Published papers, open-source projects, and tools.
Publications
WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces
Sicheng Fan, Rui Wan, Yifei Leng, Gaoning Liang, Li Ling, Yanyi Shang, Dehan Kong
IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026
WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents
Sicheng Fan, Qingyun Shi, Shengze Xu, Shengbo Cai, Tieyong Zeng, Li Ling, Yanyi Shang, Dehan Kong
International Conference on Learning Representations, 2026
Open Source
WebChain
ActiveWebChain is the largest open-source dataset of human-annotated trajectories on real-world websites, comprising 31,725 trajectories with 318,000 steps. First-author paper at CVPR 2026.
WebFactory
ActiveWebFactory is an automated RL training pipeline for GUI web agents, eliminating unsafe live web interactions and expensive human-annotated datasets. First-author paper at ICLR 2026.
WebClone
ActiveWeb Agent evaluation environment. Provides offline controllable website cloning for reproducible AI Agent testing, supporting batch data generation and standardized evaluation.
CafeMeet
ActiveSmart meeting spot recommendation system. Using AI and map data analysis to intelligently recommend the best cafés for group meetings, considering ratings, distance, ambiance, and transportation.