Bowen Wang

I'm a

2nd

year Ph.D. student at the University of Hong Kong, with deep focus and passion on NLP and fundamental model-based agent. I'm fortunate to get advised by Prof. Tao Yu  and be a part of the XLANG Lab  and HKU NLP Lab. Previously I got my bachlor's degree of Computer Science and Technology at Tsinghua University  focusing on Human-Computer Interaction (HCI), advised by Prof. Chun Yu and Prof. Yuanchun Shi.
My long-term research goal is to build autonomous GUI agents that can 1) understand open-ended natural language (or even multi-modal) instructions; 2) observe the wild GUI environment (OS, Mobile and more) with grounded knowledge; 3) generate executable steps (e.g. atomic actions, system shortcuts, codes, etc.) iteratively to finalize the task.
My research focuses on building general autonomous agents. I have primarily worked on computer-use agents (CUA), developing comprehensive evaluation frameworks and open foundations through supervised fine-tuning. My current research interests lie in digital agentic models, particularly in post-training methodologies for creating genuinely generalizable and intelligent agents that can reliably operate across diverse digital environments.

Experiences

  1. Company
    Moonshot AI
    Role
    Research Intern
    Date

Education

  1. Company
    The University of Hong Kong
    Role
    Ph.D. Student
    Date
  2. Company
    National University of Singapore
    Role
    Research Assistant
    Date
  3. Company
    Tsinghua University
    Role
    B.E. in Computer Science
    Date

Computer Agent Arena: Evaluating Computer-Use Agents via Crowdsourcing from Real Users

Bowen Wang*, Xinyuan Wang*, Jiaqi Deng*, Tianbao Xie, Ryan Li, Yanzhe Zhang, Zicheng Gong, Gavin Li, Toh Jing Hua, Ion Stoica, Wei-Lin Chiang, Diyi Yang, Yu Su, Yi Zhang, Zhiguo Wang, Victor Zhong, Tao Yu

OpenCUA: Open Foundations for Computer-Use Agents

Xinyuan Wang*, Bowen Wang*, Dunjie Lu, Junlin Yang, Tianbao Xie, Junli Wang, Jiaqi Deng, Xiaole Guo, Yiheng Xu, Chen Henry Wu, Zhennan Shen, Zhuokai Li, Ryan Li, Xiaochuan Li, Junda Chen, Boyuan Zheng, Peihang Li, Fangyu Lei, Ruisheng Cao, Yeqiao Fu, Dongchan Shin, Martin Shin, Jiariu Hu, Yuyan Wang, Jixuan Chen, Yuxiao Ye, Danyang Zhang, Yipu Wang, Heng Wang, Diyi Yang, Victor Zhong, Y. Charles, Zhilin Yang, Tao Yu

NeurIPS'25 (Spotlight)COLM'25 AIA Workshop (Best Paper Award)[Website][Paper][Code]

Kimi-VL Technical Report

Kimi Team: Angang Du, Bohong Yin, Bowei Xing, Bowen Qu, Bowen Wang, Cheng Chen, Chenlin Zhang, Chenzhuang Du, Chu Wei, Congcong Wang, Dehao Zhang, Dikang Du, Dongliang Wang, Enming Yuan, Enzhe Lu, Fang Li, Flood Sung, Guangda Wei, Guokun Lai, Han Zhu, Hao Ding, Hao Hu, Hao Yang, et al.

Technical Report[Paper]

AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI

Bowen Wang*, Lihang Pan*, Chun Yu, Yuxuan Chen, Xiangyu Zhang, Yuanchun Shi

ArXiv[Paper]

Interaction Proxy Manager: Semantic Model Generation and Run-time Support for Reconstructing Ubiquitous User Interfaces of Mobile Services

Tian Huang, Chun Yu, Weinan Shi, Bowen Wang, David Yang, Yihao Zhu, Zhaoheng Li, Yuanchun Shi

IMWUT 22'[Paper]