SWE-Bench Mobile: Can Large Language Model Agents Develop Industry-Level Mobile Applications?
Muxin Tian*†, Zhe Wang*, Blair Yang, Zhenwei Tang, Kunlun Zhu, Honghua Dong, Hanchen Li, Xinni Xie, Guangjing Wang, Jiaxuan You
arxiv preprint, 2026 Under Review
Where LLM Agents Fail and How They can Learn From Failures
Kunlun Zhu*†, Muxin Tian*, Zijia Liu*, Bingxuan Li*, Yingxuan Yang, Jiaxun Zhang, Pengrui Han, Qipeng Xie, Fuyang Cui, Weijia Zhang, Xiaoteng Ma, Xiaodong Yu, Gowtham Ramesh, Jialian Wu, Zicheng Liu, Pan Lu, James Zou, Jiaxuan You
arXiv preprint, 2025 Under Review
OasisSimp: An Open-source Asian-English Sentence Simplification Dataset
Hannah Liu*, Muxin Tian*†, Iqra Ali, Haonan Gao, Qiaoyiwen Wu, Blair Yang, Uthayasanker Thayasivam, En-Shiun Annie Lee, Pakawat Nakwijit, Surangika Ranathunga, Ravi Shekhar
LREC, 2026 (Oral)