Homepage

Welcome to my homepage!

  • I am currently an Algorithm Engineer specializing in the pretraining and post training of large language model (LLM) and multimodal LLMs. Previously, I obtained my Master’s degree in Computer Engineering from the National University of Singapore (NUS).
  • 🎓I completed my undergraduate degree at Nanjing University of Aeronautics and Astronautics (NUAA) from 2018 to 2022. In 2022, I had the opportunity to work as a community intern in the HUAWEI MindSpore division during my stay in Durham, UK from May 2023 to July 2023.
  • 🎞️ Update 2023.8 Glad to join in SHOWLAB @NUS, where impressive and innovative works are been made!
  • News: July 2024 Start an internship at Tencent WXG.
  • Video GUI got accepted by Neurips 2024 (DB track Spotlight). Congrats.
  • GUI Action Narrator got accepted by ACMMM 2025.

Research Interests

  • Vision and Language
  • LL(V)M based Agent
  • GUI Understanding

Reseach Works

  • AssitGUI (CVPR 2024) Pioneer AI Assistant for Graphical User Interface (GUI) which can assist users in completing complex tasks, boosting human productivity. More details are available at arxiv

  • VideoGUI (Neurips 2024 DB) Evaluating Ai agent comprehensively. Available at Link.

  • GUI Action Narrator (ACMMM 2025) How Visual language model narrate the learn from human demonstrations in GUI? Available at Link.

Internships

  • Tencent WXG Guangzhou From July 2024 to Oct 2024

Current updates

Served as reviewer in Neurips 25 DB track