Homepage

Welcome to my homepage!

  • 🎓I am currently pursuing my MSc in Computer Engineering at the National University of Singapore (NUS). I completed my undergraduate degree at Nanjing University of Aeronautics and Astronautics (NUAA) from 2018 to 2022. In 2022, I had the opportunity to work as a community intern in the HUAWEI MindSpore division during my stay in Durham, UK from May 2023 to July 2023.
  • 🎞️ Update 2023.8 Glad to join in SHOWLAB @NUS, where impressive and innovative works are been made!
  • News: July 2024 Start an internship at Tencent WXG.
  • Datasets and annotations of GUI Action Narrator are released.
  • Video GUI got accepted by Neurips 2024 (DB track Spotlight). Congrats.

Research Interests

  • Vision and Language
  • LL(V)M based Agent
  • GUI Understanding

Reseach Works

  • AssitGUI (CVPR 2024) Pioneer AI Assistant for Graphical User Interface (GUI) which can assist users in completing complex tasks, boosting human productivity. More details are available at arxiv

  • GUI Action Narrator How Visual language model can tell the action of the user in Graphic User Interface through consecutive screenshots? Available at Link.

  • VideoGUI (Neurips 2024 DB) Evaluating Ai agent comprehensively. Available at Link.

Internships

  • Tencent WXG Guangzhou From July 2024 to Nov 2024