Homepage

Welcome to my homepage!

  • 🎓I am currently pursuing my MSc in Computer Engineering at the National University of Singapore (NUS). I completed my undergraduate degree at Nanjing University of Aeronautics and Astronautics (NUAA) from 2018 to 2022. In 2022, I had the opportunity to work as a community intern in the HUAWEI MindSpore division during my stay in Durham, UK from May 2023 to July 2023.
  • 🎞️ Update 2023.8 Glad to join in SHOWLAB @NUS, where impressive and innovative works are been made!
  • News: July 2024 Start an internship at Tencent WXG.
  • Datasets and annotations of GUI Action Narrator are released.
  • Video GUI got accepted by Neurips 2024 (DB track Spotlight). Congrats.

Research Interests

  • Vision and Language
  • LL(V)M based Agent
  • GUI Understanding

Publications at a Glance

  • CVPR 2024 accepted paper, collaborator Pioneer AI Assistant for Graphical User Interface (GUI) which can assist users in completing complex tasks, boosting human productivity. More details are available at arxiv




  • GUI Action Narrator How Visual language model can tell the action of the user in Graphic User Interface through consecutive screenshots? Available at Link.






Internships

  • Tencent WXG Guangzhou From July 2024 to Nov 2024