Homepage
Welcome to my homepage!
- 🎓I am currently pursuing my MSc in Computer Engineering at the National University of Singapore (NUS). I completed my undergraduate degree at Nanjing University of Aeronautics and Astronautics (NUAA) from 2018 to 2022. In 2022, I had the opportunity to work as a community intern in the HUAWEI MindSpore division during my stay in Durham, UK from May 2023 to July 2023.
- 🎞️ Update 2023.8 Glad to join in SHOWLAB @NUS, where impressive and innovative works are been made!
- News: July 2024 Start an internship at Tencent WXG.
- Datasets and annotations of GUI Action Narrator are released.
- Video GUI got accepted by Neurips 2024 (DB track Spotlight). Congrats.
Research Interests
- Vision and Language
- LL(V)M based Agent
- GUI Understanding
Publications at a Glance
CVPR 2024 accepted paper, collaborator Pioneer AI Assistant for Graphical User Interface (GUI) which can assist users in completing complex tasks, boosting human productivity. More details are available at arxiv
GUI Action Narrator How Visual language model can tell the action of the user in Graphic User Interface through consecutive screenshots? Available at Link.
Internships
- Tencent WXG Guangzhou From July 2024 to Nov 2024