I’m a second-year master’s student at Zhejiang University, supervised by Prof. Tao Jin. I have published several papers as the first author at top AI conferences such as ICLR, ICML, ACL and ACM MM. Previously, I was fortunate to intern at the Social Computing Group at Microsoft Research Asia (MSRA), where I worked on streaming video understanding under the mentorship of Jianxun Lian.
My research focuses on Multimodal Large Language Models, especially the applications of Vision-Language Models and effective fine-tuning strategies. Recently, I’ve been particularly interested in streaming video understanding, aiming to enable models to continuously interpret live video streams with strong temporal reasoning and timely responses. My long-term goal is to build a truly user-friendly AI assistant—reliable, practical, and proactive—that can understand visual content, communicate naturally, and help users accomplish real-world tasks with a consistently solid experience.
📖 Educations
- 2024.09 - 2027.06 (now), Master Student, Software School, Software Engineer, Zhejiang University.
- 2020.09 - 2024.06, Undergraduate, Software College, Software Engineering (International (English)), Northeastern University.
- 2017.09 - 2020.06, No.3 Middle School of Wuhan.
🎖 Honors and Awards
- 2026.02 MSRA Stars of Tomorrow Award.
- 2025.10 National Scholarship.
📝 Publications (First Author)
Streaming Video Understanding

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions
Weicai Yan, Yuhong Dai, Qi Ran, Haodong Li, Wang Lin, Hao Liao, Xing Xie, Tao Jin, Jianxun Lian
Parameter-Efficient Fine-Tuning

Text-Guided Multi-Scale Frequency Representation Adaptation
Weicai Yan, Xinhua Ma, Wang Lin, Tao Jin

Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision
Weicai Yan, Wang Lin, Zirun Guo, Ye Wang, Fangming Feng, Xiaoda Yang, Zehan Wang, Tao Jin
- Code.
- Prompt Visualization.


Low-rank Prompt Interaction for Continual Vision-Language Retrieval
Weicai Yan, Ye Wang, Wang Lin, Zirun Guo, Zhou Zhao, Tao Jin
- Code.
💻 Internships
- 2025.08 - 2026.02, MSRA, Social Computing Group, Beijing.
Services
Reviewer: ACM MM 2025, ICLR 2025, ICLR 2026