EN | 中文
Rongyao Fang

Rongyao Fang 方荣耀方荣耀

Research Scientist研究员

Qwen VL TeamQwen VL 团队
Alibaba Group阿里巴巴集团

Email:邮箱: rongyaofang@gmail.com

[Google Scholar]    [GitHub]    [CV]

Biography个人简介

I am currently a Research Scientist at Alibaba Qwen VL Team, working on unified multimodal large models that integrate visual understanding and generation, as well as agentic approaches that leverage VLMs to orchestrate visual creation. My research is driven by a passion for Artificial General Intelligence (AGI), with a focus on building omni-modal foundation models that can perceive, reason, and create across vision, language, and beyond.

I obtained my Ph.D. from the Multimedia Laboratory (MMLab) of The Chinese University of Hong Kong (CUHK) in 2025, fortunate to be supervised by Prof. Hongsheng Li. I also worked closely with Prof. Xihui Liu.

Previously, I was a visiting scholar at MIT CSAIL, advised by Prof. Dina Katabi. I obtained my B.Eng. degree from Shanghai Jiao Tong University, where I was ranked 1st/157 and advised by Prof. Bingbing Ni.

我目前在阿里巴巴 Qwen VL 团队担任研究员,主要从事统一多模态大模型研究,推动视觉理解与视觉生成的一体化能力提升,同时探索以视觉语言模型为核心的智能体驱动视觉创作。我的研究以通用人工智能(AGI)为长期目标,关注全模态基础模型,让模型能够在视觉、语言等不同模态间实现统一的感知、推理与生成。

我于2025年在香港中文大学多媒体实验室(MMLab)获得博士学位,导师为李鸿升教授。博士期间,我也与刘希慧教授保持紧密合作。

此前,我曾在麻省理工学院计算机科学与人工智能实验室(MIT CSAIL)担任访问学者,导师为 Prof. Dina Katabi。本科毕业于上海交通大学信息工程专业,排名第1/157名,导师为倪冰冰教授

News最新动态

Education教育背景

Publications学术论文

(* indicates equal contribution)(* 表示同等贡献)

Experience工作经历

Selected Awards所获荣誉