I'm currently working as a Senior Research Scientist at
Canva
,
with a focus on Image Generation and Multimodal LLMs.
I completed my Master's degree at Nankai University,
where I was under the supervision of
Ming-Ming Cheng.
Please feel free to contact me at
(📮: zzhang🥳mail🔅nankai🔅edu🔅cn)
is now live on
, an AI-driven graphic design generation system for multi-layer and editable compositions with strong visual appeal.
was accepted by AAAI 2025. We have unleashed the potential of MLLM in graphic design.
, an awesome MLLM for Referential Dialogue.
Available on
Layout to Design generates complete and visually appealing designs from user-provided text, assets, and an initial layout, helping users quickly create polished posters, product displays, and promotional content.
Masked Region Transformer for Layered Image
Generation and Editing at Scale
CVPR 2026  
[Paper]
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer
for Creative Graphic Design
ICLR 2026  
[Repo]
[Project page]
[Paper]
[bib]
Decomposition of Graphic Design with Unified Multimodal Model
ICML 2025  
[Repo Coming Soon]
Gradient-Induced Co-Saliency Detection
ECCV 2020  
[PDF]
[Project]
[Code]
[Short Video]
[Long Video]
[Slides]
[中译版]
[bib]