Zhao Zhang 's Publications

Preprint Papers

    Icon CreatiPoster: Towards Editable and Controllable
    Multi-Layer Graphic Design Generation

    Zhao Zhang, Yutao Cheng, Dexiang Hong, Maoke Yang
    Gonglei Shi, Lei Ma, Hui Zhang, Jie Shao, Xinglong Wu

    arXiv 2025   [Repo] [Paper]

    logo Shikra: Unleashing Multimodal LLM’s Referential Dialogue Magic
    Keqin Chen, Zhao Zhang*, Weili Zeng, Richong Zhang, Feng Zhu, Rui Zhao
    arXiv   [PDF] [Code] [bib]


    Image Harmonization by Matching Regional References
    Ziyue Zhu, Zhao Zhang, Zheng Lin, Ruiqi Wu, Chunle Guo
    arXiv   [PDF] [Code] [中译版] [bib]


Journal Papers

2025

    RelationLMM: Large Multimodal Model as Open and Versatile Visual Relationship Generalist
    Chi Xie, Shuang Liang, Jie Li, Zhao Zhang, Feng Zhu, Rui Zhao
    TPAMI 2025   [Paper] [bib]

    User-Oriented Interactive Style Transfer
    Zheng Lin, Zhao Zhang, Kang-Rui Zhang, Bo Ren, Ming-Ming Cheng
    CVMJ 2025   [PDF] [Code] [中译版] [bib]

2023

    Co-Salient Object Detection with Co-Representation Purification
    Ziyue Zhu*, Zhao Zhang*, Zheng Lin, Xing Sun, Ming-Ming Cheng
    TPAMI 2023   [PDF] [Code] [中译版] [bib]

2022

    Sequential Interactive Image Segmentation
    Zheng Lin, Zhao Zhang, Zi-Yue Zhu, Deng-Ping Fan, Xia-Lei Liu
    CVMJ 2022   [PDF] [Code] [中译版]

2021

    Bilateral Attention Network for RGB-D Salient Object Detection
    Zhao Zhang, Zheng Lin, Jun Xu, Wenda Jin, Shao-Ping Lu, and Deng-Ping Fan
    TIP 2021   [PDF] [Code] [bib]

2020

    Rethinking RGB-D Salient Object Detection: Models, Datasets, and Large-Scale Benchmarks
    Deng-Ping Fan, Zheng Lin, Zhao Zhang, Menglong Zhu, Ming-Ming Cheng
    TNNLS 2020   [PDF] [Code] [Project] [bib]


Conference Papers

2026

    Magic Layers logo Masked Region Transformer for Layered Image Generation and Editing at Scale
    Zhicong Tang, Jingye Chen, Zhao Zhang, Mohan Zhou, Yuchi Liu, Yifan Pu, Yalong Bai, Ethan Smith, Yuhui Yuan
    CVPR 2026   [Paper]

    CreatiDesign: A Unified Multi-Conditional Diffusion Transformer
    for Creative Graphic Design

    Hui Zhang, Dexiang Hong, Maoke Yang, Yutao Cheng, Zhao Zhang
    Jie Shao, Xinglong Wu, Zuxuan Wu, and Yu-Gang Jiang

    ICLR 2026   [Repo] [Project page] [Paper] [bib]

    Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens
    Qingsong Xie, Zhao Zhang, Zhe Huang, Yanhao Zhang, Haonan Lu, Zhenyu Yang
    CVPR 2026   [PDF] [Project] [bib]

2025

    Decomposition of Graphic Design with Unified Multimodal Model
    Hui Nie, Zhao Zhang, Yutao Cheng, Maoke Yang, Gonglei Shi, Qingsong Xie, Jie Shao, Xinglong Wu
    ICML 2025   [Repo Coming Soon]

    Graphic Design with Large Multimodal Model
    Yutao Cheng*, Zhao Zhang*, Maoke Yang*, Hui Nie, Chunyuan Li, Xinglong Wu, Jie Shao
    AAAI 2025   [PDF] [Project] [bib]

2024

    Link-Context Learning for Multimodal LLMs
    Yan Tai, Weichen Fan, Zhao Zhang, Feng Zhu, Rui Zhao, Ziwei Liu
    CVPR 2024   [PDF] [Code] [bib]


2023

    Described Object Detection: Liberating Object Detection with Flexible Expressions
    Chi Xie*, Zhao Zhang*, Yixuan Wu, Feng Zhu, Rui Zhao, Shuang Liang
    NeurIPS 2023   [PDF] [Code] [bib]

    Advancing Referring Expression Segmentation Beyond Single Image
    Yixuan Wu*, Zhao Zhang*, Chi Xie, Feng Zhu, Rui Zhao
    ICCV 2023   [PDF] [Code] [bib]

2022

    PAC-Net: Highlight Your Video via History Preference Modeling
    Hang Wang, Penghao Zhou, Chong Zhou, Zhao Zhang, Xing Sun
    ECCV 2022   [PDF] [bib]

    Multi-Mode Interactive Image Segmentation
    Zheng Lin, Zhao Zhang*, Ling-Hao Han, Shao-Ping Lu
    ACM MM 2022   [PDF] [Code] [中译版]

    KnifeCut: Refining Thin Part Segmentation with Cutting Lines
    Zheng Lin, Zheng-Peng Duan, Zhao Zhang, Chunle Guo, Ming-Ming Cheng
    ACM MM 2022 (Oral)   [PDF] [Code] [中译版]

    FocusCut: Diving into a Focus View in Interactive Segmentation
    Zheng Lin, Zheng-Peng Duan, Zhao Zhang, Chun-Le Guo, Ming-Ming Cheng
    CVPR 2022 (Oral)   [PDF] [Code] [中译版] [bib]

2020

    Interactive Image Segmentation with First Click Attention
    Zheng Lin, Zhao Zhang, Lin-Zhuo Chen, Ming-Ming Cheng, Shao-Ping Lu
    CVPR 2020   [PDF] [Code] [Project] [bib]

2018

    Low Resolution Face Recognition and Reconstruction via Deep Canonical Correlation Analysis
    Zhao Zhang Yun-Hao Yuan, Xiao-bo Shen, Yun Li
    ICASSP 2018   [PDF] [bib]

2017

    Face Hallucination and Recognition Using Kernel Canonical Correlation Analysis
    Zhao Zhang Yun-Hao Yuan, Yun Li, Bin Li, Ji-Peng Qiang
    ICONIP 2017 (Oral)   [PDF] [Slides] [bib]

    Supervised Deep Canonical Correlation Analysis for Multiview Feature Learning
    Yan Liu, Yun Li, Yun-Hao Yuan, Ji-Peng Qiang, Min Ruan, Zhao Zhang
    ICONIP 2017   [PDF] [bib]