Zhao Zhang 's Publications

Preprint Papers

    Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens
    Qingsong Xie, Zhao Zhang, Zhe Huang, Yanhao Zhang, Haonan Lu, Zhenyu Yang
    ArXiv   [PDF] [Project] [bib]

    Shikra: Unleashing Multimodal LLM’s Referential Dialogue Magic
    Keqin Chen, Zhao Zhang*, Weili Zeng, Richong Zhang, Feng Zhu, Rui Zhao
    arXiv   [PDF] [Code] [bib]


    Image Harmonization by Matching Regional References
    Ziyue Zhu, Zhao Zhang, Zheng Lin, Ruiqi Wu, Chunle Guo
    arXiv   [PDF] [Code] [中译版] [bib]


Journal Papers

2025

    RelationLMM: Large Multimodal Model as Open and Versatile Visual Relationship Generalist
    Chi Xie, Shuang Liang, Jie Li, Zhao Zhang, Feng Zhu, Rui Zhao
    TPAMI 2025   [Paper] [bib]

    User-Oriented Interactive Style Transfer
    Zheng Lin, Zhao Zhang, Kang-Rui Zhang, Bo Ren, Ming-Ming Cheng
    CVMJ 2025   [PDF] [Code] [中译版] [bib]

2023

    Co-Salient Object Detection with Co-Representation Purification
    Ziyue Zhu*, Zhao Zhang*, Zheng Lin, Xing Sun, Ming-Ming Cheng
    TPAMI 2023   [PDF] [Code] [中译版] [bib]

2022

    Sequential Interactive Image Segmentation
    Zheng Lin, Zhao Zhang, Zi-Yue Zhu, Deng-Ping Fan, Xia-Lei Liu
    CVMJ 2022   [PDF] [Code] [中译版]

2021

    Bilateral Attention Network for RGB-D Salient Object Detection
    Zhao Zhang, Zheng Lin, Jun Xu, Wenda Jin, Shao-Ping Lu, and Deng-Ping Fan
    TIP 2021   [PDF] [Code] [bib]

2020

    Rethinking RGB-D Salient Object Detection: Models, Datasets, and Large-Scale Benchmarks
    Deng-Ping Fan, Zheng Lin, Zhao Zhang, Menglong Zhu, Ming-Ming Cheng
    TNNLS 2020   [PDF] [Code] [Project] [bib]


Conference Papers

2025

    Decomposition of Graphic Design with Large Multimodal Model
    Hui Nie, Zhao Zhang, Yutao Cheng, Maoke Yang, Gonglei Shi, Qingsong Xie, Jie Shao, Xinglong Wu
    ICML 2025   [Repo Coming Soon]

    Graphic Design with Large Multimodal Model
    Yutao Cheng*, Zhao Zhang*, Maoke Yang*, Hui Nie, Chunyuan Li, Xinglong Wu, Jie Shao
    AAAI 2025   [PDF] [Project] [bib]

2024

    Link-Context Learning for Multimodal LLMs
    Yan Tai, Weichen Fan, Zhao Zhang, Feng Zhu, Rui Zhao, Ziwei Liu
    CVPR 2024   [PDF] [Code] [bib]


2023

    Described Object Detection: Liberating Object Detection with Flexible Expressions
    Chi Xie*, Zhao Zhang*, Yixuan Wu, Feng Zhu, Rui Zhao, Shuang Liang
    NeurlPS 2023   [PDF] [Code] [bib]

    Advancing Referring Expression Segmentation Beyond Single Image
    Yixuan Wu*, Zhao Zhang*, Chi Xie, Feng Zhu, Rui Zhao
    ICCV 2023   [PDF] [Code] [bib]

2022

    PAC-Net: Highlight Your Video via History Preference Modeling
    Hang Wang, Penghao Zhou, Chong Zhou, Zhao Zhang, Xing Sun
    ECCV 2022   [PDF] [bib]

    Multi-Mode Interactive Image Segmentation
    Zheng Lin, Zhao Zhang*, Ling-Hao Han, Shao-Ping Lu
    ACM MM 2022   [PDF] [Code] [中译版]

    KnifeCut: Refining Thin Part Segmentation with Cutting Lines
    Zheng Lin, Zheng-Peng Duan, Zhao Zhang, Chunle Guo, Ming-Ming Cheng
    ACM MM 2022 (Oral)   [PDF] [Code] [中译版]

    FocusCut: Diving into a Focus View in Interactive Segmentation
    Zheng Lin, Zheng-Peng Duan, Zhao Zhang, Chun-Le Guo, Ming-Ming Cheng
    CVPR 2022 (Oral)   [PDF] [Code] [中译版] [bib]

2020

    Interactive Image Segmentation with First Click Attention
    Zheng Lin, Zhao Zhang, Lin-Zhuo Chen, Ming-Ming Cheng, Shao-Ping Lu
    CVPR 2020   [PDF] [Code] [Project] [bib]

2018

    Low Resolution Face Recognition and Reconstruction via Deep Canonical Correlation Analysis
    Zhao Zhang Yun-Hao Yuan, Xiao-bo Shen, Yun Li
    ICASSP 2018   [PDF] [bib]

2017

    Face Hallucination and Recognition Using Kernel Canonical Correlation Analysis
    Zhao Zhang Yun-Hao Yuan, Yun Li, Bin Li, Ji-Peng Qiang
    ICONIP 2017 (Oral)   [PDF] [Slides] [bib]

    Supervised Deep Canonical Correlation Analysis for Multiview Feature Learning
    Yan Liu, Yun Li, Yun-Hao Yuan, Ji-Peng Qiang, Min Ruan, Zhao Zhang
    ICONIP 2017   [PDF] [bib]