2026

  1. DistDF: Time-Series Forecasting Needs Joint-Distribution Wasserstein Alignment
    Hao Wang, Licheng Pan, Yuan Lu, Zhixuan Chu, Xiaoxi Li, Shuting He, Zhichao Chen, Haoxuan Li, Qingsong Wen and Zhouchen Lin
    International Conference on Learning Representations (ICLR), 2026
  2. Quadratic Direct Forecast for Training Multi-Step Time-Series Forecast Models
    Hao Wang, Licheng Pan, Yuan Lu, Zhichao Chen, Tianqiao Liu, Shuting He, Zhixuan Chu, Qingsong Wen, Haoxuan Li and Zhouchen Lin
    International Conference on Learning Representations (ICLR), 2026
  3. Monocular Normal Estimation via Shading Sequence Estimation
    Zhen Li, Xiaotian Ma, Minghua Hu, Yuqian Zhao, Yinqiang Yu, Qijian Zheng, Chang Liu, Xudong Jiang and Song Bai
    International Conference on Learning Representations (ICLR), 2026
    Oral, Acceptance Rate 1.1%, Corresponding author
  4. FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting
    Yitong Yang, Yinglin Wang, Changshuo Wang, Huajie Wang and Shuting He
    AAAI Conference on Artificial Intelligence (AAAI), 2026
    Corresponding author
  5. SplitFlux: Learning to Decouple Content and Style from a Single Image
    Yitong Yang, Yinglin Wang, Changshuo Wang, Yongjun Zhang, Ziyang Chen and Shuting He
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026
    Corresponding author
  6. GREx: Generalized Referring Expression Segmentation, Comprehension, and Generation
    Henghui Ding, Chang Liu, Shuting He, Xudong Jiang and Yu-Gang Jiang
    International Journal of Computer Vision (IJCV), 2026
    Corresponding author

2025

  1. MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation
    Henghui Ding, Chang Liu, Shuting He, Kaining Ying, Xudong Jiang, Chen Change Loy and Yu-Gang Jiang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
    Corresponding author
  2. ReferSplat: Referring Segmentation in 3D Gaussian Splatting
    Shuting He, Guangquan Jie, Changshuo Wang, Yun Zhou, Shuming Hu, Guanbin Li and Henghui Ding
    International Conference on Machine Learning (ICML), 2025
    Oral, Acceptance Rate 1.0%
  3. SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
    Shiqi Huang, Shuting He, Huaiyuan Qin and Bihan Wen
    IEEE International Conference on Computer Vision (ICCV), 2025
    Highlight, Acceptance Rate 5.0%
  4. ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation
    Shiqi Huang, Shuting He and Bihan Wen
    AAAI Conference on Artificial Intelligence (AAAI), 2025
  5. GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding
    Zijun Lin, Shuting He, Cheston Tan and Bihan Wen
    IEEE International Conference on Computer Vision (ICCV), 2025
  6. HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation
    Weihuang Lin, Yiwei Ma, Xiaoshuai Sun, Shuting He, Jiayi Ji, Liujuan Cao and Rongrong Ji
    ACM International Conference on Multimedia (ACM MM), 2025
  7. GlFoMR: A Glance-then-Focus Multimodal Reasoning Framework for Diagram Question Answering Number
    Yaxian Wang, Bifan Wei, Jun Liu, Lingling Zhang, Shuting He, Jun Li and Qika Lin
    International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
  8. Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
    Yaxian Wang, Henghui Ding, Shuting He, Xudong Jiang, Bifan Wei and Jun Liu
    AAAI Conference on Artificial Intelligence (AAAI), 2025
  9. Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision
    Hao Wang, Zhengnan Li, Zhichao Chen, Xu Chen, Shuting He, Guangyi Liu, Haoxuan Li and Zhouchen Lin
    Annual Conference on Neural Information Processing Systems (NeurIPS), 2025
  10. Looking Clearer with Text: A Hierarchical Context Blending Network for Occluded Person Re-Identification
    Changshuo Wang, Shuting He, Meiqing Wu, Siew-Kei Lam, Prayag Tiwari and Xingyu Gao
    IEEE Transactions on Information Forensics and Security (TIFS), 2025
  11. Point Clouds Meets Physics: Dynamic Acoustic Field Fitting Network for Point Cloud Understanding
    Changshuo Wang, Shuting He, Xiang Fang, Jiawei Han, Zhonghang Liu, Xin Ning, Weijun Li and Prayag Tiwari
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  12. Reasoning Beyond Points: A Visual Introspective Approach for Few-Shot 3D Segmentation
    Changshuo Wang, Shuting He, Xiang Fang, Zhijian Hu, Jia-Hong Huang, Yixian Shen and Prayag Tiwari
    Annual Conference on Neural Information Processing Systems (NeurIPS), 2025
  13. Seeing the Overlooked: Bio-Visual Inspired Weak Saliency Feedback Transformer for Person Re-identification
    Changshuo Wang, Shuting He, Xiang Fang, Fangzhe Nan and Prayag Tiwari
    ACM International Conference on Multimedia (ACM MM), 2025
  14. Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation
    Changshuo Wang, Shuting He, Xiang Fang, Meiqing Wu, Siew Kei Lam and Prayag Tiwari
    AAAI Conference on Artificial Intelligence (AAAI), 2025
  15. Prompt-Softbox-Prompt: A Free-Text Embedding Control for Image Editing
    Yitong Yang, Yinglin Wang, Tian Zhang, Jing Wang and Shuting He
    ACM International Conference on Multimedia (ACM MM), 2025
    Corresponding author

2024

  1. Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
    Shuting He and Henghui Ding
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  2. Region Generation and Assessment Network for Occluded Person Re-Identification
    Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang and Henghui Ding
    IEEE Transactions on Information Forensics and Security (TIFS), 2024
  3. RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
    Shuting He and Henghui Ding
    ACM International Conference on Multimedia (ACM MM), 2024
  4. SegPoint: Segment Any Point Cloud via Large Language Model
    Shuting He and Henghui Ding
    European Conference on Computer Vision (ECCV), 2024
  5. TIP
    VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search
    Shuting He, Hao Luo, Wei Jiang, Xudong Jiang and Henghui Ding
    IEEE Transactions on Image Processing (TIP), 2024
  6. Dual-head Genre-instance Transformer Network for Arbitrary Style Transfer
    Meichen Liu, Shuting He, Songnan Lin and Bihan Wen
    ACM International Conference on Multimedia (ACM MM), 2024
  7. Referring Image Editing: Object-level Image Editing via Referring Expressions
    Chang Liu, Xiangtai Li and Henghui Ding
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  8. Context-Aware Integration of Language and Visual References for Natural Language Tracking
    Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo and Jiming Chen
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

2023

  1. MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
    Henghui Ding, Chang Liu, Shuting He, Xudong Jiang and Chen Change Loy
    IEEE International Conference on Computer Vision (ICCV), 2023
  2. MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
    Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr and Song Bai
    IEEE International Conference on Computer Vision (ICCV), 2023
  3. VLT: Vision-language Transformer and Query Generation for Referring Segmentation
    Henghui Ding, Chang Liu, Suchen Wang and Xudong Jiang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
  4. Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation
    Shuting He, Henghui Ding and Wei Jiang
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  5. TIP
    Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation
    Shuting He, Xudong Jiang, Wei Jiang and Henghui Ding
    IEEE Transactions on Image Processing (TIP), 2023
  6. Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
    Shuting He, Henghui Ding and Wei Jiang
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  7. GRES: Generalized Referring Expression Segmentation
    Chang Liu, Henghui Ding and Xudong Jiang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  8. TIP
    Multi-modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
    Chang Liu, Henghui Ding, Yulun Zhang and Xudong Jiang
    IEEE Transactions on Image Processing (TIP), 2023

2022

  1. TIP
    Deep Interactive Image Matting with Feature Propagation
    Henghui Ding, Hui Zhang, Chang Liu and Xudong Jiang
    IEEE Transactions on Image Processing (TIP), 2022
  2. TMM
    Instance-specific Feature Propagation for Referring Segmentation
    Chang Liu, Xudong Jiang and Henghui Ding
    IEEE Transactions on Multimedia (TMM), 2022

2021

  1. Vision-language Transformer and Query Generation for Referring Segmentation
    Henghui Ding, Chang Liu, Suchen Wang and Xudong Jiang
    IEEE/CVF International Conference on Computer Vision (ICCV), 2021
  2. TransReID: Transformer-based Object Re-Identification
    Shuting He, Hao Luo, Pichao Wang, Fan Wang, Hao Li and Wei Jiang
    IEEE International Conference on Computer Vision (ICCV), 2021