Selected Pulications

All publications on [Google Scholar]

  • VidToMe: Video Token Merging for Zero-Shot Video Editing
    Xirui Li, Chao Ma, Xiaokang Yang, and Ming-Hsuan Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  • SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction
    Pin Tang, Zhongdao Wang, Guoqing Wang, Jilai Zheng, Xiangxuan Ren, bailan feng, and Chao Ma
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  • DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking
    Fei Xie, Zhongdao Wang, and Chao Ma
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  • Single-Model and Any-Modality for Video Object Tracking
    Zongwei Wu, Jilai Zheng, Xiangxuan Ren, Florin-Alexandru Vasluianu, Chao Ma, Danda Pani Paudel, Luc Van Gool, and Radu Timofte
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  • Domain Prompt Learning with Quaternion Networks
    Qinglong Cao, Zhengqin Xu, Yuntian Chen, Chao Ma, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  • Monocular Identity-Conditioned Facial Reflectance Reconstruction
    Xingyu Ren, Jiankang Deng, Yuhao Cheng, Jia Guo, Chao Ma, Yichao Yan, Wenhan Zhu, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  • Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving
    Junyi Cao, Zhichao Li, Naiyan Wang, and Chao Ma
    IEEE International Conference on Robotics and Automation (ICRA), 2024
    [paper] [code]

  • Frame Fusion with Vehicle Motion Prediction for 3D Object Detection
    Xirui Li, Feng Wang, Naiyan Wang, and Chao Ma
    IEEE International Conference on Robotics and Automation (ICRA), 2024
    [paper]

  • LERE: Learning-Based Low-Rank Matrix Recovery with Rank Estimation
    Zhengqin Xu, Yulun Zhang, Chao Ma, Yichao Yan, Zelin Peng, Shoulie Xie, Shiqian Wu, and Xiaokang Yang
    AAAI Conference on Artificial Intelligence (AAAI), 2024
    [paper] [code]

  • Domain-controlled prompt learning
    Qinglong Cao, Zhengqin Xu, Yuntian Chen, Chao Ma, and Xiaokang Yang
    AAAI Conference on Artificial Intelligence (AAAI), 2024
    [paper] [code]

  • Prompt Learning with Quaternion Networks
    Boya Shi, Zhengqin Xu, Shuai Jia, and Chao Ma
    International Conference on Learning Representations (ICLR), 2024
    [paper]

  • Adapting Pretrained Large-Scale Vision Models for Face Forgery Detection (Best Paper Award)
    Lantao Wang, and Chao Ma
    International Conference on Multimedia Modeling (MMM), 2024
    [paper] [award]

  • ProtoTransfer: Cross-Modal Prototype Transfer for Point Cloud Segmentation
    Pin Tang, Hai-Ming Xu, and Chao Ma
    IEEE/CVF International Conference on Computer Vision (ICCV), 2023
    [paper]

  • 3D-Aware Face Swapping
    Yixuan Li, Chao Ma, Yichao Yan, Wenhan Zhu, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
    [paper] [code]

  • Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues
    Xingyu Ren, Jiankang Deng, Chao Ma, Yichao Yan, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
    [paper]

  • VideoTrack: Learning to Track Objects via Video Transformer
    Fei Xie, Lei Chu, Jiahao Li, Yan Lu, and Chao Ma
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
    [paper]

  • PlenVDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and Rendering
    Han Yan, Celong Liu, Chao Ma, and Xing Mei
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
    [paper] [code]

  • SmartAssign: Learning A Smart Knowledge Assignment Strategy for Deraining and Desnowing
    Yinglong Wang, Chao Ma, and Jianzhuang Liu
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
    [paper] [code]

  • UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird’s-Eye View
    Shengchao Zhou, Weizhou Liu, Chen Hu, Shuchang Zhou, and Chao Ma
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
    [paper]

  • Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition
    Shuai Jia, Bangjie Yin*, Taiping Yao, Shouhong Ding, Chunhua Shen, Xiaokang Yang, and Chao Ma*
    Conference on Neural Information Processing Systems (NeurIPS), 2022
    [paper]

  • PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection
    Guangsheng Shi, Ruifeng Li*, and Chao Ma*
    European Conference on Computer Vision (ECCV), 2022
    [paper] [code]

  • AiATrack: Attention in Attention for Transformer Visual Tracking
    Shenyuan Gao, Chunluan Zhou, Chao Ma, Xinggang Wang, and Junsong Yuan
    European Conference on Computer Vision (ECCV), 2022
    [paper] [code]

  • LIFT: Learning 4D LiDAR Image Fusion Transformer for 3D Object Detection
    Yihan Zeng, Da Zhang, Chunwei Wang, Zhenwei Miao, Ting Liu, Xin Zhan, Dayang Hao, and Chao Ma*
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
    [paper] [supp]

  • End-to-End Reconstruction-Classification Learning for Face Forgery Detection
    Junyi Cao, Chao Ma*, Taiping Yao, Shen Chen, Shouhong Ding, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
    [paper] [supp] [code]

  • Exploring Frequency Adversarial Attacks for Face Forgery Detection
    Shuai Jia, Chao Ma*, Taiping Yao, Bangjie Yin, Shouhong Ding, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
    [paper]

  • Unsupervised Sounding Object Localization with Bottom-Up and Top-Down Attention
    Jiayin Shi and Chao Ma*
    IEEE Winter Conference on Applications of Computer Vision (WACV), 2022
    [paper] [code]

  • Learning Transferable Features for Point Cloud Detection via 3D Contrastive Co-training
    Yihan Zeng, Chunwei Wang, Yunbo Wang, Hang Xu, Chaoqiang Ye, Zhen Yang, and Chao Ma*
    Conference on Neural Information Processing Systems (NeurIPS), 2021
    [paper] [supp]

  • Learning to Track Objects from Unlabeled Videos
    Jilai Zheng, Chao Ma*, Houwen Peng, and Xiaokang Yang
    IEEE/CVF International Conference on Computer Vision (ICCV), 2021
    [paper] [supplement] [code]

  • Cross-Modal 3D Object Detection and Tracking for Auto-Driving
    Yihan Zeng, Chao Ma*, Ming Zhu, Zhiming Fan, and Xiaokang Yang
    IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021
    [paper] [slide] [video]

  • PointAugmenting: Cross-Modal Augmentation for 3D Object Detection
    Chunwei Wang, Chao Ma*, Ming Zhu, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
    [paper] [supp] [code]

  • IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
    Shuai Jia, Yibing Song, Chao Ma*, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
    [paper] [code]

  • Multi-Decoding Deraining Network and Quasi-Sparsity Based Training
    Yinglong Wang, Chao Ma*, and Bing Zeng*
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
    [paper]

  • On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
    Zeyu Yan, Fei Wen, Rendong Ying, Chao Ma, and Peilin Liu
    International Conference on Machine Learning (ICML), 2021
    [paper] [code]

  • Robust Online Tracking via Contrastive Spatio-Temporal Aware Network
    Siyuan Yao, Hua Zhang, Wenqi Ren, Chao Ma, Xiaoguang Han, and Xiaochun Cao
    IEEE Transactions on Image Processing (TIP), 2021
    [paper]

  • Deep Object Tracking with Shrinkage Loss
    Xiankai Lu, Chao Ma*, Jianbin Shen, Xiaokang Yang, Ian Reid, and Ming-Hsuan Yang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
    [paper] [code]

  • Learning Recurrent Memory Activation Networks for Visual Tracking
    Shi Pu, Yibing Song, Chao Ma, Honggang Zhang, and Ming-Hsuan Yang
    IEEE Transactions on Image Processing (TIP), 2021
    [paper]

  • Cross-Modality 3D Object Detection
    Ming Zhu, Chao Ma*, Pan Ji, Xiaokang Yang
    IEEE Winter Conference on Applications of Computer Vision (WACV) 2021.
    [paper] [code]

  • Robust Tracking against Adversarial Attacks
    Shuai Jia, Chao Ma*, Yibing Song, Xiaokang Yang
    European Conference on Computer Vision (ECCV) 2020.
    [paper] [code]

  • Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering
    Ruixue Tang, Chao Ma*, Wei Emma Zhang, Qi Wu, Xiaokang Yang
    European Conference on Computer Vision (ECCV) 2020.
    [paper] [code]

  • Rethinking Image Deraining via Rain Streaks and Vapors
    Yinglong Wang, Yibing Song, Chao Ma, Bing Zeng
    European Conference on Computer Vision (ECCV) 2020.
    [paper] [code]

  • Unsupervised Deep Representation Learning for Real-Time Tracking
    Ning Wang, Wengang Zhou, Yibing Song, Chao Ma, Wei Liu, and Houqiang Li
    International Journal of Computer Vision (IJCV) 2020.
    [paper] [code]

  • Real-Time Correlation Tracking Via Joint Model Compression and Transfer
    Ning Wang, Wengang Zhou, Yibing Song, Chao Ma, Houqiang Li
    IEEE Transactions on Image Processing (TIP) 2020.
    [paper] [code]

  • Semi-Supervised 3D Face Representation Learning From Unconstrained Photo Collections
    Zhongpai Gao, Juyong Zhang, Yudong Guo, Chao Ma, Guangtao Zhai, Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020 Workshop. (Best paper award)
    [paper]

  • Robust Visual Tracking via Hierarchical Convolutional Features
    Chao Ma, Jia-Bin Huang, Xiaokang Yang, and Ming-Hsuan Yang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2019.
    [paper] [code]

  • Target-Aware Deep Tracking
    Xin Li, Chao Ma, Baoyuan Wu, Zhenyu He, and Ming-Hsuan Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019.
    [paper] [code]

  • Unsupervised Deep Tracking
    Ning Wang, Yibing Song, Chao Ma, Wengang Zhou , Wei Liu, and Houqiang Li
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019.
    [paper] [code]

  • Depth-Aware Video Frame Interpolation
    Wenbo Bao, Wei-Sheng Lai, Chao Ma, Xiaoyun Zhang, Zhiyong Gao, and Ming-Hsuan Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019.
    [paper] [code]

  • See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks
    Xiankai Lu, Wenguan Wang, Chao Ma, Jianbin Shen, Ling Shao, and Fatih Porikli
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019.
    [paper] [code]

  • Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking
    Chao Ma, Jia-Bin Huang, Xiaokang Yang, and Ming-Hsuan Yang
    Internation Journal of Computer Vision (IJCV), 2018.
    [paper] [code]

  • Visual Question Answering with Memory Augmented Networks
    Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton van den Hengel, and Ian Reid
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
    [paper]

  • Deep Attentive Tracking via Reciprocative Learning
    Shi Pu, Yibing Song, Chao Ma, Honggang Zhang, and Ming-Hsuan Yang
    Advances in Neural Information Processing Systems (NeurIPS) 2018.
    [paper] [code]

  • Deep Regression Tracking with Shrinkage Loss
    Xiankai Lu, Chao Ma*, Bingbing Ni, Xiaokang Yang, Ian Reid, and Ming-Hsuan Yang (The first two authors have equal contributions)
    European Conference on Computer Vision (ECCV) 2018.
    [paper] [code]

  • VITAL: VIsual Tracking via Adversarial Learning
    Yibing Song, Chao Ma*, Xiaohe Wu, Lijun Gong, Linchao Bao, Wangmeng Zuo, Chunhua Shen, Rynson Lau, and Ming-Hsuan Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2018.
    [paper] [code]

  • CREST: Convolutional RESidula Learning for Visual Tracking
    Yibing Song, Chao Ma, Lijun Gong, Jiawei Zhang, Rynson Lau, and Ming-Hsuan Yang
    IEEE/CVF International Conference on Computer Vision (ICCV) 2017.
    [paper] [code]

  • Video Segmentation via Multiple Granularity Analysis
    Rui Yang, Bingbing Ni, Chao Ma, Yi Xu, and Xiaokang Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2017.
    [paper]

  • Learning a No-reference Quality Metric for Single-Image Super-Resolution
    Chao Ma, Chih-Yuan Yang, Xiaokang Yang, and Ming-Hsuan Yang
    Computer Vision and Image Understanding (CVIU), 2017.
    [paper] [code]

  • Person Re-Identification via Recurrent Feature Aggregation
    Yichao Yan, Bingbing Ni, Zhichao Song, Chao Ma, Yan Yan, and Xiaokang Yang
    European Conference on Computer Vision (ECCV), 2016.
    [paper] [code]

  • When Correlation Filters Meet Convolutional Neural Networks for Visual Tracking
    Chao Ma, Yi Xu, Bingbing Ni, and Xiaokang Yang
    IEEE Signal Processing Letters (SPL), 2016.
    [paper]

  • Sketch Retrieval via Local Dense Stroke Features
    Chao Ma, Xiaokang Yang, Chongyang Zhang, Xiang Ruan, and Ming-Hsuan Yang
    Image and Vision Computing (IVC), 2016.
    [paper]

  • Hierarchical Convolutional Features for Visual Tracking
    Chao Ma, Jia-Bin Huang, Xiaokang Yang, and Ming-Hsuan Yang
    IEEE/CVF International Conference on Computer Vision (ICCV) 2015.
    [paper] [code]

  • Long-term Correlation Tracking
    Chao Ma, Xiaokang Yang, Chongyang Zhang, and Ming-Hsuan Yang
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2015.
    [paper] [code]

  • Learning A Temporally Invariant Representation for Visual Tracking (Top 10% Paper)
    Chao Ma, Xiaokang Yang, Chongyang Zhang, and Ming-Hsuan Yang
    IEEE International Conference on Image Processing (ICIP) 2015.
    [paper] [code]

  • Single Image Super-Resolution: A Benchmark
    Chih-Yuan Yang, Chao Ma, and Ming-Hsuan Yang
    European Conference on Computer Vision (ECCV) 2014.
    [paper] [code]

  • Sketch Retrieval via Dense Stroke Features
    Chao Ma, Xiaokang Yang, Chongyang Zhang, Xiang Ruan, and Ming-Hsuan Yang
    British Machine Vision Conference (BMVC) 2013.
    [paper]