Publications

(2024). Masked AutoDecoder is Effective Multi-Task Vision Generalist. In Conference on Computer Vision and Pattern Recognition (CVPR), 2024

PDF Cite

(2024). FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization. In Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

PDF Cite

(2024). Weakly Supervised Monocular 3D Detection with a Single-View Image. In Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

PDF Cite

(2024). Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining. In Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

PDF Cite

(2024). Mitigating object hallucinations in large vision-language models through visual contrastive decoding. In Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

PDF Cite

(2024). Efficient Test-Time Adaptation of Vision-Language Models. In Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

PDF Cite

(2024). Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with Transformer. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024.

PDF Cite

(2024). Vision-Language Models for Vision Tasks: A Survey. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024.

PDF Cite

(2024). One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024.

PDF Cite

(2023). Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.

PDF Cite

(2024). LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors. In International Conference on Learning Representations (ICLR), 2024.

PDF Cite

(2024). Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion. In International Journal on Computer Vision (IJCV), 2024.

PDF Cite

(2024). Modeling Continuous Motion for 3D Point Cloud Object Tracking. In AAAI Conference on Artificial Intelligence (AAAI), 2024.

PDF Cite

(2023). POCE: Pose-Controllable Expression Editing. In IEEE Transactions on Image Processing (TIP), 2023.

PDF Cite

(2023). Cross-Domain Facial Expression Recognition via Contrastive Warm up and Complexity-aware Self-Training. In IEEE Transactions on Image Processing (TIP), 2023.

PDF Cite

(2023). Domain Adaptive LiDAR Point Cloud Segmentation with 3D Spatial Consistency. In IEEE Transactions on Multimedia (TMM), 2023.

PDF Cite

(2023). Online Map Vectorization for Autonomous Driving: A Rasterization Perspective. In Thirty-Seventh Conference on Neural Information Processing Systems (NeurIPS), 2023.

PDF Cite

(2023). Bridging Semantic Gaps for Language-Supervised Semantic Segmentation. In Thirty-Seventh Conference on Neural Information Processing Systems (NeurIPS), 2023

PDF Cite

(2023). 3D Open-vocabulary Segmentation with Foundation Models. In Thirty-Seventh Conference on Neural Information Processing Systems (NeurIPS), 2023.

PDF Cite Code

(2023). Pose-Free Neural Radiance Fields via Implicit Pose Regularization. In IEEE International Conference on Computer Vision (ICCV), 2023.

PDF Cite

(2023). Domain Generalization via Balancing Training Difficulty and Model Capability. In IEEE International Conference on Computer Vision (ICCV), 2023.

PDF Cite

(2023). WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields. In IEEE International Conference on Computer Vision (ICCV), 2023.

PDF Cite

(2023). Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin Memory. In IEEE International Conference on Computer Vision (ICCV), 2023.

PDF Cite

(2023). Multimodal Image Synthesis and Editing: The Generative AI Era. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.

PDF Cite Code

(2023). Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations. In Pattern Recognition (PR), 2023.

PDF Cite

(2023). 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite Code

(2023). KD-DLGAN: Data Limited Image Generation via Knowledge Distillation. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite

(2023). StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite Code

(2023). Hierarchical Mask Calibration for Unified Domain Adaptive Panoptic Segmentation. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite

(2023). FAC: 3D Representation Learning via Foreground Aware Feature Contrast. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite Code

(2023). Regularized Vector Quantization for Tokenized Image Synthesis. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite

(2023). Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite

(2023). Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite Code

(2023). Da-detr: Domain adaptive detection transformer by hybrid attention. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

PDF Cite

(2023). I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.

PDF Cite

(2023). Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.

PDF Cite Code

(2022). PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds. In Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS), 2022.

PDF Cite Code

(2022). Masked Generative Adversarial Networks are Data-Efficient Generation Learners. In Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS), 2022.

PDF Cite

(2022). Meta-detr: Few-shot object detection via unified image-level meta-learning. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022.

PDF Cite Code

(2022). Auto-regressive Image Synthesis with Integrated Quantization. In European Conference on Computer Vision (ECCV) (Oral), 2022.

PDF Cite

(2022). Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting. In European Conference on Computer Vision (ECCV) (Oral), 2022.

PDF Cite

(2022). Domain Adaptive Video Segmentation via Temporal Pseudo Supervision. In European Conference on Computer Vision (ECCV), 2022.

PDF Cite

(2022). Bi-level feature alignment for versatile image translation and manipulation. In European Conference on Computer Vision (ECCV), 2022.

PDF Cite

(2022). Contextual Text Block Detection towards Scene Text Understanding. In European Conference on Computer Vision (ECCV), 2022.

PDF Cite Dataset

(2022). VMRF: View Matching Neural Radiance Fields. In 30th ACM International Conference on Multimedia (ACM Multimedia), 2022.

PDF Cite

(2022). Towards Counterfactual Image Manipulation via CLIP. In 30th ACM International Conference on Multimedia (ACM Multimedia), 2022.

PDF Cite

(2022). Music-to-Dance Generation with Optimal Transport. In International Joint Conferences on Artificial Intelligence (IJCAI), 2022.

PDF Cite

(2022). Modulated Contrast for Versatile Image Synthesis. In Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

PDF Cite Code

(2022). PTTR: Relational 3D Point Cloud Object Tracking with Transformer. In Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

PDF Cite

(2022). Spectral Unsupervised Domain Adaptation for Visual Recognition. In Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

PDF Cite

(2022). Category Contrast for Unsupervised Domain Adaptation in Visual Tasks. In Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

PDF Cite

(2022). Marginal Contrastive Correspondence for Guided Image Generation. In Conference on Computer Vision and Pattern Recognition (CVPR)(Oral), 2022.

PDF Cite

(2022). Fourier Document Restoration for Robust Document Dewarping and Recognition. In Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

PDF Cite

(2022). Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation. In Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

PDF Cite Code

(2022). Accelerating DETR Convergence via Semantic-Aligned Matching. In Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

PDF Cite Code

(2022). GMLight: Lighting Estimation via Geometric Distribution Approximation. In IEEE Transactions on Image Processing (TIP), 2022.

PDF Cite Code

(2022). Investigating Pose Representations and Motion Contexts Modeling for 3D Motion Prediction. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022.

PDF Cite

(2022). Detection and Rectification of Arbitrary Shaped Scene Texts by using Text Keypoints and Links. In Pattern Recognition, 2022.

PDF Cite

(2022). Learning Disentangled Representation Implicitly via Transformer for Occluded Person Re-Identification. In IEEE Transactions on Multimedia (TMM), 2022.

PDF Cite

(2022). SynLiDAR: Learning From Synthetic LiDAR Sequential Point Cloud for Semantic Segmentation. In AAAI Conference on Artificial Intelligence (AAAI), 2022.

PDF Cite Code

(2022). GenCo: Generative Co-training on Data-Limited Image Generation. In AAAI Conference on Artificial Intelligence (AAAI), 2022.

PDF Cite

(2022). Multi-Level Adversarial Network for Domain Adaptive Semantic Segmentation. In Pattern Recognition, 2022.

PDF Cite

(2021). Model adaptation: Historical contrastive learning for unsupervised domain adaptation without source data. In Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS), 2021.

PDF Cite

(2021). Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning. In IEEE International Conference on Computer Vision (ICCV), 2021.

PDF Cite

(2021). Unsupervised domain adaptive 3d detection with multi-level consistency. In IEEE International Conference on Computer Vision (ICCV), 2021.

PDF Cite

(2021). WaveFill: A Wavelet-based Generation Network for Image Inpainting. In IEEE International Conference on Computer Vision (ICCV) (Oral), 2021.

PDF Cite

(2021). Sparse Needlets for Lighting Estimation with Spherical Transport Loss. In IEEE International Conference on Computer Vision (ICCV), 2021.

PDF Cite Code

(2021). Dual Learning Music Composition and Dance Choreography. In ACM International Conference on Multimedia (ACM MM), 2021.

PDF Cite

(2021). Domain Adaptive Video Segmentation via Temporal Consistency Regularization. In IEEE International Conference on Computer Vision (ICCV), 2021.

PDF Cite Code

(2021). Diverse Image Inpainting with Bidirectional and Autoregressive Transformers. In ACM International Conference on Multimedia (ACM MM), 2021.

PDF Cite

(2021). RDA: Robust Domain Adaptation via Fourier Adversarial Attacking. In IEEE International Conference on Computer Vision (ICCV), 2021.

PDF Cite

(2021). Unbalanced Feature Transport for Exemplar-based Image Translation. In Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

PDF Cite Code

(2021). FSDR: Frequency Space Domain Randomization for Domain Generalization. In Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

PDF Cite

(2021). Cross-view regularization for domain adaptive panoptic segmentation. In Conference on Computer Vision and Pattern Recognition (CVPR) (Oral), 2021.

PDF Cite Code

(2021). Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection. In IEEE Transactions on Multimedia (TMM), 2021.

PDF Cite Code

(2021). Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection. In IEEE Transactions on Multimedia (TMM), 2021.

PDF Cite

(2021). Scale variance minimization for unsupervised domain adaptation in image segmentation. In Pattern Recognition, 2021.

PDF Cite Code

(2021). Visual Navigation With Multiple Goals Based on Deep Reinforcement Learning. In IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021.

PDF Cite

(2021). Matching on Sets: Conquer Occluded Person Re-identification Without Alignment. In AAAI Conference on Artificial Intelligence (AAAI), 2021.

PDF Cite

(2021). FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation. In ISPRS Journal of Photogrammetry and Remote Sensing, 2021.

PDF Cite

(2021). EMLight: Lighting Estimation via Spherical Distribution Approximation. In AAAI Conference on Artificial Intelligence (AAAI), 2021.

PDF Cite Code

(2021). Brain MRI super-resolution using coupled-projection residual network. In Neurocomputing, 2021.

PDF Cite

(2020). Multiple Expert Brainstorming for Domain Adaptive Person Re-identification. In European Conference on Computer Vision (ECCV), 2020.

PDF Cite Code

(2020). LEED: Label-Free Expression Editing via Disentanglement. In European Conference on Computer Vision (ECCV), 2020.

PDF Cite

(2020). Contextual-Relation Consistent Domain Adaptation for Semantic Segmentation. In European Conference on Computer Vision (ECCV), 2020.

PDF Cite

(2020). Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis. In European Conference on Computer Vision (ECCV) (Spotlight), 2020.

PDF Cite

(2020). AMLN: Adversarial-based Mutual Learning Network for Online Knowledge Distillation. In European Conference on Computer Vision (ECCV), 2020.

PDF Cite

(2020). A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification. In International Joint Conference on Artificial Intelligence (IJCAI), 2020.

PDF Cite

(2020). Synergistic 2D/3D Convolutional Neural Network for Hyperspectral Image Classification. In Remote Sensing, 2020.

PDF Cite

(2020). Suppressing Uncertainties for Large-Scale Facial Expression Recognition. In Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

PDF Cite Code

(2020). Cascade EF-GAN: Progressive Facial Expression Editing with Local Focuses. In Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

PDF Cite

(2020). AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-identification. In Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

PDF Cite

(2020). Salient Object Detection by Fusing Local and Global Contexts. In IEEE Transactions on Multimedia (TMM), 2020.

PDF Cite

(2020). Part-aware Progressive Unsupervised Domain Adaptation for Person Re-Identification. In IEEE Transactions on Multimedia (TMM), 2020.

PDF Cite

(2019). Single-Image Dehazing via Compositional Adversarial Network. In IEEE Transactions on Cybernetics (TCYB), 2019.

PDF Cite

(2019). MSR: Multi-Scale Shape Regression for Scene Text Detection. In International Joint Conference on Artificial Intelligence (IJCAI), 2019.

PDF Cite

(2019). GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition. In IEEE International Conference on Computer Vision (ICCV), 2019.

PDF Cite

(2019). Exploring the Task Cooperation in Multi-goal Visual Navigation. InInternational Joint Conference on Artificial Intelligence (IJCAI), 2019.

PDF Cite

(2019). Towards Natural and Accurate Future Motion Prediction of Humans and Animals. In Conference on Computer Vision and Pattern Recognition (CVPR), 2019.

PDF Cite Code

(2019). Spatial Fusion GAN for Image Synthesis. In Conference on Computer Vision and Pattern Recognition (CVPR), 2019.

PDF Cite

(2019). ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification. In Conference on Computer Vision and Pattern Recognition (CVPR), 2019.

PDF Cite

(2019). SS-HCNN:Semi-Supervised Hierarchical Convolutional Neural Network for Image Classification. In IEEE Transactions on Image Processing (TIP), 2019.

PDF Cite

(2019). CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery. In IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2019.

PDF Cite Code

(2019). A pooling based scene text proposal technique for scene text reading in the wild. In Pattern Recognition, 2019.

PDF Cite

(2019). Attention driven person re-identification. In Pattern Recognition, 2019.

PDF Cite

(2018). Superpixel Guided Deep-Sparse-Representation Learning for Hyperspectral Image Classification. In IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2018.

PDF Cite

(2018). S-CNN: Subcategory-Aware Convolutional Networks for Object Detection. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2018.

PDF Cite

(2018). Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes. In European Conference on Computer Vision (ECCV), 2018.

PDF Cite

(2018). Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping. In European Conference on Computer Vision (ECCV), 2018.

PDF Cite

(2018). YoTube: Searching Action Proposal via Recurrent and Static Regression Networks. In IEEE Transactions on Image Processing (TIP), 2018.

PDF Cite