Publications

2024

  1. ANAH: Analytical Annotation of Hallucinations in Large Language Models
    Ziwei Ji, Yuzhe Gu, Wenwei Zhang, and 3 more authors
    In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) 2024
  2. T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
    Zehui Chen, Weihua Du, Wenwei Zhang, and 8 more authors
    In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) 2024
  3. Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
    Zehui Chen, Kuikun Liu, Qiuchen Wang, and 5 more authors
    In Findings of the Association for Computational Linguistics: ACL 2024
  4. LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
    Xi Chen, Songyang Zhang, Qibing Bai, and 2 more authors
    In Findings of the Association for Computational Linguistics: ACL 2024
  5. MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
    Hongwei Liu, Zilong Zheng, Yuxuan Qiao, and 7 more authors
    In Findings of the Association for Computational Linguistics: ACL 2024
  6. Differential Model Scaling using Differential Topk
    Kai Liu, Ruohui Wang, Jianfei Gao, and 1 more author
    In International Conference on Machine Learning (ICML) 2024
  7. Can AI Assistants Know What They Don’t Know?
    Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, and 6 more authors
    In International Conference on Machine Learning (ICML) 2024
  8. Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
    Chonghua Wang, Haodong Duan, Songyang Zhang, and 2 more authors
    In NAACL 2024
  9. BotChat: Evaluating LLMs’ Capabilities of Having Multi-Turn Dialogues
    Haodong Duan, Jueqi Wei, Chonghua Wang, and 5 more authors
    In NAACL findings 2024
  10. PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
    Yiming Zhang, Zhening Xing, Yanhong Zeng, and 2 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  11. Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text
    Junshu Tang, Yanhong Zeng, Ke Fan, and 4 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  12. EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
    Tai Wang, Xiaohan Mao, Chenming Zhu, and 11 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  13. OMG-Seg: Is One Model Good Enough For All Segmentation?
    Xiangtai Li, Haobo Yuan, Wei Li, and 6 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  14. Towards language-driven video inpainting via multimodal large language models
    Jianzong Wu, Xiangtai Li, Chenyang Si, and 8 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  15. From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
    Rongjie Li, Songyang Zhang, Dahua Lin, and 2 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  16. RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation
    Peng Lu, Tao Jiang, Yining Li, and 3 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024

2023

  1. Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
    Youquan Liu, Lingdong Kong, Jun Cen, and 5 more authors
    Advances in Neural Information Processing Systems (NeurIPS) 2023
  2. Improving Pixel-based MIM by Reducing Wasted Modeling Capability
    Yuan Liu, Songyang Zhang, Jiacheng Chen, and 3 more authors
    In Proceedings of the IEEE International Conference on Computer Vision (ICCV) 2023
  3. Robo3d: Towards robust and reliable 3d perception against corruptions
    Lingdong Kong, Youquan Liu, Xin Li, and 6 more authors
    In Proceedings of the IEEE International Conference on Computer Vision (ICCV) 2023
  4. Multimodal-gpt: A vision and language model for dialogue with humans
    Tao Gong, Chengqi Lyu, Shilong Zhang, and 7 more authors
    arXiv preprint arXiv:2305.04790 2023
  5. TG-VQA: Ternary Game of Video Question Answering
    Hao Li, Peng Jin, Zesen Cheng, and 5 more authors
    In International Joint Conference on Artificial Intelligence (IJCAI) 2023
  6. RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer
    Jiahao Wang, Songyang Zhang, Yong Liu, and 6 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
  7. Dense Distinct Query for End-to-End Object Detection
    Shilong Zhang, Jiaqi Wang, Jiangmiao Pang, and 4 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
  8. Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection
    Xinjiang Wang, Xingyi Yang, Shilong Zhang, and 6 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
  9. StructToken: Rethinking Semantic Segmentation with Structural Prior
    Fangjian Lin, Zhanhao Liang, Sitong Wu, and 3 more authors
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) 2023
  10. Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation
    Zhao Yang, Jiaqi Wang, Yansong Tang, and 3 more authors
    Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 2023
  11. Boosting Point Clouds Rendering via Radiance Mapping
    Xiaoyang Huang, Yi Zhang, Bingbing Ni, and 3 more authors
    Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 2023

2022

  1. RTMDet: An Empirical Study of Designing Real-Time Object Detectors
    Chengqi Lyu, Wenwei Zhang, Haian Huang, and 5 more authors
    arXiv preprint arXiv:2212.07784 2022
  2. Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation
    Lin Chen, Zhixiang Wei, Xin Jin, and 4 more authors
    Advances in Neural Information Processing Systems (NeurIPS) 2022
  3. Group R-CNN for Weakly Semi-supervised Object Detection with Points
    Shilong Zhang, Zhuoran Yu, Liyang Liu, and 3 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
  4. TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition
    Haodong Duan, Nanxuan Zhao, Kai Chen, and 1 more author
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
  5. Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
    Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, and 4 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
  6. Dense Siamese Network
    Wenwei Zhang, Jiangmiao Pang, Kai Chen, and 1 more author
    European Conference on Computer Vision (ECCV) 2022
  7. Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks
    Haodong Duan, Yue Zhao, Kai Chen, and 2 more authors
    In European Conference on Computer Vision (ECCV) Workshop 2022
  8. OCSampler: Compressing Videos to One Clip with Single-step Sampling
    Jintao Lin, Haodong Duan, Kai Chen, and 2 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
  9. LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
    Zhao Yang, Jiaqi Wang, Yansong Tang, and 3 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
  10. Pyskl: Towards good practices for skeleton action recognition
    Haodong Duan, Jiaqi Wang, Kai Chen, and 1 more author
    In Proceedings of the 30th ACM International Conference on Multimedia 2022
  11. MMRotate: A Rotated Object Detection Benchmark Using PyTorch
    Yue Zhou, Xue Yang, Gefan Zhang, and 9 more authors
    In Proceedings of the 30th ACM International Conference on Multimedia 2022
  12. Revisiting Skeleton-based Action Recognition
    Haodong Duan, Yue Zhao, Kai Chen, and 2 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022

2021

  1. Few-Shot Object Detection via Association and DIscrimination
    Yuhang Cao, Jiaqi Wang, Ying Jin, and 4 more authors
    Advances in Neural Information Processing Systems (NeurIPS) 2021
  2. MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
    Zhanghui Kuang, Hongbin Sun, Zhizhong Li, and 10 more authors
    In Proceedings of the 29th ACM International Conference on Multimedia 2021
  3. K-Net: Towards Unified Image Segmentation
    Wenwei Zhang, Jiangmiao Pang, Kai Chen, and 1 more author
    Advances in Neural Information Processing Systems (NeurIPS) 2021
  4. Temporal ROI Align for Video Object Recognition
    Tao Gong, Kai Chen, Xinjiang Wang, and 5 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 2021
  5. Towards Balanced Learning for Instance Recognition
    Jiangmiao Pang, Kai Chen, Qi Li, and 5 more authors
    International Journal of Computer Vision (IJCV) 2021
  6. CARAFE++: Unified Content-Aware ReAssembly of FEatures
    Jiaqi Wang, Kai Chen, Rui Xu, and 3 more authors
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2021

2020

  1. Seesaw Loss for Long-Tailed Instance Segmentation
    Jiaqi Wang, Wenwei Zhang, Yuhang Zang, and 7 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020
  2. Positional Encoding as Spatial Inductive Bias in GANs
    Rui Xu, Xintao Wang, Kai Chen, and 2 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020
  3. Feature Pyramid Grids
    Kai Chen, Yuhang Cao, Chen Change Loy, and 2 more authors
    arXiv preprint arXiv:2004.03580 2020
  4. Side-Aware Boundary Localization for More Precise Object Detection
    Jiaqi Wang, Wenwei Zhang, Yuhang Cao, and 6 more authors
    In European Conference on Computer Vision (ECCV) 2020
  5. Prime sample attention in object detection
    Yuhang Cao, Kai Chen, Chen Change Loy, and 1 more author
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020

2019

  1. Libra R-CNN: Towards Balanced Learning for Object Detection
    Jiangmiao Pang, Kai Chen, Jianping Shi, and 3 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
  2. Region Proposal by Guided Anchoring
    Jiaqi Wang*, Kai Chen*, Shuo Yang, and 2 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
  3. Hybrid task cascade for instance segmentation
    Kai Chen, Jiangmiao Pang, Jiaqi Wang, and 9 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
  4. MMDetection: Open MMLab Detection Toolbox and Benchmark
    Kai Chen, Jiaqi Wang*, Jiangmiao Pang*, and 22 more authors
    arXiv preprint arXiv:1906.07155 2019
  5. CARAFE: Content-Aware ReAssembly of FEatures
    Jiaqi Wang, Kai Chen, Rui Xu, and 3 more authors
    In Proceedings of the IEEE International Conference on Computer Vision (ICCV) 2019

2018

  1. Optimizing Video Object Detection via a Scale-Time Lattice
    Kai Chen, Jiaqi Wang, Shuo Yang, and 4 more authors
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018

2017

  1. Video Object Segmentation with Re-identification
    Xiaoxiao Li, Yuankai Qi, Zhe Wang, and 6 more authors
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshop 2017
  2. Discover and Learn New Objects from Documentaries
    Kai Chen, Hang Song, Chen Change Loy, and 1 more author
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017