Projects

SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects
Abhinav Kumar, Yuliang Guo, Xinyu Huang, Liu Ren and Xiaoming Liu

Monocular 3D detectors achieve remarkable performance on cars and smaller objects. However, their performance drops on larger objects, leading to fatal accidents. Some attribute the failures to training data scarcity or receptive field requirements of large objects. In this paper, we highlight this understudied problem of generalization to large objects ...
Continue reading
Keywords: 3D Object Detection, Image Segmentation
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu

Modern neural networks use building blocks such as convolutions that are equivariant to arbitrary 2D translations. However, these vanilla blocks are not equivariant to arbitrary 3D translations in the projective manifold. Even then, all monocular 3D detectors use vanilla blocks to obtain the 3D coordinates, a task for which the ...
Continue reading
Keywords: 3D Object Detection
GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Xiaoming Liu

Modern 3D object detectors have immensely benefited from the end-to-end learning idea. However, most of them use a post-processing algorithm called Non-Maximal Suppression (NMS) only during inference. While there were attempts to include NMS in the training pipeline for tasks such as 2D object detection, they have been less widely ...
Continue reading
Keywords: 3D Object Detection

Publications

2025

CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector
Abhinav Kumar, Yuliang Guo, Zhihao Zhang, Xinyu Huang, Liu Ren, Xiaoming Liu
In Proceeding of International Conference on Computer Vision (ICCV 2025), Honolulu, Hawaii, Oct. 2025
Bibtex

RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection
Yunfei Long, Abhinav Kumar, Xiaoming Liu, Daniel Morris
In Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2025), Nashville, TN, Jun. 2025
Bibtex | arXiv

2024

RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry
Shengjie Zhu*, Girish Chandar Ganesan*, Abhinav Kumar, Xiaoming Liu
In Proceeding of European Conference on Computer Vision (ECCV 2024), Milan, Italy, Oct. 2024
Bibtex | PDF | arXiv

SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects
Abhinav Kumar, Yuliang Guo, Xinyu Huang, Liu Ren, Xiaoming Liu
In Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, Jun. 2024
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code

2023

PrObeD: Proactive Object Detection Wrapper
Vishal Asnani, Abhinav Kumar, Suya You, Xiaoming Liu
In Proceeding of Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA, Dec. 2023
Bibtex | PDF | arXiv | Supplemental | Code

Tame a Wild Camera: In-the-Wild Monocular Camera Calibration
Shengjie Zhu, Abhinav Kumar, Masa Hu, Xiaoming Liu
In Proceeding of Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA, Dec. 2023
Bibtex | arXiv | Code | Video

RADIANT: RADar Image Association NeTwork for 3D Object Detection
Yunfei Long, Abhinav Kumar, Daniel Morris, Xiaoming Liu, Marcos Paul Gerardo Castro, Punarjay Chakravarty
In Proceeding of Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), Washington, D.C., Feb. 2023
Bibtex | PDF

2022

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu
In Proceeding of European Conference on Computer Vision (ECCV 2022), Tel-Aviv, Israel, Oct. 2022
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code

2021

GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Xiaoming Liu
In Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2021), Nashville, TN, Jun. 2021
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code | Video

2020

LUVLi Face Alignment: Estimating Landmarks’ Location, Uncertainty, and Visibility Likelihood
Abhinav Kumar*, Tim K. Marks*, Wenxuan Mou*, Ye Wang, Michael Jones, Anoop Cherian, Toshiaki Koike-Akino, Xiaoming Liu, Chen Feng
In Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2020), Seattle, WA, Jun. 2020
Bibtex | PDF | arXiv | Supplemental | Code | Video | Dataset

2019

UGLLI Face Alignment:Estimating Uncertainty with Gaussian Log-Likelihood Loss
Abhinav Kumar*, Tim K.Marks*, Wenxuan Mou*, Chen Feng, Xiaoming Liu
In Proceeding of International Conference on Computer Vision Workshops (ICCVW 2019), Seoul, Korea, Oct. 2019 (Best Oral Presentation)
Bibtex | PDF | Poster