3D Object Detection

SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects
Abhinav Kumar, Yuliang Guo, Xinyu Huang, Liu Ren and Xiaoming Liu

Monocular 3D detectors achieve remarkable performance on cars and smaller objects. However, their performance drops on larger objects, leading to fatal accidents. Some attribute the failures to training data scarcity or receptive field requirements of large objects. In this paper, we highlight this understudied problem of generalization to large objects ...
Continue reading
Keywords: 3D Object Detection, Image Segmentation
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu

Modern neural networks use building blocks such as convolutions that are equivariant to arbitrary 2D translations. However, these vanilla blocks are not equivariant to arbitrary 3D translations in the projective manifold. Even then, all monocular 3D detectors use vanilla blocks to obtain the 3D coordinates, a task for which the ...
Continue reading
Keywords: 3D Object Detection
Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image
Feng Liu, Xiaoming Liu

Inferring 3D locations and shapes of multiple objects from a single 2D image is a long-standing objective of computer vision. Most of the existing works either predict one of these 3D properties or focus on solving both for a single object. One fundamental challenge lies in how to learn an ...
Continue reading
Keywords: 3D Object Detection, 3D Shape Reconstruction, Generic Object 3D Reconstruction
GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Xiaoming Liu

Modern 3D object detectors have immensely benefited from the end-to-end learning idea. However, most of them use a post-processing algorithm called Non-Maximal Suppression (NMS) only during inference. While there were attempts to include NMS in the training pipeline for tasks such as 2D object detection, they have been less widely ...
Continue reading
Keywords: 3D Object Detection
Kinematic 3D Object Detection in Monocular Video
Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele

Perceiving the physical world in 3D is fundamental for selfdriving applications. Although temporal motion is an invaluable resource to human vision for detection, tracking, and depth perception, such features have not been thoroughly utilized in modern 3D object detectors. In this work, we propose a novel method for monocular video-based ...
Continue reading
Keywords: 3D Object Detection
M3D-RPN: Monocular 3D Region Proposal Network for Object Detection
Garrick Brazil, Xiaoming Liu

Understanding the world in 3D is a critical component of urban autonomous driving. Generally, the combination of expensive LiDAR sensors and stereo RGB imaging has been paramount for successful 3D object detection algorithms, whereas monocular image-only methods experience drastically reduced performance. We propose to reduce the gap by reformulating the ...
Continue reading
Keywords: 3D Object Detection

2025

CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector
Abhinav Kumar, Yuliang Guo, Zhihao Zhang, Xinyu Huang, Liu Ren, Xiaoming Liu
In Proceeding of International Conference on Computer Vision (ICCV 2025), Honolulu, Hawaii, Oct. 2025
Bibtex

RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection
Yunfei Long, Abhinav Kumar, Xiaoming Liu, Daniel Morris
In Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2025), Nashville, TN, Jun. 2025
Bibtex | arXiv

2024

SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects
Abhinav Kumar, Yuliang Guo, Xinyu Huang, Liu Ren, Xiaoming Liu
In Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, Jun. 2024
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code

2023

RADIANT: RADar Image Association NeTwork for 3D Object Detection
Yunfei Long, Abhinav Kumar, Daniel Morris, Xiaoming Liu, Marcos Paul Gerardo Castro, Punarjay Chakravarty
In Proceeding of Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), Washington, D.C., Feb. 2023
Bibtex | PDF

2022

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu
In Proceeding of European Conference on Computer Vision (ECCV 2022), Tel-Aviv, Israel, Oct. 2022
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code

2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image
Feng Liu, Xiaoming Liu
In Proceeding of Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Virtual, Dec. 2021
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code | Video

GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection
Abhinav Kumar, Garrick Brazil, Xiaoming Liu
In Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2021), Nashville, TN, Jun. 2021
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code | Video

2020

Kinematic 3D Object Detection in Monocular Video
Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele
In Proceeding of European Conference on Computer Vision (ECCV 2020), Virtual, Aug. 2020
Bibtex | PDF | arXiv | Supplemental | Project Webpage | Code | Video

2019

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection
Garrick Brazil, Xiaoming Liu
In Proceeding of International Conference on Computer Vision (ICCV 2019), Seoul, South Korea, Oct. 2019 (Oral presentation)
Bibtex | PDF | arXiv | Project Webpage | Code | Video