GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection

1Stony Brook University, 2Federal University of Rio Grande do Sul
Conference on Computer Vision and Pattern Recognition (CVPR) 2025

Comparison

Image

FCOS-Baseline

Gemini

FCOS-GauCho

GPT4

FCOS-Baseline

MistralOCR

FCOS-GauCho

Abstract

Oriented Object Detection (OOD) has received increased attention in the past years, being a suitable solution for detecting elongated objects in remote sensing analysis. In particular, using regression loss functions based on Gaussian distributions has become attractive since they yield simple and differentiable terms. However, existing solutions are still based on regression heads that produce Oriented Bounding Boxes (OBBs), and the known problem of angular boundary discontinuity persists. In this work, we propose a regression head for OOD that directly produces Gaussian distributions based on the Cholesky matrix decomposition. The proposed head, named GauCho, theoretically mitigates the boundary discontinuity problem and is fully compatible with recent Gaussian-based regression loss functions. Furthermore, we advocate using Oriented Ellipses (OEs) to represent oriented objects, which relates to GauCho through a bijective function and alleviates the encoding ambiguity problem for circular objects. Our experimental results show that GauCho can be a viable alternative to the traditional OBB head, achieving results comparable to or better than state-of-the-art detectors for the challenging dataset DOTA.

Description of the image

Approach

GauCho detects oriented objects with typical representations or Oriented Ellipses. It relies on a novel regression head designed to directly predict the parameters of 2D Gaussian distributions through the Cholesky decomposition of their covariance matrices, which theoretically mitigates the boundary discontinuity problem.

Description of the image

Results

We adapted different OOD approaches to add the GauCho head: FCOS, RetinaNet, R3Det, and RoI-Transformer. Result below used ResNet-50 (R-50) backbone as default for all detectors. For all detectors, we generated results using different Gaussian-based loss functions: GWD, KLD and ProbIoU.

Description of the image

We achieved SOTA results in DOTAv1.

Description of the image

BibTeX

@InProceedings{Marques_2025_CVPR,
    author    = {Marques, José Henrique Lima and Murrugarra-Llerena, Jeffri and Jung, Claudio R.},
    title     = {GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2025},
    pages     = {3593-3602}
}