ETH Computational Intelligence Lab 2025: Uncertainty-Aware Ensemble for Monocular Depth Estimation 2

In this work, we explored whether the per- formance of the state-of-the-art models on the Monocular Depth Estimation task can be im- proved by averaging multiple models’ predictions based on the model’s uncertainty. To verify this assumption, we fine-tuned a Dense Prediction Transformer model modified to predict uncer- tainty using Gaussian Negative Log Likelihood loss. Then, we trained multiple experts by further fine-tuning the aforementioned model for differ- ent room types. To combine their predictions, we trained a separate meta-model that takes the mod- els’ uncertainty into account. We showed that this approach decreases siRMSE compared to the performance of the base model. Additionally, we identified that this approach is sensitive to hyper- parameters and can suffer from mode collapse. Finally, our results show that the meta-model reg- ularly combines the ensemble’s individual predic- tions in a manner consistent with the intuition of the expert’s room-type domain knowledge and generalizes to ambiguous images.

The GitHub repository is at Computational Intelligence Lab 2025 Project Repository and the corresponding report at Computational Intelligence Lab 2025 Project Report

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Martin Wertich

Share on