Projects

Selected research and engineering work across machine learning theory, large-scale model training, and uncertainty-aware modeling. The projects below combine mathematical analysis, empirical evaluation, and systems-oriented implementation.

ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning

Type: Research Project | Year: 2026 | Topics: RLHF, Active Learning, Preference Learning

Development of ActiveUltraFeedback, a RLHF pipeline for Preference Data Generation utilizing Active Learning

Paper Code Dataset

Bachelor Thesis: Exploring the Hidden Structures of Attention Layers in Transformer Models through the Lens of Gaussian Distributions (July 01, 2024)

Type: Bachelor's Thesis | Year: 2024 | Topics: Transformer Theory, Attention Mechanisms, Random Matrix Theory

Analyzing the mathematics of attention layers through random matrix theory and a finite-dimensional Gaussian approximation

Thesis

ETH Large-Scale AI Engineering 2026: Mamba, DeltaNet and Torch DDP

Type: Course Project | Year: 2026 | Topics: State Space Models, Distributed Training, Performance Engineering

Custom implementation of the DeltaNet and Mamba SSM, comparing performance/throughput against a Transformer baseline, and testing distributed data parallel (DDP) schemes

Code Report

ETH Computational Intelligence Lab 2025: Uncertainty-Aware Ensemble for Monocular Depth Estimation

Type: Course Project | Year: 2025 | Topics: Computer Vision, Uncertainty Estimation, Ensemble Modeling

Fine-tuning a mixture-of-experts meta-model for monocular depth estimation using epistemic uncertainty estimates