ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning
Development of ActiveUltraFeedback, a RLHF pipeline for Preference Data Generation utilizing Active Learning 
Development of ActiveUltraFeedback, a RLHF pipeline for Preference Data Generation utilizing Active Learning 
Analyzing the mathematics of attention layers through random matrix theory and a finite-dimensional Gaussian approximation 
Custom implementation of the DeltaNet and Mamba SSM, comparing performance/throughput against a Transformer baseline, and testing distributed data parallel (DDP) schemes 
Fine-tuning a mixture-of-experts meta-model for monocular depth estimation using epistemic uncertainty estimates 
Published in International Conference on Machine Learning (ICML), 2026
* Equal contribution
ACTIVEULTRAFEEDBACK introduces a modular active learning pipeline that uses uncertainty-aware reward estimates to select informative response pairs, reducing the amount of preference data needed for strong downstream performance.
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.