ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning
Development of ActiveUltraFeedback, a RLHF pipeline for Preference Data Generation utilizing Active Learning 
Selected research and engineering work across machine learning theory, large-scale model training, and uncertainty-aware modeling. The projects below combine mathematical analysis, empirical evaluation, and systems-oriented implementation.
Development of ActiveUltraFeedback, a RLHF pipeline for Preference Data Generation utilizing Active Learning 
Analyzing the mathematics of attention layers through random matrix theory and a finite-dimensional Gaussian approximation 
Custom implementation of the DeltaNet and Mamba SSM, comparing performance/throughput against a Transformer baseline, and testing distributed data parallel (DDP) schemes 
Fine-tuning a mixture-of-experts meta-model for monocular depth estimation using epistemic uncertainty estimates 