Profile

Intro

I'm a final year Ph.D. student in the Machine Learning Lab at the University of Freiburg, supervised by Prof. Frank Hutter. Prior to starting my Ph.D studies, I received my masters degree from ETH Zurich in 2021, where I worked on efficient generative models at the Computer Vision Lab, supervised by, Dr. Zhiwu Huang and Dr. Suryansh Kumar. I was awarded the prestigious ESOP scholarship for my masters studies. I received my Bachelor's degree from Vellore Institute of Technology (VIT). During my Ph.D. studies I have also interned at the Applied Sciences Group (ASG) at Microsoft Cambridge, where I worked on 2-bit quantization and finetuning, supervised by Dr. Pashmina Cameron and Dr. James Hensman.

My research focuses on automating and optimizing foundation model inference-particularly for large language models (LLMs) and vision models-to facilitate inference efficiency in their real-world applications. While training these models incurs a significant but one-time expense, the costs of inference can escalate quickly over the model's lifecycle. To address this, my work develops automated techniques for pruning, quantization, and knowledge distillation, reducing the manual effort in tuning these methods. Ultimately, my goal is to make foundation models more accessible and sustainable across diverse domains. As an open-source effort towards this goal, I develop and maintain the library whittle, with several others.

My recent research interests include the following topics:

  • Automated Foundation Model Compression
  • Efficient and Multi-Objective Neural Architecture Search




  • Recent Preprints

    (*: equal contribution)