I am an ELLIS PhD student supervised by Prof. Iryna Gurevych at the UKP Lab, TU Darmstadt and co-supervised by Prof. Amartya Sanyal at the University of Copenhagen. Previously, I spent two great years in IIIT Hyderabad, India with Prof. Ponnurangam Kumaraguru.

I am broadly interested in the privacy and safety of large language models. I currently work on contextual privacy failures during fine-tuning, auditing unlearning in representations, differentially private alignment, and evaluation for multi-agent systems. Outside of research, I enjoy cricket and travelling.


Selected Papers

Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models
MASEval: A Framework-Agnostic Evaluation Library for Multi-Agent Systems
Responsible Evaluation of AI for Mental Health
Auditing Language Model Unlearning via Information Decomposition
Differentially Private Steering for Large Language Model Alignment
Socratic Reasoning Improves Positive Text Rewriting
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
An Unsupervised, Geometric and Syntax-aware Quantification of Polysemy
SyMCoM - Syntactic Measure of Code Mixing A Study Of English-Hindi Code-Mixing
HLDC: Hindi Legal Documents Corpus

Invited Talks

2026: Guest Lecture in the RSAI course at IIITH

2025: Invited talk at Dagstuhl Seminar

2025: Guest Lecture in the DL4NLP Course at TU Darmstadt


Media Coverage

Jan 2026: Privacy Collapse was listed in the top exciting papers by AI World.

Oct 2025: Our work on safety in Indian legal data was covered by The Hindu.


Reviewing

ACL Rolling Review (ACL, EMNLP, NAACL, EACL), AAAI 2025, NeurIPS 2025, ICML MUGen workshop, WiNLP workshop, LLMSec workshop, CLPsych workshop


Student Advising

  • Are You Sure?: Uncertainty Estimation in LLM Judges - Patrick Gantner (MSc)
  • Investigating Privacy Leakage and its Mitigation in Activation Editing of LLMs - Michail Moroz (MSc)
  • Enhancing LLM Reasoning Capabilities on Therapeutic Interventions - Alicia Gleichmann (BSc)