Hi, I'm a 5th-year Computational Linguistics and Computer Science dual degree student at IIIT Hyderabad. I work on mechanistic interpretability of language models from a linguistics lens at the Language Technologies Research Centre. Currently I'm working as an intern at the data integration team of Qube Research & Technologies, a quantitative hedge fund.

std::multiset

Using C++ multisets for logarithmic search and removal.

Investigated how multilingual transformers (XLM-RoBERTa, mBERT, IndicBERT) internally encode grammatical information from the Paninian framework across seven Indian languages. Used diagnostic probing to analyze layer-wise encoding patterns, finding syntactic dependencies peak in middle-upper layers (6-9) while lexical features emerge earlier.

Explored multi-hop QA using teacher-student distillation for reasoning chain transfer. Built a graph traversal RAG pipeline that achieved 15% accuracy boost by teaching smaller models logical paths.

Experimented with Racket-based tensor DSL, reimagining ML ops through Lisp's compositional philosophy. Built computation graph auto-differentiation from scratch using functional paradigms.

Nanopass compiler in Racket with graph-coloring register allocation. Implemented tuple support and garbage collection using interference graphs for optimal register usage.