Hi! I'm a Computer Science PhD student at Columbia University, advised by Vishal Misra and Dan Rubenstein.
I study retrieval-augmented LLMs—how models use retrieved evidence to reason and act under real-world
constraints, and the retrieval systems that enable them.
Research interests: agentic LLMs, long-horizon
planning, external tool use, and interpretability.
We are building evaluation pipelines that chain open and declassified documents to estimate when sensitive facts can be inferred (“mosaic risk”). Early benchmarks and red‑team query suites are underway.
Adapting Decision Transformers for long‑horizon marketing actions under a strict global spending budget. We explore safe‑RL style preference tuning and counterfactual evaluation on logged trajectories to optimize retention per dollar.
A lightweight dashboard that streams token logits, probabilities, and entropy during inference to spot when/where models learn concepts, experience mode collapse, or forget. We study links to curriculum design and catastrophic-drift debugging.
We propose ClusterSC to mitigate noise and the curse of dimensionality in disaggregate-level synthetic control by uncovering latent donor subgroups. Results: theoretical guarantees and significant MSE improvement on synthetic and real-world datasets.
We introduced an open-ended benchmark suite for embodied agent research, built on Minecraft and backed by a web-scale knowledge base. My role: built multimodal data pipeline for Minecraft Wiki and Reddit. I am highly grateful for this early project that inspired me to pursue agentic AI research.