I’m a researcher working on LLMs at Anthropic.

Before this, I worked on reinforcement learning at Google DeepMind. I co-led AlphaProof, where we got an LLM to teach itself enough math to get a silver medal at the International Mathematical Olympiad, almost cracking the IMO grand challenge. I also worked on post-training Gemini.

Before DeepMind, I worked on improving Google’s search ranking algorithm with deep learning.

I live in San Francisco. You can find me on X (Twitter), or email me at rishicomplex@gmail.com.