I am a senior staff research scientist at Google DeepMind. I lead evaluation for Gemini / Bard. I also drove workstreams enabling #1 on LMSYS via frontier evaluation and posttraining research. Prior to Gemini, my most notable works are in infrastructure (Mesh TensorFlow, Tensor2Tensor, TensorFlow Probability, Edward), modeling (Image Transformer, Automatic Differentiation Variational Inference), a