Chief Scientist @ Alibaba DAMO Academy (达摩院) Previously Principal Research Scientist @ Google Brain (Mountain View)
I focus on Large Scale Representation Learning, Generative AI, and Reasoning. My code runs on TPU pods and Apsara clusters, optimizing the boundary between silicon and neuron.
Leading the AliceMind (Language Technology Lab) & M6 foundational model architecture. We are teaching machines to understand the nuances of humanity.
- Algorithms: Transformer, MoE (Mixture of Experts), Distributed SGD
- Frameworks: JAX, TensorFlow, PyTorch (Custom Kernels)
- Infrastructure: Borg, Kubernetes, Apsara (飞天)




