Yineng Zhang zhyncs

🔭 Lead Software Engineer at Baseten, focusing on model performance optimization
💼 Previously at Meituan, specialized in GPU inference using TensorFlow/TensorRT for CTR and PyTorch for LLMs. Earlier at Baidu, familiar with bRPC and Babylon
💻 Open Source: Team Member at LMSYS Org, core developer of SGLang, committer for FlashInfer and LMDeploy
👀 Check out my talk on SGLang at GPU MODE or CAMEL-AI Hackathon
🚀 DeepSeek V3 Related: SGLang Day One Support, Latent Space Podcast, The New York Times First Article, Second Article
📫 Contact: [email protected] | Telegram
📄 More: LinkedIn | Homepage
🙌 The best way to contact me is via the SGLang Slack. We're looking for open-source enthusiasts and learners to help grow the SGLang project and community.
☕ If you want to chat on Google Meet, schedule a time through my Calendly. Please include a brief self-introduction and your discussion topic on Calendly. I will decide whether to proceed based on the circumstances. Thank you for understanding.

Provide feedback