- π Lead Software Engineer at Baseten, focusing on model performance optimization
- πΌ Previously at Meituan, specialized in GPU inference using TensorFlow/TensorRT for CTR and PyTorch for LLMs. Earlier at Baidu, familiar with bRPC and Babylon
- π» Open Source: Team Member at LMSYS Org, core developer of SGLang, committer for FlashInfer and LMDeploy
- π Check out my talk on SGLang at GPU MODE or CAMEL-AI Hackathon
- π DeepSeek V3 Related: SGLang Day One Support, Latent Space Podcast, The New York Times First Article, Second Article
- π« Contact: [email protected] | Telegram
- π More: LinkedIn | Homepage
- π The best way to contact me is via the SGLang Slack. We're looking for open-source enthusiasts and learners to help grow the SGLang project and community.
- β If you want to chat on Google Meet, schedule a time through my Calendly. Please include a brief self-introduction and your discussion topic on Calendly. I will decide whether to proceed based on the circumstances. Thank you for understanding.
zhyncs
Follow
π―
Pinned Loading
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a fast serving framework for large language models and vision language models.
-
flashinfer-ai/flashinfer
flashinfer-ai/flashinfer PublicFlashInfer: Kernel Library for LLM Serving
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.