SGLang
A fast serving framework for large language models
An open-source serving framework and runtime for large language models and vision-language models, focused on fast inference and structured generation.

Recent stories
0 linked stories
No linked stories yet.