DwarfStar
Small native inference engine optimized for DeepSeek V4 Flash
Open-source native inference engine optimized for DeepSeek V4 Flash, with support for DeepSeek V4 PRO on very high-memory machines. Provides model loading, prompt rendering, tool calling, KV cache handling, server API support, CLI usage, GGUF/imatrix generation, and quality/speed testing.

Recent stories
0 linked stories
No linked stories yet.