⚙️
LLM Inference & Serving
115 tools
Inference runtimes, model serving platforms, fine-tuning infra, and GPU/accelerator providers for LLMs.
OpenRouter
OpenRouter, Inc.
The unified interface for AI models.
44 stories
vLLM
vLLM Project
Easy, fast, and cheap LLM serving
26 stories
SGLang
LMSYS Corp.
A fast serving framework for large language models
19 stories
Ollama
Ollama Inc.
Get up and running with large language models locally
14 stories
AI Studio
Google
Build with Gemini in AI Studio
13 stories
Amazon Bedrock
Amazon Web Services
Build and scale generative AI applications with foundation models.
5 stories
Grok Build
xAI
Build with Grok
5 stories
O
OpenAI Platform
OpenAI
Build with OpenAI APIs and tools.
5 stories
llama.cpp
Georgi Gerganov
C/C++ LLM inference for local execution.
4 stories
Together AI
Together AI
The AI Acceleration Cloud
4 stories
Unsloth
Unsloth AI
Fast, memory-efficient LLM fine-tuning
4 stories
Claude
Anthropic
The AI assistant from Anthropic
3 stories
fal
fal
Serverless AI inference platform
3 stories
O
OpenAI API
OpenAI
Build with OpenAI models via API.
3 stories
Baseten
Baseten
Deploy and serve AI models
2 stories
DFlash
Z Lab
DFlash
2 stories
Hugging Face Hub
Hugging Face
The platform for models, datasets, and Spaces.
2 stories
N
Nous Portal
Nous Research
AI portal from Nous Research.
2 stories
NVIDIA NIM
NVIDIA
Inference microservices for AI models
2 stories
Baidu Qianfan
Baidu
Baidu's large-model development platform
1 story
Claude Console
Anthropic
Claude API console
1 story
Claude Platform on AWS
Anthropic
Claude on AWS
1 story
Decoupled DiLoCo
Google DeepMind
Decentralized, communication-efficient model training.
1 story
Diffusers
Hugging Face
The go-to library for state-of-the-art diffusion models.
1 story
Factory Router
Factory
AI routing service
1 story
FlashQLA
Alibaba Cloud
FlashQLA
1 story
Gemini Live API
Google
Build live, multimodal experiences with Gemini.
1 story
Google Cloud
Google
Cloud computing for building, deploying, and scaling applications on Google infrastructure.
1 story
llama.cpp
ggml.ai
Inference of LLMs in C/C++
1 story
LM Studio
Element Labs, Inc.
Discover, download, and run local LLMs.
1 story
Miles RL Training
RadixArk
RL training software
1 story
NVIDIA RTX Spark
NVIDIA
NVIDIA RTX Spark
1 story
O
OpenAI Guaranteed Capacity
OpenAI
Guaranteed access to OpenAI capacity
1 story
Prime Intellect
Prime Intellect
Decentralized AI infrastructure
1 story
Tile Kernels
Hangzhou DeepSeek Artificial Intelligence Co., Ltd.
Tile-based GPU kernels from DeepSeek.
1 story
Zyphra Cloud
Zyphra Technologies Inc.
Zyphra Cloud
1 story
Zyphra Inference
Zyphra
Zyphra's hosted inference service
1 story
AFM Playground
Arcee AI
Playground for AFM
0 stories
AI Gateway
Vercel
Gateway to AI models
0 stories
AI/ML API
AI/ML API
Unified API for AI models
0 stories
Arcee
Arcee AI
Enterprise AI platform for language-model workflows
0 stories
Baidu AI Studio LLM API
Baidu
API service for Baidu AI Studio
0 stories
BytePlus
BytePlus Pte. Ltd.
Enterprise technology platform
0 stories
Cerebras
Cerebras Systems
Hosted AI inference platform
0 stories
Claude Code Router
musistudio
Open-source Claude Code router
0 stories
CloudMatrix-Infer
Huawei Cloud
AI inference service
0 stories
Coding Agents
Baseten
Build and run coding agents on Baseten
0 stories
Conway
Conway Research
Conway by Conway Research
0 stories
Core AI PyTorch Extensions
Core AI
PyTorch extensions from Core AI.
0 stories
DeepClaude
Asterisk
DeepSeek + Claude
0 stories
DeepEP
DeepSeek
Communication library for expert parallelism.
0 stories
DeepGEMM
DeepSeek
Efficient FP8 GEMM library
0 stories
DeepSeek-Reasonix
DeepSeek
Unverified target name
0 stories
DGX Spark
NVIDIA
Personal AI supercomputer
0 stories
DwarfStar
antirez
Antirez-associated software project
0 stories
Exo
Exo Labs
Distributed inference across devices.
0 stories
Fireworks AI
Fireworks.ai, Inc.
AI inference and model deployment platform
0 stories
FlashLib
FlashML-org
FlashLib
0 stories
FlashMLA
DeepSeek
Fast MLA decoding
0 stories
ForgeTrain
OpenBMB
Training tool for large-language-model workflows
0 stories
Gemini Omni
Google
Google's AI assistant product context
0 stories
Google AI Edge Gallery
Google LLC
Explore and run AI models on-device.
0 stories
GPT4All
Nomic AI
Run large language models locally on your machine
0 stories
Groq
Groq
Fast AI inference
0 stories
g
gstack
gstack
Software product
0 stories
GuideLLM
Red Hat
Benchmarking tool for LLM serving systems
0 stories
Hugging Face
Hugging Face
The AI community building the future.
0 stories
Interfaze
JigsawStack, Inc.
Interfaze by JigsawStack
0 stories
LangSmith LLM Gateway
LangChain
LLM gateway for LangSmith
0 stories
Lightning AI
Lightning AI, Inc.
Build, train, and deploy AI products in the cloud.
0 stories
LiteLLM
BerriAI
Open-source LLM gateway
0 stories
llmster
Element Labs, Inc.
llmster
0 stories
Meta CLI
Meta
Meta CLI for developer workflows
0 stories
Mirage
Crisp
AI for customer support
0 stories
Mistral Studio
Mistral AI
Studio for building and managing AI workflows and agents.
0 stories
ModellixAI
ModellixAI
Provisional catalog entry for the ModellixAI product name.
0 stories
ModelScope
Alibaba Cloud
Open-source model community and platform
0 stories
Modular
Modular
AI platform
0 stories
Mooncake
KVCache.ai
KV-cache software for LLM serving
0 stories
Multimodal Max
Modular
Multimodal AI service
0 stories
NeMo-RL
NVIDIA
Reinforcement learning for LLM post-training
0 stories
Novita Sandbox
Novita AI
Novita AI sandbox for testing AI services.
0 stories
NVIDIA DGX 8xB200
NVIDIA
8x B200 GPU DGX system
0 stories
Open Generative AI
Muapi
Generative AI software product
0 stories
O
Open Responses
Open Responses
Open Responses
0 stories
O
OpenAI API
OpenAI
Developer API for OpenAI models
0 stories
OptiLLM
Algorithmic SuperIntelligence Labs
AI/LLM optimization tool
0 stories
OrcaRouter
Continuum AI
AI model routing platform.
0 stories
Ostris Cloud
Ostris, LLC
Cloud software from Ostris, LLC.
0 stories
PaddlePaddle AI Studio
Baidu, Inc.
Baidu's AI development platform for PaddlePaddle.
0 stories
parakeet.cpp
Frikallo
C++ Parakeet implementation
0 stories
Pocket TTS
Kyutai
Kyutai text-to-speech product
0 stories
Prime Intellect Lab
Prime Intellect
Prime Intellect Lab
0 stories
PrismML
PrismML
PrismML software product
0 stories
R
Rosalind Biodefense
OpenAI
Biodefense-focused OpenAI tool
0 stories
RTX Spark
NVIDIA Corporation
Spark-branded NVIDIA product
0 stories
RunPod
Runpod, Inc.
Cloud GPU platform for AI
0 stories
SiMa.ai
SiMa.ai
AI software platform
0 stories
sparkrun
Spark Arena
sparkrun
0 stories
Thinking Machines
Thinking Machines Lab
Software product associated with Thinking Machines Lab
0 stories
TileLang
Tile-AI
A language and toolchain for efficient GPU kernel development
0 stories
TileLang-Ascend
Tile-AI
TileLang support for Ascend
0 stories
Together Fine-Tuning
Together AI
Fine-tune open models on Together AI.
0 stories
Train Models
TrainEngine.ai
AI model training platform.
0 stories
train-sentence-transformers
Hugging Face
Train sentence-transformer models on Hugging Face Space
0 stories
Unsloth AI
Unsloth
Fine-tune LLMs faster.
0 stories
U
Unsloth Studio
Unsloth
Unsloth's studio for language-model workflows.
0 stories
Vast.ai
Vast.ai
GPU cloud marketplace
0 stories
Venice API
Venice.ai
Developer API for Venice.ai models
0 stories
Vertex AI
Google Cloud
A unified machine learning platform for building and using generative AI
0 stories
vLLM Omni
vLLM Project
Multimodal inference and serving platform
0 stories
vLLM-factory
vLLM Project
vLLM ecosystem software
0 stories
xAI
xAI
AI platform from xAI
0 stories
xAI CLI
xAI
Terminal access to xAI services
0 stories
ZenMux
AI Force Singapore Pte. Ltd.
Unverified software product.
0 stories