⚙️
LLM Inference & Serving
115 tools
Inference runtimes, model serving platforms, fine-tuning infra, and GPU/accelerator providers for LLMs.
AI Studio
Google
Build with Gemini
5 stories
Grok Build
xAI
Build with Grok
5 stories
fal
fal
Serverless AI inference platform
3 stories
Gemini Omni
Google
Google's AI assistant product context
3 stories
Ollama
Ollama Inc.
Get up and running with large language models, locally.
2 stories
OpenRouter
OpenRouter, Inc.
The unified interface for AI models.
2 stories
BytePlus
BytePlus Pte. Ltd.
Enterprise software platform
1 story
Claude
Anthropic
The AI assistant from Anthropic
1 story
Claude Console
Anthropic
Claude API console
1 story
Hugging Face Hub
Hugging Face
The platform for models, datasets, and Spaces.
1 story
LM Studio
Element Labs, Inc.
Run AI models locally on your computer.
1 story
NVIDIA RTX Spark
NVIDIA
NVIDIA-branded software product
1 story
xAI
xAI
AI platform from xAI
1 story
AFM Playground
Arcee AI
Playground for AFM
0 stories
AI Gateway
Vercel
A single gateway for AI model access and routing.
0 stories
AI/ML API
AI/ML API
Unified API for AI models
0 stories
Amazon Bedrock
Amazon Web Services
Build and scale generative AI applications with foundation models.
0 stories
Arcee
Arcee AI
Enterprise AI platform for language-model workflows
0 stories
Baidu AI Studio LLM API
Baidu
API service for Baidu AI Studio
0 stories
Baidu Qianfan
Baidu
Baidu's large-model development platform
0 stories
Baseten
Baseten
Deploy and serve AI models
0 stories
Cerebras
Cerebras Systems
Hosted AI inference platform
0 stories
Claude Code Router
musistudio
Open-source Claude Code router
0 stories
Claude Platform on AWS
Anthropic
Claude on AWS
0 stories
CloudMatrix-Infer
Huawei Cloud
AI inference service
0 stories
Coding Agents
Baseten
Build and run coding agents.
0 stories
Conway
Conway Research
Conway by Conway Research
0 stories
Core AI PyTorch Extensions
Core AI
PyTorch extensions from Core AI
0 stories
Decoupled DiLoCo
Google DeepMind
Decentralized, communication-efficient model training.
0 stories
DeepClaude
Asterisk
AI service combining Claude and DeepSeek-style reasoning
0 stories
DeepEP
DeepSeek
Communication library for expert parallelism.
0 stories
DeepGEMM
DeepSeek
Efficient FP8 GEMM library
0 stories
DeepSeek-Reasonix
DeepSeek
Unverified target name
0 stories
DFlash
Z Lab
DFlash
0 stories
DGX Spark
NVIDIA
Personal AI supercomputer
0 stories
Diffusers
Hugging Face
Diffusion models in PyTorch and beyond.
0 stories
DwarfStar
antirez
Antirez-associated software project
0 stories
Exo
Exo Labs
Distributed inference across devices.
0 stories
Factory Router
Factory
AI request routing for Factory
0 stories
Fireworks AI
Fireworks.ai, Inc.
AI inference and model deployment platform
0 stories
FlashLib
FlashML-org
FlashLib
0 stories
FlashMLA
DeepSeek
Fast MLA decoding
0 stories
FlashQLA
Alibaba Cloud
FlashQLA
0 stories
ForgeTrain
OpenBMB
Training tool for large-language-model workflows
0 stories
Gemini Live API
Google
Build live, multimodal experiences with Gemini.
0 stories
Google AI Edge Gallery
Google LLC
Explore and run AI models on-device.
0 stories
Google Cloud
Google
Cloud computing for building, deploying, and scaling applications on Google infrastructure.
0 stories
GPT4All
Nomic AI
Run large language models locally.
0 stories
Groq
Groq
Fast AI inference
0 stories
g
gstack
gstack
Software product
0 stories
GuideLLM
Red Hat
Benchmarking tool for LLM serving systems
0 stories
Hugging Face
Hugging Face
AI platform for models, datasets, and apps.
0 stories
Interfaze
JigsawStack, Inc.
Interfaze by JigsawStack
0 stories
LangSmith LLM Gateway
LangChain
LLM gateway for LangSmith
0 stories
Lightning AI
Lightning AI, Inc.
Build, train, and deploy AI apps
0 stories
LiteLLM
BerriAI
All LLM APIs in the OpenAI format
0 stories
llama.cpp
ggml.ai
Inference of LLMs in C/C++
0 stories
llama.cpp
Georgi Gerganov
Port of LLaMA to C/C++
0 stories
llmster
Element Labs, Inc.
llmster
0 stories
Meta CLI
Meta
Meta CLI for developer workflows
0 stories
Miles RL Training
RadixArk
RL training software
0 stories
Mirage
Crisp
AI for customer support
0 stories
Mistral Studio
Mistral AI
Studio for building AI applications on Mistral AI
0 stories
ModellixAI
ModellixAI
Provisional catalog entry for the ModellixAI product name.
0 stories
ModelScope
Alibaba Cloud
Open-source model community and platform
0 stories
Modular
Modular
AI platform
0 stories
Mooncake
KVCache.ai
KV-cache software for LLM serving
0 stories
Multimodal Max
Modular
Multimodal AI service
0 stories
NeMo-RL
NVIDIA
Reinforcement learning for LLM post-training
0 stories
N
Nous Portal
Nous Research
AI portal from Nous Research.
0 stories
Novita Sandbox
Novita AI
Novita AI sandbox for testing AI services.
0 stories
NVIDIA DGX 8xB200
NVIDIA
8x B200 GPU DGX system
0 stories
NVIDIA NIM
NVIDIA
Inference microservices for AI models
0 stories
Open Generative AI
Muapi
Generative AI software product
0 stories
O
Open Responses
Open Responses
Open Responses
0 stories
O
OpenAI API
OpenAI
Build with OpenAI models.
0 stories
O
OpenAI API
OpenAI
Developer API for OpenAI models
0 stories
O
OpenAI Guaranteed Capacity
OpenAI
Reserved capacity for OpenAI models and APIs
0 stories
O
OpenAI Platform
OpenAI
Build with OpenAI APIs and tools.
0 stories
OptiLLM
Algorithmic SuperIntelligence Labs
AI/LLM optimization tool
0 stories
OrcaRouter
Continuum AI
AI model routing platform.
0 stories
Ostris Cloud
Ostris, LLC
Cloud-based software product from Ostris, LLC.
0 stories
PaddlePaddle AI Studio
Baidu, Inc.
Baidu's AI development platform for PaddlePaddle.
0 stories
parakeet.cpp
Frikallo
C++ Parakeet implementation
0 stories
Pocket TTS
Kyutai
Kyutai text-to-speech product
0 stories
Prime Intellect
Prime Intellect
Decentralized AI infrastructure
0 stories
Prime Intellect Lab
Prime Intellect
Prime Intellect Lab
0 stories
PrismML
PrismML
PrismML software product
0 stories
R
Rosalind Biodefense
OpenAI
OpenAI biodefense software product.
0 stories
RTX Spark
NVIDIA Corporation
NVIDIA software product
0 stories
RunPod
Runpod, Inc.
Cloud GPU platform for AI
0 stories
SGLang
LMSYS Corp.
A fast serving framework for large language models
0 stories
SiMa.ai
SiMa.ai
Edge AI platform
0 stories
sparkrun
Spark Arena
sparkrun
0 stories
Thinking Machines
Thinking Machines Lab
Official site
0 stories
Tile Kernels
Hangzhou DeepSeek Artificial Intelligence Co., Ltd.
Tile-based GPU kernels from DeepSeek.
0 stories
TileLang
Tile-AI
A language and toolchain for efficient GPU kernel development
0 stories
TileLang-Ascend
Tile-AI
TileLang support for Ascend
0 stories
Together AI
Together AI
The AI Acceleration Cloud
0 stories
Together Fine-Tuning
Together AI
Fine-tune open models on Together AI.
0 stories
Train Models
TrainEngine.ai
AI model training platform.
0 stories
train-sentence-transformers
Hugging Face
Train sentence-transformers models on Hugging Face
0 stories
Unsloth
Unsloth AI
Fast, easy, and cheap fine-tuning for LLMs.
0 stories
Unsloth AI
Unsloth
Fine-tune LLMs faster.
0 stories
U
Unsloth Studio
Unsloth
Fine-tuning and model management platform
0 stories
Vast.ai
Vast.ai
GPU cloud marketplace
0 stories
Venice API
Venice.ai
Developer API for Venice.ai models
0 stories
Vertex AI
Google Cloud
A unified machine learning platform for building and using generative AI
0 stories
vLLM
vLLM Project
Fast and easy-to-use library for LLM inference and serving.
0 stories
vLLM Omni
vLLM Project
Multimodal inference and serving platform
0 stories
vLLM-factory
vLLM Project
vLLM ecosystem software
0 stories
xAI CLI
xAI
Terminal access to xAI services
0 stories
ZenMux
AI Force Singapore Pte. Ltd.
Unverified software product.
0 stories
Zyphra Cloud
Zyphra Technologies Inc.
Cloud AI access
0 stories
Zyphra Inference
Zyphra
Hosted inference service
0 stories