[ MENU ]
LIVE
────────
·
──:──:── UTC
about
⌘
K
search
?
help
[ MENU ]
All niches
Niche
inference
Inference & Serving
[
TOOLS_TRACKED
]
36
[
ACCELERATING
]
0
[
DYING
]
0
[
AVG_VELOCITY
]
0.47
/10
See our pick → Best Inference & Serving
[
ACCELERATING
]
Accelerating
0
No accelerating tools right now.
[
STABLE
]
Stable
10
Tool
Velocity
Trend 30d
Δ 7d
Stars
Class
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
4.28
↑ +841
84k
Stable
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
2.30
↑ +82
43k
Stable
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
2.22
↑ +97
36k
Stable
jd-opensource/xllm
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
1.46
↑ +18
1.4k
Stable
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
0.85
↑ +27
9.7k
Stable
Avarok-Cybersecurity/atlas
Pure Rust Inference Engine
0.78
↑ +18
529
Stable
xorbitsai/inference
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
0.77
↑ +19
9.4k
Stable
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
0.65
↑ +7
13k
Stable
GeeeekExplorer/nano-vllm
Nano vLLM
0.53
↑ +81
14k
Stable
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
0.52
↑ +9
8.7k
Stable
LIVE
────────
·
──:──:── UTC
about
⌘
K
search
?
help