LIVE
────────
·
──:──:── UTC
about
⌘
K
search
?
help
All niches
Niche
inference
Inference & Serving
[
TOOLS_TRACKED
]
36
[
ACCELERATING
]
0
[
DYING
]
2
[
AVG_VELOCITY
]
0.04
/10
[
ACCELERATING
]
Accelerating
0
No accelerating tools right now.
[
STABLE
]
Stable
10
Tool
Velocity
Trend 30d
Δ 7d
Stars
Class
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
2.25
↑ +80
42k
Stable
GeeeekExplorer/nano-vllm
Nano vLLM
1.17
↑ +109
13k
Stable
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
0.92
↑ +34
9.5k
Stable
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
0.90
↑ +18
8.6k
Stable
xorbitsai/inference
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
0.65
↑ +12
9.3k
Stable
NVIDIA/kvpress
LLM KV cache compression made easy
0.64
↑ +9
1.1k
Stable
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
0.50
↑ +10
13k
Stable
stas00/ml-engineering
Machine Learning Engineering Open Book
0.29
↑ +47
18k
Stable
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).
0.21
↑ +14
2.3k
Stable
beam-cloud/beta9
Ultrafast serverless GPU inference, sandboxes, and background jobs
0.16
↑ +8
1.6k
Stable
[
STALLING
]
Stalling and dying
5
Tool
Velocity
Trend 30d
Δ 7d
Stars
Class
jd-opensource/xllm
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
0.92
↑ +7
1.3k
Stalling
Tiiny-AI/PowerInfer
High-speed Large Language Model Serving for Local Deployment
0.10
↑ +28
9.4k
Dying
adithya-s-k/AI-Engineering.academy
Mastering Applied AI, One Concept at a Time
0.09
↑ +4
2.2k
Stalling
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
0.06
↑ +18
18k
Dying
RunanywhereAI/runanywhere-sdks
Production ready toolkit to run AI locally
0.05
↑ +1
10k
Stalling
LIVE
────────
·
──:──:── UTC
about
⌘
K
search
?
help