Tool profile

intel/auto-round

stalling

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

diffusersggufint4llmsmxfp4nvfp4omniquantization

Velocity score

0.51/ 10

[STARS]

1.5k

[FORKS]

149

[CONTRIBUTORS]

[LAST_COMMIT]

today

OPEN_ON_GITHUB

Is intel/auto-round still actively maintained?

Score breakdown

669/ 1000

inference · intel/auto-round

Velocity50%

Adoption30%

Maintenance15%

Community5%

[CODE_GROWTH]

769

[INSTALL_VEL]

498

[ACTIVITY]

595

[COMMUNITY_SIGNAL]

918

Terminal score: 0–1000 raw, weighted across 4 dimensions. Public score: 0–10 normalized (shown in the 30-day stars chart above).