Skip to content

Pinned Loading

  1. vllm vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 62.1k 11k

  2. llm-compressor llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 2.2k 276

  3. recipes recipes Public

    Common recipes to run vLLM

    Jupyter Notebook 206 73

Repositories

Showing 10 of 26 repositories
  • vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    vllm-project/vllm’s past year of commit activity
    Python 62,069 Apache-2.0 11,034 1,892 (27 issues need help) 1,247 Updated Nov 5, 2025
  • tpu-inference Public

    TPU inference for vLLM, with unified JAX and PyTorch support.

    vllm-project/tpu-inference’s past year of commit activity
    Python 143 Apache-2.0 21 8 (1 issue needs help) 55 Updated Nov 5, 2025
  • vllm-gaudi Public

    Community maintained hardware plugin for vLLM on Intel Gaudi

    vllm-project/vllm-gaudi’s past year of commit activity
    Python 15 Apache-2.0 62 0 61 Updated Nov 5, 2025
  • vllm-ascend Public

    Community maintained hardware plugin for vLLM on Ascend

    vllm-project/vllm-ascend’s past year of commit activity
    Python 1,315 Apache-2.0 536 657 (8 issues need help) 188 Updated Nov 5, 2025
  • vllm-spyre Public

    Community maintained hardware plugin for vLLM on Spyre

    vllm-project/vllm-spyre’s past year of commit activity
    Python 37 Apache-2.0 26 6 16 Updated Nov 5, 2025
  • ci-infra Public

    This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

    vllm-project/ci-infra’s past year of commit activity
    HCL 24 Apache-2.0 44 0 20 Updated Nov 5, 2025
  • vllm-xpu-kernels Public

    The vLLM XPU kernels for Intel GPU

    vllm-project/vllm-xpu-kernels’s past year of commit activity
    C++ 11 Apache-2.0 13 0 5 Updated Nov 5, 2025
  • recipes Public

    Common recipes to run vLLM

    vllm-project/recipes’s past year of commit activity
    Jupyter Notebook 206 Apache-2.0 73 6 5 Updated Nov 5, 2025
  • semantic-router Public

    Intelligent Router for Mixture-of-Models

    vllm-project/semantic-router’s past year of commit activity
    Rust 2,154 Apache-2.0 277 93 (17 issues need help) 30 Updated Nov 5, 2025
  • production-stack Public

    vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

    vllm-project/production-stack’s past year of commit activity
    Python 1,905 Apache-2.0 313 89 (3 issues need help) 58 Updated Nov 5, 2025