Skip to content

Popular repositories Loading

  1. any-precision-llm any-precision-llm Public

    [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

    Python 123 7

  2. flashTP flashTP Public

    Torch-native C++/CUDA library to accelerate tensor-product layers in MLIPs

    Cuda 53 4

  3. Ginex Ginex Public

    Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching

    Python 41 8

  4. flashneuron flashneuron Public

    C++ 40 6

  5. OpenDNN OpenDNN Public

    OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library

    C++ 26 5

  6. DecDEC DecDEC Public

    [OSDI 2025] DecDEC: A Systems Approach to Advancing Low‑Bit LLM Quantization

    Python 21 3

Repositories

Showing 10 of 78 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…