Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.3k 193

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 1k 86

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 866 116

  4. PanzaMail PanzaMail Public

    Python 297 19

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 280 24

  6. llmq llmq Public

    Quantized LLM training in pure CUDA/C++.

    C++ 238 14

Repositories

Showing 10 of 78 repositories
  • ISTA-DASLab-Optimizers-CUDA Public

    This repository contains the code for CUDA kernels that support `ISTA-DASLab-Optimizers` project

    IST-DASLab/ISTA-DASLab-Optimizers-CUDA’s past year of commit activity
    0 MIT 0 0 0 Updated Feb 4, 2026
  • MatGPTQ Public

    MatGPTQ is a one-shot quantization technique that quantizes a model to multiple bit-widths, which can be served in different environments by leveraging custom kernel support.

    IST-DASLab/MatGPTQ’s past year of commit activity
    Python 2 MIT 0 1 0 Updated Feb 4, 2026
  • behemoth Public

    Library for creating synthetic tabular data and converting it to sentences, for training models.

    IST-DASLab/behemoth’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Feb 2, 2026
  • Quartet-II Public

    Quartet II Official Code

    IST-DASLab/Quartet-II’s past year of commit activity
    Python 33 3 0 0 Updated Feb 2, 2026
  • WUSH Public
    IST-DASLab/WUSH’s past year of commit activity
    1 Apache-2.0 0 0 0 Updated Feb 2, 2026
  • DASH Public

    Official implementation of DASH optimizer

    IST-DASLab/DASH’s past year of commit activity
    0 MIT 0 1 0 Updated Feb 2, 2026
  • llmq Public

    Quantized LLM training in pure CUDA/C++.

    IST-DASLab/llmq’s past year of commit activity
    C++ 238 Apache-2.0 14 0 0 Updated Jan 20, 2026
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 119 MIT 12 2 0 Updated Jan 8, 2026
  • local_platinum_bench Public

    This repo allows you to run Platinum Bench evals via vLLM.

    IST-DASLab/local_platinum_bench’s past year of commit activity
    Python 0 CC-BY-4.0 0 0 1 Updated Dec 19, 2025
  • MoE-Quant Public

    Code for data-aware compression of DeepSeek models

    IST-DASLab/MoE-Quant’s past year of commit activity
    Python 70 10 4 1 Updated Dec 11, 2025

Top languages

Loading…