Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.3k 195

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 1k 87

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 877 120

  4. PanzaMail PanzaMail Public

    Python 301 22

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 280 24

  6. llmq llmq Public

    Quantized LLM training in pure CUDA/C++.

    C++ 244 14

Repositories

Showing 10 of 84 repositories
  • CloverLM Public

    🍀 Codebase for CloverLM

    IST-DASLab/CloverLM’s past year of commit activity
    Python 6 MIT 0 0 0 Updated Apr 2, 2026
  • nanochat_optimizers Public Forked from karpathy/nanochat

    The best ChatGPT that $100 can buy.

    IST-DASLab/nanochat_optimizers’s past year of commit activity
    Python 0 MIT 6,823 0 0 Updated Mar 27, 2026
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 14 MIT 0 0 0 Updated Mar 25, 2026
  • Quartet-II Public

    Quartet II Official Code

    IST-DASLab/Quartet-II’s past year of commit activity
    Python 65 8 0 1 Updated Mar 23, 2026
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 120 MIT 12 2 0 Updated Mar 18, 2026
  • GridSearcher Public

    GridSearcher simplifies running grid searches for machine learning projects in Python, emphasizing parallel execution and GPU scheduling without dependencies on SLURM or other workload managers.

    IST-DASLab/GridSearcher’s past year of commit activity
    Python 5 Apache-2.0 0 0 0 Updated Mar 17, 2026
  • llmq Public

    Quantized LLM training in pure CUDA/C++.

    IST-DASLab/llmq’s past year of commit activity
    C++ 244 Apache-2.0 14 0 1 Updated Mar 6, 2026
  • ant Public Forked from gvlassis/ant

    🐜 Research-friendly Deep Learning framework

    IST-DASLab/ant’s past year of commit activity
    Python 0 MIT 1 0 0 Updated Mar 3, 2026
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 104 17 12 3 Updated Feb 26, 2026
  • Trion Public

    Code for Trion, a faster implementation of Dion (low-rank Muon) via Dynamic Column Selection based on a fixed projection matrix.

    IST-DASLab/Trion’s past year of commit activity
    0 0 0 0 Updated Feb 23, 2026

Top languages

Loading…

Most used topics

Loading…