Skip to content

Pull requests: intel/auto-round

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Continuously optimize AutoScheme RAM consumption
#1703 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
chore: add shared agent config layout
#1700 opened Apr 17, 2026 by yiliu30 Contributor Loading…
support model_free WOQ quantization
#1699 opened Apr 17, 2026 by xin3he Contributor Loading…
4 of 9 tasks
Fix Qwen Omni quantization model issue for long form audio generation
#1698 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
update mtp quant for special cases
#1691 opened Apr 16, 2026 by xin3he Contributor Loading…
2 of 9 tasks
Fix module.to("meta") for models with plain Tensors
#1688 opened Apr 15, 2026 by yiliu30 Contributor Loading…
1 of 9 tasks
rename scheme INT8_W8A8 to INT8
#1687 opened Apr 15, 2026 by thuang6 Contributor Loading…
4 of 9 tasks
Feats: Quantize/save/evaluate the Wan-AI/WAN2.2 models in w4a16 format
#1678 opened Apr 14, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
quick fix: gptqmodel no longer includes gptqmodel_marlin_kernels
#1671 opened Apr 9, 2026 by xin3he Contributor Draft
1 of 9 tasks
[step 1]support variable block input shapes for gemma4
#1656 opened Apr 3, 2026 by wenhuach21 Contributor Loading…
2 of 9 tasks
add support for gemma4 model
#1655 opened Apr 3, 2026 by n1ck-guo Contributor Loading…
1 of 9 tasks
fix gguf issue in alg_ext.py
#1649 opened Apr 2, 2026 by wenhuach21 Contributor Loading…
9 tasks
inplace hadamard
#1641 opened Mar 31, 2026 by wenhuach21 Contributor Draft
3 tasks
[Draft] Support TurboQuant KV-cache quantization
#1634 opened Mar 27, 2026 by lvliang-intel Contributor Draft
2 of 9 tasks
Support ByteDance-Seed/BAGEL-7B-MoT quantization in w4a16 format
#1633 opened Mar 27, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
feat: add --dry-run estimation mode
#1592 opened Mar 22, 2026 by mvanhorn Loading…
[Step1 ]new architecture for auto_round api/new engineering ready only add when the PR is ready to merge
#1542 opened Mar 13, 2026 by n1ck-guo Contributor Loading…
1 of 9 tasks
0.12.0
[N4Landing]update draft
#1538 opened Mar 12, 2026 by wenhuach21 Contributor Loading…
9 tasks
Enhance llmc CI on GPU and XPU
#1483 opened Mar 2, 2026 by chensuyue Contributor Loading…
1 of 9 tasks
0.13.0
Add asym for XPU backend.
#1316 opened Jan 22, 2026 by luoyu-intel Contributor Draft
[WIP][refactor quanizers][step 1] refactor rtn and tuning
#1278 opened Jan 14, 2026 by n1ck-guo Contributor Loading…
ProTip! Exclude everything labeled bug with -label:bug.