f1f98e369f
This change adds support for 2:4 sparsity when using Marlin quantization. The 2:4 kernel is used when: * The quantizer is `marlin`; * the quantizer checkpoint format is `marlin_24`. Fixes #2098. |
||
---|---|---|
.. | ||
base.h | ||
mem.h | ||
mma.h |