Convert Python Code to Apk

spbitnet — Sparse-BitNet Inference on Consumer GPUs

Custom CUDA kernels for accelerating 1.58-bit ternary LLM inference with 2:4 structured sparsity on NVIDIA Ampere GPUs. Implements the core ideas from Sparse-BitNet (Zhang et al., March 2026) with ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

spbitnet — Sparse-BitNet Inference on Consumer GPUs

Trending now