Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs May 5, 2025 by appcompact