Abstract: Transformers are at the core of modern AI nowadays. They rely heavily on matrix multiplication and require efficient acceleration due to their substantial memory and computational ...
Abstract: Multiplication is a fundamental operation in neural network models. However, signed multibit multiplication and accumulation (MAC) pose significant challenges, primarily due to the ...