[Paper] Emulation of Complex Matrix Multiplication based on the Chinese Remainder Theorem
Modern computing architectures feature low-precision matrix multiplication units that achieve substantially higher throughput than their high-precision counterp...