[Paper] Collapse or Preserve: Data-Dependent Temporal Aggregation for Spiking Neural Network Acceleration
Spike sparsity is widely believed to enable efficient spiking neural network (SNN) inference on GPU hardware. We demonstrate this is an illusion: five distinct ...