All Stories

  1. Cambricon-R: A Fully Fused Accelerator for Real-Time Learning of Neural Scene Representation
  2. RM-STC: Row-Merge Dataflow Inspired GPU Sparse Tensor Core for Energy-Efficient Sparse Acceleration
  3. Caffeine: Towards Uniformed Representation and Acceleration for Deep Convolutional Neural Networks
  4. OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization