2026
Y.-C. Chen, C. P. Lee, Ze-Wei Liou, N. Verma
(2026).
Massive Spikes in LLMs are Bias Vectors: Mechanistic Uncovering and Spike-Free Quantization.
arXiv ‘26.
2025
Ze-Wei Liou, D.-Y. Hong
(2025).
Optimizing Compute Core Assignment for Dynamic Batch Inference in AI Inference Accelerator.
ACM SAC ‘25.
2023