Paper-Conference

Optimizing Compute Core Assignment for Dynamic Batch Inference in AI Inference Accelerator

Modern AI inference accelerators offer high-performance and power-efficient computations for machine learning models. Most accelerators employ static inference to enhance …

avatar
Ze-Wei Liou

Design of Analog-AI Hardware Accelerators for Transformer-based Language Models

Analog Non-Volatile Memory-based accelerators offer high-throughput and energy-efficient Multiply-Accumulate operations for the large Fully-Connected layers that dominate …

g.-w.-burr