Skip to main content

Neural Processing Unit

Neural processing unit (NPU) is a specialized hardware accelerator for executing artificial neural network workloads, used in data centers, edge systems, and devices to increase performance per watt and throughput for AI inference compared with general-purpose compute architectures.