2

[MICRO 2020 (Best Paper)] A Hardware–Software Blueprint for Flexible Deep Learning Specialization

This article describes the Versatile Tensor Accelerator (VTA), a programmable DL architecture designed to be extensible in the face of evolving workloads. VTA achieves “flexible specialization” via a parameterizable architecture, two-level …