Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and dro...
Real-time multi-model multi-task (MMMT) workloads, a new form of deep
le...
Low-rank tensor compression has been proposed as a promising approach to...
Vision transformers (ViTs) have attracted much attention for their super...
Neural network accelerator is a key enabler for the on-device AI inferen...
Neural Architecture Search (NAS) has demonstrated its power on various A...
Recent advances in deep neural networks (DNNs) have made DNNs the backbo...
Machine learning (ML), especially deep learning is made possible by the
...
Federated learning enables resource-constrained edge compute devices, su...
Deep Neural Networks are becoming increasingly popular in always-on IoT ...
Efficient and compact neural network models are essential for enabling t...
Hardware acceleration of Deep Neural Networks (DNNs) aims to tame their
...
Keyword spotting (KWS) is a critical component for enabling speech based...
Deep convolutional neural network (CNN) inference requires significant a...