Enabling Efficient Computer Architectural and System Support for Next-Generation Deep Learning Applications
中文字永2021在线观看将于2022年3月12日（周六）举行主题为“Enabling Efficient Computer Architectural and System Support for Next-Generation Deep Learning Applications”的学术报告会。敬请光临！
报告题目：Enabling Efficient Computer Architectural and System Support for Next-Generation Deep Learning Applications
报 告 人：长江学者 李涛 教授
报告时间：2022年3月12 日周六 10:30
In recent years, the artificial intelligence (AI) techniques, represented by deep neural networks (DNN), have emerged as indispensable tools in many fields. Traditionally, due to its huge compute power and scalability, the cloud data center is often the best option for training and evaluating AI applications. With the increasing computing power and energy efficiency of mobile devices, there is a growing interest in performing AI applications on mobile platforms. As a result, we believe the next-generation AI applications are pervasive across all platforms, ranging from central cloud data center to edge-side wearable and mobile devices.
However, we observe several gaps that challenge the pervasive AI applications. First, the large size of such newly developed AI networks poses both throughput and energy challenges to the underlying processing hardware, which hinders ubiquitous deployment for many promising AI applications. Second, the traditional statically trained AI model in cloud data center could not efficiently handle the dynamic data in the real in-situ environments, which leads to low inference accuracy. Lastly, the training of AI models still involves extensive human efforts to collect and label the large-scale dataset, which becomes impractical in big data era where raw data is largely un-labeled and uncategorized.
In this talk, I will present architecture and system support which enables next generation AI applications to become high efficient and intelligent. I will first introduce Pervasive AI, a user satisfaction-aware deep learning inference framework, to provide the best user satisfaction when migrating AI-based applications from Cloud to all kinds of platforms. Next, I will describe In-situ AI, a novel-computing paradigm tailored to in-situ AI applications. Furthermore, to tackle the big data challenge and achieve real intelligent (support autonomous learning), I will introduce Unsupervised AI, an unsupervised GAN-based deep learning accelerator.