Yang Bai
Ph.D. Department of Computer Science and Engineering The Chinese University of Hong Kong Github Curriculum Vitae ybai62868 at gmail dot com |
I'm a Ph.D. at the Department of Computer Science and Engineering, The Chinese University of Hong
Kong, with Prof. Bei Yu. My research interests
lie in applying compilers to generate high-performance tensor programs or spatial
accelerators:
- Machine learning algorithms for automating system design
- Compilers for optimizing deep learning accelerator design
- Domain-specific languages for software/hardware co-design
Educations
[Aug 2020 - July 2024] The Chinese University of Hong Kong
Ph.D, Major in Computer Science.
[Sep 2017 - Jun 2020] Chinese Academy of Sciences
M.Eng, Major in Computer Science.
[Sep 2013 - Jun 2017] Xidian University
B.Eng, Major in Telecommunication Engineering.
Experiences
[Jun 2019 - Oct 2019] Computer Systems Lab, Cornell Univeristy
Work with Prof. Zhiru Zhang.
Multi-Paradigm Programming Infra for Software-Defined Heterogeneous Computing (HeteroCL).
Projects
OpenCL Backend Development for HeteroCL
[code]
I develop the Xilinx & Intel OpenCL backend for TVM-inspired HeteroCL and implement critical compiler backend optimization, e.g., loop unrolling, loop pipelining and partition for Xilinx OpenCL backend and implemented arbitrary precision integers for Intel OpenCL backend. I Implement the whole pipeline from Python-based domain-specific language to FPGA-targeted compilation flow
AWS-F1 Tutorial for KNN-DigitRec:
[tutorial]
I write tutorials for running KNN-DigitRec example on AWS-F1 using HeteroCL. I design a new target backend for AWS development, combined HeteroCL Vivado HLS C++ code and host file based on Rosetta automatically generate host and wrapper files for design automation.
Publications
Design Automation with Efficient Compilation on Hardware Accelerators
Yang Bai
Ph.D. Thesis
ChatPattern: Layout Pattern Customization via Natural Language
Zixiao Wang, Yunheng Shen, Xufeng Yao, Wenqian Zhao, Yang Bai, Farzan Farnia, Bei Yu
ACM/IEEE Design Automation Conference (DAC) 2024
AdaOPC 2.0: Enhanced Adaptive Mask Optimization Framework for Via Layers
Wenqian Zhao, Xufeng Yao, Shuo Yin, Yang Bai, Ziyang Yu, Yuzhe Ma, Bei Yu, Martin D.F. Wong
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)
ATFormer: A Learned Performance Model with Transfer Learning Across Devices for Deep Learning Tensor Programs
Yang Bai, Wenqian Zhao, Shuo Yin, Zixiao Wang, Bei Yu
Conference on Empirical Methods in Natural Language Processing (EMNLP-main Long Paper) 2023
SoC-Tuner: An Importance-guided Exploration Framework for DNN-targeting SoC Design
Shixin Chen, Su Zheng, Chen Bai, Wenqian Zhao Shuo Yin, Yang Bai, Bei Yu
IEEE/ACM Asian and South Pacific Design Automation Conference (ASPDAC) 2024
GTCO: Graph and Tensor Co-Design for Transformer-based Image Recognition on Tensor Cores
Yang Bai, Xufeng Yao, Qi Sun, Wenqian Zhao, Shixin Chen, Zixiao Wang, Bei Yu
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)
ALCOP: Automatic Load-Compute Pipelining in Deep Learning Compiler for AI-GPUs
Guyue Huang, Yang Bai, Liu Liu, Yuke Wang, Bei Yu, Yufei Ding, Yuan Xie
Conference on Machine Learning and Systems (MLSys) 2023
DiffPattern: Layout Pattern Generation via Discrete Diffusion
Zixiao Wang, Yunheng Shen, Wenqian Zhao, Yang Bai, Guojin Chen, Farzan Farnia, Bei Yu
ACM/IEEE Design Automation Conference (DAC) 2023
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
Wenqian Zhao, Yang Bai, Qi Sun, Wenbo Li, Haisheng Zheng, Nianjuan Jiang, Jiangbo Lu, Bei Yu, Martin D.F. Wong
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)
AutoGraph: Optimizing DNN Computation Graph for Parallel GPU Kernel Execution
Yuxuan Zhao, Qi Sun, Zhuolun He, Yang Bai, Bei Yu
AAAI Conference on Artificial Intelligence (AAAI) 2023
PCL: Proxy-based
Contrastive Learning for Domain Generalization
Xufeng Yao, Yang Bai, Xinyun Zhang, Yuechen Zhang, Qi Sun, Ran Chen, Ruiyu Li, Bei Yu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
GTuner: Tuning DNN Computations on GPU via Graph Attention Network
Qi Sun, Xinyun Zhang, Hao Geng, Yuxuan Zhao, Yang Bai, Haisheng Zheng, Bei Yu
IEEE Design Automation Conference (DAC) 2022
Fast and
Efficient DNN Deployment via Deep Gaussian Transfer Learning
Qi Sun, Chen Bai, Tinghuan Chen, Hao Geng, Xinyun Zhang, Yang Bai, Bei Yu
IEEE International Conference on Computer Vision (ICCV) 2021
A
High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
Wenqian Zhao, Qi Sun, Yang Bai, Haisheng Zheng, Wenbo Li, Bei Yu, Martin D.F. Wong
IEEE International Conference on Computer-Aided Design (ICCAD) 2021
AutoGTCO: Graph and Tensor Co-Optimize for Image Recognition with Transformers on GPU
Yang Bai, Xufeng Yao, Qi Sun, Bei Yu
IEEE International Conference on Computer-Aided
Design (ICCAD) 2021
ACPNet: Anchor-Center Based Person Network for Human Pose Estimation and Instance Segmentation
Yang Bai, Weiqiang Wang
IEEE International Conference on Multimedia and Expo (ICME) 2019
Professional Activities
Conference Review:
Journal Review: