Ph.D. Candidate
Department of Computer Science and Engineering
The Chinese University of Hong Kong
Github
Curriculum Vitae
ybai62868 at gmail dot com

I'm a Ph.D. candidate at the Department of Computer Science and Engineering, The Chinese University of Hong Kong, with Prof. Bei Yu. My research interests lie in applying compilers to generate high-performance tensor programs or spatial accelerators:

  • Machine learning algorithms for automating system design
  • Compilers for optimizing deep learning accelerator design
  • Domain-specific languages for software/hardware co-design

Feel free to drop me an email if we have similar interests, and I'm open to collaborations.

Educations

[Aug 2020 - Now] The Chinese University of Hong Kong

Ph.D, Major in Computer Science.

[Sep 2017 - Jun 2020] Chinese Academy of Sciences

M.Eng, Major in Computer Science.

[Sep 2013 - Jun 2017] Xidian University

B.Eng, Major in Telecommunication Engineering.

Experiences

[Jun 2019 - Oct 2019] Computer Systems Lab, Cornell Univeristy

Work with Prof. Zhiru Zhang.
Multi-Paradigm Programming Infra for Software-Defined Heterogeneous Computing (HeteroCL).

Projects

OpenCL Backend Development for HeteroCL [code]

I develop the Xilinx & Intel OpenCL backend for TVM-inspired HeteroCL and implement critical compiler backend optimization, e.g., loop unrolling, loop pipelining and partition for Xilinx OpenCL backend and implemented arbitrary precision integers for Intel OpenCL backend. I Implement the whole pipeline from Python-based domain-specific language to FPGA-targeted compilation flow

AWS-F1 Tutorial for KNN-DigitRec: [tutorial]

I write tutorials for running KNN-DigitRec example on AWS-F1 using HeteroCL. I design a new target backend for AWS development, combined HeteroCL Vivado HLS C++ code and host file based on Rosetta automatically generate host and wrapper files for design automation.

Publications

ChatPattern: Layout Pattern Customization via Natural Language
Zixiao Wang, Yunheng Shen, Xufeng Yao, Wenqian Zhao, Yang Bai, Farzan Farnia, Bei Yu
ACM/IEEE Design Automation Conference (DAC) 2024

AdaOPC 2.0: Enhanced Adaptive Mask Optimization Framework for Via Layers
Wenqian Zhao, Xufeng Yao, Shuo Yin, Yang Bai, Ziyang Yu, Yuzhe Ma, Bei Yu, Martin D.F. Wong
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)

ATFormer: A Learned Performance Model with Transfer Learning Across Devices for Deep Learning Tensor Programs
Yang Bai, Wenqian Zhao, Shuo Yin, Zixiao Wang, Bei Yu
Conference on Empirical Methods in Natural Language Processing (EMNLP-main Long Paper) 2023

SoC-Tuner: An Importance-guided Exploration Framework for DNN-targeting SoC Design
Shixin Chen, Su Zheng, Chen Bai, Wenqian Zhao Shuo Yin, Yang Bai, Bei Yu
IEEE/ACM Asian and South Pacific Design Automation Conference (ASPDAC) 2024

GTCO: Graph and Tensor Co-Design for Transformer-based Image Recognition on Tensor Cores
Yang Bai, Xufeng Yao, Qi Sun, Wenqian Zhao, Shixin Chen, Zixiao Wang, Bei Yu
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)

ALCOP: Automatic Load-Compute Pipelining in Deep Learning Compiler for AI-GPUs
Guyue Huang, Yang Bai, Liu Liu, Yuke Wang, Bei Yu, Yufei Ding, Yuan Xie
Conference on Machine Learning and Systems (MLSys) 2023

DiffPattern: Layout Pattern Generation via Discrete Diffusion
Zixiao Wang, Yunheng Shen, Wenqian Zhao, Yang Bai, Guojin Chen, Farzan Farnia, Bei Yu
ACM/IEEE Design Automation Conference (DAC) 2023

A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
Wenqian Zhao, Yang Bai, Qi Sun, Wenbo Li, Haisheng Zheng, Nianjuan Jiang, Jiangbo Lu, Bei Yu, Martin D.F. Wong
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)

AutoGraph: Optimizing DNN Computation Graph for Parallel GPU Kernel Execution
Yuxuan Zhao, Qi Sun, Zhuolun He, Yang Bai, Bei Yu
AAAI Conference on Artificial Intelligence (AAAI) 2023

PCL: Proxy-based Contrastive Learning for Domain Generalization
Xufeng Yao, Yang Bai, Xinyun Zhang, Yuechen Zhang, Qi Sun, Ran Chen, Ruiyu Li, Bei Yu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022

GTuner: Tuning DNN Computations on GPU via Graph Attention Network
Qi Sun, Xinyun Zhang, Hao Geng, Yuxuan Zhao, Yang Bai, Haisheng Zheng, Bei Yu
IEEE Design Automation Conference (DAC) 2022

Fast and Efficient DNN Deployment via Deep Gaussian Transfer Learning
Qi Sun, Chen Bai, Tinghuan Chen, Hao Geng, Xinyun Zhang, Yang Bai, Bei Yu
IEEE International Conference on Computer Vision (ICCV) 2021

A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
Wenqian Zhao, Qi Sun, Yang Bai, Haisheng Zheng, Wenbo Li, Bei Yu, Martin D.F. Wong
IEEE International Conference on Computer-Aided Design (ICCAD) 2021

AutoGTCO: Graph and Tensor Co-Optimize for Image Recognition with Transformers on GPU
Yang Bai, Xufeng Yao, Qi Sun, Bei Yu
IEEE International Conference on Computer-Aided Design (ICCAD) 2021