Zong-Ci Lu (Serge) [email protected] Experience AMD,2022/05 - present Develop GEMM kernel for AMD GPU in GCN assembly Study on different stratedies on GPU kernel fusion such as GEMM + GEMM and GEMM + Softmax + GEMM Appier,2022//05 Developed API for AIQUA service Skymizer,2020//03 TensorFlow integration with DLA (Deep Learning Accelerator) Sped up float to int8 quantization by using x86 SIMD instructions(10%~50% improvement depends on batch size) Developed customized neural network visualization tool to help developers to debug graph partitioning result Amended forward shape
C++
Python
Employed
Full-time / Interested in working remotely
6-10 years
National Tsing Hua University
・
Mathematics
The Most Lightweight and Effective Recruiting Plan
Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.