吳啓玄 (Alex Wu)

  Hsinchu City, Taiwan 300


About me

I am a graduated student in National Yang Ming Chiao Tung University,Taiwan. My thesis is "Model-Agnostic Training with Sample Selection for Fair and Robust Learning". 

I am familiar with  deep learning related technologies, including trustworthy AI and computer vision.

In addition, I am also interested and passionate in information technology, including database, cloud, software design and development.

        

Education


Sep 2021 - Jul 2023

National Yang Ming Chiao Tung University (NYCU)

Master's Degree, Institute Of Information Management



Sep 2017 - Jun 2021

National Taipei University (NTPU)

Bachelor's Degree, Statistics

Sep 2014 - Jun 2017

National Hsinchu Senior High School

Professional Skills

Subjects


  • Deep Learning
    • Trustworthy AI
    • Computer Vision
    • Optimization
  • Data Mining
  • Machine Learning
  • Statistical Modeling
  • Algorithm
  • Cloud Computing
  • Database


Skills


  • Python, C, C++
  • PyTorch, Tensorflow, Keras
  • LINUX
  • GIT, Bitbucket, Jira
  • Airflow, MLflow
  • AWS, Azure
  • Google Cloud Platform
    • BigQuery, GCS
    • Composer, Pub/Sub
    • Function,  Scheduler

Language


  • Chinese
  • English (TOEIC 720)

AI/ML Related Course Record


Deep Learning 

CM, NYCU (交大電信所)

Spring 2022, GRADE: A+

https://github.com/alexwu0911/Deep-Learning-2022-Spring

  • Implement FNN gradient descent using only python numpy package.
  • Establish CNN and ResNet to classify images by PyTorch.
  • Establish Transformer to classify news documents by PyTorch, Top 2 accuracy in class .
  • Implement VAE and GAN to create anime face by pyTorch.
  • Build DQN and Actor-Critc RL model on Lunarlander using PyTorch.

Machine Learning

EE, NYCU (交大電子所) & MTK

Spring 2022, GRADE: A

https://github.com/alexwu0911/Machine-Learning-2022-Spring

  • Train an image classification model on CIFAR-100  with limited size, 100MB.
    • Build ResNet-50 and use SAM Optimizer (2021, Google). 
    • Achieve almost 83% accuracy.
  • Final project is building road image segmentaion model on mediaTek chip. 
    • Using U-Net and post-training quantization to compress tflite model.
    • Achieve 0.735 mIOU, Top 3 in class.

Data Mining

IIM, NYCU (交大資管所)

Fall 2021, GRADE: A 

https://github.com/alexwu0911/Data-Mining-2021-Fall

  • Using Keras, scikit-learn build ml model such as  random forest, xgboost, LSTM... to do sentiment analysis, time series prediction etc.
  • Construct Hadoop & Pyspark to compute matrix factorization distributedly.

Work Experience



AI Engineer Intern

MediaTek Inc. (AIDE/AI Team)

Jan 2023 - Jun 2023
Hsinchu City, Taiwan

  • Wafer parametric map anomaly auto recognition (Self-Supervised learning).
  • Solve wafer map missing value issue by per-pixel mean, global mean method.
  • Build wafer map anomaly pattern classifier by PyTorch Lightning and Hydra.

Data Engineer Intern

MediaTek Inc. (AIDE/DE Team)

Jun 2022 - Dec 2022
Hsinchu City, Taiwan

  • Supporting AI project in data pipeline and ETL.
  • Build statistics and completeness KPI of test item and die in CP、FT、SLT three test stage.
  • Using Python Airflow and Big Query、 Google Cloud Storage to establish auto data schedulers.
  • Construct loosely-coupled structure by pub/sub service on GCP.





AI/ML Related Competition


2022 TSMC x Microsoft Careerhack 

TSMC & Microsoft

Jan 2022

Work: Energy-Efficient Dashboard

The IOT device connects to the home appliance, transmits the collected electricity consumption data to the azure IoT center, and uses python to capture the data in the blob storage to organize the data and present it on the smart dashboard, providing home appliance users with a clearer understanding of home appliances power consumption details and electricity bill trial calculation, so as to achieve smart power consumption and energy saving.


 

The 6th ETC Traffic Data Competition

MOTC, ROC

Aug 2020

Work:  CATS - Clog AlerT System

After using a variety of models to build, the best prediction model for the backlog mileage is obtained, and the influence of different variables on the backlog mileage is explored. By exploring the importance of variables and building models, the possible back-blocking impacts after future national highway accidents can be dealt with more quantitatively and systematically.

2020 NTPU Stats Research Project Competition

NTPU Stats

Jun 2020

Work: P2P Default risk forecast

Using Logistic regression and random forest to construct a default prediction model of P2P online lending platform, the data is obtained from Lending Club, from 2016~2019.





Certifications


TOEIC

720

Apr 2018

 

Big Data Analyst and Machine Learning Engineer 

(Industry Professional Assessment System, IPAS)

Nov 2021, Dec 2022