吳漢陽 (Hang-Yang Wu)

Hi, I am Young. Majored in computer science, from National Taiwan University of Science and Technology. My research fields is machine learning and social network analysis, using machine learning to detect important relationships in the social network, and sparsify the graph to reduce the data capacity.

Email: [email protected]
Tel: 0934039069

Program Language

Python

C++

Framework/Tools

Tensorflow

Pandas

Numpy

Matplotlib

Data Processing

Web Crawler

Courses

Machine Learning

Practices of Deep Learning

Cloud Computing and Services

Practice of Social Media Analytics

Neural Networks

Information Retrieval

Algorithm Design and Application

工作經歷 Work Experience

Garmin 台灣國際航電, Devops Engineer, 2020年7月 ~ 2021年4月

- 熟悉RESTful API，公司內部服務互相串接 (Gerrit、Jenkins、Jira)

- CI/CD intergration，自動化code-review流程以及自動化驗證程式碼的安全性

- 熟悉WAF build，完善軟體構建系統

- 熟悉Git版本控制

中央研究院, 實習生, 2018年3月 ~ 2019年1月

- Hyperedge Aware Greedy (HAG) 演算法實作

- HAG演算法效能改善

學歷 Education

國立台灣科技大學, 2017年9月 ~ 2020年4月

M.S., Computer Science and Information Engineering
Research Fields: Deep Learning, Social Network Analysis

國際學術論文 Internation Conference

Hang-Yang Wu, Yi-Ling Chen, "Graph Sparsification with Generative Adversarial Network", IEEE International Conference on Data Mining (ICDM 2020)

Internship and Competition

Academia Sinica Intern, July 2018 ~ Dec 2018

After completing the courses. I applied for the internship program of the Academia Sinica. During the period of Academia Sinica, my job is to improve the performance of the algorithm.

Paragraph image 04 00@2x 0a60eca693a10c928b852fbc59ecb80f69e59f0d26acd4c74f39c7f256369567

Paragraph image 04 01@2x d7b528f9ab8e34777562695011fbbc2a4f3f28d8b47f4c8e57ee7c3410afdfa8

Data Innovation Application Competition, April 2018

Our work is to make a model that uses deep learning to predict weather profiles. The main information is tides, waves, and wind direction. Mainly for people who love surfing.

Paper

Graph Sparsification with Generative Adversarial Networks, Jan 2019 ~ Dec 2019

Graph sparsification aims to reduce the number of edges of a network while maintaining its accuracy for given tasks. In this study, we propose a novel method that is able to sparsify networks for community detection tasks. Our method is able to capture those relationships that are not shown in the original graph but are relatively important, and creating artificial edges to reflect these relationships and further increase the effectiveness of the community detection task.

We use GAN as a learning model and let the generator learn to generate random walks that are able to capture the structure of a network. During the training phase, in addition to judging the authenticity of the random walk, discriminator also considers the relationship between nodes at the same time. We design a reward function to guide the generator creating random walks that contain useful hidden relation information. These random walks are then combined to form a new social network that is efficient and effective for community detection. Experiments demonstrate that our sparsification method is effective in real-world networks and able to be applied in different community discovery clustering algorithms.

Project

C++|Hyperedge Aware Greedy, Mar 2018 ~ Dec 2018

This is the seed diffusion algorithm I implemented during my internship at the SINICA. It can be applied to graphs with hyperedge. In addition, I also make three pruning techniques to speed up the execution time, and use multithread to parallelize the algorithm.

Paragraph image 00 00@2x ebb59a6d9adb03673d06762584bb6a0cc401a7cc4bd081bb82ce6f841d95aa2b

Experience and Work

Paragraph image 01 00@2x 1a3881c875a7a1fb1e859435ef9363b5ddf36f4e73d1ad63a1a0af69f2a9f745

Python|Continuous Bag-of-Words Modeling, Jan 2018

The order of the text affects the overall semantics, Using Deep Learning, let the first and last words as input, and the middle as output. If each word is made as a one-hot vector, data-sparse problems may occur. In order to avoid this, the word embedding technique is used to reach the compression of the text vector representation.

Python|Taxi Travel Time Prediction, Nov 2017

It's a 2015 ECML PKDD competition, I use Deep Learning approaches to predict the taxi driving time. Although it's expired when I joined this competition on Kaggle. I still submit my result and the score is 0.60 (mean-squared-error). The first place score was 0.53.

Python|Probabilistic Latent Semantic Analysis , Oct 2017

From the perspective of probability, use the Expectation-Maximization algorithm to calculate the probability of generating the current Topic for each document to match the degree of the query. After that, I have more knowledge of information retrieval.

Python|Vector Space Model , Sep 2017

The classic data retrieval approaches converts the documents and query into vectors, then calculates the cosine similarity between matching query and the document. This helps me know how the search engine work although it's just a basic approach and not advanced.

Python|MNIST Handwriting Recognition, Aug 2017

MNIST handwriting recognition is a very basic machine learning example. As a beginner for machine learning, I also try to start with this.

Python|Water Level Prediction, July 2017

In order to know well with python and machine learning, I got some practices on the internet. This is a water level prediction with the Scikit tool.