找工作
搜尋職缺
探索不同產業和地區的所有工作機會

搜尋公司
根據公司名稱尋找理想工作

主題專區
探索依特定主題或產業分類的工作機會
下載 CakeResume App
求職工具
履歷
使用我們的免費履歷工具，獲取理想職缺

作品集
分享你的作品集展現你的成功專案
履歷
使用我們的免費履歷工具，獲取理想職缺

履歷工具
免費製作、下載履歷

履歷模板
提供大量專業模板立即使用

履歷範例
從他人履歷獲取製作靈感

職業指南
各產業、職能的履歷教學與範例

履歷協助
從我們的招募團隊獲取關於履歷的專業建議
作品集
分享你的作品集展現你的成功專案

作品集工具
製作一份展現個人專業的作品集

作品集展示區
瀏覽他人的真實作品集，尋找靈感並進行人脈拓展
資源
資源
從豐富內容了解職業發展、求職策略等更多資訊
查看全部文章
求職指南
履歷
求職信
作品集＆個人品牌
面試技巧
求職新知
產業＆職位介紹
職涯發展
職涯規劃
職涯工具模板
職場人際溝通
職場管理學
人物／企業專訪
人物／企業專訪
雇主人資
人資營運
人資招募
CakeResume 特輯
團隊與企業文化
最新消息
活動分享
白皮書
2023 CakeResume 雇主品牌白皮書
2024 CakeResume MA 儲備幹部招募白皮書
2024 CakeResume 主動式徵才白皮書
精選文章
面試技巧
【自介範例】吸引人的面試自我介紹怎麼說？4 技巧完美活用自我介紹
閱讀更多
《科技職涯》Podcast
專門邀請在科技、數位等不同領域的工作者來分享他們的職涯趣事。
Apple Podcasts
Google Podcasts
Spotify
《職涯探險》Podcast
透過分享跨域思維與職涯選擇，啟發年輕人才實踐職涯目標和理想生活
Apple Podcasts
Google Podcasts
Spotify
徵才
人才搜尋引擎
搜尋履歷

職缺刊登
免費開始

獵才顧問
人才媒合服務

名義雇主（EoR）服務
在台灣建立企業團隊

雇主品牌推廣
建立和推廣您的雇主品牌
價格方案
職缺刊登價格方案

人才搜尋引擎價格方案

履歷製作價格方案
建立你的人脈
我的人脈
管理人脈及你的聯繫對象

CakeResume Meet
透過認識並連結其他使用者，擴大你的職涯人脈

社群
透過討論、活動參與與其他用戶交流
下載 CakeResume App

透過認識並連結其他使用者，擴大你的職涯人脈

CakeResume 找人才

進階搜尋

正在積極求職中

目前會考慮了解新的機會

目前沒有興趣尋找新的機會

Taiwan

台灣

Taipei City, Taiwan

台北市, 台灣

New Taipei City, Taiwan

Taipei, Taiwan

United States

新北市, 台灣

Taichung City, Taiwan

Indonesia

台中市, 台灣

Jakarta, Indonesia

Tainan City, Taiwan

United Kingdom

Hsinchu City, Taiwan

India

Kaohsiung City, Taiwan

Taiwan Province, Taiwan

Taoyuan City, Taiwan

Al Aḩmadī, Kuwait

軟體

經營、管理、商務

政府機關

設計

生物、醫藥

客戶服務

教育

工程研發

金融

物流、貿易

其他

建設

餐飲服務 / 食品相關

製造

行銷

文字編輯、新聞採訪、藝術演藝

業務

科技

工業

銀行 / 保險 / 金融

生醫 / 醫療

顧問 / 審計

教育 / 培訓 / 招聘

廣告 / 行銷 / 代理

農林漁牧業

健康 / 社會 / 環境

移動 / 運輸

建築設計

公司服務

文化 / 媒體 / 娛樂

設計 / 藝術

分銷

食品和飲料

飯店 / 旅遊 / 休閒

公共行政

服務

小於 1 年

1 到 2 年

2 到 4 年

4 到 6 年

6 到 10 年

10 到 15 年

15 年以上

AI 智慧配對

National Taiwan University

國立台灣大學

國立臺灣大學

National Yang Ming Chiao Tung University

國立陽明交通大學

National Chengchi University

國立政治大學

National Cheng Kung University

國立成功大學

National Tsing Hua University

國立清華大學

National Central University

National Taiwan University of Science and Technology

國立中央大學

國立台灣科技大學

國立臺灣科技大學

Feng Chia University

National Dong Hwa University

National Sun Yat-sen University

National Taipei University of Technology

National Taiwan Normal University

Yuan Ze University

元智大學

國立中山大學

國立台北科技大學

國立東華大學

國立臺北科技大學

國立臺灣師範大學

逢甲大學

Chung Yuan Christian University

Taiwan

台灣

Taipei City, Taiwan

台北市, 台灣

United States

New Taipei City, Taiwan

新北市, 台灣

Taichung City, Taiwan

台中市, 台灣

Japan

Singapore

日本

Hsinchu City, Taiwan

Indonesia

新竹市, 台灣

Australia

Taoyuan City, Taiwan

United Kingdom

桃園市, 台灣

Great Britain

全職

兼職

實習生

Python

Machine Learning

SQL

Docker

Data Analysis

Excel

AWS

Deep Learning

Linux

PowerPoint

無

有

1～5 人

5～10 人

10～15 人

15 人以上

一個月內

兩個月內

三個月內

半年內

一年內

超過一年

AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist

Data scientist

Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst

Data Scientist, Data Analyst, Machine Learning Engineer

Data Analyst/Data Scientist

Software engineer

Algorithm Engineer/ Data Scientist/ Sr. Project Management

Data Analyst 數據分析師 / Data Scientist 資料科學家

Data Scientist, Data Analyst, Machine Learning Engineer, Supply Chain Manager, Data Science Manager,

後端工程師

Bachelor of Business Administration (BBA)

Bachelor of Engineering (BEng)

Bachelor of Science (BS)

Bachelor’s Degree

Master of Business Administration (MBA)

Master of Science (MS)

Master’s Degree

Doctor of Philosophy (PhD)

Non-Degree Program (e.g. Coursera certificate)

Other

高中職

大學

碩士

博士

2023

2021

2020

2019

2018

2017

2016

2015

2014

2011

在職中

Off

全選

TSMC

Academia Sinica

Freelancer

Google

國立成功大學

緯創資通股份有限公司

ASUS

CM Visual Technology Corporation/微采視像科技股份有限公司

Coretronic Intelligent Cloud Service

Innolux Corporation/群創光電股份有限公司

對遠端工作有興趣

暫不考慮遠端工作

我只想遠端工作

全職接案者

兼職接案者

不提供接案服務

Chinese - 母語或雙語

English - 進階

English - 中階

English - 專業

English - 母語或雙語

Japanese - 初階

French - 母語或雙語

German - 初階

Chinese - 進階

Japanese - 中階

English

Chinese

Indonesian

Vietnamese

4 到 6 年

6 到 10 年

10 到 15 年

15 年以上

隱藏已讀結果
展開所有工作經驗

僅開放給付費企業

曾任

Career transition @Career Break

・

2024 ~ 2024

NLP Engineer / Data Scientist / Machine Learning Engineer

一個月內

Python

SQL

NLP

全職 / 對遠端工作有興趣

4 到 6 年

National Chengchi University

・

資訊科學系

升級以查看

僅開放給付費企業

曾任

Data Engineer @Rooit Inc. (XO App)

・

2023 ~ 2023

AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist

一個月內

Python

Data Analysis

Data Science

全職 / 對遠端工作有興趣

中國醫藥大學(China Medical University)

・

臨床醫學研究所

升級以查看

陳奕妤

曾任

Senior Data Analyst @趨勢科技

・

2022 ~ 現在

Data Scientist, Data Analyst, Machine Learning Engineer

一個月內

Cathy Chen Sr. Data Analyst Senior data analyst with over 6 years experience in ETL, data visualization, exploratory data analysis, machine learning, deep learning, customized online dashboard using SQL , R , Python and data analytics tools. Data Scientist, Data Analyst Taipei, Taiwan [email protected] Experience Sr. Data Analyst • TrendMicro NovNow Work with cross-functional teams(UI/UX designer, Front-end, Back-end, Marketing, PM, Sales) to provide related data, design metrics, report and dashboard. Cross app data tracking and user journey analysis. VisionOne customers engagement score - the metrics can help fields to

python

SQL

全職 / 對遠端工作有興趣

4 到 6 年

輔仁大學 Fu Jen Catholic University

・

統計資訊學系

陳勤霖

曾任

博士後研究員 @洛桑大學神經發育疾病實驗室

・

2023 ~ 2023

Data Scientist, Data Analyst, Machine Learning Engineer

一個月內

學腦科學實驗室 1. 神經電生理訊號分析、神經細胞追蹤分析，與藥理試驗。 2. 研究論文撰寫與國際研討會的舉辦。技能 Data Science Data Analysis, Image Analysis, Machine Learning, Deep Learning, Statistical Analysis, Data visualization Programming Python, PyTorch, NumPy, Pandas, Matplotlib, Scikit-Learn, Git, PostgreSQL, Docker Biotechnology Neuroscience, Genetics, Imaging, Scientific Writing Soft skill Project Management, Probelm Solving, Team Player, Proactive Communication 語言 English — 專業 Chinese — 母語或

Data Science

Data Analysis

Machine Learning

全職 / 對遠端工作有興趣

4 到 6 年

洛桑聯邦理工學院(EPFL)

・

神經科學

梁賦康（Foo-Hong, Leong）

Product Manager @東元電機股份有限公司 (TECO Electric & Machinery Co. Ltd.)

・

2023 ~ 2023

Data Scientist, Data Analyst, Machine Learning Engineer

一個月內

started to learn Python in 2018 at TEDU and my first project was the Stock Trend Prediction by CNN. I kept using Python to implement web crawling, OOP, and Pandas in my job, intend to let my work become more automated. I used those techniques to automate the data-gathering problem, which shorten the existing progress duration. I'm very passionate about Data Scientist and Machine Learning. Work Experience Product Manager • 東元電機股份有限公司 (TECO Electric & Machinery Co. Ltd.) JanuaryOctoberProduct Analytics 2. Market Trend Analytics 3

Python

Power BI

Data Analytics

全職 / 對遠端工作有興趣

6 到 10 年

國立成功大學 National Cheng Kung University

・

Mechanical Engineering

李孟霖

資深資料工程師 @緯創資通股份有限公司

・

2020 ~ 現在

Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst、Solution Architect、Cloud Architect

一個月內

作經歷緯創資通股份有限公司，2020 年 7 月年 3 月「HR Digital Transformation Team Leader」構想大型數位轉型專案，尋求資源並架構數位轉型藍圖（構想Data Center、人才運營平台等數轉專案） Azure HR Domain 負責人；Power Platform HR Domain 負責人；one of Wistron Microsoft Copilot Top 300 users 具Power BI講師及實習生帶領經驗「HR Data Center

python

PowerBI

Power Platform

全職 / 對遠端工作有興趣

4 到 6 年

元智大學 Yuan Ze University

・

工業工程與管理學所

僅開放給付費企業

曾任

Data Analyst @趨勢科技 TrendMicro

・

2021 ~ 2024

Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst

一個月內

PL/SQL

Python

全職 / 對遠端工作有興趣

6 到 10 年

天主教輔仁大學 FU JEN CATHOLIC UNIVERSITY

・

金融所

升級以查看

陶俊良

資料分析師 Data Analyst @Portto 門戶科技| Blocto

・

2022 ~ 2024

Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst

一個月內

Portto 門戶科技| Blocto • 九月三月 2024 Main Responsibilities: Establishing Data Pipeline Exploring new product features and competitor analysis on Dune Dashboard on the EVM User tagging for the Growth team (including Discord bot for monitoring Project details: Data Pipeline Regularly integrating client-side and BE data with external APIs and data collected by bots on Bigquery Establishing a systematic coding data table combined with Slack bot command manual and automatic data replenishment Daily data monitoring with Slack bot Planning client-side (app, sdk js) Amplitude event tracking to maximize data collection Using existing data to

python

MySQL

全職 / 對遠端工作有興趣

4 到 6 年

臺灣大學

・

流行病學與預防醫學所生物統計組

Vel Tien-Yun Wu

Data Engineer @Groundhog Technologies Inc.

・

2021 ~ 2024

Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst

一個月內

Vel Tien-Yun Wu I bring 5 years of hands-on experience in data engineering and software development, with a focus on building scalable data processing systems utilizing Hadoop, Spark, Kafka and Docker. My expertise in developing efficient ETL pipelines has been fundamental in optimizing data workflows for various data warehouses, enhancing data integrity and availability. My track record includes managing high-volume data pipelines, automating scheduling processes to improve operational efficiency, and deploying monitoring solutions that have reduced Mean-Time-To-Repair (MTTR) by 40%. I have a strong foundation in SQL, especially PostgreSQL, which enables

Git

Python

Scala

全職 / 對遠端工作有興趣

4 到 6 年

University of Illinois at Urbana-Champaign, School of Information Sciences

・

Information Management

Evan Wu

Back End Devel0per @英仕國際

・

2020 ~ 現在

Data Analyst 數據分析師 / Data Scientist 資料科學家

一個月內

Evan Wu Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. Ut wisi enim ad minim veniam, quis nostrud. Taiwan 工作經歷 Back End Devel0per • 英仕國際三月Present Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. Java Software Developer • iiNumbers, Inc. / 木刻思股份有限公司五月九月 2020 Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed

JAVA

Golang

SQL

全職 / 對遠端工作有興趣

10 到 15 年

National Chung Hsing University

・

Computer Science and Engineering

最輕量、快速的招募方案，數百家企業的選擇

搜尋履歷，主動聯繫求職者，提升招募效率。

瀏覽所有搜尋結果
每日可無限次數開啟陌生對話
搜尋僅開放付費企業檢視的履歷
檢視使用者信箱 & 電話

立即升級

7 天內退款保證，可隨時取消

1 2 3 4 5 6 7 8 9

搜尋技巧

嘗試搜尋最精準的關鍵字組合

資深後端 php laravel

如果結果不夠多，再逐一刪除較不重要的關鍵字

將須完全符合的字詞放在雙引號中

"社群行銷"

在不想搜尋到的字詞前面加上減號，如果想濾掉中文字，需搭配雙引號使用 (-"人資")

UI designer -UX

免費方案僅能搜尋公開履歷。

升級至進階方案，即可瀏覽所有搜尋結果（包含數萬筆覽僅在 CakeResume 平台上公開的履歷）。

立即升級

職場能力評價定義

專業技能

該領域中具備哪些專業能力（例如熟悉 SEO 操作，且會使用相關工具）。

問題解決能力

能洞察、分析問題，並擬定方案有效解決問題。

變通能力

遇到突發事件能冷靜應對，並隨時調整專案、客戶、技術的相對優先序。

溝通能力

有效傳達個人想法，且願意傾聽他人意見並給予反饋。

時間管理能力

了解工作項目的優先順序，有效運用時間，準時完成工作內容。

團隊合作能力

具有向心力與團隊責任感，願意傾聽他人意見並主動溝通協調。

領導力

專注於團隊發展，有效引領團隊採取行動，達成共同目標。

兩個月內

linsam

Sr. Data Engineer

17LIVE

・

2021 ~ 現在

Taipei, 台灣

專業背景

目前狀態

就職中

求職階段

目前會考慮了解新的機會

專業

數據工程師, Python 開發人員, 系統架構

產業

資訊服務

工作年資

4 到 6 年

管理經歷

無

技能

Python

MySQL

Linode

API Development

Linux

RabbitMQ

Celery

Nginx

Flask(Python)

Django(Python)

Git

docker swarm

Docker

docker-compose

Data Mining

Machine Learning

Traefik

Redis

ELK(ElasticSearch)

ELK

Prometheus

Grafana

Airflow

dolphindb

SQL

FastAPI

GKE

K8S

Real-Time Systems

GCP

語言能力

English

・

中階

求職偏好

希望獲得的職位

Data Solution Architect, Sr. Data Engineer, Data Engineer Manager

預期工作模式

全職

期望的工作地點

Taipei, 台灣, Taiwan

遠端工作意願

對遠端工作有興趣

接案服務

是，我利用業餘時間接案

學歷

學校

NDHU

主修科系

統計

列印

linsam

data engineer、backend engineer

• 0972724528 • 台灣 • [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience

17 Live - Senior Data Engineer (IC5), May. 2021 - now

• Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python.

• Maintenance BigQuery more than 100 tables.

• Create pipelines from mysql and mongo to bigquery.

• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage.

• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.

• Reduce Data Team 25% cost.

• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.

• Create Data Team's first API via GKE for ML model, include achieve graceful shutdown, and run stress test via ApacheBench, and setup auto-scaling by hpa. 95% latency is under 200ms and RPS is over 200.

• Create a Tagging System for tracking groups of users.

• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage.

• Create document culture by confluence.

• The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4.

• Assist in interview more than 10 new data engineer.

• Mentor junior data engineers to be more effective individual contributors.

• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages)

• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings - Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account.

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue).

• Maintain and develop an ETL distributed queuing system with 20 machines.

• Optimize the ETL system reduced more than 50% execution time.

• Develop new product crawler let product volume increase 1.5%.

• Making analysis BI charts provide for other departments.

Mandatory Military Service，Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). And comparing single equation estimators and confidence interval with system equation.

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects

FinMind Open data Api

Open source financial data, more than 50 dataset, provide Api.

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ).

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.

Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.

Rossmann Store Sales - Kaggle

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.

Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank.

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.

Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank.

Predicting which products will an consumer purchase again.

Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills

Distributed Queue System

1. Rabbitmq & Celery & Flower.

2. 8 nodes ( Cloud ) distributed queue system for web crawling.

3. Deploy by Docker and GKE.

4. Graceful Shutdown.

Database

1. MySQL ( RDBMS ).

2. Redis ( NoSQL ).

3. Dolphindb ( TSDB ).

GCP

1. Pub/Sub.

2. GKE ( K8S ).

3. GCE.

4. BQ.

5. Composer.

6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team.

2. Using gitlab runner.

3. CD for auto publish python package.

4. CD for auto update and deploy new version service.

Log Collect & Monitor

1. Distributed system log collect by elk.

2. Prometheus and Grafana. Monitor user usage, request latency, request count

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.

data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.

2. Design more 200 ETL by airflow.

3. Build airflow by composer

4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.

Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium.

2. Auto recognition captcha code by CNN model.

Data Mining

Python - numpy, pandas, sklearn.

R - parallel, dplyr, data.table, mice.

WEB

1. https://finmindtrade.com/

2. nginx

3. frontend - vue

4. backend - python

5. traefik.

API

1. FastAPI.

2. Websocket.

3. Loading Balance.

4. Async.

5. Graceful Shutdown.

Stress Test

1. ApacheBench.

2. Upper bound of FinMind api is 8000/minute request.

Education

National Dong Hwa University, Master of Science, Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages

R, Python. Basic in English and proficient in Chinese.

履歷

個人檔案

列印

linsam

data engineer、backend engineer

• 0972724528 • 台灣 • [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience

17 Live - Senior Data Engineer (IC5), May. 2021 - now

• Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python.

• Maintenance BigQuery more than 100 tables.

• Create pipelines from mysql and mongo to bigquery.

• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage.

• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.

• Reduce Data Team 25% cost.

• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.

• Create a Tagging System for tracking groups of users.

• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage.

• Create document culture by confluence.

• The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4.

• Assist in interview more than 10 new data engineer.

• Mentor junior data engineers to be more effective individual contributors.

• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages)

• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings - Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account.

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue).

• Maintain and develop an ETL distributed queuing system with 20 machines.

• Optimize the ETL system reduced more than 50% execution time.

• Develop new product crawler let product volume increase 1.5%.

• Making analysis BI charts provide for other departments.

Mandatory Military Service，Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects

FinMind Open data Api

Open source financial data, more than 50 dataset, provide Api.

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ).

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.

Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.

Rossmann Store Sales - Kaggle

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.

Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank.

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.

Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank.

Predicting which products will an consumer purchase again.

Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills

Distributed Queue System

1. Rabbitmq & Celery & Flower.

2. 8 nodes ( Cloud ) distributed queue system for web crawling.

3. Deploy by Docker and GKE.

4. Graceful Shutdown.

Database

1. MySQL ( RDBMS ).

2. Redis ( NoSQL ).

3. Dolphindb ( TSDB ).

GCP

1. Pub/Sub.

2. GKE ( K8S ).

3. GCE.

4. BQ.

5. Composer.

6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team.

2. Using gitlab runner.

3. CD for auto publish python package.

4. CD for auto update and deploy new version service.

Log Collect & Monitor

1. Distributed system log collect by elk.

2. Prometheus and Grafana. Monitor user usage, request latency, request count

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.

data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.

2. Design more 200 ETL by airflow.

3. Build airflow by composer

4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.

Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium.

2. Auto recognition captcha code by CNN model.

Data Mining

Python - numpy, pandas, sklearn.

R - parallel, dplyr, data.table, mice.

WEB

1. https://finmindtrade.com/

2. nginx

3. frontend - vue

4. backend - python

5. traefik.

API

1. FastAPI.

2. Websocket.

3. Loading Balance.

4. Async.

5. Graceful Shutdown.

Stress Test

1. ApacheBench.

2. Upper bound of FinMind api is 8000/minute request.

Education

National Dong Hwa University, Master of Science, Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages

R, Python. Basic in English and proficient in Chinese.