CakeResume 找人才

进阶搜寻
On
4 到 6 年
6 到 10 年
10 到 15 年
15 年以上
Avatar of the user.
Avatar of the user.
曾任
Career transition @Career Break
2024 ~ 2024
NLP Engineer / Data Scientist / Machine Learning Engineer
一個月內
Python
SQL
NLP
待业中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
National Chengchi University
資訊科學系
Avatar of the user.
Avatar of the user.
曾任
Data Engineer @Rooit Inc. (XO App)
2023 ~ 2023
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
一個月內
Python
Data Analysis
Data Science
待业中
正在积极求职中
全职 / 对远端工作有兴趣
6 到 10 年
中國醫藥大學(China Medical University)
臨床醫學研究所
Avatar of 陳奕妤.
Avatar of 陳奕妤.
曾任
Senior Data Analyst @趨勢科技
2022 ~ 现在
Data Scientist, Data Analyst, Machine Learning Engineer
一個月內
Cathy Chen Sr. Data Analyst Senior data analyst with over 6 years experience in ETL, data visualization, exploratory data analysis, machine learning, deep learning, customized online dashboard using SQL , R , Python and data analytics tools. Data Scientist, Data Analyst Taipei, Taiwan [email protected] Experience Sr. Data Analyst • TrendMicro NovNow Work with cross-functional teams(UI/UX designer, Front-end, Back-end, Marketing, PM, Sales) to provide related data, design metrics, report and dashboard. Cross app data tracking and user journey analysis. VisionOne customers engagement score - the metrics can help fields to
python
R
SQL
待业中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
輔仁大學 Fu Jen Catholic University
統計資訊學系
Avatar of 陳勤霖.
Avatar of 陳勤霖.
曾任
博士後研究員 @洛桑大學神經發育疾病實驗室
2023 ~ 2023
Data Scientist, Data Analyst, Machine Learning Engineer
一個月內
學腦科學實驗室 1. 神經電生理訊號分析、神經細胞追蹤分析,與藥理試驗。 2. 研究論文撰寫與國際研討會的舉辦。 技能 Data Science Data Analysis, Image Analysis, Machine Learning, Deep Learning, Statistical Analysis, Data visualization Programming Python, PyTorch, NumPy, Pandas, Matplotlib, Scikit-Learn, Git, PostgreSQL, Docker Biotechnology Neuroscience, Genetics, Imaging, Scientific Writing Soft skill Project Management, Probelm Solving, Team Player, Proactive Communication 語言 English — 專業 Chinese — 母語或
Data Science
Data Analysis
Machine Learning
待业中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
洛桑聯邦理工學院(EPFL)
神經科學
Avatar of 梁賦康 (Foo-Hong, Leong).
Avatar of 梁賦康 (Foo-Hong, Leong).
Product Manager @東元電機股份有限公司 (TECO Electric & Machinery Co. Ltd.)
2023 ~ 2023
Data Scientist, Data Analyst, Machine Learning Engineer
一個月內
started to learn Python in 2018 at TEDU and my first project was the Stock Trend Prediction by CNN. I kept using Python to implement web crawling, OOP, and Pandas in my job, intend to let my work become more automated. I used those techniques to automate the data-gathering problem, which shorten the existing progress duration. I'm very passionate about Data Scientist and Machine Learning. Work Experience Product Manager • 東元電機股份有限公司 (TECO Electric & Machinery Co. Ltd.) JanuaryOctoberProduct Analytics 2. Market Trend Analytics 3
Python
Power BI
Data Analytics
就职中
正在积极求职中
全职 / 对远端工作有兴趣
6 到 10 年
國立成功大學 National Cheng Kung University
Mechanical Engineering
Avatar of 李孟霖.
Avatar of 李孟霖.
資深資料工程師 @緯創資通股份有限公司
2020 ~ 现在
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst、Solution Architect、Cloud Architect
一個月內
作經歷 緯創資通股份有限公司,2020 年 7 月年 3 月 「HR Digital Transformation Team Leader」 構想大型數位轉型專案,尋求資源並架構數位轉型藍圖 (構想Data Center、人才運營平台等數轉專案) Azure HR Domain 負責人;Power Platform HR Domain 負責人 ;one of Wistron Microsoft Copilot Top 300 users 具Power BI講師及實習生帶領經驗 「HR Data Center
python
PowerBI
Power Platform
就职中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
元智大學 Yuan Ze University
工業工程與管理學所
Avatar of the user.
Avatar of the user.
曾任
Data Analyst @趨勢科技 TrendMicro
2021 ~ 2024
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
一個月內
R
PL/SQL
Python
待业中
正在积极求职中
全职 / 对远端工作有兴趣
6 到 10 年
天主教輔仁大學 FU JEN CATHOLIC UNIVERSITY
金融所
Avatar of 陶俊良.
Avatar of 陶俊良.
資料分析師 Data Analyst @Portto 門戶科技| Blocto
2022 ~ 2024
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
一個月內
Portto 門戶科技| Blocto • 九月三 月 2024 Main Responsibilities: Establishing Data Pipeline Exploring new product features and competitor analysis on Dune Dashboard on the EVM User tagging for the Growth team (including Discord bot for monitoring Project details: Data Pipeline Regularly integrating client-side and BE data with external APIs and data collected by bots on Bigquery Establishing a systematic coding data table combined with Slack bot command manual and automatic data replenishment Daily data monitoring with Slack bot Planning client-side (app, sdk js) Amplitude event tracking to maximize data collection Using existing data to
python
R
MySQL
就职中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
臺灣大學
流行病學與預防醫學所 生物統計組
Avatar of Vel Tien-Yun Wu.
Avatar of Vel Tien-Yun Wu.
Data Engineer @Groundhog Technologies Inc.
2021 ~ 2024
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
一個月內
Vel Tien-Yun Wu I bring 5 years of hands-on experience in data engineering and software development, with a focus on building scalable data processing systems utilizing Hadoop, Spark, Kafka and Docker. My expertise in developing efficient ETL pipelines has been fundamental in optimizing data workflows for various data warehouses, enhancing data integrity and availability. My track record includes managing high-volume data pipelines, automating scheduling processes to improve operational efficiency, and deploying monitoring solutions that have reduced Mean-Time-To-Repair (MTTR) by 40%. I have a strong foundation in SQL, especially PostgreSQL, which enables
Git
Python
Scala
就职中
正在积极求职中
全职 / 对远端工作有兴趣
4 到 6 年
University of Illinois at Urbana-Champaign, School of Information Sciences
Information Management
Avatar of Evan Wu.
Avatar of Evan Wu.
Back End Devel0per @英仕國際
2020 ~ 现在
Data Analyst 數據分析師 / Data Scientist 資料科學家
一個月內
Evan Wu Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. Ut wisi enim ad minim veniam, quis nostrud. Taiwan 工作經歷 Back End Devel0per • 英仕國際 三月Present Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed diam nonummy nibh euismod tincidunt ut laoreet dolore magna aliquam erat volutpat. Java Software Developer • iiNumbers, Inc. / 木刻思股份有限公司 五月九月 2020 Lorem ipsum dolor sit amet, consectetuer adipiscing elit, sed
JAVA
Golang
SQL
就职中
正在积极求职中
全职 / 对远端工作有兴趣
10 到 15 年
National Chung Hsing University
Computer Science and Engineering

最轻量、快速的招募方案,数百家企业的选择

搜寻简历,主动联系求职者,提升招募效率。

  • 浏览所有搜寻结果
  • 每日可无限次数开启陌生对话
  • 搜尋僅開放付費企業檢視的简历
  • 检视使用者信箱 & 电话
搜寻技巧
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
免费方案仅能搜寻公开简历。
升级至进阶方案,即可浏览所有搜寻结果(包含数万笔览仅在 CakeResume 平台上公开的简历)。

职场能力评价定义

专业技能
该领域中具备哪些专业能力(例如熟悉 SEO 操作,且会使用相关工具)。
问题解决能力
能洞察、分析问题,并拟定方案有效解决问题。
变通能力
遇到突发事件能冷静应对,并随时调整专案、客户、技术的相对优先序。
沟通能力
有效传达个人想法,且愿意倾听他人意见并给予反馈。
时间管理能力
了解工作项目的优先顺序,有效运用时间,准时完成工作内容。
团队合作能力
具有向心力与团队责任感,愿意倾听他人意见并主动沟通协调。
领导力
专注于团队发展,有效引领团队采取行动,达成共同目标。
兩個月內
Sr. Data Engineer
17LIVE
2021 ~ 现在
Taipei, 台灣
专业背景
目前状态
就职中
求职阶段
目前会考虑了解新的机会
专业
数据工程师, Python 开发人员, 系统架构
产业
资讯服务
工作年资
4 到 6 年
管理经历
技能
Python
MySQL
Linode
API Development
Linux
RabbitMQ
Celery
Nginx
Flask(Python)
Django(Python)
Git
docker swarm
Docker
docker-compose
Data Mining
Machine Learning
Traefik
Redis
ELK(ElasticSearch)
ELK
Prometheus
Grafana
Airflow
dolphindb
SQL
FastAPI
GKE
K8S
Real-Time Systems
GCP
语言能力
English
中阶
求职偏好
希望获得的职位
Data Solution Architect, Sr. Data Engineer, Data Engineer Manager
预期工作模式
全职
期望的工作地点
Taipei, 台灣, Taiwan
远端工作意愿
对远端工作有兴趣
接案服务
是,我利用业余时间接案
学历
学校
NDHU
主修科系
統計
列印
Profile 02 00@2x 71843ef6a0df47d6255a9c0436c409dcd5cd81f6514c51a6b2a93339d82bbff6

linsam

data engineer、backend engineer

 • 0972724528 •  台灣  •  [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience


17 Live -  Senior Data Engineer (IC5), May. 2021 - now

Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python. 
• Maintenance BigQuery more than 100 tables. 
• Create pipelines from mysql and mongo to bigquery. 
• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage. 
• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.
Reduce Data Team 25% cost.
• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.
• Create Data Team's first API via GKE for ML model, include achieve graceful shutdown, and run stress test via ApacheBench, and setup auto-scaling by hpa. 95% latency is under 200ms and RPS is over 200.
• Create a Tagging System for tracking groups of users. 
• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage. 
• Create document culture by confluence.
The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4. 
• Assist in interview more than 10 new data engineer. 
• Mentor junior data engineers to be more effective individual contributors.
• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages
• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings -  Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account. 

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019 

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue). 

• Maintain and develop an ETL distributed queuing system with 20 machines

• Optimize the ETL system reduced more than 50% execution time. 

• Develop new product crawler let product volume increase 1.5%. 

• Making analysis BI charts provide for other departments.

Mandatory Military Service,Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). And comparing single equation estimators and confidence interval with system equation.

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects


FinMind Open data Api


Open source financial data, more than 50 dataset, provide Api. 

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ). 

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.



Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.


Rossmann Store Sales - Kaggle 

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.


Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank. 

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.


Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank. 

Predicting which products will an consumer purchase again.



 Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills


Distributed Queue System

1. Rabbitmq & Celery & Flower. 

2. 8 nodes ( Cloud ) distributed queue system for web crawling. 

3. Deploy by Docker and GKE.

4. Graceful Shutdown.


Database

1. MySQL ( RDBMS ). 

2. Redis ( NoSQL ). 

3. Dolphindb ( TSDB ).


GCP

1. Pub/Sub.
2. GKE ( K8S ).
3. GCE.
4. BQ.
5. Composer.
6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team. 

2. Using gitlab runner. 

3. CD for auto publish python package. 

4. CD for auto update and deploy new version service.


Log Collect & Monitor

1. Distributed system log collect by elk.  

2. Prometheus and Grafana. Monitor user usage, request latency, request count 

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.



data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.
2. Design more 200 ETL by airflow.
3. Build airflow by composer
4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.


Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium. 

2. Auto recognition captcha code by CNN model.


Data Mining

Python - numpy, pandas, sklearn. 

R - parallel, dplyr, data.table, mice.


WEB

1. https://finmindtrade.com/ 

2. nginx

3. frontend - vue 

4. backend - python 

5. traefik.


API

1. FastAPI.
2. Websocket.
3. Loading Balance.
4. Async.
5. Graceful Shutdown.

Stress Test 

1. ApacheBench.
2. Upper bound of FinMind api is 8000/minute request.


Education

National Dong Hwa University, Master of Science,  Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages


R, Python. Basic in English and proficient in Chinese.

简历
个人档案
Profile 02 00@2x 71843ef6a0df47d6255a9c0436c409dcd5cd81f6514c51a6b2a93339d82bbff6

linsam

data engineer、backend engineer

 • 0972724528 •  台灣  •  [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience


17 Live -  Senior Data Engineer (IC5), May. 2021 - now

Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python. 
• Maintenance BigQuery more than 100 tables. 
• Create pipelines from mysql and mongo to bigquery. 
• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage. 
• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.
Reduce Data Team 25% cost.
• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.
• Create Data Team's first API via GKE for ML model, include achieve graceful shutdown, and run stress test via ApacheBench, and setup auto-scaling by hpa. 95% latency is under 200ms and RPS is over 200.
• Create a Tagging System for tracking groups of users. 
• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage. 
• Create document culture by confluence.
The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4. 
• Assist in interview more than 10 new data engineer. 
• Mentor junior data engineers to be more effective individual contributors.
• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages
• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings -  Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account. 

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019 

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue). 

• Maintain and develop an ETL distributed queuing system with 20 machines

• Optimize the ETL system reduced more than 50% execution time. 

• Develop new product crawler let product volume increase 1.5%. 

• Making analysis BI charts provide for other departments.

Mandatory Military Service,Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). And comparing single equation estimators and confidence interval with system equation.

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects


FinMind Open data Api


Open source financial data, more than 50 dataset, provide Api. 

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ). 

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.



Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.


Rossmann Store Sales - Kaggle 

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.


Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank. 

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.


Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank. 

Predicting which products will an consumer purchase again.



 Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills


Distributed Queue System

1. Rabbitmq & Celery & Flower. 

2. 8 nodes ( Cloud ) distributed queue system for web crawling. 

3. Deploy by Docker and GKE.

4. Graceful Shutdown.


Database

1. MySQL ( RDBMS ). 

2. Redis ( NoSQL ). 

3. Dolphindb ( TSDB ).


GCP

1. Pub/Sub.
2. GKE ( K8S ).
3. GCE.
4. BQ.
5. Composer.
6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team. 

2. Using gitlab runner. 

3. CD for auto publish python package. 

4. CD for auto update and deploy new version service.


Log Collect & Monitor

1. Distributed system log collect by elk.  

2. Prometheus and Grafana. Monitor user usage, request latency, request count 

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.



data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.
2. Design more 200 ETL by airflow.
3. Build airflow by composer
4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.


Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium. 

2. Auto recognition captcha code by CNN model.


Data Mining

Python - numpy, pandas, sklearn. 

R - parallel, dplyr, data.table, mice.


WEB

1. https://finmindtrade.com/ 

2. nginx

3. frontend - vue 

4. backend - python 

5. traefik.


API

1. FastAPI.
2. Websocket.
3. Loading Balance.
4. Async.
5. Graceful Shutdown.

Stress Test 

1. ApacheBench.
2. Upper bound of FinMind api is 8000/minute request.


Education

National Dong Hwa University, Master of Science,  Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages


R, Python. Basic in English and proficient in Chinese.