CakeResume 找人才

進階搜尋
On
4 到 6 年
6 到 10 年
10 到 15 年
15 年以上
Avatar of the user.
Avatar of the user.
曾任
Data Engineer @Rooit Inc. (XO App)
2023 ~ 2023
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning EngineerData Scientist
一個月內
Python
Data Analysis
Data Science
待業中
正在積極求職中
全職 / 對遠端工作有興趣
6 到 10 年
中國醫藥大學(China Medical University)
臨床醫學研究所
Avatar of the user.
Avatar of the user.
Data Engineer @TSMC 台積電
2022 ~ 現在
資料分析師、演算法工程師、軟體工程師、軟體專案管理
一個月內
Backend Development
NLP
Python
就職中
正在積極求職中
全職 / 對遠端工作有興趣
4 到 6 年
國立中央大學 National Central University
網路學習科技研究所
Avatar of Vel Tien-Yun Wu.
Avatar of Vel Tien-Yun Wu.
Data Engineer @Groundhog Technologies Inc.
2021 ~ 2024
Data Analyst、Data EngineerData Scientist、Customer Experience Analyst
一個月內
my work within multidisciplinary teams, ensuring clear and effective communication. Seeking a role as a Data Engineer or Data Analyst, I am eager to apply my technical expertise and analytical skills to contribute to meaningful projects and collaborate with a dynamic team. New Taipei City, Taiwan Work Experience Data Engineer • Groundhog Technologies Inc. JulyPresent - Built and maintained data piplines (through which several hundred millions rows of data flow through daily) using Scala Spark/ Hadoop - Managed cron jobs and performed regular data recovery using Apache Airflow - Performed regular Extract, transform, load (ETL) operations through Hive and HDFS
Git
Python
Scala
就職中
正在積極求職中
全職 / 對遠端工作有興趣
4 到 6 年
University of Illinois at Urbana-Champaign, School of Information Sciences
Information Management
Avatar of 李孟霖.
Avatar of 李孟霖.
資深資料工程師 @緯創資通股份有限公司
2020 ~ 現在
Data Analyst、Data EngineerData Scientist、Customer Experience Analyst、Solution Architect、Cloud Architect
一個月內
李孟霖 數位轉型,進行行政作業流程優化、協助企業快速掌握數據價值。 ● 最新經歷:緯創資通股份有限公司 人力資源數位轉型 資深數據分析師 ● 經歷:財團法人中衛發展中心 資料分析師顧問 ● 證書:Wistron Data Engineer證書;PJ法L1證書 年資:4 年 10 個月 職
python
PowerBI
Power Platform
就職中
正在積極求職中
全職 / 對遠端工作有興趣
4 到 6 年
元智大學 Yuan Ze University
工業工程與管理學所
Avatar of 林冠安.
Avatar of 林冠安.
曾任
Data Analyst @趨勢科技 TrendMicro
2021 ~ 2024
Data Analyst、Data EngineerData Scientist、Customer Experience Analyst
一個月內
執行成效。 3. 流程及報表開發 與PM或Sales operations討論報表需求,以stored procedure開發計算邏輯,並建立相關view以利Tableau報表開發。 作為user與data engineer之間的溝通管通,統合需求後開發符合user之報表或資料。 依照過去資料驗證經驗,以stored procedure開發各項指標之DQ邏輯,以避
R
PL/SQL
Python
待業中
正在積極求職中
全職 / 對遠端工作有興趣
6 到 10 年
天主教輔仁大學 FU JEN CATHOLIC UNIVERSITY
金融所
Avatar of 陶俊良.
Avatar of 陶俊良.
資料分析師 Data Analyst @Portto 門戶科技| Blocto
2022 ~ 2024
Data Analyst、Data EngineerData Scientist、Customer Experience Analyst
一個月內
Portto 門戶科技| Blocto • 九月三 月 2024 Main Responsibilities: Establishing Data Pipeline Exploring new product features and competitor analysis on Dune Dashboard on the EVM User tagging for the Growth team (including Discord bot for monitoring Project details: Data Pipeline Regularly integrating client-side and BE data with external APIs and data collected by bots on Bigquery Establishing a systematic coding data table combined with Slack bot command manual and automatic data replenishment Daily data monitoring with Slack bot Planning client-side (app, sdk js) Amplitude event tracking to maximize data collection Using existing data to
python
R
MySQL
就職中
正在積極求職中
全職 / 對遠端工作有興趣
4 到 6 年
臺灣大學
流行病學與預防醫學所 生物統計組
Avatar of Justin Liu.
Avatar of Justin Liu.
Manager @GOMAJI 夠麻吉
2017 ~ 現在
Project Lead / Tech Lead / Team Lead / Technical Manager
一個月內
Justin Liu GOMAJI 夠麻吉 經理 A technical leader with extensive IT management experience, skilled in driving technological innovation, optimizing development processes, and leading cross-functional teams to achieve business objectives. In my previous roles, I successfully led a team of 10, including web developers, App developers, DevOps, data engineers, and API developers, to accomplish several key projects. I have a profound understanding and practical experience in technical architecture, cloud architect solutions(GCP and AWS), CI/CD and Docker, and data analytics, and committed to enhancing team efficiency and product quality. 具有豐
Team Lead
Management Team
Cloud Architecture
就職中
正在積極求職中
全職 / 對遠端工作有興趣
10 到 15 年
Shih Hsin University
Management Information Systems, General
Avatar of Vu Nguyen Ngoc Quang.
Avatar of Vu Nguyen Ngoc Quang.
曾任
Mobile App Developer @Apple Inc.
2014 ~ 現在
Lead Infrastructure Engineer
兩個月內
for performance and scalability. Developed databases that supported Web applications and Web sites. Developed system interaction and sequence diagrams. Big Data Engineer • Freelancer JuneJuly 2023 Built machine learning models using TensorFlow and Scikit-Learn libraries for predictive analysis of customer behavior. Designed and implemented a scalable data warehouse architecture using Apache Cassandra, PostgresDB, and Redis. Optimized database performance by tuning queries in SQL Server, Oracle and PostgreSQL databases. Implemented efficient data processing algorithms on large datasets with Apache Spark, MapReduce, and Pandas Python. Created dashboards in Tableau Desktop Professional Edition to visualize complex
Machine learning
Virtualization Technologies
Pandas Python
待業中
正在積極求職中
全職 / 對遠端工作有興趣
6 到 10 年
Avatar of the user.
Avatar of the user.
Corporate Strategy Project Director @17LIVE Inc.
2023 ~ 現在
Business Strategiest
一個月內
Excel
Project Management
就職中
正在積極求職中
全職 / 對遠端工作有興趣
6 到 10 年
National Chengchi University
Communication, General
Avatar of moh yanni fikri.
Avatar of moh yanni fikri.
曾任
Electrical Maintenance @PT. Pabrik Kertas Tjiwi Kimia Tbk.
2021 ~ 2023
Engineer
一個月內
in preparation for when there is a tool has problem or automation project Education Politeknik Perkapalan Negeri Surabaya Automation Engineering,GPASkill Preventive Maintenance ( Preparing weekly, monthly summary, and exception reports ) AC / DC Drive Wiring Diagram ( AUTOCAD, EPLAN P8 Electrical ) PLC and HMI Programming SCADA, Power Inverter Project and People Management Electrical Troubleshooting Computerized Maintenance Management Systems (CMMS) Data Analysis and Visualization Machine Learning Certification Data Science and Machine Learning - Purwadhika Digital Technology SchoolPurwadhikaSystem 800xA with AC 800M Hardware Maintenance and Troubleshooting - ABBInstrument Inspector level 2 - Inspector TrainingInstrument Inspector level 1 - Inspector TrainingPLC Intermediate Engineer - PPNSIndustrial Automation System Design - BNSP
Data Science
Python
Machine Learning
待業中
正在積極求職中
全職 / 暫不考慮遠端工作
4 到 6 年
Politeknik Perkapalan Negeri Surabaya
Automation Engineering

最輕量、快速的招募方案,數百家企業的選擇

搜尋履歷,主動聯繫求職者,提升招募效率。

  • 瀏覽所有搜尋結果
  • 每日可無限次數開啟陌生對話
  • 搜尋僅開放付費企業檢視的履歷
  • 檢視使用者信箱 & 電話
搜尋技巧
1
嘗試搜尋最精準的關鍵字組合
資深 後端 php laravel
如果結果不夠多,再逐一刪除較不重要的關鍵字
2
將須完全符合的字詞放在雙引號中
"社群行銷"
3
在不想搜尋到的字詞前面加上減號,如果想濾掉中文字,需搭配雙引號使用 (-"人資")
UI designer -UX
免費方案僅能搜尋公開履歷。
升級至進階方案,即可瀏覽所有搜尋結果(包含數萬筆覽僅在 CakeResume 平台上公開的履歷)。

職場能力評價定義

專業技能
該領域中具備哪些專業能力(例如熟悉 SEO 操作,且會使用相關工具)。
問題解決能力
能洞察、分析問題,並擬定方案有效解決問題。
變通能力
遇到突發事件能冷靜應對,並隨時調整專案、客戶、技術的相對優先序。
溝通能力
有效傳達個人想法,且願意傾聽他人意見並給予反饋。
時間管理能力
了解工作項目的優先順序,有效運用時間,準時完成工作內容。
團隊合作能力
具有向心力與團隊責任感,願意傾聽他人意見並主動溝通協調。
領導力
專注於團隊發展,有效引領團隊採取行動,達成共同目標。
兩個月內
Sr. Data Engineer
17LIVE
2021 ~ 現在
Taipei, 台灣
專業背景
目前狀態
就職中
求職階段
目前會考慮了解新的機會
專業
數據工程師, Python 開發人員, 系統架構
產業
資訊服務
工作年資
4 到 6 年
管理經歷
技能
Python
MySQL
Linode
API Development
Linux
RabbitMQ
Celery
Nginx
Flask(Python)
Django(Python)
Git
docker swarm
Docker
docker-compose
Data Mining
Machine Learning
Traefik
Redis
ELK(ElasticSearch)
ELK
Prometheus
Grafana
Airflow
dolphindb
SQL
FastAPI
GKE
K8S
Real-Time Systems
GCP
語言能力
English
中階
求職偏好
希望獲得的職位
Data Solution Architect, Sr. Data Engineer, Data Engineer Manager
預期工作模式
全職
期望的工作地點
Taipei, 台灣, Taiwan
遠端工作意願
對遠端工作有興趣
接案服務
是,我利用業餘時間接案
學歷
學校
NDHU
主修科系
統計
列印
Profile 02 00@2x 71843ef6a0df47d6255a9c0436c409dcd5cd81f6514c51a6b2a93339d82bbff6

linsam

data engineer、backend engineer

 • 0972724528 •  台灣  •  [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience


17 Live -  Senior Data Engineer (IC5), May. 2021 - now

Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python. 
• Maintenance BigQuery more than 100 tables. 
• Create pipelines from mysql and mongo to bigquery. 
• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage. 
• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.
Reduce Data Team 25% cost.
• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.
• Create Data Team's first API via GKE for ML model, include achieve graceful shutdown, and run stress test via ApacheBench, and setup auto-scaling by hpa. 95% latency is under 200ms and RPS is over 200.
• Create a Tagging System for tracking groups of users. 
• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage. 
• Create document culture by confluence.
The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4. 
• Assist in interview more than 10 new data engineer. 
• Mentor junior data engineers to be more effective individual contributors.
• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages
• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings -  Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account. 

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019 

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue). 

• Maintain and develop an ETL distributed queuing system with 20 machines

• Optimize the ETL system reduced more than 50% execution time. 

• Develop new product crawler let product volume increase 1.5%. 

• Making analysis BI charts provide for other departments.

Mandatory Military Service,Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). And comparing single equation estimators and confidence interval with system equation.

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects


FinMind Open data Api


Open source financial data, more than 50 dataset, provide Api. 

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ). 

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.



Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.


Rossmann Store Sales - Kaggle 

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.


Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank. 

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.


Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank. 

Predicting which products will an consumer purchase again.



 Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills


Distributed Queue System

1. Rabbitmq & Celery & Flower. 

2. 8 nodes ( Cloud ) distributed queue system for web crawling. 

3. Deploy by Docker and GKE.

4. Graceful Shutdown.


Database

1. MySQL ( RDBMS ). 

2. Redis ( NoSQL ). 

3. Dolphindb ( TSDB ).


GCP

1. Pub/Sub.
2. GKE ( K8S ).
3. GCE.
4. BQ.
5. Composer.
6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team. 

2. Using gitlab runner. 

3. CD for auto publish python package. 

4. CD for auto update and deploy new version service.


Log Collect & Monitor

1. Distributed system log collect by elk.  

2. Prometheus and Grafana. Monitor user usage, request latency, request count 

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.



data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.
2. Design more 200 ETL by airflow.
3. Build airflow by composer
4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.


Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium. 

2. Auto recognition captcha code by CNN model.


Data Mining

Python - numpy, pandas, sklearn. 

R - parallel, dplyr, data.table, mice.


WEB

1. https://finmindtrade.com/ 

2. nginx

3. frontend - vue 

4. backend - python 

5. traefik.


API

1. FastAPI.
2. Websocket.
3. Loading Balance.
4. Async.
5. Graceful Shutdown.

Stress Test 

1. ApacheBench.
2. Upper bound of FinMind api is 8000/minute request.


Education

National Dong Hwa University, Master of Science,  Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages


R, Python. Basic in English and proficient in Chinese.

履歷
個人檔案
Profile 02 00@2x 71843ef6a0df47d6255a9c0436c409dcd5cd81f6514c51a6b2a93339d82bbff6

linsam

data engineer、backend engineer

 • 0972724528 •  台灣  •  [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience


17 Live -  Senior Data Engineer (IC5), May. 2021 - now

Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python. 
• Maintenance BigQuery more than 100 tables. 
• Create pipelines from mysql and mongo to bigquery. 
• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage. 
• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.
Reduce Data Team 25% cost.
• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.
• Create Data Team's first API via GKE for ML model, include achieve graceful shutdown, and run stress test via ApacheBench, and setup auto-scaling by hpa. 95% latency is under 200ms and RPS is over 200.
• Create a Tagging System for tracking groups of users. 
• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage. 
• Create document culture by confluence.
The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4. 
• Assist in interview more than 10 new data engineer. 
• Mentor junior data engineers to be more effective individual contributors.
• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages
• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings -  Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account. 

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019 

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue). 

• Maintain and develop an ETL distributed queuing system with 20 machines

• Optimize the ETL system reduced more than 50% execution time. 

• Develop new product crawler let product volume increase 1.5%. 

• Making analysis BI charts provide for other departments.

Mandatory Military Service,Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). And comparing single equation estimators and confidence interval with system equation.

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects


FinMind Open data Api


Open source financial data, more than 50 dataset, provide Api. 

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ). 

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.



Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.


Rossmann Store Sales - Kaggle 

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.


Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank. 

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.


Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank. 

Predicting which products will an consumer purchase again.



 Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills


Distributed Queue System

1. Rabbitmq & Celery & Flower. 

2. 8 nodes ( Cloud ) distributed queue system for web crawling. 

3. Deploy by Docker and GKE.

4. Graceful Shutdown.


Database

1. MySQL ( RDBMS ). 

2. Redis ( NoSQL ). 

3. Dolphindb ( TSDB ).


GCP

1. Pub/Sub.
2. GKE ( K8S ).
3. GCE.
4. BQ.
5. Composer.
6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team. 

2. Using gitlab runner. 

3. CD for auto publish python package. 

4. CD for auto update and deploy new version service.


Log Collect & Monitor

1. Distributed system log collect by elk.  

2. Prometheus and Grafana. Monitor user usage, request latency, request count 

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.



data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.
2. Design more 200 ETL by airflow.
3. Build airflow by composer
4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.


Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium. 

2. Auto recognition captcha code by CNN model.


Data Mining

Python - numpy, pandas, sklearn. 

R - parallel, dplyr, data.table, mice.


WEB

1. https://finmindtrade.com/ 

2. nginx

3. frontend - vue 

4. backend - python 

5. traefik.


API

1. FastAPI.
2. Websocket.
3. Loading Balance.
4. Async.
5. Graceful Shutdown.

Stress Test 

1. ApacheBench.
2. Upper bound of FinMind api is 8000/minute request.


Education

National Dong Hwa University, Master of Science,  Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages


R, Python. Basic in English and proficient in Chinese.