CakeResume Talent Search

Advanced filters
On
4-6 years
6-10 years
10-15 years
More than 15 years
Taipei, Taiwan
Avatar of the user.
Avatar of the user.
Data Science @Alfred Labs.
2017 ~ Present
Within one month
Python
R
SQL
4-6 years
University of Taipei
Computer Science
Avatar of 白紋愷.
Avatar of 白紋愷.
Engineer @Trend Micro 趨勢科技
2021 ~ Present
Software Engineer
Within two months
CD: Using GitHub Action to trigger product build and deploying package when the package is output successfully. BlockChain Security Corp, Programmer, May 2020 ~ Oct 2021 Developed Web App Penetration Testing Platform: Developed a website on Internet Information Services (IIS) using .NET Framework. Built databases to store user data, penetrating data, and penetrating syntaxes on a SQL Server. Integrated APIs of third-party penetration testing tools, e.g. ZAP and SQL Map. Solved the load balance and the upper limit issues of ZAP scanning using multi-threading. Maintained Endpoint Detection and Response (EDR
C/C++
SQL
Git
Employed
Ready to interview
Full-time / Interested in working remotely
4-6 years
National Chengchi University
Computer Science
Avatar of Bryan Lin.
Avatar of Bryan Lin.
Past
Principal Software Engineer @Optoma
2022 ~ 2023
Senior Software Engineer
Within two months
. 5. Host study group to encourage self-learning and knowledge sharing. 6. Product: XDR (includes Workbench, Observed Attack Techniques, and Managed XDR). - Visualization with SVG(d3) and canvas (pixi) to draw a graph for attack root cause analysis. - T able Virtualization to display big data with non-fixed height and a smooth UI. - UI sliding query to mitigate big data query timeout issues. - Microfrontend implementation with qiankun. - Transfer legacy javascript to typescript. - Replace legacy Redux with Context API - Create FIPS-compliant nodejs docker image for deployment on Azure Government Cloud
React.js
AWS
Azure
Unemployed
Open to opportunities
Full-time / Remote Only
More than 15 years
Feng-Chia University
Computer Science
Avatar of the user.
Avatar of the user.
資深程式設計師 @緯創軟體股份有限公司
2022 ~ Present
程式設計師
Within one month
JavaScript
JavaScript / ES6 / jQuery
HTML/CSS
Full-time / Interested in working remotely
6-10 years
台北市立大學
資訊科學
Avatar of 黃偉傑.
Avatar of 黃偉傑.
Full Stack Engineer @Boxful 香港商便利存有限公司
2021 ~ Present
Front-End / Back-End / Full Stack Web Developer
Within one month
and Cloud SQL, optimizing system scalability and availability. Skilled in Docker and Jenkins environment setup, CI, and Git version control, improving the development workflow and team collaboration. Successfully refactored the company's main order service architecture and upgraded to PHP 7.4 and Laravel 6, resolving compatibility issues with packages and syntax and improving system performance Redesigned the order data table structure to improve normalization and reduce complexity, enhancing data management and processing efficiency. Experienced in integrating payment APIs such as AFTEE, TAPPAY, and newebpay, enabling seamless and secure payment processing for custo...
PHP
MySQL
JavaScript
Employed
Full-time / Interested in working remotely
4-6 years
National Yunlin University of Science and Technology
Computer Science and Information Engineering
Avatar of the user.
Avatar of the user.
Customer Engineer @iKala 愛卡拉互動媒體股份有限公司
2022 ~ Present
Within two months
nlp machine learning
+python
AI & Machine Learning
Employed
Full-time
4-6 years
國立中央大學
認知與神經科學所
Avatar of Ivan Lo.
Avatar of Ivan Lo.
Senior Assistant Manager @HTC Vive
2018 ~ Present
Manager
More than one year
to customer's. # Wrapper System - app encryption/DRM mechanism, including PC-VR engine unity/unreal and Mobile-VR android . Principal Engineer • HTC 十月五月 2017 HTC VR Viveport BE Engineer. Backend Service 1. Content Management System (CMS) - viveport store app content data manager system. 2. Gamification - stats and achievement system. 3. Payment - shopping cart. 4. Beta Testing - private beta testing system for developer. Architecture Design 1. Overall system architecture: content management, content review flow. 2. Client-Server communication flow and provide
People Management
Problem Solving
Learning Skills
Employed
Full-time / Interested in working remotely
10-15 years
National Taiwan University
Master of Science (M.S). Computer Science

The Most Lightweight and Effective Recruiting Plan

Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.

  • Browse all search results
  • Unlimited access to start new conversations
  • Resumes accessible for only paid companies
  • View users’ email address & phone numbers
Search Tips
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Only public resumes are available with the free plan.
Upgrade to an advanced plan to view all search results including tens of thousands of resumes exclusive on CakeResume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Within one month
Sr. Data Engineer
17LIVE
2021 ~ Present
Taipei, 台灣
Professional Background
Current status
Employed
Job Search Progress
Open to opportunities
Professions
Data Engineer, Python Developer, System Architecture
Fields of Employment
Information Services
Work experience
4-6 years
Management
None
Skills
Python
MySQL
Linode
API Development
Linux
RabbitMQ
Celery
Nginx
Flask(Python)
Django(Python)
Git
docker swarm
Docker
docker-compose
Data Mining
Machine Learning
Traefik
Redis
ELK(ElasticSearch)
ELK
Prometheus
Grafana
Airflow
dolphindb
SQL
FastAPI
GKE
K8S
Real-Time Systems
GCP
Languages
English
Intermediate
Job search preferences
Positions
Data Solution Architect, Sr. Data Engineer, Data Engineer Manager
Job types
Full-time
Locations
Taipei, 台灣, Taiwan
Remote
Interested in working remotely
Freelance
Yes, I freelance in my spare time
Educations
School
NDHU
Major
統計
Print
Profile 02 00@2x 71843ef6a0df47d6255a9c0436c409dcd5cd81f6514c51a6b2a93339d82bbff6

linsam

data engineer、backend engineer

 • 0972724528 •  台灣  •  [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience


17 Live -  Senior Data Engineer (IC5), May. 2021 - now

Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python. 
• Maintenance BigQuery more than 100 tables. 
• Create pipelines from mysql and mongo to bigquery. 
• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage. 
• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.
Reduce Data Team 25% cost.
• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.
• Create Data Team's first API via GKE for ML model, include achieve graceful shutdown, and run stress test via ApacheBench, and setup auto-scaling by hpa. 95% latency is under 200ms and RPS is over 200.
• Create a Tagging System for tracking groups of users. 
• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage. 
• Create document culture by confluence.
The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4. 
• Assist in interview more than 10 new data engineer. 
• Mentor junior data engineers to be more effective individual contributors.
• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages
• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings -  Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account. 

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019 

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue). 

• Maintain and develop an ETL distributed queuing system with 20 machines

• Optimize the ETL system reduced more than 50% execution time. 

• Develop new product crawler let product volume increase 1.5%. 

• Making analysis BI charts provide for other departments.

Mandatory Military Service,Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). And comparing single equation estimators and confidence interval with system equation.

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects


FinMind Open data Api


Open source financial data, more than 50 dataset, provide Api. 

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ). 

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.



Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.


Rossmann Store Sales - Kaggle 

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.


Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank. 

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.


Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank. 

Predicting which products will an consumer purchase again.



 Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills


Distributed Queue System

1. Rabbitmq & Celery & Flower. 

2. 8 nodes ( Cloud ) distributed queue system for web crawling. 

3. Deploy by Docker and GKE.

4. Graceful Shutdown.


Database

1. MySQL ( RDBMS ). 

2. Redis ( NoSQL ). 

3. Dolphindb ( TSDB ).


GCP

1. Pub/Sub.
2. GKE ( K8S ).
3. GCE.
4. BQ.
5. Composer.
6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team. 

2. Using gitlab runner. 

3. CD for auto publish python package. 

4. CD for auto update and deploy new version service.


Log Collect & Monitor

1. Distributed system log collect by elk.  

2. Prometheus and Grafana. Monitor user usage, request latency, request count 

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.



data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.
2. Design more 200 ETL by airflow.
3. Build airflow by composer
4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.


Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium. 

2. Auto recognition captcha code by CNN model.


Data Mining

Python - numpy, pandas, sklearn. 

R - parallel, dplyr, data.table, mice.


WEB

1. https://finmindtrade.com/ 

2. nginx

3. frontend - vue 

4. backend - python 

5. traefik.


API

1. FastAPI.
2. Websocket.
3. Loading Balance.
4. Async.
5. Graceful Shutdown.

Stress Test 

1. ApacheBench.
2. Upper bound of FinMind api is 8000/minute request.


Education

National Dong Hwa University, Master of Science,  Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages


R, Python. Basic in English and proficient in Chinese.

Resume
Profile
Profile 02 00@2x 71843ef6a0df47d6255a9c0436c409dcd5cd81f6514c51a6b2a93339d82bbff6

linsam

data engineer、backend engineer

 • 0972724528 •  台灣  •  [email protected]

5~6 years experience with data engineer and soft engineer. (Distributed Queue System, Database, Web Crawling, RESTful API, ETL, Docker, CICD, GCP, K8S, Airflow ...etc.)

1~2 years experience with data science. (data analysis, machine learning and deep learning)

Work Experience


17 Live -  Senior Data Engineer (IC5), May. 2021 - now

Refactor ETL, create a airflow project by Cloud Composer to transfer ETL tools from digdag to airflow and transfer ETL develop method from shell script to python. 
• Maintenance BigQuery more than 100 tables. 
• Create pipelines from mysql and mongo to bigquery. 
• Create a good development culture, including the introduction of CICD, dev-stage-uat-master, release news, unit tests and test coverage. 
• Using Airflow unified scheduler job, like cloud function scheduler, BQ scheduler, crontab, and ML model by R or Python ...etc.
Reduce Data Team 25% cost.
• Create Data Team's first real-time ETL system via GKE, Pub/Sub and Memorystore for sending push notifications to users.
• Create Data Team's first API via GKE for ML model, include achieve graceful shutdown, and run stress test via ApacheBench, and setup auto-scaling by hpa. 95% latency is under 200ms and RPS is over 200.
• Create a Tagging System for tracking groups of users. 
• Create a BigQuery Resource Monitor to monitor users BQ slot and query count usage. 
• Create document culture by confluence.
The finalists of Break the Norm awards on 2021-Q3 and 2021-Q4. 
• Assist in interview more than 10 new data engineer. 
• Mentor junior data engineers to be more effective individual contributors.
• Apply the data team's models to the company's APP. (automatically send push notifications and in-app messages
• Automatically update recommend streamer list via data team's models to the company's APP.

SinoPac Holdings -  Software Engineer(Python), Nov. 2019 - May. 2021

• Develop python Api (shioaji) for stock/option/future place orde and account. 

• Develop C# Api (shioaji) for stock/option/future place orde and account, and setup CI/CD with GitHub actions.

• Deploy test system for simulate trading by docker swarm.

• Collecting distributed system Log by elk, grafana and prometheus. 13GB log data/daily.

• Monitor distributed system and alert chatbot.

• Develop a transaction-by-trade and odd lot trading API.

Open Up Summit Speaker ( FinMind ) - 2019-12-01

Tripresso - Data Engineer, Oct. 2018 - Nov. 2019 

• Analysis travel data and build a machine learning model. Estimating increase 3% orders (revenue). 

• Maintain and develop an ETL distributed queuing system with 20 machines

• Optimize the ETL system reduced more than 50% execution time. 

• Develop new product crawler let product volume increase 1.5%. 

• Making analysis BI charts provide for other departments.

Mandatory Military Service,Oct. 2017 - Oct. 2018

NDHU - RA, Mar. 2016 - Aug. 2017

Analysing G7 financial data. Model validation and parameter estimation by regression models ( SUR, MLE, Bootstrapping ). And comparing single equation estimators and confidence interval with system equation.

NDHU - TA, Sep. 2015 - Jul. 2017

Calculus, Linear Algebra, Statistics.

Projects


FinMind Open data Api


Open source financial data, more than 50 dataset, provide Api. 

More than 2,000 people registered.

2,000 stars on github.

Automatic update daily by docker swarm, distributed queue system rabbitmq and celery ( 10 cloud machines ). 

Total more than 1 billion data, 10 million streaming data per day.

Architecture diagram.



Bosch Production Line Performance - Kaggle Post-competition analysis, top 6% rank.

Highly imbalance data, ratio is 1000 : 1, 10 GB dataset size. And the data is 50% missing value. More than 4000 variables, but I build models by only 50 features.


Rossmann Store Sales - Kaggle 

Post-competition analysis, top 10% rank.

Time series problem. Building models predict sales after 48 days.


Grupo Bimbo Inventory Demand - Kaggle

Post-competition analysis, top 8% rank. 

Time series problem, eighty millions data size. Building models predict inventory demand after 2 weeks.


Instacart Market Basket Analysis - Kaggle

Real competition, top 25% rank. 

Predicting which products will an consumer purchase again.



 Verification code to text

Create python package of Taiwan Train Verification Code to text.

The model is made by keras-CNN.

Skills


Distributed Queue System

1. Rabbitmq & Celery & Flower. 

2. 8 nodes ( Cloud ) distributed queue system for web crawling. 

3. Deploy by Docker and GKE.

4. Graceful Shutdown.


Database

1. MySQL ( RDBMS ). 

2. Redis ( NoSQL ). 

3. Dolphindb ( TSDB ).


GCP

1. Pub/Sub.
2. GKE ( K8S ).
3. GCE.
4. BQ.
5. Composer.
6. MemoryStore.

CI/CD

1. Create automated tests and automated deploy for the FinMind team. 

2. Using gitlab runner. 

3. CD for auto publish python package. 

4. CD for auto update and deploy new version service.


Log Collect & Monitor

1. Distributed system log collect by elk.  

2. Prometheus and Grafana. Monitor user usage, request latency, request count 

3. Monitor by telegram bot and slackbot.

4. Monitor vm and container by Netdata and cadvisor.



data pipeline

1. Design data pipeline for crawler, backend and analysis by airflow.
2. Design more 200 ETL by airflow.
3. Build airflow by composer
4. Build a real-time pipeline for sending push notifications to users

Machine Learning

xgboost, random forest, svm. statistics - ols, lasso.


Web Crawling

1. Python - request, BeautifulSoup, lxml, selenium. 

2. Auto recognition captcha code by CNN model.


Data Mining

Python - numpy, pandas, sklearn. 

R - parallel, dplyr, data.table, mice.


WEB

1. https://finmindtrade.com/ 

2. nginx

3. frontend - vue 

4. backend - python 

5. traefik.


API

1. FastAPI.
2. Websocket.
3. Loading Balance.
4. Async.
5. Graceful Shutdown.

Stress Test 

1. ApacheBench.
2. Upper bound of FinMind api is 8000/minute request.


Education

National Dong Hwa University, Master of Science,  Sep. 2017.

Major : Mathematics and Statistics.

Tamkang University. Bachelor of Science, Sep. 2015.

Major : Mathematics

Languages


R, Python. Basic in English and proficient in Chinese.