陳慶全

找工作
搜寻职缺
探索不同产业和地区的所有工作机会

搜寻公司
根据公司名称寻找理想工作

主题专区
探索依特定主题或产业分类的工作机会
下载 CakeResume App
求职工具
简历
使用我们的免费简历工具，获取理想职缺

作品集
分享你的作品集展现你的成功专案
简历
使用我们的免费简历工具，获取理想职缺

简历工具
免费制作、下载简历

简历模板
提供大量专业模板立即使用

简历范例
从他人简历获取制作灵感

职业指南
各产业、职能的简历教学与范例

简历协助
从我们的招募团队获取关于简历的专业建议
作品集
分享你的作品集展现你的成功专案

作品集工具
制作一份展现个人专业的作品集

作品集展示区
浏览他人的真实作品集，寻找灵感并进行人脉拓展
资源
资源
从丰富内容了解职业发展、求职策略等更多资讯
查看全部文章
求职指南
简历
求职信
作品集＆个人品牌
面试技巧
求职新知
产业＆职位介绍
职业发展
职业规划
职业工具模板
职场人际沟通
职场管理学
人物／企业专访
人物／企业专访
雇主人力资源
人力资源运营
人力资源招募
CakeResume 专题
团队与企业文化
最新消息
活动分享
White Paper
CakeResume 2023 Employer Branding Ebook
CakeResume 2024 Management Associate Ebook
CakeResume 2024 Active Sourcing Ebook
精选文章
面试技巧
【自介範例】吸引人的面試自我介紹怎麼說？4 技巧完美活用自我介紹
阅读更多
《科技职涯》Podcast
专门邀请在科技、数位等不同领域的工作者来分享他们的职涯趣事。
Apple Podcasts
Google Podcasts
Spotify
Career Adventure Podcast
We inspire young professionals by showcasing diverse career journeys.
Apple Podcasts
Google Podcasts
Spotify
招聘
人才搜寻引擎
搜寻简历

职缺刊登
免费开始

猎才顾问
人才媒合服务

名义雇主（EoR）服务
在台湾建立企业团队

雇主品牌推广
建立和推广您的雇主品牌
价格方案
职缺刊登价格方案

人才搜寻引擎价格方案

简历制作价格方案
建立你的人脉
我的人脉
管理人脉及你的联系对象

CakeResume Meet
透过认识并连结其他使用者，扩大你的职涯人脉

社群
透过讨论、活动参与与其他用户交流
下载 CakeResume App

透过认识并连结其他使用者，扩大你的职涯人脉

Senior Data Engineer

0Connections

Senior Data Engineer

* Data engineer and data scientist with over six years of experience. * Proven success in processing big volume of data (6TB per day) in Spark in Scala and MPI in R and Python. * Proven success in developing a machine learning model with Spark in Scala on 30 billion of records for IoT device recognition.

Microsoft

National Cheng Kung University,

New Taipei City, 台灣

职场能力评价

专业背景

目前状态
就职中
专业
数据工程师
・
数据科学家
・
大数据开发人员
产业
资讯服务
・
大数据
・
人工智能 / 机器学习
工作年资
4 到 6 年 (4 到 6 年相关工作经验)
管理经历
我有管理 1～5 人的经验
技能
R
Python
C++
Matlab
Shell Script
machine learning
Deep Learning
Data Analysis
Data Mining
Data Science
Data Cleaning
apache hive
Apache Spark
hadoop ecosystem
Oracle
MySQL
SQL
PowerPoint
Statistics
AWS
Docker
Bash
Scala
Azure
语言能力
Chinese
・
母语或双语
English
・
进阶
Japanese
・
进阶
最高学历
硕士

求职偏好

预期工作模式
全职
・
对远端工作有兴趣
希望获得的职位
資料科學家、資料工程師、資料分析師
期望的工作地点
Taipei, 台灣
・
Japan
・
USA
・
Canada
・
UK
・
Netherlands
・
Germany
・
Switzerland
接案服务
不提供接案服务

工作经验

Senior Data Engineer

Microsoft

・

全职

2021年1月 - 现在

台灣新北市

** Reliability Data System – Data Engineer • Process 1B records of data per day from data centers to provide data views for reliability engineers. • Lead 2 interns to complete data pipelines to visualize data for reliability engineers. ** Quality Management System – Data Engineer • Increased correctness rate of server components by 120% by leading data collection projects to get aligned with the data in internal databases. • Reduced runtime of data pipelines by 80% via replacing Hive with Spark. In the same time, the cost is reduced by 60% with transiting from Hadoop cluster to serverless Spark cluster. • Lead 9 Indian contractor to complete service migration to meet Microsoft compliance.

Senior Data Scientist

Trend Micro Inc.

・

全职

2019年1月 - 2021年1月

・

2 年 1 个月

台灣台北市

** Home Network Security – Data Engineer • Reduced 90% time of reports from 1B security events every day. This helps marketing and sales people in Japan, Singapore and Australlia to find opportunities to improve business. • Visualized the relationship between security events for thread experts with word2vec and t-SNE. ** Network Behavior Analysis Project – Data Scientist • Developed a machine learning model to recognize IoT devices based on 30 billion records of netflows via Spark in Scala and Python. • Reached a 90% accuracy rate in identifying periodic network behaviors of IoT devices with a statistical model.

Senior Data Engineer an Data Scientist

TSMC

・

全职

2016年7月 - 2019年1月

・

2 年 7 个月

台灣台中市

** Yield Improvement Project – Data Engineer and Data Scientist • Processed the big volume of data (6TB per day) to maintain a data warehouse for machine learning projects. • Reduced the out-of-control rate by 30% via a statistical model. • Reduced scrapping rate by 80% with homemade anomaly detection algorithms. • Reduced 80% time to find key factors of yield rates via data visualization and statistics ** Big Data Solutions – Data Engineer • Digest 6TB data per day by building an on-premise big data solution via Scala, Spark and Hive. • Reduced 95% implementation time of machine learning algorithms via R, MPI, Hive and Spark. ** Weekly Productivity Improvement Program – Leader • Developed R packages to reduce reinventing the wheels and increase productivity. • Taught writing clean and performant codes to data scientists and data engineers. • Organized study groups to share knowledge of machine learning and statistics with colleagues.

Full-time Research Assistant

Academia Sinica

・

全职

2015年9月 - 2016年6月

・

10 个月

台灣台北市

** Main role • Decreased data processing time by 80% via R and MongoDB to process millions of records of data per day. • Got a 40% lowered RMSE in imputing missing values with home-made machine learning than other methods.

学历

National Cheng Kung University,

Master’s Degree

・

Statistics

2012 - 2014

・

4/4 GPA

简介

== Achievements == • Completed a master’s thesis entitled “A Classification Approach Based on Density Ratio Estimation with Subspace Projection.” Advisor: Ray-Bing Chen. • Earned a grade of 95% in my statistical methods, generalized linear models, and statistical data mining classes, and 92% in my linear models class. I am thus confident with building models and inferences from models. • Completed an advanced probability theory class designed for Ph. D. students.

National Cheng Kung University

Bachelor’s Degree

・

Economics and Statistics

2008 - 2012

・

3.5/4 GPA

简介

With an advanced plan and hard work, I earned 175 credits for 2 majors within 4 years.