CakeResume Talent Search

Advanced filters
On
4-6 years
6-10 years
10-15 years
More than 15 years
National Taiwan University
Avatar of 陳昭儒.
Avatar of 陳昭儒.
Past
Data Engineer @BUBBLEYE | We're hiring!
2021 ~ 2022
Software Enginer
Within one month
size:GB Number of rows: 6,268,519,176 Qudowe Project Lead & Software Engineer Product of Pixnet Travel Hackathon 2019, a trip planner based on Instagram's data Work Experience Vpon, Data Engineer Aug 2018 ~ Oct 2020 Implement Akka-http(Scala) server endpoints for Vpon Data Platform Product Create new ETL pipelines using GCP Spark(Apache Spark) and GCP Dataflow(Apache Beam) to batch input/output hundreds of files Migrate existing ETL pipelines from AWS(Hive SQL) to GCP(BigQuery SQL) using python Migrate datawarehouse from AWS(Hive) to GCP(BigQuery) Setup Prometheus on GKE (Google Kubernetes Engine) to
Python
ETL
Web Scraping
Unemployed
Ready to interview
Full-time / Interested in working remotely
4-6 years
National Taiwan University
電機工程學系
Avatar of Chin-Hung (Wilson) Liu.
Avatar of Chin-Hung (Wilson) Liu.
Principal Engineer, Data Engineering @KKCompany
2023 ~ Present
Backend Engineer, Data Engineer, MLOps Engineer
Within one month
Chin-Hung (Wilson) Liu I am a lead architect responsible for designing and implementing a large-scale data pipeline for Lomotif, Paktor x 17LIVE, utilizing GCP/AWS/Python/Scala, in collaboration with data science and machine learning teams in Singapore and TW HQ, as well as with the Hadoop ecosystem (HDFS/HBase/Kafka) at JSpectrum in Hong Kong and Sydney. With over 15 years of experience in designing and developing Java/Scala/Python-based applications for daily operations, I bring: ● At least 8 years of experience in data analysis, pipeline design
Big Data
Data Engineering
ETL
Employed
Open to opportunities
Full-time / Interested in working remotely
10-15 years
National Taiwan University
EMBA Programs, Business Administration, Accounting, Finance and International Business.
Avatar of 陳婉玲.
Avatar of 陳婉玲.
Analyst @Business Next Media Corp.
2017 ~ 2019
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
Within one month
business process automation using data techniques on loan, credit analysis and procurement process. • Design solutions for data leakage prevention and data security. • Implement end-to-end generative AI solution integrating with GraphDB, Chatbot & Avatar. Morrison Express Corp., Data Engineer, Jan 2021 ~ Jul 2022 • Orchestrated data pipeline, ETL process and API to connect diverse data systems and facilitate data migration. • Full accountability for overseeing data architecture and services spanning 70 offices across five continents. • Integrated AWS cloud services, optimizing applications with S3, Lambda, SQS, SNS, and ECS. ASUSTeK Computer Inc., Digital Analyst, Sep
Python
SQL/MySQL
Databases
Employed
Full-time / Interested in working remotely
4-6 years
National Taiwan University
Psychology
Avatar of Recca Chao.
Avatar of Recca Chao.
高級工程師 @關網資訊股份有限公司
2022 ~ 2023
Senior PHP engineer
Within one month
開立發票桌面程式 天下雜誌, 後端工程師, Mar 2020 ~ Jun 2020 協助天下學習產品開發 研究 airflow 功能,管理數據分析流程 交接以 Laravel 製作 ETL 專案 協助建立工程文件 協助維護的產品 天下學習 數據分析系統 104 人力銀行, 後端工程師, Jul 2019 ~ Jan 2020 使用 Laravel 框架建立產
PHP development
MySQL database design
Laravel Framework
Employed
Not open to opportunities
Full-time / Not interested in working remotely
6-10 years
National Taiwan University
資訊工程
Avatar of the user.
Avatar of the user.
Senior Data Architect @Agoda
2023 ~ Present
Data Engineer
Within one month
Python
PySpark
MySQL
Employed
Full-time / Interested in working remotely
4-6 years
National Taiwan University
Graduate Institute of Applied Mechanics
Avatar of 林奕勳.
Active
Avatar of 林奕勳.
Active
人工智慧工程師 @玉山銀行智能金融處 (Intelligent Finance Dept., E-SUN Bank )
2020 ~ Present
Machine Learning Researcher / Engineer
Within one month
maintenance and optimization cost of AI projects with integration of customizable data validation, metadata registration, and time/memory-profiling tools. Ensure flexibility and extensibility in pandas-like packages (such as CuDF, Polars, PySpark, etc) for future parallelizability, targeting 1000X speedup. Responsibility: Coordinate the design and development of ETL framework components. Give instruction to ETL research intern and provide code review and design guidance for the project participants. Accomplishment: Have wrapped the framework into a workable python package and successfully imported it into two production projects, with ETL business logic automatically rendered on the Airflow UI
Research
Music Information Retrieval
Natural Language Processing
Employed
Full-time / Interested in working remotely
4-6 years
National Taiwan University
Graduate Institute of Communication Engineering
Avatar of the user.
Avatar of the user.
Senior Staff Backend Engineer / Backend Team Lead @WeMo Scooter
2019 ~ 2022
工程師
Within two months
JavaScript
Node.js
RESTful API
Not open to opportunities
Full-time / Interested in working remotely
6-10 years
National Taiwan University
Robotics
Avatar of the user.
Avatar of the user.
CTO @KryptoGO Co., Ltd.
2021 ~ Present
AI Software Engineer,Deep learning Engineer
Within one year
Algorithm
ASP.NET
Machine Learning
Employed
Not open to opportunities
Full-time / Interested in working remotely
6-10 years
National Taiwan University
Computer Science and Information Engineering
Avatar of 陳峻奇.
Avatar of 陳峻奇.
Jr. Programmer @德義資訊股份有限公司
2013 ~ 2015
Developer Team Leader, Architect, FullStack Developer
More than one year
Git / SVN Android APP Arduino Development Container Technology and Kubernetes Operating Experience Experience / 經歷 Senior Java Developer at Delta Electronics / 台達電子工業股份有限公司 (2018/03~present) 於台達IABG內進行Holmes Project之ETL相關專案開發、管理。 Senior Java Developer at Delta Electronics / 台達電子工業股份有限公司 (2017/03~2018/03) 於台達研究院內進行Block Control、IoT、Big Data Analytics Platform
Word
PowerPoint
Excel
Employed
Full-time / Interested in working remotely
6-10 years
National Taiwan University
Bachelor of Bio-Industrial Mechatronics Engineering

The Most Lightweight and Effective Recruiting Plan

Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.

  • Browse all search results
  • Unlimited access to start new conversations
  • Resumes accessible for only paid companies
  • View users’ email address & phone numbers
Search Tips
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Only public resumes are available with the free plan.
Upgrade to an advanced plan to view all search results including tens of thousands of resumes exclusive on CakeResume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Within one month
AI Engineer
Logo of 玉山銀行智能金融處 (Intelligent Finance Dept., E-SUN Bank ).
玉山銀行智能金融處 (Intelligent Finance Dept., E-SUN Bank )
2020 ~ Present
Taipei, Taiwan
Professional Background
Current status
Employed
Job Search Progress
Professions
Data Engineer, Machine Learning Engineer
Fields of Employment
Software, Banking
Work experience
1-2 years work experience (4-6 years relevant)
Management
I've had experience in managing 1-5 people
Skills
Research
Music Information Retrieval
Natural Language Processing
Information Retrieval
Machine Learning
python programming
C++ Language
Java
Tensorflow (Keras)
Numpy Data Manipulation
sklearn
Linux
Verilog
Arduino
Github
Airflow
Design Patterns
Docker
Python Programming
PyTorch
Pytorch Lightning
PostgreSQL
Languages
English
Intermediate
Job search preferences
Positions
Machine Learning Researcher / Engineer
Job types
Full-time
Locations
台灣台北市, 台灣新竹市
Remote
Interested in working remotely
Freelance
Yes, I freelance in my spare time
Educations
School
National Taiwan University
Major
Graduate Institute of Communication Engineering
Print
Zx6amgvjmkxenvsveonw

Jeffrey Lin (林奕勳)

[email protected] TW, Taipei

CAREER SUMMARY 


  • 6+ years of deep learning research and development experience, with a strong ability in prototyping, evaluation, and deployment of models, in both academical and industrial environment.
  • 2+ years of financial AI development experience, specialized in the design of python API, SDK, and framework for optimizing the deployment flow of data science, ETL, and machine learning pipelines. 
  • 1+ year of participation in the technology management assistant bootcamp, with real-experience on business user support, gaining first-hand insight on financial and corporative AI usage via cross-department collaboration. 
  • Having excellent code-quality standard, efficiency-oriented thinking, and perseverance in face of challenging problems; Continue absorbing “technical-nutrients” from the development of open-source AI community.

PROJECT

Next-Generation ETL Framework for Data Science Team

2022/7-2022/12

  • Goal: 
    1. Design a lightweight ETL python framework enabling data scientists to seamlessly deploy their pandas/SQL processes to airflow in a maintainable format.
    2. Reduce the maintenance and optimization cost of AI projects with integration of customizable data validation, metadata registration, and time/memory-profiling tools. 
    3. Ensure flexibility and extensibility in pandas-like packages (such as CuDF, Polars, PySpark, etc) for future parallelizability, targeting 1000X speedup.
  • Responsibility: Coordinate the design and development of ETL framework components. Give instruction to ETL research intern and provide code review and design guidance for the project participants.
  • Accomplishment: Have wrapped the framework into a workable python package and successfully imported it into two production projects, with ETL business logic automatically rendered on the Airflow UI in an easily traceable way.

Enhance and Maintain House Price and Mortgage Loan API

2022/2-2022/7

  • Responsibility: API Service maintenance, code refactoring, and model performance enhancement for the house-price/mortgage loan automatic appraisal system. 
  • Accomplishment: 
    1. Encrypted API re-routing: 
      • Re-route the internal mortgage API service as an encrypted API service for external client. Understand and implement the RSA/AES double encryption protocol under spec-ambiguous situation by investigating client-side PHP code. 
    2. Code refactor and migration: 
      • Reduce the cognitive complexity of service code from 1000-line-single-file situation by thoroughly understand the business logic and refactoring it into chain-of-command design pattern. 
      •  Design common module with logging mechanism to reduce future maintenance cost. 
      •  Draw and carry out the code migration plan without compromising the 24/7-criteria. 
    3. Re-design of house appraisal model for interpretability:  
      • Coordinate with research institution and business unit to build a business interpretable deep learning model. 
      • Re-design the loss function with regularization term to meet the legal requirements. 
      • Reducing the model inference time by adopting Google’s state-of-the-art nearest neighbor algorithm for referential house searching.  

Scalable ETL Pipeline for Credit Loan Marketing Data

2021/8-2022/1

  • Responsibility: 
    1. Orchestrate the cross-department coordination of the development process.
    2. Design a low-code python-SQL interface for the data-analyst of business unit and provide development and CICD guidance.
    3. Ensure business-logic-extensibility in considerations of future marketing campaign. 
    4. Ensure timely delivery of 7 million customer campaign data under 1day computation criteria.
  • Accomplishment: 
    1. Have helped the data-analyst of business unit successfully incorporate new campaign business logic into the production pipeline.
    2. A comprehensive tutorial to the development interface and CICD toolkits, enabling smooth new feature extension with zero-guidance.
    3. Significantly speedup the pipeline from 5d to 8h via map-reduce parallelization with multiple Airflow worker nodes. 

PyTorch development framework for Industry-Academy Cooperation

2021/4-2022/8

  • Goal: Develop Pytorch model development framework to reduce the academy-to-company code migration cost.
  • Features:
    1. Support training, validation, testing, weight/performance inspection, and  check-pointing of models with different architecture.
    2. Fixed experimentation flow for all models with formatted running script and folder structure, to enhance code readability and reproducibility.
    3. Support lazy evaluation and visualization of data pre-processing, avoid re-generation of data in experimentation scenario.
    4. Supporting parallel model hyper-parameter tuning via Ray (a general python parallelization framework).
  • Tools: Pytorch-lightning, Ray, TensorBoard, pyflow-viz

2D Indoor Positioning System through Android App

2014/1-2015/1 

Instructor: Professor Ren-Song Tsay
  • Survey on the state-of-the-art indoor positioning technologies and write a project proposal. Investigate the relationship between distance and the amplitude of the Bluetooth signal through experiment. Develop an android app that fuses the signals of Bluetooth and Geo sensor for position estimation. 
  • Score: A+ 

RESEARCH

Develop techniques applicable to the music industry and design experiments for proof-of-concept

2015/9-2020/1  

Instructor: Professor Homer H. Chen  

  • Developed a context-based tag propagation method to reduce the training noise and successfully improved a deep music auto-tagging model (SampleCNN) by 25.8% in MAP
  • Enhance the robustness of the tag propagation method and successfully extended the improvement to two other neural networks (CRNN and 2D-CNN) with the increase of improvement reaches 6.5%, 4.9%, and 5.5% (for SampleCNN, CRNN, and 2D-CNN, respectively)
  • Responsibility: Build up the experiment acceleration (via GPU) environment on a Linux-based server; Design the model evaluation and training strategy with TensorFlow. 
  • Conduct error analysis to understand the problem in the training data using Matplotlib and Seaborn. Tune the hyper-parameters during model development. Accelerate the evaluation and data visualization process using ray, a parallel processing package. 

Research proposal in music for lab funding from the Ministry of Science and Technology (MOST)

2015/9-2017/10 

EDUCATION 

M.S. in Graduate Institute of Communication Engineering, National Taiwan University 

2015/9 ~ 2020/1 

  • Multimedia Signal Processing (A) 
  • Special Topics on Internet of Things (A): Develop an IOT product and demonstrate its functionality. 
  • Computational Methods and Tools for Data Science (A): Familiarize myself with basic EDA (exploratory data analysis) techniques including PCA and t-SNE.

B.S. in Electrical Engineering Department, National Tsing Hua University

2011/9 ~ 2015/6

  • Programming Classes: Logic Design (A+) / Logic Design Lab (A+) / Embedding System Lab (A+)  / Computer Programming Design (A+)
  • Math Classes: Calculus I&II (A+/A+) Linear Algebra (A+) / Numerical Analysis (A) 

SKILLS


    • Programming Skills: Design Patterns、ETL、Image Building、Machine Learning、Parallel Acceleration、Python SDK Design、Refactoring 
    • Development Environment: Linux, Docker, Oracle-Cloud, Visual Studio Code 
    • Computer Languages: Python、C++、SQL、JAVA 
    • Tools for Machine Learning and Data Science: Pytorch、Pytorch-Lightning、CuDF (GPU-accelerating dataframe)、Pandas、Networkx、LightGBM、DeepGraphLibrary 
    • Tools for Model Serving, Deployment, and Data Pipeline: Airflow、FastAPI、 k8S、PostgresDB、RedisGraph、Neo4j、Ray For Collaboration: Github、Azure DevOps 

Resume
Profile
Zx6amgvjmkxenvsveonw

Jeffrey Lin (林奕勳)

[email protected] TW, Taipei

CAREER SUMMARY 


  • 6+ years of deep learning research and development experience, with a strong ability in prototyping, evaluation, and deployment of models, in both academical and industrial environment.
  • 2+ years of financial AI development experience, specialized in the design of python API, SDK, and framework for optimizing the deployment flow of data science, ETL, and machine learning pipelines. 
  • 1+ year of participation in the technology management assistant bootcamp, with real-experience on business user support, gaining first-hand insight on financial and corporative AI usage via cross-department collaboration. 
  • Having excellent code-quality standard, efficiency-oriented thinking, and perseverance in face of challenging problems; Continue absorbing “technical-nutrients” from the development of open-source AI community.

PROJECT

Next-Generation ETL Framework for Data Science Team

2022/7-2022/12

  • Goal: 
    1. Design a lightweight ETL python framework enabling data scientists to seamlessly deploy their pandas/SQL processes to airflow in a maintainable format.
    2. Reduce the maintenance and optimization cost of AI projects with integration of customizable data validation, metadata registration, and time/memory-profiling tools. 
    3. Ensure flexibility and extensibility in pandas-like packages (such as CuDF, Polars, PySpark, etc) for future parallelizability, targeting 1000X speedup.
  • Responsibility: Coordinate the design and development of ETL framework components. Give instruction to ETL research intern and provide code review and design guidance for the project participants.
  • Accomplishment: Have wrapped the framework into a workable python package and successfully imported it into two production projects, with ETL business logic automatically rendered on the Airflow UI in an easily traceable way.

Enhance and Maintain House Price and Mortgage Loan API

2022/2-2022/7

  • Responsibility: API Service maintenance, code refactoring, and model performance enhancement for the house-price/mortgage loan automatic appraisal system. 
  • Accomplishment: 
    1. Encrypted API re-routing: 
      • Re-route the internal mortgage API service as an encrypted API service for external client. Understand and implement the RSA/AES double encryption protocol under spec-ambiguous situation by investigating client-side PHP code. 
    2. Code refactor and migration: 
      • Reduce the cognitive complexity of service code from 1000-line-single-file situation by thoroughly understand the business logic and refactoring it into chain-of-command design pattern. 
      •  Design common module with logging mechanism to reduce future maintenance cost. 
      •  Draw and carry out the code migration plan without compromising the 24/7-criteria. 
    3. Re-design of house appraisal model for interpretability:  
      • Coordinate with research institution and business unit to build a business interpretable deep learning model. 
      • Re-design the loss function with regularization term to meet the legal requirements. 
      • Reducing the model inference time by adopting Google’s state-of-the-art nearest neighbor algorithm for referential house searching.  

Scalable ETL Pipeline for Credit Loan Marketing Data

2021/8-2022/1

  • Responsibility: 
    1. Orchestrate the cross-department coordination of the development process.
    2. Design a low-code python-SQL interface for the data-analyst of business unit and provide development and CICD guidance.
    3. Ensure business-logic-extensibility in considerations of future marketing campaign. 
    4. Ensure timely delivery of 7 million customer campaign data under 1day computation criteria.
  • Accomplishment: 
    1. Have helped the data-analyst of business unit successfully incorporate new campaign business logic into the production pipeline.
    2. A comprehensive tutorial to the development interface and CICD toolkits, enabling smooth new feature extension with zero-guidance.
    3. Significantly speedup the pipeline from 5d to 8h via map-reduce parallelization with multiple Airflow worker nodes. 

PyTorch development framework for Industry-Academy Cooperation

2021/4-2022/8

  • Goal: Develop Pytorch model development framework to reduce the academy-to-company code migration cost.
  • Features:
    1. Support training, validation, testing, weight/performance inspection, and  check-pointing of models with different architecture.
    2. Fixed experimentation flow for all models with formatted running script and folder structure, to enhance code readability and reproducibility.
    3. Support lazy evaluation and visualization of data pre-processing, avoid re-generation of data in experimentation scenario.
    4. Supporting parallel model hyper-parameter tuning via Ray (a general python parallelization framework).
  • Tools: Pytorch-lightning, Ray, TensorBoard, pyflow-viz

2D Indoor Positioning System through Android App

2014/1-2015/1 

Instructor: Professor Ren-Song Tsay
  • Survey on the state-of-the-art indoor positioning technologies and write a project proposal. Investigate the relationship between distance and the amplitude of the Bluetooth signal through experiment. Develop an android app that fuses the signals of Bluetooth and Geo sensor for position estimation. 
  • Score: A+ 

RESEARCH

Develop techniques applicable to the music industry and design experiments for proof-of-concept

2015/9-2020/1  

Instructor: Professor Homer H. Chen  

  • Developed a context-based tag propagation method to reduce the training noise and successfully improved a deep music auto-tagging model (SampleCNN) by 25.8% in MAP
  • Enhance the robustness of the tag propagation method and successfully extended the improvement to two other neural networks (CRNN and 2D-CNN) with the increase of improvement reaches 6.5%, 4.9%, and 5.5% (for SampleCNN, CRNN, and 2D-CNN, respectively)
  • Responsibility: Build up the experiment acceleration (via GPU) environment on a Linux-based server; Design the model evaluation and training strategy with TensorFlow. 
  • Conduct error analysis to understand the problem in the training data using Matplotlib and Seaborn. Tune the hyper-parameters during model development. Accelerate the evaluation and data visualization process using ray, a parallel processing package. 

Research proposal in music for lab funding from the Ministry of Science and Technology (MOST)

2015/9-2017/10 

EDUCATION 

M.S. in Graduate Institute of Communication Engineering, National Taiwan University 

2015/9 ~ 2020/1 

  • Multimedia Signal Processing (A) 
  • Special Topics on Internet of Things (A): Develop an IOT product and demonstrate its functionality. 
  • Computational Methods and Tools for Data Science (A): Familiarize myself with basic EDA (exploratory data analysis) techniques including PCA and t-SNE.

B.S. in Electrical Engineering Department, National Tsing Hua University

2011/9 ~ 2015/6

  • Programming Classes: Logic Design (A+) / Logic Design Lab (A+) / Embedding System Lab (A+)  / Computer Programming Design (A+)
  • Math Classes: Calculus I&II (A+/A+) Linear Algebra (A+) / Numerical Analysis (A) 

SKILLS


    • Programming Skills: Design Patterns、ETL、Image Building、Machine Learning、Parallel Acceleration、Python SDK Design、Refactoring 
    • Development Environment: Linux, Docker, Oracle-Cloud, Visual Studio Code 
    • Computer Languages: Python、C++、SQL、JAVA 
    • Tools for Machine Learning and Data Science: Pytorch、Pytorch-Lightning、CuDF (GPU-accelerating dataframe)、Pandas、Networkx、LightGBM、DeepGraphLibrary 
    • Tools for Model Serving, Deployment, and Data Pipeline: Airflow、FastAPI、 k8S、PostgresDB、RedisGraph、Neo4j、Ray For Collaboration: Github、Azure DevOps