CakeResume Talent Search

Advanced filters
On
4-6 years
6-10 years
10-15 years
More than 15 years
Avatar of Vel Tien-Yun Wu.
Avatar of Vel Tien-Yun Wu.
Data Engineer @Groundhog Technologies Inc.
2021 ~ 2024
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
Within one month
and collaborate with a dynamic team. New Taipei City, Taiwan Work Experience Data Engineer • Groundhog Technologies Inc. JulyPresent - Built and maintained data piplines (through which several hundred millions rows of data flow through daily) using Scala Spark/ Hadoop - Managed cron jobs and performed regular data recovery using Apache Airflow - Performed regular Extract, transform, load (ETL) operations through Hive and HDFS command line interfaces - Utilized streaming technologies such as Kafka to store data into various pools and lakes alike Graduate Hourly employee • University of Illinois at Urbana-Champaign SeptemberDecember 2020 Designed and developed student assessments/assignments for
Git
Python
Scala
Employed
Ready to interview
Full-time / Interested in working remotely
4-6 years
University of Illinois at Urbana-Champaign, School of Information Sciences
Information Management
Avatar of 鄒適文.
Active
Avatar of 鄒適文.
Active
Past
Lead Data Scientist / Senior Data Scientist @Vinnovation Network 維諾森資訊科技
2022 ~ 2023
資料科學家、資料科學工程師、機器學習工程師
Within one month
import traditional CSV files. Utilized Databricks for large-scale data processing, leveraging its Spark capabilities to efficiently transform and aggregate incoming data streams. With the combined power of Databricks and AWS Lambda, ensured unparalleled data consistency, quality, and preparedness for sophisticated analytics and reporting. Utilized Databricks and Airflow to run extensive data profiling tasks, analyzing data patterns and identifying potential quality issues before they reached the Databricks Delta Lake. Established robust guardrails using the combined might of AWS Lambda, Apache Airflow and Databricks, ensuring that data stored in the DataBricks Delta Lake consistently met the highest
python
tensorflow
keras
Unemployed
Ready to interview
Full-time / Interested in working remotely
4-6 years
台灣大學
大氣科學所
Avatar of the user.
Avatar of the user.
Lead Fullstack Engineer @Proto Research Inc
2022 ~ Present
Front-End / Back-End / Full Stack Web Developer
Within one month
Python
React.js
Next.js
Employed
Open to opportunities
Full-time / Remote Only
6-10 years
National Taiwan University
Mechanical Engineering
Avatar of the user.
Avatar of the user.
Principal Engineer @Coretronic Corporation, 中強光電
2020 ~ 2022
Machine Learning Engineer
Within one month
Python
Linux
C++
Employed
Full-time / Interested in working remotely
6-10 years
國立台灣大學
經濟學
Avatar of Red Cho.
Avatar of Red Cho.
Team Lead/Senior Software Engineer @MEXC
2022 ~ Present
Software Engineer
Within one month
stable api under high qps with load testing / concurrent testing. Software engineer • Gogoro/Goshare • 2019/11~2022/01 .Implement Tarjan's strongly connected component algorithm and topological sort algorithm in the task management system. .Develop a customized synchronizer system by combining core concepts from Apache Airflow and Apache Kafka Connecter . .Experience in performance test , load test and stress test and unit test. .Experience in solving java native memory leak in JIT c2 level 4 compiler issue. Software Engineer • Lang Live • 2019/03 ~ 2019/10 .Deal with 10 million+
PHP
Python
Docker
Employed
Full-time / Interested in working remotely
4-6 years
Avatar of the user.
Avatar of the user.
Data Scientist/ Engineer @Intomarkets
2022 ~ Present
Data Scientist
More than one year
Machine Learning
Statistical Analysis
Big Data
Full-time / Interested in working remotely
6-10 years
University of Paderborn
Master of Science in Computer
Avatar of 邱政亞.
Avatar of 邱政亞.
全端工程師 @崴浤科技
2015 ~ Present
資料工程師
Within two months
料抓取與邏輯計算,最後將資料以網頁呈現至前端‧ 這整套系統自建於Linode的雲端主機,使用UbuntuLTS系統環境,安裝包括nginx, mysql, phpmyadmin, python, apache airflowApache airflow 曾經的踩雷: 1. 每次跑task時都會讓Scheduler與Mysql Crash,查 Mysql log之後發現好像是主機記憶體 不足 的問題,把linode的主機從RAM
Python
MySQL
Airflow
Employed
Full-time / Interested in working remotely
6-10 years
國立台中科技大學
資訊管理
Avatar of 蔡智鈞.
Avatar of 蔡智鈞.
Principle Engineer @HTC
2023 ~ Present
Software Engineer / Backend Engineer
Within one month
. # Air quality real-time monitoring mapUtilized the ASP.NET MVC framework for building a low-cost PM2.5 sensing platform for the Environment Protection Administration Executive Yuan R.O.C. 2. Integrated Google Maps API to display GIS base data, and leveraged Apache EChart to present historical data. 3. Developed a Messenger Bot to provide convenient data querying services. EducationAppWorks School Blockchain Program國立暨南國際大學(National Chi Nan University) 資訊管理學系國立暨南國際大學
node.js / express.js
Python
Kafka
Employed
Open to opportunities
Full-time / Interested in working remotely
4-6 years
AppWorks School
Blockchain Program
Avatar of Chin-Hung (Wilson) Liu.
Avatar of Chin-Hung (Wilson) Liu.
Principal Engineer, Data Engineering @KKCompany
2023 ~ Present
Backend Engineer, Data Engineer, MLOps Engineer
Within one month
operation, moderating model training results, and designing SLIs/SLOs for EKS Clusters. More responsibilities/details as below. Optimize music (UMG) pipeline with queries and memories for Elasticsearch and PostgreSQL, the pipeline saving 90% execution time from 10+ hours to 40 mins. Migrate service from apache spark, AWS Data Lake Formation to AWS MWAA, EKS airflow environment. Design, and deliver distributed system for Ray Serve with AI team. Design, and implement a modern machine learning pipeline for a recommendation, and moderation pipe. Design SLA and implement alert log reporting system (history logs
Big Data
Data Engineering
ETL
Employed
Open to opportunities
Full-time / Interested in working remotely
10-15 years
National Taiwan University
EMBA Programs, Business Administration, Accounting, Finance and International Business.
Avatar of 張致瑋.
BI/DATA Engineer
Within one month
Chih-Wei Chang  Data Engineer | BI | Web3 • [email protected] •Aiming to forge a progressive career in business intelligence, employing my analytical and technical prowess in data engineering to effectively further business objectives. Having BI/DATA Engineer experience with different industry; substantial experience designing and executing solutions for business problems involving large scale data warehousing Design and implement data warehouse solutions for different stage of project Familiar with design, implement and maintain ETL process Abilities Programming Python SQL Dataflow Cloud Pubsub ETL MSBI Hive Airflow Tableau Docker Trinity DW Platforms AWS GCP People/Personal Project
SQL
my-sql
ETL
Full-time / Interested in working remotely
6-10 years

The Most Lightweight and Effective Recruiting Plan

Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.

  • Browse all search results
  • Unlimited access to start new conversations
  • Resumes accessible for only paid companies
  • View users’ email address & phone numbers
Search Tips
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Only public resumes are available with the free plan.
Upgrade to an advanced plan to view all search results including tens of thousands of resumes exclusive on CakeResume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Within one month
資料科學工程師
Vinnovation Network 維諾森資訊科技
2022 ~ 2023
Taipei City, Taiwan
Professional Background
Current status
Unemployed
Job Search Progress
Ready to interview
Professions
Data Engineer, Data Scientist, Machine Learning Engineer
Fields of Employment
Artificial Intelligence / Machine Learning
Work experience
4-6 years
Management
I've had experience in managing 5-10 people
Skills
python
tensorflow
keras
machine learning
Deep Learning
pytorch
Fortran
Vim
Git
Statistics
Linear Algebra
PyTorch
Medical Image Processing
LLMs A.I.
GPT
Languages
Chinese
Native or Bilingual
English
Fluent
Job search preferences
Positions
資料科學家、資料科學工程師、機器學習工程師
Job types
Full-time
Locations
台灣台北市, 台灣新竹市
Remote
Interested in working remotely
Freelance
Yes, I freelance in my spare time
Educations
School
台灣大學
Major
大氣科學所
Print

Shih-Wen Tsou

- With more than 5 years of experience in Data Analysis, Machine Learning and Deep Learning, familiar with Modeling, Data Analysis, Image Processing, Machine Learning, and Deep Learning.

  Taipei City, Taiwan      

WORK EXPERIENCE

Lead Data Scientist / Full Stack Data Scientist, Vinnovation Network, Taipei, Taiwan

Data Engineering / Data Analysis

  • Spearheaded the development of a fully automated data integration pipeline that aggregated diverse data sets into a S3 Data Lake.
  • Successfully integrated a range of data sources, including real-time data feeds from AWS Redshift and DocumentDB, as well as batch processes to import traditional CSV files.
  • Utilized Databricks for large-scale data processing, leveraging its Spark capabilities to efficiently transform and aggregate incoming data streams.
  • With the combined power of Databricks and AWS Lambda, ensured unparalleled data consistency, quality, and preparedness for sophisticated analytics and reporting.
  • Utilized Databricks and Airflow to run extensive data profiling tasks, analyzing data patterns and identifying potential quality issues before they reached the Databricks Delta Lake.
  • Established robust guardrails using the combined might of AWS Lambda, Apache Airflow and Databricks, ensuring that data stored in the DataBricks Delta Lake consistently met the highest quality benchmarks.

MLOps / Machine Learning / Data Science

  • Utilized Databricks to build a LightGCN-based recommendation system, fine-tuning for precise content delivery. Monitored model versions with MLflow, ensuring continuous integration.
  • Seamlessly merged our recommendation system with Databricks Delta Lake, maintaining a high-quality data influx and elevating system performance.
  • Developed a comprehensive MLOps service on Databricks, spanning from preprocessing to deployment. Leveraged automation tools to swiftly adapt models to new data.
  • Developed an API service used as an internal tool within the company, leveraging OpenAI Whisper for automatic speech recognition and ChatGPT for intelligent language translation, resulting in approximately an 80% reduction in manpower and time costs.
  • Fine-tuned a Llama-based Large Language Model (LLM) and, by integrating Langchain and Pinecone, optimized the search experience on our website.

Apr. 2022 - Jul. 2023

Machine Learning Researcher, Lab. for Cloud Dynamics and Modeling, National Taiwan University, Taipei, Taiwan    

  • Developed a model using U-net to detect ship tracks in satellite images, resulting in an 80% times savings.

  • Cooperated with multiple domain experts, including Atmospheric Science and Environmental Science, to solve problems with machine learning techniques.

  • Developed a model to classify Typhoon tracks with 96.5% accuracy rate, where the traditional method is about 80%.

  • Configured and managed a GPU-enforced workstation for the lab members to execute High Performance Computing (HPC) tasks.

Sep. 2021 - Mar. 2022

Data Scientist, Vizuro, Taipei, Taiwan

  • Developed an end-to-end pipeline to detect Breast Cancer in 3D Breast MRI images, encompassing data storage, data pre-processing, and detection model building.

  • Leveraged the power of deep learning algorithms, fusing them with insights gathered from medical research, to refine and augment the performance of our breast cancer diagnosis model.

  • Deployed the Deep-learning Breast Cancer Detection model integrated to the hospital PACS system.

  • Developed a model to segment 3D breast MR images and deployed it to ImageJ to expedite annotation and reduce labeling time.

May. 2019 - Sep. 2021

EDUCATION

Sep. 2016 - Jan. 2019

National Taiwan University

Master of Science in Atmospheric Sciences

Sep. 2011 - Jun. 2016

National Taiwan University

Bachelor of Science in Atmospheric Sciences

SKILL

Programming: Python, R, C/C++, GO, Matlab, Fortran

Machine Learning:

  • Traditional: Random Forest, XGBoost, K-means, DBSCAN, PCA, t-SNE
  • Deep Learning: CNNs, RNNs, transformers, GPT, Fast R-CNN series, U-Net, DCGANs, Whisper, Explainable AI techniques

Databases: PostgreSQL, MySQL, MongoDB

Data Engineering: Databricks, PySpark, Airflow, Airbyte

Cloud (AWS): Lambda, S3, EC2, Personalize, VPC

Side Projects


  • Lightning Prediction with Deep Learning and explain the model with physical methods.

  • Learning to generate the Manhattan building with Deep Convolutional GAN from OpenStreeMap building model.

  • Predicting short-term stock market price trends with Machine Learning.

  • Build an Investment Portfolio machine with a Rebalancing Strategy from scratch.

Resume
Profile

Shih-Wen Tsou

- With more than 5 years of experience in Data Analysis, Machine Learning and Deep Learning, familiar with Modeling, Data Analysis, Image Processing, Machine Learning, and Deep Learning.

  Taipei City, Taiwan      

WORK EXPERIENCE

Lead Data Scientist / Full Stack Data Scientist, Vinnovation Network, Taipei, Taiwan

Data Engineering / Data Analysis

  • Spearheaded the development of a fully automated data integration pipeline that aggregated diverse data sets into a S3 Data Lake.
  • Successfully integrated a range of data sources, including real-time data feeds from AWS Redshift and DocumentDB, as well as batch processes to import traditional CSV files.
  • Utilized Databricks for large-scale data processing, leveraging its Spark capabilities to efficiently transform and aggregate incoming data streams.
  • With the combined power of Databricks and AWS Lambda, ensured unparalleled data consistency, quality, and preparedness for sophisticated analytics and reporting.
  • Utilized Databricks and Airflow to run extensive data profiling tasks, analyzing data patterns and identifying potential quality issues before they reached the Databricks Delta Lake.
  • Established robust guardrails using the combined might of AWS Lambda, Apache Airflow and Databricks, ensuring that data stored in the DataBricks Delta Lake consistently met the highest quality benchmarks.

MLOps / Machine Learning / Data Science

  • Utilized Databricks to build a LightGCN-based recommendation system, fine-tuning for precise content delivery. Monitored model versions with MLflow, ensuring continuous integration.
  • Seamlessly merged our recommendation system with Databricks Delta Lake, maintaining a high-quality data influx and elevating system performance.
  • Developed a comprehensive MLOps service on Databricks, spanning from preprocessing to deployment. Leveraged automation tools to swiftly adapt models to new data.
  • Developed an API service used as an internal tool within the company, leveraging OpenAI Whisper for automatic speech recognition and ChatGPT for intelligent language translation, resulting in approximately an 80% reduction in manpower and time costs.
  • Fine-tuned a Llama-based Large Language Model (LLM) and, by integrating Langchain and Pinecone, optimized the search experience on our website.

Apr. 2022 - Jul. 2023

Machine Learning Researcher, Lab. for Cloud Dynamics and Modeling, National Taiwan University, Taipei, Taiwan    

  • Developed a model using U-net to detect ship tracks in satellite images, resulting in an 80% times savings.

  • Cooperated with multiple domain experts, including Atmospheric Science and Environmental Science, to solve problems with machine learning techniques.

  • Developed a model to classify Typhoon tracks with 96.5% accuracy rate, where the traditional method is about 80%.

  • Configured and managed a GPU-enforced workstation for the lab members to execute High Performance Computing (HPC) tasks.

Sep. 2021 - Mar. 2022

Data Scientist, Vizuro, Taipei, Taiwan

  • Developed an end-to-end pipeline to detect Breast Cancer in 3D Breast MRI images, encompassing data storage, data pre-processing, and detection model building.

  • Leveraged the power of deep learning algorithms, fusing them with insights gathered from medical research, to refine and augment the performance of our breast cancer diagnosis model.

  • Deployed the Deep-learning Breast Cancer Detection model integrated to the hospital PACS system.

  • Developed a model to segment 3D breast MR images and deployed it to ImageJ to expedite annotation and reduce labeling time.

May. 2019 - Sep. 2021

EDUCATION

Sep. 2016 - Jan. 2019

National Taiwan University

Master of Science in Atmospheric Sciences

Sep. 2011 - Jun. 2016

National Taiwan University

Bachelor of Science in Atmospheric Sciences

SKILL

Programming: Python, R, C/C++, GO, Matlab, Fortran

Machine Learning:

  • Traditional: Random Forest, XGBoost, K-means, DBSCAN, PCA, t-SNE
  • Deep Learning: CNNs, RNNs, transformers, GPT, Fast R-CNN series, U-Net, DCGANs, Whisper, Explainable AI techniques

Databases: PostgreSQL, MySQL, MongoDB

Data Engineering: Databricks, PySpark, Airflow, Airbyte

Cloud (AWS): Lambda, S3, EC2, Personalize, VPC

Side Projects


  • Lightning Prediction with Deep Learning and explain the model with physical methods.

  • Learning to generate the Manhattan building with Deep Convolutional GAN from OpenStreeMap building model.

  • Predicting short-term stock market price trends with Machine Learning.

  • Build an Investment Portfolio machine with a Rebalancing Strategy from scratch.