CakeResume Talent Search

Advanced filters
On
4-6 years
6-10 years
10-15 years
More than 15 years
Avatar of 鄒適文.
Active
Avatar of 鄒適文.
Active
Past
Lead Data Scientist / Senior Data Scientist @Vinnovation Network 維諾森資訊科技
2022 ~ 2023
資料科學家、資料科學工程師、機器學習工程師
Within one month
Shih-Wen Tsou - With more than 5 years of experience in Data Analysis, Machine Learning and Deep Learning, familiar with Modeling, Data Analysis, Image Processing, Machine Learning, and Deep Learning. Taipei City, Taiwan WORK EXPERIENCE Lead Data Scientist / Full Stack Data Scientist, Vinnovation Network, Taipei, Taiwan Data Engineering / Data Analysis Spearheaded the development of a fully automated data integration pipeline that aggregated diverse data sets into a S3 Data Lake. Successfully integrated a range of data sources, including real-time data feeds from AWS Redshift and DocumentDB, as well as batch processes to import traditional CSV
python
tensorflow
keras
Unemployed
Ready to interview
Full-time / Interested in working remotely
4-6 years
台灣大學
大氣科學所
Avatar of the user.
Avatar of the user.
Data Science @Alfred Labs.
2017 ~ Present
Within two months
Python
R
SQL
4-6 years
University of Taipei
Computer Science
Avatar of Cheung Ka Chun Gordon.
Avatar of Cheung Ka Chun Gordon.
Cloud Engineer @Eclass Limited
2021 ~ Present
More than one year
Gordon (Ka Chun)Cheung Flat A, 9/F, Ho Tat Building, 160, Prince Edward Road West, Hong Kong Contact No. :| Email: [email protected] EDUCATION & QUALIFICATIONS City University of Hong Kong, BEng (Hons) in Information Engineering Sept 2012 – July 2015 Modules Include: Computer Programming, Linear Algebra & Multi-Variable Calculus, Discrete Mathematics for Computing, Logic Circuit Design, Microcomputer Systems, Basic Electronic Circuit, Object-Oriented Programming and Design, Data Structures and Algorithms, Operating Systems, Database Systems, Principles of Communications, Data Communications and Networking, Signals and Systems, Applied Queueing Systems, Information Product Design, Internet Technology, Engineers in Society, Engine
AWS
CCNA
Shell Scripting
Employed
Full-time / Interested in working remotely
6-10 years
City University of Hong Kong
Information Engineering
Avatar of the user.
Avatar of the user.
CONSULTANT ON ENERGY TOPICS @OSEMOSYs and CLEWS Project - UNDESA (USA) and KTH University (SW) 2014 - 2017
2014 ~ 2017
SPECIALIST
More than one year
photoshop
STATA
SPSS Statistics
Full-time / Interested in working remotely
4-6 years
NATIONAL TSING HUA UNIVERSITY
Master in Business Administration and Technology

The Most Lightweight and Effective Recruiting Plan

Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.

  • Browse all search results
  • Unlimited access to start new conversations
  • Resumes accessible for only paid companies
  • View users’ email address & phone numbers
Search Tips
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Only public resumes are available with the free plan.
Upgrade to an advanced plan to view all search results including tens of thousands of resumes exclusive on CakeResume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Within one month
資料科學工程師
Vinnovation Network 維諾森資訊科技
2022 ~ 2023
Taipei City, Taiwan
Professional Background
Current status
Unemployed
Job Search Progress
Ready to interview
Professions
Data Engineer, Data Scientist, Machine Learning Engineer
Fields of Employment
Artificial Intelligence / Machine Learning
Work experience
4-6 years
Management
I've had experience in managing 5-10 people
Skills
python
tensorflow
keras
machine learning
Deep Learning
pytorch
Fortran
Vim
Git
Statistics
Linear Algebra
PyTorch
Medical Image Processing
LLMs A.I.
GPT
Languages
Chinese
Native or Bilingual
English
Fluent
Job search preferences
Positions
資料科學家、資料科學工程師、機器學習工程師
Job types
Full-time
Locations
台灣台北市, 台灣新竹市
Remote
Interested in working remotely
Freelance
Yes, I freelance in my spare time
Educations
School
台灣大學
Major
大氣科學所
Print

Shih-Wen Tsou

- With more than 5 years of experience in Data Analysis, Machine Learning and Deep Learning, familiar with Modeling, Data Analysis, Image Processing, Machine Learning, and Deep Learning.

  Taipei City, Taiwan      

WORK EXPERIENCE

Lead Data Scientist / Full Stack Data Scientist, Vinnovation Network, Taipei, Taiwan

Data Engineering / Data Analysis

  • Spearheaded the development of a fully automated data integration pipeline that aggregated diverse data sets into a S3 Data Lake.
  • Successfully integrated a range of data sources, including real-time data feeds from AWS Redshift and DocumentDB, as well as batch processes to import traditional CSV files.
  • Utilized Databricks for large-scale data processing, leveraging its Spark capabilities to efficiently transform and aggregate incoming data streams.
  • With the combined power of Databricks and AWS Lambda, ensured unparalleled data consistency, quality, and preparedness for sophisticated analytics and reporting.
  • Utilized Databricks and Airflow to run extensive data profiling tasks, analyzing data patterns and identifying potential quality issues before they reached the Databricks Delta Lake.
  • Established robust guardrails using the combined might of AWS Lambda, Apache Airflow and Databricks, ensuring that data stored in the DataBricks Delta Lake consistently met the highest quality benchmarks.

MLOps / Machine Learning / Data Science

  • Utilized Databricks to build a LightGCN-based recommendation system, fine-tuning for precise content delivery. Monitored model versions with MLflow, ensuring continuous integration.
  • Seamlessly merged our recommendation system with Databricks Delta Lake, maintaining a high-quality data influx and elevating system performance.
  • Developed a comprehensive MLOps service on Databricks, spanning from preprocessing to deployment. Leveraged automation tools to swiftly adapt models to new data.
  • Developed an API service used as an internal tool within the company, leveraging OpenAI Whisper for automatic speech recognition and ChatGPT for intelligent language translation, resulting in approximately an 80% reduction in manpower and time costs.
  • Fine-tuned a Llama-based Large Language Model (LLM) and, by integrating Langchain and Pinecone, optimized the search experience on our website.

Apr. 2022 - Jul. 2023

Machine Learning Researcher, Lab. for Cloud Dynamics and Modeling, National Taiwan University, Taipei, Taiwan    

  • Developed a model using U-net to detect ship tracks in satellite images, resulting in an 80% times savings.

  • Cooperated with multiple domain experts, including Atmospheric Science and Environmental Science, to solve problems with machine learning techniques.

  • Developed a model to classify Typhoon tracks with 96.5% accuracy rate, where the traditional method is about 80%.

  • Configured and managed a GPU-enforced workstation for the lab members to execute High Performance Computing (HPC) tasks.

Sep. 2021 - Mar. 2022

Data Scientist, Vizuro, Taipei, Taiwan

  • Developed an end-to-end pipeline to detect Breast Cancer in 3D Breast MRI images, encompassing data storage, data pre-processing, and detection model building.

  • Leveraged the power of deep learning algorithms, fusing them with insights gathered from medical research, to refine and augment the performance of our breast cancer diagnosis model.

  • Deployed the Deep-learning Breast Cancer Detection model integrated to the hospital PACS system.

  • Developed a model to segment 3D breast MR images and deployed it to ImageJ to expedite annotation and reduce labeling time.

May. 2019 - Sep. 2021

EDUCATION

Sep. 2016 - Jan. 2019

National Taiwan University

Master of Science in Atmospheric Sciences

Sep. 2011 - Jun. 2016

National Taiwan University

Bachelor of Science in Atmospheric Sciences

SKILL

Programming: Python, R, C/C++, GO, Matlab, Fortran

Machine Learning:

  • Traditional: Random Forest, XGBoost, K-means, DBSCAN, PCA, t-SNE
  • Deep Learning: CNNs, RNNs, transformers, GPT, Fast R-CNN series, U-Net, DCGANs, Whisper, Explainable AI techniques

Databases: PostgreSQL, MySQL, MongoDB

Data Engineering: Databricks, PySpark, Airflow, Airbyte

Cloud (AWS): Lambda, S3, EC2, Personalize, VPC

Side Projects


  • Lightning Prediction with Deep Learning and explain the model with physical methods.

  • Learning to generate the Manhattan building with Deep Convolutional GAN from OpenStreeMap building model.

  • Predicting short-term stock market price trends with Machine Learning.

  • Build an Investment Portfolio machine with a Rebalancing Strategy from scratch.

Resume
Profile

Shih-Wen Tsou

- With more than 5 years of experience in Data Analysis, Machine Learning and Deep Learning, familiar with Modeling, Data Analysis, Image Processing, Machine Learning, and Deep Learning.

  Taipei City, Taiwan      

WORK EXPERIENCE

Lead Data Scientist / Full Stack Data Scientist, Vinnovation Network, Taipei, Taiwan

Data Engineering / Data Analysis

  • Spearheaded the development of a fully automated data integration pipeline that aggregated diverse data sets into a S3 Data Lake.
  • Successfully integrated a range of data sources, including real-time data feeds from AWS Redshift and DocumentDB, as well as batch processes to import traditional CSV files.
  • Utilized Databricks for large-scale data processing, leveraging its Spark capabilities to efficiently transform and aggregate incoming data streams.
  • With the combined power of Databricks and AWS Lambda, ensured unparalleled data consistency, quality, and preparedness for sophisticated analytics and reporting.
  • Utilized Databricks and Airflow to run extensive data profiling tasks, analyzing data patterns and identifying potential quality issues before they reached the Databricks Delta Lake.
  • Established robust guardrails using the combined might of AWS Lambda, Apache Airflow and Databricks, ensuring that data stored in the DataBricks Delta Lake consistently met the highest quality benchmarks.

MLOps / Machine Learning / Data Science

  • Utilized Databricks to build a LightGCN-based recommendation system, fine-tuning for precise content delivery. Monitored model versions with MLflow, ensuring continuous integration.
  • Seamlessly merged our recommendation system with Databricks Delta Lake, maintaining a high-quality data influx and elevating system performance.
  • Developed a comprehensive MLOps service on Databricks, spanning from preprocessing to deployment. Leveraged automation tools to swiftly adapt models to new data.
  • Developed an API service used as an internal tool within the company, leveraging OpenAI Whisper for automatic speech recognition and ChatGPT for intelligent language translation, resulting in approximately an 80% reduction in manpower and time costs.
  • Fine-tuned a Llama-based Large Language Model (LLM) and, by integrating Langchain and Pinecone, optimized the search experience on our website.

Apr. 2022 - Jul. 2023

Machine Learning Researcher, Lab. for Cloud Dynamics and Modeling, National Taiwan University, Taipei, Taiwan    

  • Developed a model using U-net to detect ship tracks in satellite images, resulting in an 80% times savings.

  • Cooperated with multiple domain experts, including Atmospheric Science and Environmental Science, to solve problems with machine learning techniques.

  • Developed a model to classify Typhoon tracks with 96.5% accuracy rate, where the traditional method is about 80%.

  • Configured and managed a GPU-enforced workstation for the lab members to execute High Performance Computing (HPC) tasks.

Sep. 2021 - Mar. 2022

Data Scientist, Vizuro, Taipei, Taiwan

  • Developed an end-to-end pipeline to detect Breast Cancer in 3D Breast MRI images, encompassing data storage, data pre-processing, and detection model building.

  • Leveraged the power of deep learning algorithms, fusing them with insights gathered from medical research, to refine and augment the performance of our breast cancer diagnosis model.

  • Deployed the Deep-learning Breast Cancer Detection model integrated to the hospital PACS system.

  • Developed a model to segment 3D breast MR images and deployed it to ImageJ to expedite annotation and reduce labeling time.

May. 2019 - Sep. 2021

EDUCATION

Sep. 2016 - Jan. 2019

National Taiwan University

Master of Science in Atmospheric Sciences

Sep. 2011 - Jun. 2016

National Taiwan University

Bachelor of Science in Atmospheric Sciences

SKILL

Programming: Python, R, C/C++, GO, Matlab, Fortran

Machine Learning:

  • Traditional: Random Forest, XGBoost, K-means, DBSCAN, PCA, t-SNE
  • Deep Learning: CNNs, RNNs, transformers, GPT, Fast R-CNN series, U-Net, DCGANs, Whisper, Explainable AI techniques

Databases: PostgreSQL, MySQL, MongoDB

Data Engineering: Databricks, PySpark, Airflow, Airbyte

Cloud (AWS): Lambda, S3, EC2, Personalize, VPC

Side Projects


  • Lightning Prediction with Deep Learning and explain the model with physical methods.

  • Learning to generate the Manhattan building with Deep Convolutional GAN from OpenStreeMap building model.

  • Predicting short-term stock market price trends with Machine Learning.

  • Build an Investment Portfolio machine with a Rebalancing Strategy from scratch.