CakeResume Talent Search

Advanced filters
On
4-6 tahun
6-10 tahun
10-15 tahun
Lebih dari 15 tahun
United States
Avatar of Vel Tien-Yun Wu.
Avatar of Vel Tien-Yun Wu.
Data Engineer @Groundhog Technologies Inc.
2021 ~ 2024
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
Dalam satu bulan
Vel Tien-Yun Wu I bring 5 years of hands-on experience in data engineering and software development, with a focus on building scalable data processing systems utilizing Hadoop, Spark, Kafka and Docker. My expertise in developing efficient ETL pipelines has been fundamental in optimizing data workflows for various data warehouses, enhancing data integrity and availability. My track record includes managing high-volume data pipelines, automating scheduling processes to improve operational efficiency, and deploying monitoring solutions that have reduced Mean-Time-To-Repair (MTTR) by 40%. I have a strong foundation in SQL, especially PostgreSQL, which enables
Git
Python
Scala
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
University of Illinois at Urbana-Champaign, School of Information Sciences
Information Management
Avatar of Alexander Kahoun.
Avatar of Alexander Kahoun.
Staff Software Engineer @Sibi
2018 ~ Sekarang
Staff Engineer/Manager
Dalam enam bulan
internal cloud-based secure file management system. - Mentored client engineers on how to maintain and enhance systems - Helped client interview and hire permanent Lead Architect and managed transition Staff Software Engineer/Data Engineer • DeMark Analytics AprilNovemberLead Data Pipeline Engineer - Implemented and maintained lambda architecture for data processing - Leveraged Hadoop and Spark to true up data in CassandraDB - Consumed terrabytes of data per hour through TCP stream and combined together with Storm and loaded into CassandraDB and RabbitMQ for quick consumption - Helped DevOps implement IaC for data pipeline with Chef - Led migration from Scala to Clojure Senior Consultant III
Mentoring
Learning
Leader
Sudah bekerja
Full-time / Hanya bekerja jarak jauh
Lebih dari 15 tahun
Temple University
Information Science & Technology
Avatar of the user.
Avatar of the user.
創辦人 @酷喬伊科技有限公司
2020 ~ Sekarang
Python developer
Dalam satu bulan
PyTorch
Python
PostgreSQL
Sudah bekerja
Tidak terbuka untuk peluang
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
Fu Jen Catholic University
Major, Optical Physics, Minor, Finance and international business

Paket Perekrutan Paling Mudah dan Efektif, Pilihan Ratusan Perusahaan

Cari lebih dari 800 ribu CV dan ambil aksi menghubungi pelamar kerja untuk rekrutmen yang lebih efektif. Pilihan ratusan perusahaan.

  • Lihat semua hasil pencarian
  • Tanpa batas harian untuk memulai pesan baru
  • CV dapat diakses oleh perusahaan berbayar
  • Lihat email pengguna & nomor telepon
Tips pencarian
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Hanya CV publik yang tersedia dengan paket gratis.
Upgrade ke paket lanjutan untuk melihat semua hasil pencarian, termasuk 10.000 lebih CV eksklusif di Cake Resume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Dalam dua bulan
Senior Data Engineer
Logo of Microsoft.
Microsoft
2021 ~ Sekarang
New Taipei City, 台灣
Latar Belakang Profesional
Status sekarang
Sudah bekerja
Tahap pencarian kerja
Profesi
Data Engineer, Data Scientist, Big Data Engineer
Bidang Pekerjaan
Layanan Informasi, Big Data, Intelegensi Artifisial/Pemelajaran Mesin
Pengalaman Kerja
4-6 tahun
Management
Saya berpengalaman mengelola 1-5 orang
Keterampilan
R
Python
C++
Matlab
Shell Script
machine learning
Deep Learning
Data Analysis
Data Mining
Data Science
Data Cleaning
apache hive
Apache Spark
hadoop ecosystem
Oracle
MySQL
SQL
PowerPoint
Statistics
AWS
Docker
Bash
Scala
Azure
Bahasa
Chinese
Bahasa ibu atau Bilingual
English
Fasih
Japanese
Fasih
Preferensi Pencarian Pekerjaan
Jabatan
資料科學家、資料工程師、資料分析師
Tipe Pekerjaan
Full-time
Lokasi
Taipei, 台灣, Japan, USA, Canada, UK, Netherlands, Germany, Switzerland
Bekerja jarak jauh
Tertarik bekerja jarak jauh
Freelance
Tidak
Pendidikan
Institusi Pendidikan
National Cheng Kung University,
Jurusan
Statistics
Cetak

Ching-Chuan Chen 陳慶全

資料科學家、資料工程師、資料分析師  •  City, TW  •  [email protected]

Data engineer and data scientist with over four half years of experience. Proven success in processing big volume of data (6TB per day) in Spark in Scala and MPI in R and Python, developing a machine learning model with Spark in Scala on 30 billions of records for IoT device recognition and developing algorithms to classify unlabeled network behaviors of customers to protect their devices from compromising. Skilled in programming, machine learning, cross-functional communication skills and creative problem solving. Objective is to become an expert in data science field to make people’s life better with statistics and machine learning.

技能


Advanced

 ♦ R / Python / SQL 

 ♦ Statistics 

 ♦ Machine Learning 

 ♦ Spark / Parallel Computing (MPI)


Intermediate

 ♦ Hive / HDFS / MongoDB

 ♦ Scala / C++ / Shell Script

 ♦ Docker

 ♦ Data Visualization / Tableau


Basic

 ♦ AWS / Azure

 ♦ Rust

 ♦ Java

工作經歷

Microsoft, Data Engineer II, Jan 2021 ~ 現在

** Quality Management System – Data Engineer
• Collect URD from quality engineers to develop systems for tracking quality issues.
• Design QMS for quality engineers to trace PRR, VLRR and perform FA of Azure hardware like server, CPU, memory, SSD, HDD and motherboard.

Company@2x

Trend Micro Inc., Senior Data Scientist, Jan 2019 ~ Jan 2021

** Home Network Security – Data Engineer
• Reduced 90% time of reports from 1B security events every day. This helps marketing and sales people in Japan, Singapore and Australlia to find opportunities to improve business.
• Visualized the relationship between security events for thread experts with word2vec and t-SNE.

** Network Behavior Analysis Project – Data Scientist
• Developed a machine learning model to recognize IoT devices based on 30 billion records of netflows via Spark in Scala and Python.
• Reached a 90% accuracy rate in identifying periodic network behaviors of IoT devices with a statistical model.

Company@2x

TSMC, Senior Data Engineer an Data Scientist, Jul 2016 ~ Jan 2019

** Yield Improvement Project – Data Engineer and Data Scientist
• Processed the big volume of data (6TB per day) to maintain a data warehouse for machine learning projects.
• Reduced the out-of-control rate by 30% via a statistical model.
• Reduced scrapping rate by 80% with homemade anomaly detection algorithms.
• Reduced 80% time to find key factors of yield rates via data visualization and statistics

** Big Data Solutions – Data Engineer
• Digest 6TB data per day by building an on-premise big data solution via Scala, Spark and Hive.
• Reduced 95% implementation time of machine learning algorithms via R, MPI, Hive and Spark.

** Weekly Productivity Improvement Program – Leader
• Developed R packages to reduce reinventing the wheels and increase productivity.
• Taught writing clean and performant codes to data scientists and data engineers.
• Organized study groups to share knowledge of machine learning and statistics with colleagues.

Company@2x

Academia Sinica, Full-time Research Assistant, Sep 2015 ~ Jun 2016

** Main role
• Decreased data processing time by 80% via R and MongoDB to process millions of records of data per day.
• Got a 40% lowered RMSE in imputing missing values with home-made machine learning than other methods.

Company@2x

學歷

National Cheng Kung University,, 碩士學位, Statistics, 2012 ~ 2014

== Achievements ==
• Completed a master’s thesis entitled “A Classification Approach Based on Density Ratio Estimation with Subspace Projection.” Advisor: Ray-Bing Chen.
• Earned a grade of 95% in my statistical methods, generalized linear models, and statistical data mining classes, and 92% in my linear models class. I am thus confident with building models and inferences from models.
• Completed an advanced probability theory class designed for Ph. D. students.

National Cheng Kung University, 學士學位, Economics and Statistics, 2008 ~ 2012

With an advanced plan and hard work, I earned 175 credits for 2 majors within 4 years.

CV
Profil

Ching-Chuan Chen 陳慶全

資料科學家、資料工程師、資料分析師  •  City, TW  •  [email protected]

Data engineer and data scientist with over four half years of experience. Proven success in processing big volume of data (6TB per day) in Spark in Scala and MPI in R and Python, developing a machine learning model with Spark in Scala on 30 billions of records for IoT device recognition and developing algorithms to classify unlabeled network behaviors of customers to protect their devices from compromising. Skilled in programming, machine learning, cross-functional communication skills and creative problem solving. Objective is to become an expert in data science field to make people’s life better with statistics and machine learning.

技能


Advanced

 ♦ R / Python / SQL 

 ♦ Statistics 

 ♦ Machine Learning 

 ♦ Spark / Parallel Computing (MPI)


Intermediate

 ♦ Hive / HDFS / MongoDB

 ♦ Scala / C++ / Shell Script

 ♦ Docker

 ♦ Data Visualization / Tableau


Basic

 ♦ AWS / Azure

 ♦ Rust

 ♦ Java

工作經歷

Microsoft, Data Engineer II, Jan 2021 ~ 現在

** Quality Management System – Data Engineer
• Collect URD from quality engineers to develop systems for tracking quality issues.
• Design QMS for quality engineers to trace PRR, VLRR and perform FA of Azure hardware like server, CPU, memory, SSD, HDD and motherboard.

Company@2x

Trend Micro Inc., Senior Data Scientist, Jan 2019 ~ Jan 2021

** Home Network Security – Data Engineer
• Reduced 90% time of reports from 1B security events every day. This helps marketing and sales people in Japan, Singapore and Australlia to find opportunities to improve business.
• Visualized the relationship between security events for thread experts with word2vec and t-SNE.

** Network Behavior Analysis Project – Data Scientist
• Developed a machine learning model to recognize IoT devices based on 30 billion records of netflows via Spark in Scala and Python.
• Reached a 90% accuracy rate in identifying periodic network behaviors of IoT devices with a statistical model.

Company@2x

TSMC, Senior Data Engineer an Data Scientist, Jul 2016 ~ Jan 2019

** Yield Improvement Project – Data Engineer and Data Scientist
• Processed the big volume of data (6TB per day) to maintain a data warehouse for machine learning projects.
• Reduced the out-of-control rate by 30% via a statistical model.
• Reduced scrapping rate by 80% with homemade anomaly detection algorithms.
• Reduced 80% time to find key factors of yield rates via data visualization and statistics

** Big Data Solutions – Data Engineer
• Digest 6TB data per day by building an on-premise big data solution via Scala, Spark and Hive.
• Reduced 95% implementation time of machine learning algorithms via R, MPI, Hive and Spark.

** Weekly Productivity Improvement Program – Leader
• Developed R packages to reduce reinventing the wheels and increase productivity.
• Taught writing clean and performant codes to data scientists and data engineers.
• Organized study groups to share knowledge of machine learning and statistics with colleagues.

Company@2x

Academia Sinica, Full-time Research Assistant, Sep 2015 ~ Jun 2016

** Main role
• Decreased data processing time by 80% via R and MongoDB to process millions of records of data per day.
• Got a 40% lowered RMSE in imputing missing values with home-made machine learning than other methods.

Company@2x

學歷

National Cheng Kung University,, 碩士學位, Statistics, 2012 ~ 2014

== Achievements ==
• Completed a master’s thesis entitled “A Classification Approach Based on Density Ratio Estimation with Subspace Projection.” Advisor: Ray-Bing Chen.
• Earned a grade of 95% in my statistical methods, generalized linear models, and statistical data mining classes, and 92% in my linear models class. I am thus confident with building models and inferences from models.
• Completed an advanced probability theory class designed for Ph. D. students.

National Cheng Kung University, 學士學位, Economics and Statistics, 2008 ~ 2012

With an advanced plan and hard work, I earned 175 credits for 2 majors within 4 years.