CakeResume Talent Search

Advanced filters
On
4-6 years
6-10 years
10-15 years
More than 15 years
Avatar of Yen-Ting Liu.
Offline
Avatar of Yen-Ting Liu.
Offline
Data Engineer @Tesla
2023 ~ 2023
Data engineer / Data anyayst
Within two months
data that included geo-location data from BigQuery and deployed it on the GCP environment. The API saved 80% of the time on fetching data (Cloud Run, IAM, BigQuery) 十月七月 2021 Data engineer • 富盈數據 Maintained distributed system and database • Constructed and managed the Hadoop ecosystem with Ambari. Built ETL pipeline to query multi-source database which processing more than three terabytes (TB) provided 90% of the analysis needs (Hive, HBase, Python, ELK, MySQL) • Established data collection and analysis workflow, saving Data scientists’ 30% of the time to analyze and build machine learning
python
Linux
R
Employed
Ready to interview
Full-time / Interested in working remotely
4-6 years
University of Texas at Dallas
Information Technology and Management
Avatar of Chin-Hung (Wilson) Liu.
Avatar of Chin-Hung (Wilson) Liu.
Principal Engineer, Data Engineering @KKCompany
2023 ~ Present
Backend Engineer, Data Engineer, MLOps Engineer
Within one month
Chin-Hung (Wilson) Liu I am a lead architect responsible for designing and implementing a large-scale data pipeline for Lomotif, Paktor x 17LIVE, utilizing GCP/AWS/Python/Scala, in collaboration with data science and machine learning teams in Singapore and TW HQ, as well as with the Hadoop ecosystem (HDFS/HBase/Kafka) at JSpectrum in Hong Kong and Sydney. With over 15 years of experience in designing and developing Java/Scala/Python-based applications for daily operations, I bring: ● At least 8 years of experience in data analysis, pipeline design
Big Data
Data Engineering
ETL
Employed
Open to opportunities
Full-time / Interested in working remotely
10-15 years
National Taiwan University
EMBA Programs, Business Administration, Accounting, Finance and International Business.
Avatar of 陳柄宏.
Avatar of 陳柄宏.
Staff Cloud Architect Enginner @域動行銷股份有限公司
2023 ~ Present
雲端工程師,雲端架構師,數據架構師
Within one month
及進步的團隊。 [email protected], Taiwan Education 輔仁大學圖書資訊學系,Skills Python 程式寫作、實作爬蟲及資料清理作業。 Database SQL : PostgreSQL, MySQL, MSSQL NoSQL: MongoDB, Redis, DynamoDB, Hadoop Docker 容器化技術 Data Lakehouse Databricks Azure 持有Microsoft Certified: Azure Solutions Architect Expert 及 Data enginner 證照 AWS 持有 AWS Certified Solutions Architect – Professional 證照 工作經歷 Staff Cloud Architect Enginner , 域動行銷股份有限
git
hadoop ecosystem
MongoDB
Employed
Not open to opportunities
Full-time / Interested in working remotely
4-6 years
輔仁大學
圖書資訊
Avatar of the user.
Avatar of the user.
Team Lead / Sr. Data Engineer @新加坡商競舞電競娛樂有限公司 Garena Online Private Ltd
2021 ~ Present
資料工程師
Within one month
Hadoop
Spark
SQL
Employed
Full-time / Interested in working remotely
6-10 years
Avatar of Carter Lin.
Avatar of Carter Lin.
Senior Data Engineer @美光科技
2021 ~ Present
Software Engineer / Backend Engineer / DevOps Engineer
Within six months
CD pipelines from scratch which follow GitOps flow and deploying service to GKE cluster using Helm . Familiar with GCP service , IAM, GCS, Big Query, Cloud Function, Pub/Sub, Cloud Scheduler Data Engineer Micron OctOct 2021 Taichung, Taiwan Developed and maintained ETL processes using Python to transfer data into Hadoop Ecosystem, including HBase and Hive, for efficient data storage and retrieval. Proficient in SQL for data manipulation and query optimization. Collaborated with cross-functional teams to design and implement data pipelines, ensuring data integrity and accuracy. Streamlined data processing workflows, resulting in significant time and resource
Python
Google cloud platform
Helm
Employed
Full-time / Remote Only
4-6 years
National Chiao Tung University
資訊管理學系
Avatar of Aiden Wu.
Avatar of Aiden Wu.
Senior Data Engineer @Garena
2021 ~ Present
Data engineer
Within one year
Aiden Wu Data Engineer / Machine Learning Engineer Taipei, Taiwan • Enthusiastic software developer: focus on distributed systems, especially Hadoop ecosystem • Experience in data engineering: develop batch and real-time data pipelines with an average of TBs per month via Spark and Airflow • Experience in machine learning: develop machine learning (ML) and deep learning (DL) models while providing services on RESTful API https://www.slideshare.net/ssuserf88631/presentations 工作經歷 Senior Data Engineer • Garena 八月Present • Build and manage self-distributed systems (e.g., Hadoop, Spark, and Kafka Cluster) • Design
Python
Spark
Machine Learning
Employed
Full-time / Interested in working remotely
4-6 years
National Cheng Kung University
Department of Electrical Engineering
Avatar of 陳慶全.
Avatar of 陳慶全.
Senior Data Engineer @Microsoft
2021 ~ Present
資料科學家、資料工程師、資料分析師
Within one month
Ching-Chuan Chen 陳慶全 資料科學家、資料工程師、資料分析師 • City, TW • [email protected] Data engineer and data scientist with over four half years of experience. Proven success in processing big volume of data (6TB per day) in Spark in Scala and MPI in R and Python, developing a machine learning model with Spark in Scala on 30 billions of records for IoT device recognition and developing algorithms to classify unlabeled network behaviors of customers to protect their devices from compromising. Skilled in programming
R
Python
C++
Employed
Full-time / Interested in working remotely
4-6 years
National Cheng Kung University,
Statistics
Avatar of the user.
Within one year
Python
Bigdata
Docker
Employed
Full-time / Interested in working remotely
10-15 years
Avatar of the user.
Avatar of the user.
Jr. Programmer @德義資訊股份有限公司
2013 ~ 2015
Developer Team Leader, Architect, FullStack Developer
More than one year
Word
PowerPoint
Excel
Employed
Full-time / Interested in working remotely
6-10 years
National Taiwan University
Bachelor of Bio-Industrial Mechatronics Engineering
Avatar of Mallikarjunareddy Guruguntla.
Big data developer
More than one year
ZOOKEEPER. Summary Excellent understanding /knowledge on HADOOP(Gen-1 and Gen-2) and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node, Resource Manager (YARN), Node Manager and Aplication Master. Expert in understanding the data and designing/implementing the enterprise platforms like Hadoop data lake and huge Data warehouses. Have over 2 years of experience as Hadoop Architect with very good exposure on Hadoop Technologies like HDFS, YARN, MapReduce, Sqoop, Flume, HBase, Hive, Presto, Oozie and Spark. Good understanding of NoSQL databases and hands on working experience in writing applications
hadoop ecosystem
Python
Scala
Full-time / Interested in working remotely
6-10 years
JNTUH
Computer science

The Most Lightweight and Effective Recruiting Plan

Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.

  • Browse all search results
  • Unlimited access to start new conversations
  • Resumes accessible for only paid companies
  • View users’ email address & phone numbers
Search Tips
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Only public resumes are available with the free plan.
Upgrade to an advanced plan to view all search results including tens of thousands of resumes exclusive on CakeResume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Within three months
Data Engineer
Logo of Tesla.
Tesla
2023 ~ 2023
Santa Clara, 加利福尼亞美國
Professional Background
Current status
Employed
Job Search Progress
Ready to interview
Professions
Data Engineer, Big Data Engineer, Back-end Engineer
Fields of Employment
Software
Work experience
4-6 years
Management
None
Skills
python
Linux
R
Docker
GCP Compute Engine
Hadoop
Spark
Airflow
Celery
Redis
Google Analytics
HBase
MongoDB
MySQL
GIT/CICD
Languages
English
Fluent
Job search preferences
Positions
Data engineer / Data anyayst
Job types
Full-time
Locations
Taipei, 台灣, California, 美國
Remote
Interested in working remotely
Freelance
Yes, I freelance in my spare time
Educations
School
University of Texas at Dallas
Major
Information Technology and Management
Print

Yen-Ting Liu

我具有5年python資料分析,熟悉以Docker搭配nginx, redis部屬api及系統於GCP上。熟悉Airflow程式及報表自動化分析流程,並有Hadoop,Elasticsearch群集管理實務、pyspark數據ETL經驗。我喜歡學習新技術,並追求以更高效率進行資料處理流程。

  Santa Clara, CA, USA          [email protected]

工作經歷

Data Engineer  •  Tesla

• Implemented Airflow ETL pipelines using Docker and integrated various databases including SQL Server, MSSQL, Vertica,
MongoDB, enhancing reliability, accessibility of the pipelines, and improving data processing efficiency by 10x
• Designed a data alert system using Python that monitors hundreds of ETL processes, and provides updates on the latest data status
every minute, and delivers status alerts to communication services or email, resolving stale data and ETL failure issues.
• Developed Python tools such as encrypting and decrypting sensitive data, and real-time operational reporting systems using Kafka
and MongoDB, enabling stakeholders to access up-to-date information for performance monitoring and reporting

一月 2023 - 五月 2023

Data engineer  •  Vpon

Designed and developed ETL pipelines
• Implemented and maintained ETL Pipelines which integrated with Jenkins on cloud services (GCP and AWS), provided by analysts
with reliable data, and saved 50% effort (Groovy, Dataproc, Spark, BigQuery, EMR, Hive, Jenkins)
• Designed ETL pipeline framework and implemented it to cloud service and held user training sessions for engineers and analysts. This project was reduced by 40% maintenance efforts and increased by 60% deployment efficiency by leveraging Airflow and
Kubernetes (Gitlab, Airflow, Python, K8s)
Developed and deployed API to retrieve data in the cloud environment
• Developed an API to retrieve data that included geo-location data from BigQuery and deployed it on the GCP environment.
The API saved 80% of the time on fetching data (Cloud Run, IAM, BigQuery)

十月 2019 - 七月 2021

Data engineer  •  富盈數據

Maintained distributed system and database
• Constructed and managed the Hadoop ecosystem with Ambari. Built ETL pipeline to query multi-source database which
processing more than three terabytes (TB) provided 90% of the analysis needs (Hive, HBase, Python, ELK, MySQL)
• Established data collection and analysis workflow, saving Data scientists’ 30% of the time to analyze and build machine
learning models with collected data (Elasticsearch, PySpark, Airflow) Constructed backend system and API
• Researched webpage user preference and behavior, and modified advertising performance evaluation system to enable
precision marketing, increasing the accuracy by 300% for advertising targeting
• Constructed articles to classify API and embedded a machine learning model (linear regression, random forest, XGBoost) to
categorize the articles. The tool has been implemented as the product and processed 90% of the articles every day (Flask)
• Upgraded an advertising API and deployed it on cloud service (GCP). Increased the total monthly revenue by 33% after
implementing the new API (Python, Docker, Nginx, Celery, Redis, Load-balance system, MySQL, HBase)

二月 2019 - 九月 2019

Research Assistant  •  Academia Sinica 中央研究院

Construct data pipeline and data analysis
• Developed bioinformatics pipeline, which saved 80% effort for non-technical scientists to analyze and visualize genome
sequencing. Published the research paper and the software in the Frontiers journal as the first author (Python, R, Linux)

十月 2017 - 四月 2018

學歷

2021 - 2023

University of Texas at Dallas

Information Technology and Management

2014 - 2016

台灣大學

生物材料

資格認證


AWS Certified Cloud Practitioner

AWS Training and Certification

十一月 2025 到期

Resume
Profile

Yen-Ting Liu

我具有5年python資料分析,熟悉以Docker搭配nginx, redis部屬api及系統於GCP上。熟悉Airflow程式及報表自動化分析流程,並有Hadoop,Elasticsearch群集管理實務、pyspark數據ETL經驗。我喜歡學習新技術,並追求以更高效率進行資料處理流程。

  Santa Clara, CA, USA          [email protected]

工作經歷

Data Engineer  •  Tesla

• Implemented Airflow ETL pipelines using Docker and integrated various databases including SQL Server, MSSQL, Vertica,
MongoDB, enhancing reliability, accessibility of the pipelines, and improving data processing efficiency by 10x
• Designed a data alert system using Python that monitors hundreds of ETL processes, and provides updates on the latest data status
every minute, and delivers status alerts to communication services or email, resolving stale data and ETL failure issues.
• Developed Python tools such as encrypting and decrypting sensitive data, and real-time operational reporting systems using Kafka
and MongoDB, enabling stakeholders to access up-to-date information for performance monitoring and reporting

一月 2023 - 五月 2023

Data engineer  •  Vpon

Designed and developed ETL pipelines
• Implemented and maintained ETL Pipelines which integrated with Jenkins on cloud services (GCP and AWS), provided by analysts
with reliable data, and saved 50% effort (Groovy, Dataproc, Spark, BigQuery, EMR, Hive, Jenkins)
• Designed ETL pipeline framework and implemented it to cloud service and held user training sessions for engineers and analysts. This project was reduced by 40% maintenance efforts and increased by 60% deployment efficiency by leveraging Airflow and
Kubernetes (Gitlab, Airflow, Python, K8s)
Developed and deployed API to retrieve data in the cloud environment
• Developed an API to retrieve data that included geo-location data from BigQuery and deployed it on the GCP environment.
The API saved 80% of the time on fetching data (Cloud Run, IAM, BigQuery)

十月 2019 - 七月 2021

Data engineer  •  富盈數據

Maintained distributed system and database
• Constructed and managed the Hadoop ecosystem with Ambari. Built ETL pipeline to query multi-source database which
processing more than three terabytes (TB) provided 90% of the analysis needs (Hive, HBase, Python, ELK, MySQL)
• Established data collection and analysis workflow, saving Data scientists’ 30% of the time to analyze and build machine
learning models with collected data (Elasticsearch, PySpark, Airflow) Constructed backend system and API
• Researched webpage user preference and behavior, and modified advertising performance evaluation system to enable
precision marketing, increasing the accuracy by 300% for advertising targeting
• Constructed articles to classify API and embedded a machine learning model (linear regression, random forest, XGBoost) to
categorize the articles. The tool has been implemented as the product and processed 90% of the articles every day (Flask)
• Upgraded an advertising API and deployed it on cloud service (GCP). Increased the total monthly revenue by 33% after
implementing the new API (Python, Docker, Nginx, Celery, Redis, Load-balance system, MySQL, HBase)

二月 2019 - 九月 2019

Research Assistant  •  Academia Sinica 中央研究院

Construct data pipeline and data analysis
• Developed bioinformatics pipeline, which saved 80% effort for non-technical scientists to analyze and visualize genome
sequencing. Published the research paper and the software in the Frontiers journal as the first author (Python, R, Linux)

十月 2017 - 四月 2018

學歷

2021 - 2023

University of Texas at Dallas

Information Technology and Management

2014 - 2016

台灣大學

生物材料

資格認證


AWS Certified Cloud Practitioner

AWS Training and Certification

十一月 2025 到期