Jobs
Job Search
Explore all available job openings across industries and locations.
Company Search
Find your dream jobs categorized by company names.
Themed Jobs
Discover job opportunities organized by specific themes or industries.
Download our App
Tools
Resume
Create your job-winning resume using our free resume builder.
Portfolio
Showcase your skills and projects with a professional portfolio.
Resume
Create your job-winning resume using our free resume builder.
Resume Builder
Make a resume for free.
Resume Templates
Access our extensive library of professional & ready-to-use templates.
Resume Examples
Get inspired by real resume examples to create your own.
Occupation Guide
Access resume writing guides tailored for different professions.
Resume Help
Get expert advice on all things resume from our team of recruitment specialists.
Portfolio
Showcase your skills and projects with a professional portfolio.
Portfolio Maker
Create a professional portfolio to highlight your skills and projects.
Portfolio Gallery
Browse through our collection of real portfolios for inspiration and networking.
Resources
Articles
Read insightful articles on career development, job search strategies, and more.
View All Articles
Job Search Guide
Resume & CV
Cover Letter
Portfolio
Interview Skills
Job Search Tips
Industry & Job Overview
Career Guidance
Career Planning
Career Tools
Career Development
Personal Branding
Success Stories
Success Stories
Business Excellence
People Operations
Recruitment & HR
About CakeResume
People & Culture
News & Updates
Events
Featured Reads
Resume & CV
What to Write in an Email When Sending a Resume [+ Examples & Tips]
Read More
Hire
Talent Search
Find Resumes.
Job Posting
Start for Free.
Recruitment Service
Acquire Talent.
Employer of Record (EOR)
Empower Your Business in Taiwan.
Employer Branding
Build and promote your employer brand.
Pricing
Job Posting Plans
Talent Search Plans
Resume Builder Plans
Build your Network
My Network
Access your personal network connections and manage your contacts.
CakeResume Meet
Expand your professional network by meeting and connecting with other users.
Community
Engage with other users through discussions, forums, and networking events.
Download our App

My Network

Access your personal network connections and manage your contacts.

CakeResume Meet

Expand your professional network by meeting and connecting with other users.

Community

Engage with other users through discussions, forums, and networking events.

CakeResume Talent Search

Advanced filters

Ready to interview

Open to opportunities

Not open to opportunities

Taiwan

台灣

New Taipei City, Taiwan

新北市, 台灣

India

Bengaluru, India

California, United States

Israel

Santa Clara County, California, United States

Taichung City, Taiwan

Taipei, Taiwan

Taoyuan City, Taiwan

Taoyuan, Taiwan

United States

台中市, 台灣

Tech

Industry

Less than 1 year

1-2 years

2-4 years

4-6 years

6-10 years

10-15 years

More than 15 years

AI Smart Matching

Fu Jen Catholic University

National Taiwan University

國立台灣大學

國立臺灣大學

輔仁大學

National Cheng Kung University

National Chung Hsing University

National Tsing Hua University

National Yang Ming Chiao Tung University

Sir M Visvesvaraya Institute of Technology

University of Texas at Dallas

ಸರ್ ಎಂ ವಿಶ್ವೇಶ್ವರಯ್ಯ ಟೆಕ್ನಾಲಜಿ ಇನ್ಸ್ಟಿಟ್ಯೂಟ್

國立中興大學

國立成功大學

國立清華大學

國立陽明交通大學

Taiwan

台灣

India

Bengaluru, India

Israel

New Taipei City, Taiwan

Taichung City, Taiwan

Taipei City, Taiwan

台中市, 台灣

台北市, 台灣

新北市, 台灣

Full-time

Python

AWS

MySQL

Scala

Hadoop

Java

SQL

Spark

hadoop ecosystem

Azure

Yes

1-5 people

15+ people

Within one month

Within two months

Within six months

Within one year

More than one year

雲端工程師，雲端架構師，數據架構師

Backend Engineer, Data Engineer, MLOps Engineer

Backend developer/Full-stack developer

Data Engineer

Data Scientist

Data engineer / Data anyayst

Developer Team Leader, Architect, FullStack Developer

Senior Dev or Tech Lead

Senior Software Engineer

Software Engineer / Backend Engineer / DevOps Engineer

Bachelor of Arts (BA)

Bachelor of Engineering (BEng)

Master of Business Administration (MBA)

Master of Science (MS)

Master’s Degree

Bachelor

Master

2023

2018

2016

2014

2011

2007

Current company

Off

Select all

TSMC

信義房屋股份有限公司

域動行銷股份有限公司

安富財經科技

無限方舟科技有限公司

藍科數位科技

銓鍇國際股份有限公司

17 Live

17LIVE

247.ai

Interested in working remotely

Remote Only

Part-time freelancer

Non-freelancer

English - Fluent

Chinese - Native or Bilingual

English - Intermediate

English - Professional

Chinese - Professional

Japanese - Fluent

English

Chinese

4-6 years

6-10 years

10-15 years

More than 15 years

Exclude read results
Show all experiences

Yen-Ting Liu

Data Engineer @Tesla

・

2023 ~ 2023

Data engineer / Data anyayst

Within two months

data that included geo-location data from BigQuery and deployed it on the GCP environment. The API saved 80% of the time on fetching data (Cloud Run, IAM, BigQuery) 十月七月 2021 Data engineer • 富盈數據 Maintained distributed system and database • Constructed and managed the Hadoop ecosystem with Ambari. Built ETL pipeline to query multi-source database which processing more than three terabytes (TB) provided 90% of the analysis needs (Hive, HBase, Python, ELK, MySQL) • Established data collection and analysis workflow, saving Data scientists’ 30% of the time to analyze and build machine learning

python

Linux

Full-time / Interested in working remotely

4-6 years

University of Texas at Dallas

・

Information Technology and Management

Offline

Chin-Hung (Wilson) Liu

Principal Engineer, Data Engineering @KKCompany

・

2023 ~ Present

Backend Engineer, Data Engineer, MLOps Engineer

Within one month

Chin-Hung (Wilson) Liu I am a lead architect responsible for designing and implementing a large-scale data pipeline for Lomotif, Paktor x 17LIVE, utilizing GCP/AWS/Python/Scala, in collaboration with data science and machine learning teams in Singapore and TW HQ, as well as with the Hadoop ecosystem (HDFS/HBase/Kafka) at JSpectrum in Hong Kong and Sydney. With over 15 years of experience in designing and developing Java/Scala/Python-based applications for daily operations, I bring: ● At least 8 years of experience in data analysis, pipeline design

Big Data

Data Engineering

ETL

Full-time / Interested in working remotely

10-15 years

National Taiwan University

・

EMBA Programs, Business Administration, Accounting, Finance and International Business.

陳柄宏

Staff Cloud Architect Enginner @域動行銷股份有限公司

・

2023 ~ Present

雲端工程師，雲端架構師，數據架構師

Within one month

及進步的團隊。 [email protected], Taiwan Education 輔仁大學圖書資訊學系,Skills Python 程式寫作、實作爬蟲及資料清理作業。 Database SQL : PostgreSQL, MySQL, MSSQL NoSQL: MongoDB, Redis, DynamoDB, Hadoop Docker 容器化技術 Data Lakehouse Databricks Azure 持有Microsoft Certified: Azure Solutions Architect Expert 及 Data enginner 證照 AWS 持有 AWS Certified Solutions Architect – Professional 證照工作經歷 Staff Cloud Architect Enginner , 域動行銷股份有限

git

hadoop ecosystem

MongoDB

Full-time / Interested in working remotely

輔仁大學

・

圖書資訊

Available for paid companies

Team Lead / Sr. Data Engineer @新加坡商競舞電競娛樂有限公司 Garena Online Private Ltd

・

2021 ~ Present

資料工程師

Within one month

Hadoop

Spark

SQL

Full-time / Interested in working remotely

6-10 years

Upgrade to View

Carter Lin

Senior Data Engineer @美光科技

・

2021 ~ Present

Software Engineer / Backend Engineer / DevOps Engineer

Within six months

CD pipelines from scratch which follow GitOps flow and deploying service to GKE cluster using Helm . Familiar with GCP service , IAM, GCS, Big Query, Cloud Function, Pub/Sub, Cloud Scheduler Data Engineer Micron OctOct 2021 Taichung, Taiwan Developed and maintained ETL processes using Python to transfer data into Hadoop Ecosystem, including HBase and Hive, for efficient data storage and retrieval. Proficient in SQL for data manipulation and query optimization. Collaborated with cross-functional teams to design and implement data pipelines, ensuring data integrity and accuracy. Streamlined data processing workflows, resulting in significant time and resource

Python

Google cloud platform

Helm

Full-time / Remote Only

4-6 years

National Chiao Tung University

・

資訊管理學系

Aiden Wu

Senior Data Engineer @Garena

・

2021 ~ Present

Data engineer

Within one year

Aiden Wu Data Engineer / Machine Learning Engineer Taipei, Taiwan • Enthusiastic software developer: focus on distributed systems, especially Hadoop ecosystem • Experience in data engineering: develop batch and real-time data pipelines with an average of TBs per month via Spark and Airflow • Experience in machine learning: develop machine learning (ML) and deep learning (DL) models while providing services on RESTful API https://www.slideshare.net/ssuserf88631/presentations 工作經歷 Senior Data Engineer • Garena 八月Present • Build and manage self-distributed systems (e.g., Hadoop, Spark, and Kafka Cluster) • Design

Python

Spark

Machine Learning

Full-time / Interested in working remotely

4-6 years

National Cheng Kung University

・

Department of Electrical Engineering

陳慶全

Senior Data Engineer @Microsoft

・

2021 ~ Present

資料科學家、資料工程師、資料分析師

Within one month

Ching-Chuan Chen 陳慶全資料科學家、資料工程師、資料分析師 • City, TW • [email protected] Data engineer and data scientist with over four half years of experience. Proven success in processing big volume of data (6TB per day) in Spark in Scala and MPI in R and Python, developing a machine learning model with Spark in Scala on 30 billions of records for IoT device recognition and developing algorithms to classify unlabeled network behaviors of customers to protect their devices from compromising. Skilled in programming

Python

C++

Full-time / Interested in working remotely

4-6 years

National Cheng Kung University,

・

Statistics

Available for paid companies

Within one year

Python

Bigdata

Docker

Full-time / Interested in working remotely

10-15 years

Upgrade to View

Available for paid companies

Jr. Programmer @德義資訊股份有限公司

・

2013 ~ 2015

Developer Team Leader, Architect, FullStack Developer

More than one year

Word

PowerPoint

Excel

Full-time / Interested in working remotely

6-10 years

National Taiwan University

・

Bachelor of Bio-Industrial Mechatronics Engineering

Upgrade to View

Mallikarjunareddy Guruguntla

Big data developer

More than one year

ZOOKEEPER. Summary Excellent understanding /knowledge on HADOOP(Gen-1 and Gen-2) and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node, Resource Manager (YARN), Node Manager and Aplication Master. Expert in understanding the data and designing/implementing the enterprise platforms like Hadoop data lake and huge Data warehouses. Have over 2 years of experience as Hadoop Architect with very good exposure on Hadoop Technologies like HDFS, YARN, MapReduce, Sqoop, Flume, HBase, Hive, Presto, Oozie and Spark. Good understanding of NoSQL databases and hands on working experience in writing applications

hadoop ecosystem

Python

Scala

Full-time / Interested in working remotely

6-10 years

JNTUH

・

Computer science

The Most Lightweight and Effective Recruiting Plan

Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.

Browse all search results
Unlimited access to start new conversations
Resumes accessible for only paid companies
View users’ email address & phone numbers

Upgrade Now

7-day money-back guarantee, cancel anytime

1 2

Search Tips

Search a precise keyword combination

senior backend php

If the number of the search result is not enough, you can remove the less important keywords

Use quotes to search for an exact phrase

"business development"

Use the minus sign to eliminate results containing certain words

UI designer -UX

Only public resumes are available with the free plan.

Upgrade to an advanced plan to view all search results including tens of thousands of resumes exclusive on CakeResume.

Upgrade Now

Definition of Reputation Credits

Technical Skills

Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).

Problem-Solving

Ability to identify, analyze, and prepare solutions to problems.

Adaptability

Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.

Communication

Ability to convey information effectively and is willing to give and receive feedback.

Time Management

Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.

Teamwork

Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.

Leadership

Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.

Within one month

Chin-Hung (Wilson) Liu

Senior Data Engineer at Paktor x M17 Entertainment Group | AWS x GCP x Azure Big Data Specialist | Data Architect

KKCompany

・

2023 ~ Present

Taiwan

Professional Background

Current status

Employed

Job Search Progress

Open to opportunities

Professions

Data Engineer, Back-end Engineer

Fields of Employment

Software

Work experience

10-15 years

Management

I've had experience in managing 1-5 people

Skills

Big Data

Data Engineering

ETL

AWS

GCP

Python

BigQuery

Data Warehouse

Data Pipeline

Java

Azure

SQL

Spark

kafka

spark streaming

Scala

Redshift

HBase

SQL Server

AWS S3

AWS Lambda

MongoDB

Hadoop

Hadoop Distributed File System

AWS SQS

Azure Storage

MySQL

PostgreSQL

Postman for API

Snowflake

Languages

English

・

Professional

Chinese

・

Native or Bilingual

Job search preferences

Positions

Backend Engineer, Data Engineer, MLOps Engineer

Job types

Full-time

Locations

Taiwan, 台灣, Singapore, Hong Kong

Remote

Interested in working remotely

Freelance

Educations

School

National Taiwan University

Major

EMBA Programs, Business Administration, Accounting, Finance and International Business.

Chin-Hung (Wilson) Liu

I am a lead architect responsible for designing and implementing a large-scale data pipeline for Lomotif, Paktor x 17LIVE, utilizing GCP/AWS/Python/Scala, in collaboration with data science and machine learning teams in Singapore and TW HQ, as well as with the Hadoop ecosystem (HDFS/HBase/Kafka) at JSpectrum in Hong Kong and Sydney.

With over 15 years of experience in designing and developing Java/Scala/Python-based applications for daily operations, I bring:

● At least 8 years of experience in data analysis, pipeline design and development, and tool building as a team member.

● In-depth knowledge of the Spark and Hadoop ecosystems, including Hadoop, HDFS, HBase, and more.

● Strong skills in designing and developing Big Data services on AWS and GCP.

● Extensive expertise in developing generic distributed systems, streaming processing, machine learning pipelines, and continuously improving ML models.

Senior Data Engineer at Paktor x 17LIVE| AWS Big Data Specialist | Data Architect
Singapore / Hong Kong / Taiwan

[email protected]

https://www.linkedin.com/in/chin-hung-wilson-liu-29392957

Nanxing Rd., Xizhi Dist., New Taipei City, Taiwan (R.O.C.)

Experience

Senior Data Engineer (DataOps / AI) / Lomotif Private Limited / Singapore

Jul. 2021 - Present.

Description and Responsibilities: Lomotif is a leading short video social platform in South America and India that holds PBs of videos in buckets and serves millions of users. DataOps and AI team take part in many challenging projects e.g. Ncanto, XROAD services, Ray Serve, and scalable model serving frameworks for support the recommendation and moderation pipeline, also integrated Universal Music Group music (UMG) and full catalog feed with 7digital. DataOps team handling 10TB+ data for day-to-day operation, moderating model training results, and designing SLIs/SLOs for EKS Clusters. More responsibilities/details as below.

Optimize music (UMG) pipeline with queries and memories for Elasticsearch and PostgreSQL, the pipeline saving 90% execution time from 10+ hours to 40 mins.
Migrate service from apache spark, AWS Data Lake Formation to AWS MWAA, EKS airflow environment.
Design, and deliver distributed system for Ray Serve with AI team.
Design, and implement a modern machine learning pipeline for a recommendation, and moderation pipe.
Design SLA and implement alert log reporting system (history logs) for moderation pipeline, histories logs handling application, server levels information for further investigation.
Supporting other departments to gather data in the appropriate platforms.

Tech Stacks :

Streaming, Snowpipe/Kinesis/Firehose
Monitoring, CloudWatch/Grafana
Orchestration, AWS MWAA / Airflow
Kubernetes, EKS
Message, SQS/SNS
MLflow, Ray Serve/EMR/Lambda
Storage, Snowflake / RDS (PostgreSQL) / ElastiCache (Redis) / Elastic search
Bucket, AWS S3

Reports to : VP of Data Engineering

Senior Data Engineer / Handshakes by DC Frontiers / Singapore

Oct. 2020 - May. 2021.

Description and Responsibilities: The main responsibility of the engineering team is launching ScoutAsia by Nikkei and The Financial Times Nikkei content to SGX TitanOTC's platform. Titan Users will be able to access Nikkei news articles from across 11 categories, including equities, stocks, indices, foreign exchange, and iron ore. DPP (Data team) is processing hundreds of GB articles/market/financial/relationships and organization for day-to-day operation on Azure and on-premise environments. More responsibilities/details as below.

Identifying, digging bottlenecks, and problem-solving especially optimizing the performance of SQL Server, NoSQL (Azure Cosmos), resource units, and message queues, reducing/saving almost 50-75% of resources.
Identifying and solving the problems between machine learning/backend/frontend/DDP side and giving the advance logical/physical design of a system. Displayed technical expertise in optimizing the databases and improving the data pipeline to achieve the objective.
Bring in industry standards to data management to deliver data at the end objective.
Building, and recruiting the new data engineering staff for the next-generation, enterprise data pipeline.

Tech Stacks :

Storage, Azure Cosmos DB/Gremlin/SQL Server/MYSQL/Redis
Storage (Bucket), Azure Blob/AWS S3
Streaming/Batch/transform, Spark/Scala (90% codebase coverage)
Message, Azure service bus, queue storage
Search, Elastic search
Algorithm, graph/concordance

Reports to : CTO

Senior Data Engineer / 17LIVE Inc. / Taiwan, Taipei.

Feb. 2020 - Jul. 2020

Description and Responsibilities: The big challenge of 17 Media data teams is facing fast-growing data volume (processing 5-10x TB level daily), complex cooperation with stakeholders, the cost optimization of the pipeline, and refactoring big latency systems .etc. As a senior data member, I’m making a data dictionary and trying to explain/design how the whole pipeline works with each component, especially how to solve those bottlenecks. More responsibilities/details as below.

Leading, and architect large-scale data pipeline for supporting scientists and shareholders.
Optimize, ensure quality and play a tough role in data lake projects/data pipes. infrastructure.
Define, and design stage, dimension, production, and fact tables for data warehouse (BigQuery).
Coordinate with client / QA / backend team for QC lists / MongoDB change stream workers.
Architect workflows with those components, Dataflow, Cloud Functions, and GCS.
Recruiting (Jr./Sr.) data engineering members, setting goals, and sprint management.

Tech Stacks :

Storage, GCS/BigQuery/Firebase/MongoDB/MYSQL
Realtime process and Message system, DataFlow (Apache Beam) / BigQuery Streaming / MongoDB Change Stream / Fluentd / Firebase / Pub/Sub
ETL/ELT workflow, Digdag / Embulk
Data warehouse, Visualization, BigQuery / Superset / Chartio / Data Studio
Continuous deployment, docker, CricleCI

Reports to : Data Head

Data Engineer / Paktor Pte. Ltd. / Singapore

Sep. 2015 - Dec. 2019.

Description and Responsibilities : This is another 0 to 1 story. As an early data member, we need to figure out the data driven policy, strategies, engineering requirements from the company. In Paktor, data / backend sides are 100% on AWS, therefore the whole data ingestion, automation and data warehouse etc. are relying on those components. We are processing 50-100x GB realtime / batch jobs and the other data sources (RDBMS, APIs) for ETL/ELT on S3, Redshift, the data platform helps our marketing / HQ scientists team getting data into insights and making good decisions. More responsibilities / details as below.

Supports Big Data and batch, real-time analytical solutions leveraging transformational technologies.
Optimize data pipeline on AWS using Kinesis-Firehose/Lambda/Kinesis Analytics/Data Pipeline, and optimize, resizing Redshift clusters and related scripts.
Translates complex analytics requirements into detailed architecture, design, and high performing software such as machine-learning, CI/CD of recommendation pipeline.
Collaborate with client / backend side developers to formulate innovative solutions to experiment and implement related algorithms.

Tech Stacks :

Storage, S3/Redshift/Aurora - Realtime process and Message system, Kinesis Firehose / SNS
Data warehouse, Visualization, Redshift / Klipfolio / Metabase
ETL/ELT workflow, Lambda / SNS / Batch / Python
Recommendation, ML, DynamoDB / EMR / Spark / Sagemaker
Metadata management, Athena (presto) / Glue / Redshift Spectrum
Continuous deployment, Elasticbeanstalk / Cloudformation
Operations, PagerDuty / Zapier / Cloud Watch

Reports to : CTO, Data Head

System Analyst (Data Backend Engineer) / JSpectrum Software Limited / Hong Kong

Jan. 2014 - Aug 2015.

Description and Responsibilities : JSPectrum is a leading passive location-based service company in Hong Kong which holds many interesting products such as NetProbe, NetWhere, NetAd etc. In Optus (The main project in Sydney), the main responsibility of system analyst is designing / implementing data ingestion (real-time processing) / load and management data with major components of the Hadoop ecosystem. We meet the challenge to process 15,000 TPS, 60,000 inserts per second and 300 GB daily storages, therefore we are trying to optimize those components with Kafka consumers, HDFS storages and re-designing keys / columns of HBase to fulfill the requirement and deployed NetAd, whole in-house solutions on Optus. More responsibilities / details as below.

Design, implement and optimize Hadoop ecosystems, MLP, real-time processing on Optus in house servers with our main product NetAd, NetWhere. We are focusing on HBase schema, HDFS, balancing Kafka consumers and more issues on data ingestion.
Collaborate with shareholders and LBS team members for further requirements with HeapMap.

Tech Stacks :

Storage, HDFS / HBase
Realtime process and Message system, Kafka streaming, Log systems
Data warehouse, Visualization, HBase / NetWhere (Dashboard)
Hadoop ecosystem, Hadoop / HDFS / Zookeeper / Spark / Hive
ETL/ELT workflow, Spark / Hive / Scala / Java

Reports to : CTO

Senior Software Engineer / Toro Development Ltd. / Taiwan, Taipei.

Oct. 2012 - Dec. 2013.

Description and Responsibilities : TORO is a technology business that provides a mobile platform and its associated systems, services and rules to help Brands (with initial focus on Sports Teams, Smart Cities and Streaming apps) become super-apps to generate additional revenue with minimum effort. Responsibilities as below.

Design, implement and test back-office modules for NFC wallet platform, Trusted Service Managers (TSM) and distributed NFC services to end users / stakeholders.
Implement RESTful services and deliver endpoints for wallet managers and collaborating with frontend, backend teams for further business requirements.

Tech Stacks: MYSQL / Spring / Hibernate / XML / Apache Camel / Java / POJO .etc.

Reports to : Head of Server Solutions

Software Engineer / Digital River / Taiwan, Taipei.

Oct. 2011 - Sep. 2012.

Description and Responsibilities : Digital river proactive partners, providing API-based Payments & Risk, Order Management and Commerce services to leading enterprise brands. The big challenge to DR is integrating with the current module and working well with a huge code base (over 2+ millions lines), the strict process including analysis requirements, design, implement, test and code review. More responsibilities as below.

Design, implement custom bundle project, bundle customized by shoppers to pick products of groups and get special discounts, the main stakeholders /users from Logitech, Microsoft.
Analysis, collect business requirements, identify use cases and collaborate with business analysts and deliver related diagrams, documents.

Tech Stacks: Oracle / Tomcat / Spring / Struts / JDO / XML / JUnit / Java / J2EE .etc.

Reports to : Technical Development Manager

Technical Supervisor / Stark Technology Inc. / Taiwan, Taipei.

Oct. 2008 - Sep. 2011.

Description and Responsibilities : Stark Technology (STI) is the largest domestic system integrator in Taiwan. We plan and deliver complete ICT solutions for a wide spectrum of industries through representing and reselling the world's leading products. This is made possible by using the most advanced technology, and providing the best professional services. More responsibilities / projects as below.

Lead, coach JR. programmers for the development process of enterprise modules, and design Fatwire CMS components as Template/Page/Cache .etc.
Design, analyze DMDB systems, and implement functions to meet the requirements of queries / storage. Optimize performance for online servers and GC tuning.

Tech Stacks : Oracle / Sybase / Tomcat / Weblogic / Spring / Struts / Hibernate / Fatwire / Java / J2EE .etc.

Reports to : Technical Manager

Relevant Skills and Qualifications

Big Data Tech Stacks

AWS Services, EC2/S3/Lambda/EMR/CloudWatch/SNS/SQS/Elastic Beanstalk
AWS Big Data Solutions, Kinesis/Firehose/Athena/Redshift/Dynamodb
GCP Big Data Solutions, BigQuery/PubSub/Dataflow/Cloud Functions
Hadoop ecosystem, Hadoop/HDFS/Zookeeper/Hbase/Hive
Spark Streaming/Apache Kafka
CI/CD: Jenkins/Cloud Formation/GitLab/Grafana

Specific Skills

Solid, well-designed real-time streaming/batch processing, ETL systems.
Monitors and conducts data-pipeline / machine learning pipeline development requests through lifecycle management and ensures that the technical solution meets.
Diagnosing and troubleshooting Redshift and specific clusters management.
Development of micro-services and endpoints based on enterprise integration patterns. Knowledge over garbage collection (JVM) tuning technologies for various servers.
Developed multi-threading processing consuming work and managed transactions.

Certifications and Training

Sun Certified Web Component Developer Java 2 Platform, Enterprise Edition.
Sun Certified Programmer for the Java 2 Platform.
Red Hat Enterprise Directory Services and Authentication Attended.
Project Management Professional (PMP)® Attended.
AWS Certified Solutions Architect Attended.
Big Data on AWS Attended.
Azure Data Engineer AssociateAttended.

Education

National Taiwan University, 2010 – 2011

EMBA Programs, Business Administration, Accounting, Finance and International Business.

Chinese Culture University Master of Information Management, 2002 – 2005

Computer Science, Data Mining, Expert Systems and Knowledge Base as major concentration.

Chinese Culture University, Bachelor Degree of Science in Journalism, 1998 - 2002

Resume

Profile

Chin-Hung (Wilson) Liu

With over 15 years of experience in designing and developing Java/Scala/Python-based applications for daily operations, I bring:

● At least 8 years of experience in data analysis, pipeline design and development, and tool building as a team member.

● In-depth knowledge of the Spark and Hadoop ecosystems, including Hadoop, HDFS, HBase, and more.

● Strong skills in designing and developing Big Data services on AWS and GCP.

● Extensive expertise in developing generic distributed systems, streaming processing, machine learning pipelines, and continuously improving ML models.

Senior Data Engineer at Paktor x 17LIVE| AWS Big Data Specialist | Data Architect
Singapore / Hong Kong / Taiwan

[email protected]

https://www.linkedin.com/in/chin-hung-wilson-liu-29392957

Nanxing Rd., Xizhi Dist., New Taipei City, Taiwan (R.O.C.)

Experience

Senior Data Engineer (DataOps / AI) / Lomotif Private Limited / Singapore

Jul. 2021 - Present.

Optimize music (UMG) pipeline with queries and memories for Elasticsearch and PostgreSQL, the pipeline saving 90% execution time from 10+ hours to 40 mins.
Migrate service from apache spark, AWS Data Lake Formation to AWS MWAA, EKS airflow environment.
Design, and deliver distributed system for Ray Serve with AI team.
Design, and implement a modern machine learning pipeline for a recommendation, and moderation pipe.
Design SLA and implement alert log reporting system (history logs) for moderation pipeline, histories logs handling application, server levels information for further investigation.
Supporting other departments to gather data in the appropriate platforms.

Tech Stacks :

Streaming, Snowpipe/Kinesis/Firehose
Monitoring, CloudWatch/Grafana
Orchestration, AWS MWAA / Airflow
Kubernetes, EKS
Message, SQS/SNS
MLflow, Ray Serve/EMR/Lambda
Storage, Snowflake / RDS (PostgreSQL) / ElastiCache (Redis) / Elastic search
Bucket, AWS S3

Reports to : VP of Data Engineering

Senior Data Engineer / Handshakes by DC Frontiers / Singapore

Oct. 2020 - May. 2021.

Identifying, digging bottlenecks, and problem-solving especially optimizing the performance of SQL Server, NoSQL (Azure Cosmos), resource units, and message queues, reducing/saving almost 50-75% of resources.
Identifying and solving the problems between machine learning/backend/frontend/DDP side and giving the advance logical/physical design of a system. Displayed technical expertise in optimizing the databases and improving the data pipeline to achieve the objective.
Bring in industry standards to data management to deliver data at the end objective.
Building, and recruiting the new data engineering staff for the next-generation, enterprise data pipeline.

Tech Stacks :

Storage, Azure Cosmos DB/Gremlin/SQL Server/MYSQL/Redis
Storage (Bucket), Azure Blob/AWS S3
Streaming/Batch/transform, Spark/Scala (90% codebase coverage)
Message, Azure service bus, queue storage
Search, Elastic search
Algorithm, graph/concordance

Reports to : CTO

Senior Data Engineer / 17LIVE Inc. / Taiwan, Taipei.

Feb. 2020 - Jul. 2020

Leading, and architect large-scale data pipeline for supporting scientists and shareholders.
Optimize, ensure quality and play a tough role in data lake projects/data pipes. infrastructure.
Define, and design stage, dimension, production, and fact tables for data warehouse (BigQuery).
Coordinate with client / QA / backend team for QC lists / MongoDB change stream workers.
Architect workflows with those components, Dataflow, Cloud Functions, and GCS.
Recruiting (Jr./Sr.) data engineering members, setting goals, and sprint management.

Tech Stacks :

Storage, GCS/BigQuery/Firebase/MongoDB/MYSQL
Realtime process and Message system, DataFlow (Apache Beam) / BigQuery Streaming / MongoDB Change Stream / Fluentd / Firebase / Pub/Sub
ETL/ELT workflow, Digdag / Embulk
Data warehouse, Visualization, BigQuery / Superset / Chartio / Data Studio
Continuous deployment, docker, CricleCI

Reports to : Data Head

Data Engineer / Paktor Pte. Ltd. / Singapore

Sep. 2015 - Dec. 2019.

Supports Big Data and batch, real-time analytical solutions leveraging transformational technologies.
Optimize data pipeline on AWS using Kinesis-Firehose/Lambda/Kinesis Analytics/Data Pipeline, and optimize, resizing Redshift clusters and related scripts.
Translates complex analytics requirements into detailed architecture, design, and high performing software such as machine-learning, CI/CD of recommendation pipeline.
Collaborate with client / backend side developers to formulate innovative solutions to experiment and implement related algorithms.

Tech Stacks :

Storage, S3/Redshift/Aurora - Realtime process and Message system, Kinesis Firehose / SNS
Data warehouse, Visualization, Redshift / Klipfolio / Metabase
ETL/ELT workflow, Lambda / SNS / Batch / Python
Recommendation, ML, DynamoDB / EMR / Spark / Sagemaker
Metadata management, Athena (presto) / Glue / Redshift Spectrum
Continuous deployment, Elasticbeanstalk / Cloudformation
Operations, PagerDuty / Zapier / Cloud Watch

Reports to : CTO, Data Head

System Analyst (Data Backend Engineer) / JSpectrum Software Limited / Hong Kong

Jan. 2014 - Aug 2015.

Design, implement and optimize Hadoop ecosystems, MLP, real-time processing on Optus in house servers with our main product NetAd, NetWhere. We are focusing on HBase schema, HDFS, balancing Kafka consumers and more issues on data ingestion.
Collaborate with shareholders and LBS team members for further requirements with HeapMap.

Tech Stacks :

Storage, HDFS / HBase
Realtime process and Message system, Kafka streaming, Log systems
Data warehouse, Visualization, HBase / NetWhere (Dashboard)
Hadoop ecosystem, Hadoop / HDFS / Zookeeper / Spark / Hive
ETL/ELT workflow, Spark / Hive / Scala / Java

Reports to : CTO

Senior Software Engineer / Toro Development Ltd. / Taiwan, Taipei.

Oct. 2012 - Dec. 2013.

Design, implement and test back-office modules for NFC wallet platform, Trusted Service Managers (TSM) and distributed NFC services to end users / stakeholders.
Implement RESTful services and deliver endpoints for wallet managers and collaborating with frontend, backend teams for further business requirements.

Tech Stacks: MYSQL / Spring / Hibernate / XML / Apache Camel / Java / POJO .etc.

Reports to : Head of Server Solutions

Software Engineer / Digital River / Taiwan, Taipei.

Oct. 2011 - Sep. 2012.

Design, implement custom bundle project, bundle customized by shoppers to pick products of groups and get special discounts, the main stakeholders /users from Logitech, Microsoft.
Analysis, collect business requirements, identify use cases and collaborate with business analysts and deliver related diagrams, documents.

Tech Stacks: Oracle / Tomcat / Spring / Struts / JDO / XML / JUnit / Java / J2EE .etc.

Reports to : Technical Development Manager

Technical Supervisor / Stark Technology Inc. / Taiwan, Taipei.

Oct. 2008 - Sep. 2011.

Lead, coach JR. programmers for the development process of enterprise modules, and design Fatwire CMS components as Template/Page/Cache .etc.
Design, analyze DMDB systems, and implement functions to meet the requirements of queries / storage. Optimize performance for online servers and GC tuning.

Tech Stacks : Oracle / Sybase / Tomcat / Weblogic / Spring / Struts / Hibernate / Fatwire / Java / J2EE .etc.

Reports to : Technical Manager

Relevant Skills and Qualifications

Big Data Tech Stacks

AWS Services, EC2/S3/Lambda/EMR/CloudWatch/SNS/SQS/Elastic Beanstalk
AWS Big Data Solutions, Kinesis/Firehose/Athena/Redshift/Dynamodb
GCP Big Data Solutions, BigQuery/PubSub/Dataflow/Cloud Functions
Hadoop ecosystem, Hadoop/HDFS/Zookeeper/Hbase/Hive
Spark Streaming/Apache Kafka
CI/CD: Jenkins/Cloud Formation/GitLab/Grafana

Specific Skills

Solid, well-designed real-time streaming/batch processing, ETL systems.
Monitors and conducts data-pipeline / machine learning pipeline development requests through lifecycle management and ensures that the technical solution meets.
Diagnosing and troubleshooting Redshift and specific clusters management.
Development of micro-services and endpoints based on enterprise integration patterns. Knowledge over garbage collection (JVM) tuning technologies for various servers.
Developed multi-threading processing consuming work and managed transactions.

Certifications and Training

Sun Certified Web Component Developer Java 2 Platform, Enterprise Edition.
Sun Certified Programmer for the Java 2 Platform.
Red Hat Enterprise Directory Services and Authentication Attended.
Project Management Professional (PMP)® Attended.
AWS Certified Solutions Architect Attended.
Big Data on AWS Attended.
Azure Data Engineer AssociateAttended.

Education

National Taiwan University, 2010 – 2011

EMBA Programs, Business Administration, Accounting, Finance and International Business.

Chinese Culture University Master of Information Management, 2002 – 2005

Computer Science, Data Mining, Expert Systems and Knowledge Base as major concentration.

Chinese Culture University, Bachelor Degree of Science in Journalism, 1998 - 2002