Charles Hsu

9 years experience in SRE/DevOps, 4 years experience in Python and DevOps and 2 year experience in Go Language.

  Taipei, Taiwan      

工作經歷

SRE Lead  •  街口金融科技股份有限公司

Build SRE Team from 0 to 1
- Mentor Backend SRE Tech from 0 to 1
- Allocate project resource
- Set Goal of each team of member

Infrastructure as code
- Implement Terraform, manage AWS, GCP, vSphere
- Implement Ansible, manage service, service configuration, Linux permission, Database permission

Cloud Architecture
- Design AWS Architecture, include IAM, VPC, EKS, RDS, Elasticache, Route53, S3, Cloudfront, ACM ...
- Design GCP Architecture, include IAM, Shared VPC, GKE, GCS, GAR ...

Refine Release Flow
- Write Gitlab migration tools, migrate Gitlab from version 10.x to 16.x
- Build requirment CI/CD component service
- Integrate Gitlab-Runner to GKE
- Integrate Jenkins Agent to GKE and Docker Agent
- Design Release flow for multiple environment and programming lanuage
- Integrate Vault to Jenkins Pipeline and Service Deployment
- Migrate old project to new architecture and deploy flow

Automation
- Write Automation Utils using Golang Cobra, include DNS Management, Monitoring Management, Ansible Operation, Configuration Management, Kibana Dashbaord, Vault Resource Management, Backup/Restore Utils (Crypt Backup using AES256+GCM and Vault transit) ...

Security
- Refine VPN architecture
- Implement Vault
- Design and build Dynamic Database Account/Password Platform , write API base on Golang Gin, integrate Vault, Consul, Superset, Frontend Team's audit system

Monitoring
- Design monitoring architecture
- Prometheus, Alertmanager and Consul as metrics monitoring
- Elastalert2 as log monitoring
- Gatus as Service Status Monitoring
- Opsgenie as Alert Center

Hardware
- IDC Firewall HA

十二月 2021 - 七月 2023

SRE  •  Smartclouds

- Implement Ansible
- Implement Terraform
// Define Platform Infrastructure
- Build Infrastructure on GCP ( Using GKE, HTTP Load Balancing, Compute Instance Group, Pub/Sub, GCS, GCR, Cloud SQL, Memory Cache(Redis) )
- Cloudflare as CDN, DDoS. WAF
- Gitlab as version control
- Jenkins as CI/CD service
- Consul as application environment config
- EFK as log system
- Implement AWS Workspace ( For Production Security )
- Build Pre-Production Environment on AWS
- Reference diagram on GCP: https://drive.google.com/file/d/1pz8zvorbL0BzBxEECXbhzpw5yC2X6wYj/view?usp=sharing
- Reference diagram on AWS: https://drive.google.com/file/d/13WwZtI7XTYWM4dr4lPQg19DaVG0SmB7R/view?usp=sharing

一月 2021 - 十一月 2021

Lead DevOps Engineer  •  Nexiosoft

// Infrastructure
- Improve Ansible (For better Infrastructure as code)
- Implement Terraform
- Refine platform structure

// CI/CD
- Upgrade Gitlab version (Write Custom Python Tools from 8.x > 13.x)
- Implement Jenkins declarative pipeline
- Implement monolithic repository concept

// Kubernetes
- Implement Kubernetes on bare metal
- Migrate application to Kubernetes
- Improve CD flow for Kubernetes
- Refine monitor system for Kubernetes and other service
- Implement log system for Kubernetes
- Implement Kustomize and HELM
- Guide Backend how to write application suitable for Kubernetes

// Design and implement an system for multi-tenant
- Write RESTful API for Backend (Base on Golang Gin, MongoDB, ETCd, Consul)
- Implement CoreDNS and ETCd (For Internal DNS, Backend application)
- Implement Consul cluster and consul-template (For configuration, Monitor, CI/CD)
- Reference diagram : https://drive.google.com/file/d/1i9b8PEXBM0HIexR6DgfzniQAeh5MeUQv/view?usp=sharing

// Automation
- Slackbot for automation (Base on Golang, retired)

十二月 2019 - 十二月 2020

SRE  •  HandsUP

- Help 17media Disaster Recovery Project (Mongo Cluster, Backend Application)
- Improve Mongo Cluster Structure (For 17media, Ongoing)
- Improve Elasticsearch Cluster Structure (For 17media User search and HandsUP Log , Ongoing)
- Implement Kustomize(For improve CD flow, Ongoing)
- Improve Ansible Structure
- Migrate Gitlab runner to Github and CircleCI, Improve HandsUP CI/CD performance

六月 2019 - 十一月 2019

Technical Assistant Manager  •  Astrocorp

- Implement Configuration Management Tools (Ansible)
- Implement CI/CD Flow (Ansible + Gitlab + Jenkins)
- Improve Elasticsearch, Logstash, Kafka data flow
- Implement Prometheus and Grafana for monitor
- Implement Cloud Service Concept
- Guide team member for better technology

九月 2018 - 四月 2019

DevOps  •  CommonWealth

- Implement Ansible
- Implement Jenkins
- Increase system structure for better performance
- Implement Docker for local develop environment
- Write automation tools for local develop environment, ansible, GCP
- Increase Zabbix monitor System
- Implement Prometheus for monitor
- Implement Log System ( Elasticsearch, Kibana, Logstash, Filebeat )
- Planing Data Flow for CommonWealth group (Ongoing case)
- Kubernetes for future deploy method (Ongoing case)
- Mentor of system admin team and develop team (Ongoing case)
- Implement Bitrix24 for our Project management (Ongoing case)
- Lead IT Team Migrate service from HiCloud to GCP (Ongoing case)

十二月 2017 - 六月 2018

DevOps  •  RAPAQ

- Implement Ansible for deploy service package, configuration and operation
- Implement Gitlab and Jenkins for CI/CD
- Build our service on GCP, with Compute Engine, Cloud Storage, App Engine, HTTP(S) load balancing, Data Flow (Stackdriver Logging And BigQuery), Cloud SQL
- Automation tool develop, base on Python and Go, for Ansible, Gitlab, Jenkins configuration, operation, deploy
- Web Test, base on Python selenium, for E2E test our Web Site
- Zabbix, for monitor service status, performance, and alert
- Convert PHP RESTful API to Go, ongoing case, for better performance, micro service, and improve CI/CD flow

二月 2017 - 十一月 2017

Sr. Linux Administrator  •  Garena

- Migration and improving Garena Taiwan core service, for better performance, operation, monitor
- AWS management, for game service deploy
- Implement Saltstack, for configuration management, deploy, operation
- Implement Logstash, Redis, Elasticsearch, Kibana (ELK) for programmer debug and analyze log
- Implement Gitlab and Jenkins for CI/CD
- Build Icinga, Zabbix, for monitor service status , alert, performance tuning
- Write Automation tool to management Saltstack, network security test, API for system management, base on Python Flask

五月 2013 - 七月 2016

學歷

2004 - 2008

Shih Chien University

Department of Information Technology and Management

Tech Skill


Golang, Python, Docker, Kubernetes, AWS, GCP