Avatar of Alex Lee.
Alex Lee
IT infrastructure
ProfileResume
Posts
18Connections
Print
Avatar of the user.

Alex Lee

IT infrastructure
I have a half year of Site Reliability Engineering (SRE) experience and MIS-Engineering experience for about 2 years, and now work as an IT Infrastructure Engineer, with responsibilities including Kubernetes(GKE, EKS) maintenance and deployment, CI/CD(ArgoCD, Gitlab), and tasks related to Infrastructure as Code (Terraform). In addition, I am also responsible for developing Telegram chatbot project(Python-telegram-bot). I love communicating with others and giving them some comments and feedbacks and solving problems together. I am willing to learn new things, maintain a good relationship with others, make the company run stable.
暫時不提供
Logo of the organization.
東海大學 Tunghai University
台中市, 台灣

Professional Background

  • Current status
    Employed
    Open to opportunities
  • Profession
    DevOps / System Admin
    System Architecture
    System, Network Administrator
  • Fields
    Computer Networking
    Big Data
    Software
  • Work experience
    4-6 years (4-6 years relevant)
  • Management
    None
  • Skills
    Python
    Linux
    Grafana
    Prometheus
    AWS
    GCP
    Kubernetes
    Telegram bot
    DevOps / CI / CD
    IaC: Terraform
    Docker
    ArgoCD
    Rancher
  • Languages
    English
    Intermediate
  • Highest level of education
    Bachelor

Job search preferences

  • Desired job type
    Full-time
    Interested in working remotely
  • Desired positions
    IT Infrastructure
  • Desired work locations
    Taichung City, Taiwan
    Taipei City, Taiwan
    Tainan City, Taiwan
    Taichung City, Taiwan
  • Freelance
    Non-freelancer

Work Experience

IT Infrastructure

暫時不提供
Full-time
May 2022 - Present
Taichung City, Taiwan
.Achievements 1. Establish EKS or GKE, configure the VPC, Subnet, NAT Route for the cluster, as well as the required node pools, vertical scaling of node pools, and node taints settings. 2. Set up Rancher to manage all clusters, use Helm to deploy a Monitoring System, and integrate alerts generated by the Monitoring System into a single Alertmanager to send alerts to internal alert groups within the company. 3. Based on the cluster resource data collected by the Monitoring System, determine the resource range required for individual services and configure the horizontal scaling quantity. 4 .Deploy Traefik using Helm, and add Middleware as needed to allow services within the cluster to be accessible through a single public endpoint, reducing the number of Public IPs generated by load balancers and lowering costs. 5. Create multiple Git Runners using Docker, or deploy a high-availability Git Runner using Helm, to deploy a high-availability pipeline. 6. Use Gitlab to write CI (Continuous Integration) scripts according to different environments and requirements, and upload the packaged Images to an internal Harbor. 7. Write the necessary configuration files for CD (Continuous Deployment), including Deployment, ConfigMap, Service, etc., according to requirements, then use Git Runner or ArgoCD to establish the CD process. 8. Use GCP Monitoring to send alerts to self-hosted Webhooks, and format them into readable content for alert groups after organizing, such as CloudSql resource usage. 9. Implement strict security standards for AWS (ACL, SG) and GCP (Firewall) to prevent security vulnerabilities. 10. Configure AWS IAM with appropriate permissions to allow processes requiring resource operations to execute, such as Terraform, aws cli in Shell Script . 11. Use Terraform to deploy infrastructure on AWS, such as EC2, GCE, VPC, etc. 12. Set up Active Directory as the internal account management system for the company, configure appropriate computer permissions through GPO, and achieve Single Sign-On for all internal services through Active Directory. 13. Develop a chatbot using Python-telegram-bot, allowing customers to perform web operations using Telegram, such as account linking, deposits, and withdrawals. .Skills AWS : EKS, VPC, EC2, LightSail, Route53, IAM, S3 GCP : GKE, VPC, GCE, Monitoring CI/CD : ArgoCD, GitlabCI CD, Git Runner IAC : Terraform Account : Active Directory, Authentik, KMS Server Container : Docker (Compose), Harbor, Dockge Programming : Python3, Shell Script
Logo of the organization.

Mis Engineering

Dec 2019 - May 2022
2 yrs 6 mos
.Achievements 1 .Established Prometheus + Grafana + TelegramBot setup to immediately trigger alerts upon machine failures. 2. Automated Azure resource deployment using PowerShell, such as batch creation of VMs and binding necessary resources like network cards and public IPs. 3. Enabled web-based self-operation for one-click switching of node public IPs to bypass complex manual Azure procedures during cloud node failures or public IP issues. 4. Maintained CDN, promptly initiating appropriate defenses without disrupting client usage during node failures or attacks through monitoring. 5. Utilized Aliyun API to automate the addition of necessary domain name resolutions without logging into Aliyun when new DNS resolutions are needed. 6. Installing ESXi and Setting up vCenter to Control an Internal ESXi Cluster and Configure Cluster Networking. .Skills Cloud: Azure Resources Management Monitoring: Prometheus + Grafana + Alertmanager + ChatApp CDN: WAF Settings, Node cluster management Scripts: PowerShell scripts for Azure control and Python scripts for Aliyun API control Fortigate: VPN, Firewall, Virtual IP, Route Vmware : ESXI + VCENTER SSL: Purchased SSL certificates correctly bound at every layer of infrastructure, including CDN or Netscaler ADC.
Logo of the organization.

Site Reliability Engineering

Jun 2019 - Nov 2019
6 mos
.Achievements 1 .Accelerated the speed of new site development by automating scripts. 2. Identified and optimized SQL commands causing errors in backend database writes. .Skills Nginx Optimization PHP Framework Symfony Doctrine ORM

Education

Logo of the organization.
Bachelor’s Degree
Computer Science Information Engineering
2014 - 2017