visualization and machine learning to automate issue analysis, saving 90% of human effort Projects Voda Scheduler FebPresent https://github.com/heyfey/vodascheduler GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster Golang, Python | MLOps | Kubernetes, Docker, Helm, Kubeflow, RabbitMQ, MongoDB, Prometheus, Keras, Tensorflow, Pytorch, Horovod, REST API Architected and built a GPU scheduler using microservices architecture on top of Kubernetes and several open-source projects Sped up overall training time by 2.38x and increased cluster utilization by 1.4x with State-of-the-art scheduling algorithms with resource elasticity
Full-time / Quan tâm đến làm việc từ xa