Key Responsibilities
- Operate and maintain production Kubernetes clusters with high availability configurations
- Develop and maintain Helm charts for service deployment and lifecycle management
- Design and implement automation tools for CI/CD pipelines using Jenkins, GitHub, and Docker
- Troubleshoot and resolve failures in containerized environments and control planes
- Collaborate with cross-functional teams to design scalable, resilient infrastructure solutions
- Write and maintain automated tests for infrastructure components and services
Requirements
- Experience operating production Kubernetes clusters and cloud-based services
- Proficiency in Python or GoLang for tooling and automation development
- Strong understanding of container lifecycle, Docker, and Helm chart management
- Familiarity with CI/CD tools such as Jenkins, GitHub Actions, and Gradle
- Knowledge of cloud provider IaaS ecosystems (VPC, Storage, IAM)