Key Responsibilities
- Design and maintain scalable AI infrastructure platforms
- Optimize cloud resources for AI workloads and cost efficiency
- Implement CI/CD pipelines for ML model deployment
- Manage Kubernetes clusters and containerized environments
- Monitor system performance and troubleshoot infrastructure issues
- Collaborate with engineering teams to improve deployment workflows
Requirements
- 3+ years of experience in infrastructure engineering or DevOps
- Proficiency in Kubernetes, Docker, and cloud platforms
- Experience with Terraform and infrastructure-as-code
- Strong understanding of networking and distributed systems
- Familiarity with AI/ML workloads and optimization techniques