About the Role
Tomorrow.io's engineering department focuses on building life-changing software and products at scale, from infrastructure handling massive amounts of data to outstanding customer-centric user experiences in B2B, B2C, and B2D products that impact billions of lives worldwide.
We are looking for a DevOps Engineer to power the reliability, security, and efficiency of the world's most impactful weather platform. You'll build self-service platforms that give developers and weather scientists true independence, weave AI into how we operate, and work side-by-side with R&D to push performance and scale further. You'll evolve our cloud infrastructure to match the pace of the business, hold the line on cost, and stay close to production through on-call. The people who thrive here bring a product mindset, take ownership without waiting to be asked, and leave the people and systems around them better than they found them.
Responsibilities
- Develop and adopt AI-powered tools to make Development and Operations processes more efficient
- Collaborate with developers and weather scientists to optimize service performance, reliability, scale, security, and cost
- Evolve and maintain adaptive cloud infrastructure to support our business strategy and enable smooth growth at scale
- Build self-service platforms for scientists and developers to work independently
- Introduce and integrate MLOps practices for GPU-based model deployment on Kubernetes
- Maintain Production availability by participating in DevOps on-call shifts
Requirements
- At least 4 years of experience as a DevOps/SRE Engineer in a Linux environment
- Experienced with AWS, GCP, or Azure and IaC, such as Terraform or Crossplane
- Experience with CI/CD tools and deployment methodologies in Kubernetes
- Strong sense of ownership and accountability for service reliability
- Comfort with AI-powered development tools and willingness to experiment with new technologies and methods
- Experience implementing and customizing monitoring systems (Datadog, Prometheus, ELK Stack)
- Experience working in an agile environment with high-velocity teams
- Proficiency with scripting languages like Python, Node.js, and Go
- Adaptable problem-solving mindset - thriving in changing environments and requirements
If you have a passion for perfecting developer processes and understanding how things work behind the scenes, and you strive for continuous improvement as a way of life with automation as the way to achieve it - this is the place for you. You’ll join our team with significant growth potential, contribute to evolving DevOps culture in a growing startup, and collaborate with some of the sharpest minds in the industry using cutting-edge technology and AI to make a meaningful global impact.