As a Capacity Systems Software Engineer at openai, you will design and develop software systems to manage and optimize the capacity of our AI infrastructure. This role requires expertise in building scalable and efficient systems that can handle large amounts of data and computational workloads.
Key Responsibilities:
- Design and develop software systems to manage capacity and optimize resource utilization.
- Collaborate with cross-functional teams to identify capacity planning needs and develop solutions.
- Develop and maintain tools and scripts to monitor and analyze system performance.
- Work with engineering teams to integrate capacity systems with other infrastructure components.
- Contribute to the development of capacity planning and forecasting models.
Requirements:
- 5+ years of experience in software engineering, with a focus on capacity planning and optimization.
- Strong expertise in programming languages such as Python and Node.js.
- Experience with machine learning and data analysis techniques.
- Familiarity with cloud-based infrastructure and services, such as AWS.
- Excellent problem-solving and communication skills.