We are seeking a highly skilled Research Infrastructure Engineer to join our team at OpenAI. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining large-scale research infrastructure for training AI systems. You will work closely with our research scientists and engineers to develop and deploy cutting-edge AI technologies.
Key Responsibilities:
- Design and implement scalable and efficient infrastructure for large-scale AI training
- Collaborate with research scientists and engineers to develop and deploy AI technologies
- Develop and maintain high-performance computing clusters and data storage systems
- Ensure the reliability, scalability, and security of our research infrastructure
- Stay up-to-date with the latest advancements in AI and infrastructure technologies
Requirements:
- 5+ years of experience in designing and building large-scale research infrastructure
- Strong understanding of cloud computing platforms (AWS) and containerization (Docker)
- Experience with high-performance computing clusters and data storage systems
- Strong programming skills in Python and Node.js
- Experience with machine learning and AI technologies