As a Data Scientist, Infrastructure at openai, you will be responsible for designing and implementing scalable data infrastructure to support the company's machine learning models. You will work closely with cross-functional teams to develop and maintain data pipelines, ensuring high-quality data is available for model training and deployment. This role requires a strong understanding of cloud-based technologies, data engineering principles, and machine learning concepts.
Key Responsibilities
- Design and implement scalable data infrastructure to support machine learning models
- Develop and maintain data pipelines to ensure high-quality data availability
- Collaborate with cross-functional teams to integrate data infrastructure with machine learning workflows
- Optimize data infrastructure for performance, scalability, and reliability
- Stay up-to-date with emerging trends and technologies in data engineering and machine learning
Requirements
- 5+ years of experience in data science, data engineering, or a related field
- Strong understanding of cloud-based technologies, such as AWS
- Proficiency in programming languages, such as Python
- Experience with machine learning concepts and techniques
- Excellent communication and collaboration skills