As a Data Engineer on the People Innovation Labs team at openai, you will design and implement scalable data systems to support the development of innovative people-related products and features. You will work closely with cross-functional teams to gather requirements, design data pipelines, and implement data solutions using a variety of technologies.
Key Responsibilities:
- Design and implement data pipelines to collect, process, and store data from various sources.
- Develop and maintain data models, data warehouses, and data lakes to support business intelligence and analytics.
- Collaborate with data scientists and engineers to develop and deploy machine learning models.
- Ensure data quality, security, and compliance with openai's data governance policies.
- Develop and maintain automated testing and deployment scripts to ensure data systems are reliable and scalable.
Requirements:
- 5+ years of experience in data engineering, with a focus on designing and implementing scalable data systems.
- Proficiency in Python, Node.js, and AWS, with experience in data warehousing, data lakes, and data pipelines.
- Strong understanding of machine learning concepts and experience with model deployment.
- Excellent communication and collaboration skills, with the ability to work with cross-functional teams.
- Bachelor's degree in Computer Science, Engineering, or related field.