Key Responsibilities
- Design and implement pre-training strategies for large-scale AI models
- Optimize training pipelines for efficiency and scalability
- Collaborate with cross-functional teams to integrate models into production systems
- Research novel techniques in neural network architectures and training methodologies
- Evaluate model performance and iterate on improvements
- Document research findings and contribute to technical reports
Requirements
- Master's or PhD in Computer Science, AI, or related field
- 3+ years of experience in machine learning research or model training
- Strong proficiency in Python and deep learning frameworks (PyTorch/TensorFlow)
- Experience with large-scale distributed training systems
- Publications or contributions to open-source AI projects are a plus