As a Software Engineer on our GPU Infrastructure team, you will design and develop high-performance GPU infrastructure for large-scale machine learning workloads. You will work closely with our engineering teams to develop and deploy scalable, efficient, and reliable systems that enable our customers to train and deploy AI models at scale.
Key Responsibilities:
- Design and develop high-performance GPU infrastructure for large-scale machine learning workloads.
- Collaborate with cross-functional teams to develop and deploy scalable, efficient, and reliable systems.
- Develop and maintain software components that integrate with our GPU infrastructure.
- Work with our engineering teams to identify and prioritize technical requirements.
- Develop and maintain technical documentation and knowledge base.
Requirements:
- 5+ years of experience in software development, with a focus on high-performance computing and GPU programming.
- Strong understanding of machine learning concepts and algorithms.
- Experience with Python, Node.js, and AWS.
- Strong problem-solving skills and ability to work in a fast-paced environment.
- Excellent communication and collaboration skills.