We are seeking a Distributed LLM Inference Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and implementing scalable distributed systems for large language model inference. You will work closely with our research and product teams to develop and deploy cutting-edge AI models and infrastructure.

Key Responsibilities:

Design and implement scalable distributed systems for large language model inference
Collaborate with research and product teams to develop and deploy AI models and infrastructure
Develop and maintain high-performance, low-latency software components
Work with cross-functional teams to identify and prioritize engineering initiatives
Contribute to the development of our company's technical vision and strategy

Requirements:

5+ years of experience in software engineering, with a focus on distributed systems and large-scale data processing
Strong understanding of machine learning concepts and large language models
Experience with Python, Node.js, and AWS
Excellent problem-solving skills and ability to work in a fast-paced environment
Strong communication and collaboration skills

Distributed LLM Inference Engineer

View Assessment Process

Think you'll be a good fit?