logo

anyscale

Distributed LLM Inference Engineer

Department
Research
Job Type / Location
remote
Experience Required
3+ years
Posted On

We are seeking a Distributed LLM Inference Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and implementing scalable distributed systems for large language model inference. You will work closely with our research and product teams to develop and deploy cutting-edge AI models and infrastructure.

Key Responsibilities:

  • Design and implement scalable distributed systems for large language model inference
  • Collaborate with research and product teams to develop and deploy AI models and infrastructure
  • Develop and maintain high-performance, low-latency software components
  • Work with cross-functional teams to identify and prioritize engineering initiatives
  • Contribute to the development of our company's technical vision and strategy

Requirements:

  • 5+ years of experience in software engineering, with a focus on distributed systems and large-scale data processing
  • Strong understanding of machine learning concepts and large language models
  • Experience with Python, Node.js, and AWS
  • Excellent problem-solving skills and ability to work in a fast-paced environment
  • Strong communication and collaboration skills

View Assessment Process

Think you'll be a good fit?