Key Responsibilities
- Design, develop, and deploy AI software components including foundation model training, large language model inference, and similarity search systems
- Implement state-of-the-art LLM optimization techniques to improve scalability, cost efficiency, and latency of production AI systems
- Collaborate with cross-functional teams to integrate AI capabilities into scalable, high-performance infrastructure
- Develop and maintain guardrails, model evaluation frameworks, and observability systems for responsible AI deployment
- Contribute to the technical vision and long-term roadmap for foundational AI systems
- Leverage open-source and SaaS technologies such as Hugging Face, Nemo Guardrails, and vector databases
Requirements
- Bachelor's degree in Computer Science, AI, or related field plus 8+ years of AI/ML experience, or Master's degree plus 6+ years
- 8+ years of Python programming experience with deep expertise in machine learning systems
- Strong foundation in mathematics and engineering principles for AI system optimization
- Passion for staying current with AI research and applying novel techniques in production
- Ability to lead technical vision and drive innovation in large-scale AI infrastructure