Key Responsibilities
- Design, develop, and deploy AI software components including foundation model training, large language model inference, and similarity search systems
- Implement state-of-the-art LLM optimization techniques to improve scalability, cost efficiency, and latency of production AI systems
- Collaborate with cross-functional teams to integrate AI capabilities into scalable, high-performance infrastructure
- Establish model evaluation frameworks, guardrails, and governance processes for responsible AI deployment
- Contribute to the technical vision and long-term roadmap for foundational AI systems
- Leverage open-source and SaaS technologies such as Hugging Face, Nemo Guardrails, and vector databases
Requirements
- Bachelor's degree in Computer Science, AI, or related field plus 4+ years of AI/ML development experience (or Master's degree plus 2+ years)
- Deep expertise in machine learning algorithms, model fine-tuning, and optimization techniques
- Proficiency in PyTorch, AWS infrastructure, and vector database systems
- Strong foundation in mathematics and engineering with a track record of identifying optimization opportunities
- Ability to stay current with AI research and apply novel techniques in production environments