Key Responsibilities
- Design, develop, and deploy large language model (LLM) applications and agent workflows
- Optimize model performance for scalability, latency, and cost efficiency
- Collaborate with cross-functional teams to integrate AI solutions into production systems
- Implement robust evaluation frameworks for model accuracy and reliability
- Stay current with advancements in LLM architectures and agent-based systems
- Document technical specifications and maintain code quality standards
Requirements
- 3+ years of experience in machine learning or AI engineering
- Proficiency in Python and frameworks like PyTorch or TensorFlow
- Hands-on experience with LLM fine-tuning and agent workflow design
- Strong understanding of NLP concepts and model optimization techniques
- Experience with cloud platforms (AWS/GCP/Azure) and MLOps tools