Key Responsibilities
- Design, develop, and deploy generative AI models on AWS infrastructure
- Optimize large language models for performance, cost, and scalability
- Collaborate with cross-functional teams to integrate AI solutions into production systems
- Implement robust monitoring and evaluation frameworks for AI model performance
- Stay updated with the latest advancements in generative AI and cloud technologies
- Ensure compliance with data privacy and security standards
Requirements
- 3+ years of experience in machine learning or AI engineering
- Proficiency in Python and AWS cloud services
- Experience with generative AI models and frameworks (e.g., Hugging Face, LangChain)
- Strong understanding of NLP techniques and model optimization
- Familiarity with containerization and deployment pipelines