Key Responsibilities
- Develop and fine-tune large language models for specific applications
- Implement retrieval-augmented generation and prompt engineering techniques
- Optimize model inference for low-latency and high-throughput scenarios
- Collaborate with researchers to translate academic advances into production systems
- Design evaluation frameworks for generative model performance
- Ensure models adhere to ethical guidelines and safety constraints
Requirements
- 3+ years of experience in AI/ML with a focus on generative models
- Proficiency in PyTorch and transformer-based architectures
- Strong background in NLP and model optimization techniques
- Experience with fine-tuning and deploying LLMs
- Familiarity with evaluation metrics for generative tasks