About the Role
As an AI/LLM Engineer, you will be responsible for designing, developing, and deploying scalable AI solutions using modern large language model (LLM) frameworks. You’ll work closely with cross-functional teams to integrate AI capabilities into products and platforms while ensuring performance, reliability, and scalability.
What You’ll Do
- Design and develop AI/LLM-based applications and services
- Build and optimize pipelines using LLM frameworks and orchestration tools
- Implement and enhance Retrieval-Augmented Generation (RAG) systems
- Integrate AI features into existing platforms and workflows
- Collaborate with product, engineering, and business teams to define requirements
- Write clean, efficient, and production-ready code
- Ensure proper testing, evaluation, and monitoring of AI systems
Skills Required
- Hands-on experience with LLMs (OpenAI, open-source models, etc.)
- Experience with orchestration frameworks (LangChain, LangGraph, etc.)
- Strong understanding of NLP concepts and ML/DL fundamentals
- Experience with model fine-tuning and evaluation techniques
- Proficiency in Python and AI/ML frameworks (PyTorch / TensorFlow)
- Understanding of RAG architectures and vector databases
Good to Have
- Experience with Docker, Kubernetes, or containerized deployments
- Exposure to microservices architecture and system design principles
- Familiarity with CI/CD pipelines and monitoring tools
- Experience working in cloud environments (AWS, Azure, GCP)