Job Summary
We are seeking a skilled Generative AI / LLM Engineer with hands-on experience in building and deploying AI-driven applications using Large Language Models (LLMs). The ideal candidate should have strong expertise in prompt engineering, model integration, and developing scalable GenAI solutions.
Key Responsibilities
- Design and develop applications using Generative AI and LLMs (GPT, LLaMA, Claude, etc.)
- Implement prompt engineering strategies for optimal model performance
- Build and integrate LLM-based solutions (chatbots, copilots, automation tools)
- Work with vector databases and embeddings for semantic search and RAG pipelines
- Fine-tune and optimize models for specific business use cases
- Collaborate with cross-functional teams (product, data, engineering)
- Ensure scalability, performance, and security of AI applications
- Stay updated with the latest trends in AI/ML and GenAI technologies
Required Skills
- Strong experience in Python
- Hands-on experience with LLMs (OpenAI, Hugging Face, Anthropic, etc.)
- Experience in Prompt Engineering & RAG (Retrieval Augmented Generation)
- Familiarity with frameworks like LangChain / LlamaIndex
- Experience with Vector Databases (Pinecone, FAISS, Weaviate, etc.)
- Knowledge of REST APIs and microservices architecture
- Understanding of ML fundamentals and NLP concepts
Preferred Skills
- Experience with fine-tuning LLMs / LoRA / PEFT techniques
- Knowledge of MLOps tools and deployment (Docker, Kubernetes)
- Experience with cloud platforms (AWS, Azure, GCP)
- Exposure to frontend integration (React, Streamlit, etc.)
- Familiarity with data pipelines and big data tools