About the Role
We are seeking a talented Computer Vision Engineer with strong academic credentials and research experience to join our AI/ML team. This role focuses on developing and implementing state-of-the-art computer vision solutions for specialized detection and analysis systems. You will work on challenging problems at the intersection of deep learning, forensic AI, and multi-modal analysis.
Key Responsibilities
Research & Development
- Design and implement novel computer vision architectures for complex detection and classification tasks
- Explore and evaluate cutting-edge research papers and integrate promising techniques into production systems
- Develop custom loss functions, training strategies, and optimization techniques for specialized applications
- Conduct rigorous experiments with comprehensive documentation and ablation studies
- Contribute to technical documentation, research reports, and potential publications
Model Development
- Build end-to-end deep learning pipelines for image, video, and multi-modal analysis
- Design and train custom neural network architectures combining CNNs, transformers, and hybrid approaches
- Implement advanced techniques including attention mechanisms, metric learning, and feature fusion
- Optimize models for production deployment with focus on accuracy-latency trade-offs
- Develop robust evaluation frameworks with domain-specific metrics
Technical Collaboration
- Work closely with the AI/ML team to solve complex technical challenges
- Provide insights on architectural decisions and experimental design
- Share knowledge through technical discussions and code reviews
- Stay current with latest research and identify relevant advances for team adoption
Production Integration
- Deploy models into production environments with monitoring and continuous improvement
- Implement data preprocessing pipelines and augmentation strategies
- Optimize inference performance through quantization, pruning, and efficient architectures
- Build APIs and services for model deployment using FastAPI or similar frameworks
Required Qualifications
Education
- Currently pursuing or completed Master's/PhD in Computer Vision, AI/ML, Computer Science, or related field from premier institutions (IIT, IIIT, NIT, or equivalent)
- Strong academic record with focus on computer vision and deep learning coursework
- Active research profile with publications in top-tier conferences (CVPR, ICCV, ECCV, NeurIPS, ICML) or journals
- Thesis/research work demonstrating deep technical expertise in computer vision
Experience
- 2+ years of hands-on experience in computer vision and deep learning research/development
- Proven track record of implementing research papers and novel architectures from scratch
- Experience with real-world computer vision projects beyond academic coursework
Technical Expertise
Deep Learning & Computer Vision
- Expert-level proficiency in PyTorch (preferred) or TensorFlow
- Strong understanding of CNN architectures (ResNet, EfficientNet, DenseNet, etc.)
- Experience with Vision Transformers (ViT, Swin, DINO, etc.)
- Knowledge of attention mechanisms and self-attention for vision tasks
- Understanding of metric learning, contrastive learning, and embedding-based methods
- Experience with multi-modal learning and cross-modal fusion techniques
Computer Vision Fundamentals
- Deep understanding of image processing, filtering, and transformations
- Experience with object detection (YOLO, Faster R-CNN, DETR) and segmentation
- Knowledge of video analysis techniques and temporal modeling
- Familiarity with feature extraction and representation learning
- Understanding of data augmentation strategies and regularization techniques
Research & Implementation
- Ability to read, critically analyze, and implement research papers independently
- Experience with experimental design, hypothesis testing, and ablation studies
- Proficiency in experiment tracking tools (Weights & Biases, MLflow, TensorBoard)
- Strong mathematical foundation in linear algebra, optimization, and probability
Software Engineering
- Proficient in Python with clean, modular coding practices
- Experience with OpenCV, torchvision, PIL/Pillow, and other CV libraries
- Knowledge of version control (Git) and collaborative development workflows
- Familiarity with Docker and containerization
- Experience with large-scale dataset handling and efficient data loading
Preferred Qualifications
- Publications in top-tier CV/ML conferences or journals (CVPR, ICCV, ECCV, NeurIPS, ICML, AAAI, etc.)
- Experience with forensic analysis, anomaly detection, or media authenticity verification
- Knowledge of generative models (GANs, VAEs, Diffusion Models)
- Understanding of adversarial robustness and model security
- Experience with 3D vision, depth estimation, or multi-view geometry
- Familiarity with signal processing for audio/visual analysis
- Background in image quality assessment or artifact detection
- Experience with few-shot learning, open-set recognition, or domain adaptation
- Knowledge of model compression and efficient architectures
- Contributions to open-source computer vision projects
- Experience with cloud platforms (AWS, GCP, Azure) for ML workloads
What We're Looking For
- Research Mindset: Strong analytical thinking with ability to formulate and test hypotheses rigorously
- Technical Excellence: Deep understanding of computer vision theory and modern deep learning
- Implementation Skills: Ability to quickly prototype ideas and translate research into working code
- Problem Solver: Creative approach to solving novel and ambiguous technical challenges
- Self-Motivated: Takes initiative in exploring new techniques and driving projects forward
- Collaborative: Excellent communication skills with ability to explain complex concepts clearly
- Detail-Oriented: Commitment to thorough experimentation, validation, and documentation
- Continuous Learner: Passion for staying current with rapidly evolving CV/ML research
What We Offer
- Opportunity to work on cutting-edge computer vision research with real-world applications
- Collaborative environment with focus on innovation and technical growth
- Exposure to production ML systems and end-to-end project ownership
- Flexibility to pursue research interests aligned with project goals
- Competitive compensation package