Key Responsibilities
- Design, train, and optimize neural network models for high-performance computing applications
- Develop and implement algorithms for neural network training and inference acceleration
- Collaborate with hardware and software teams to integrate ML models into production systems
- Optimize model architectures for latency, power efficiency, and memory footprint
- Research and implement state-of-the-art techniques in neural network compression and acceleration
- Benchmark and validate model performance against industry standards
Requirements
- 5+ years of experience in machine learning with a focus on neural networks
- Strong expertise in deep learning frameworks (TensorFlow, PyTorch)
- Experience with neural network optimization techniques (quantization, pruning, distillation)
- Proficiency in Python, C++, and GPU programming (CUDA/OpenCL)
- Background in computer vision or high-performance computing is a plus