Key Responsibilities

Design, train, and optimize neural network models for high-performance computing applications
Develop and implement algorithms for neural network training and inference acceleration
Collaborate with hardware and software teams to integrate ML models into production systems
Optimize model architectures for latency, power efficiency, and memory footprint
Research and implement state-of-the-art techniques in neural network compression and acceleration
Benchmark and validate model performance against industry standards

Requirements

5+ years of experience in machine learning with a focus on neural networks
Strong expertise in deep learning frameworks (TensorFlow, PyTorch)
Experience with neural network optimization techniques (quantization, pruning, distillation)
Proficiency in Python, C++, and GPU programming (CUDA/OpenCL)
Background in computer vision or high-performance computing is a plus

Neural Networks Engineer - Expedera

View Assessment Process