Technical Skill Requirements
- PhD or MSEE preferred.
- 5+ years of experience working on optimized library development for GPU and/or SIMD and VLIW processor architectures.
- Experience in partitioning problems for execution on a multicore processor system with latency, bandwidth and memory utilization tradeoffs.
- Proficient in C/C++, Python and various ML frameworks like TensorFlow, PyTorch etc.
- Must have strong competency in understanding and analyzing complex signal processing and computer vision algorithms.
- Ability to translate a compute algorithm into fixed point hardware implementation.
- Excellent written and verbal communication skills in English.
- Experience creating internal and/or customer facing detailed documentation.