About the Role

We are seeking a highly skilled Reinforcement Learning (RL) Engineer to develop, implement, and optimize RL algorithms for real-world and simulation-based applications. The ideal candidate has strong foundations in machine learning, deep learning, control systems, and hands-on experience deploying RL models in production or embedded systems.

Responsibilities

Design, implement, and optimize RL algorithms such as PPO, SAC, TD3, DQN, A3C, TRPO, etc.
Develop custom reward functions, policy architectures, and learning workflows.
Conduct research on state-of-the-art RL techniques and integrate into product or research pipelines.
Build or work with simulation environments such as PyBullet, Mujoco, IsaacGym, CARLA, Gazebo, or custom environments.
Integrate RL agents with environment APIs, physics engines, and sensor models.
Deploy RL models on real systems (e.g., robots, embedded hardware, autonomous platforms).
Optimize RL policies for latency, robustness, and real-world constraints.
Work with control engineers to integrate RL with classical controllers (PID, MPC, etc.)
Run large-scale experiments, hyper parameter tuning, and ablation studies.
Analyse model performance, failure cases, and implement improvements.
Work closely with robotics, perception, simulation, and software engineering teams.
Document algorithms, experiments, and results for internal and external stakeholders.

Skills Required

Strong expertise in Python, with experience in ML frameworks like PyTorch or TensorFlow.
Deep understanding of:
- Markov Decision Processes (MDP)
- Policy & value-based RL
- Deep learning architectures (CNN, RNN, Transformers)
- Control theory fundamentals
Experience with RL libraries (stable-baselines3, RLlib, CleanRL, etc.).
Experience with simulation tools or robotics middleware (ROS/ROS2, Gazebo).

Added Advantage

Experience in robotics, mechatronic, or embedded systems.
Experience with C++ for performance-critical applications.
Knowledge of GPU acceleration, CUDA, or distributed training.
Experience bringing RL models from simulation to real-world (Sim2Real).
Experience with cloud platforms (AWS/GCP/Azure).

Experience

2–4 years of hands-on RL experience (academic or industry).
Published RL research papers (optional but preferred).

Qualifications

Bachelor’s/Master’s/PhD in Computer Science, Robotics, AI, Machine Learning, or related field

Reinforcement Learning (RL) Engineer