Research Engineer in Reinforcement Learning

InstaDeep - Paris (75)


InstaDeep Ltd is an EMEA leader in decision-making AI products for the Enterprise, with headquarters in London, and offices in Paris, Tunis, Lagos, Dubai and Cape Town. With expertise in both machine intelligence research and practical business deployments, the Company provides a competitive advantage to its partners in an AI-first world. Leveraging its extensive know-how in GPU-accelerated computing, deep learning and reinforcement learning, InstaDeep has built products and solutions that tackle the most complex challenges across a range of industries. The firm’s hands-on approach to research, combined with a broad spectrum of clients, ensures an exciting and rewarding environment to work and thrive in. InstaDeep has also developed collaborations with global leaders in the Artificial intelligence ecosystem, such as Google DeepMind, Nvidia and Intel.


InstaDeep is looking for a new Research Engineer in Reinforcement Learning to join our expanding research team in Paris. The team’s research focuses on better understanding how to apply efficient reinforcement learning to solve real industrial problems at scale. Notably, the team studies in depth compositional approaches based on neural program synthesis, model based reinforcement learning and the improvement of evolutionary approaches sample efficiency thanks to policy gradients. Research efforts are towards developing novel algorithms and improving the state-of-the-art in these areas.

In the Research team, the focus of a Research Engineer is to participate in the research effort by assisting Research Scientists. Their goal is to develop and implement novel ideas while building effective, modular and sustainable software solutions.

The core tasks Research Engineers are responsible for include: developing prototype applications, providing software design and programming support to research projects, as well as implementing and maintaining software libraries.

In this role at InstaDeep you will report to the research team lead in Paris. Given the current state of the pandemic, all work will be fully remote. However, relocation may be required once the situation improves.


  • Implement novel algorithms and research ideas as directed by research scientists and team leads in accordance with the team’s research agenda and goals. This will primarily be in the area of reinforcement learning and evolutionary algorithms, but could include related algorithms/ideas spanning larger fields such as machine learning and deep learning.
  • Contribute to the design, project planning and implementation of a core research library and environment test suite for reinforcement learning.
  • Design and implement algorithms in such a way to best leverage modern hardware and distributed computing systems (CPUs, GPUs, TPUs, Cloud, etc.).
  • Report and present experimental results and research findings clearly and efficiently, both internally and externally, verbally and in writing.
  • Contribute to the team’s publication efforts, which could include the development of model diagrams, producing high quality plots of experimental results, assisting in writing up experimental details and results, e.g. explaining model architectures, hyperparameter configurations/tuning and training procedures.
  • When required, bridge the gap between the research and product teams by integrating new fundamental research into applied projects. This could include collaborating with the Engineering team to design and run experiments, including designing and evaluating new algorithms as well as implementing known algorithms at scale across distributed computing infrastructure and assist in deploying models in production.

Person Specification


  • M.S./Ph.D. degree in Computer Science, Operational Research, Reinforcement learning or related field.
  • Basic knowledge in reinforcement learning.
  • Experience in developing and debugging in C/C++, Python or similar languages.
  • Experience using deep learning frameworks such as PyTorch, Tensorflow and/or Jax.
  • Experience with distributed systems, HPC, compilers, and/or CUDA programming.
  • Research and software engineer experience demonstrated via an internship, contributions to open source, work experience or coding competitions.
  • Proven ability to contribute to research communities and/or efforts, including publishing scientific papers at conferences (JMLR, ICLR, NeurIPS, ICML, GECCO, etc.).
  • Work permit for France.


  • Working in small, diverse teams where you can make an impact.
  • Varied challenges across industries .
  • Cooperation across European and African offices
  • Annual offsite events.
Attention - In the recruitment process, legitimate companies never withdraw fees from candidates. If there are companies that attract interview fees, tests, ticket reservations, etc. it is better to avoid it because there are indications of fraud. If you see something suspicious please contact us: [email protected]