Job Detail

Research Engineer in Reinforcement Learning

Data Science and AI Full–time
ID: #10823
Posted: 2026-03-18
Salary

Description

About Applied AI: The Applied AI team at InstaDeep creates optimization solutions for large scale real-world industrial problems. Each solution is built to be ready for full production use by our clients and it is the Applied AI team that owns all stages of this lifecycle: Within Applied AI, Research Engineering builds the AI optimization systems, MLOps sets up the training and inference infrastructure and the Software teams build graphical user interfaces and API servers. These teams work together to provide the end-to-end capabilities required to create scalable and high-performing optimization tools for the end user, all done 100% within InstaDeep. Our work spreads across many industries - from Energy to Logistics to Production Optimization, there are plenty of interesting topics! What does a working day look like: • Research Engineers at InstaDeep work closely with our clients to build a deep understanding of the use case and the constraints the operational experts face in their daily operations. One of the core responsibilities of a Research Engineer is then to translate this industrial knowledge into optimization problem statements. • On a daily basis, you will be working with your InstaDeep colleagues to brainstorm, prototype new research ideas and to iterate and improve on existing implementations by running large-scale experiments and deploying high-quality engineering. You will also have access to InstaDeep’s powerful compute resources (as of 2024 among the top 20 H100 GPU clusters), enabling you to efficiently train models and accelerate innovation. • There are also opportunities to engage in pre-sales activities and to support our Business Development team with your acquired unique combination of domain knowledge and AI expertise. Requirements: • Professional experience in AI Research, Applied AI/ML or Mathematical Optimization • Proven experience in software development in Python, ideally in projects with codebases of production-grade quality, multiple contributors and version control • Candidates must have the right to work in the UK. Visa sponsorship is not available for this role. • Attendance in the office for 3 days a week mandatory, we do not offer remote roles. Nice to have: • Professional experience in Reinforcement Learning and / or dealing with combinatorial optimization problems • Hands-on experience in high-performance computing environments (Kubernetes, Ray etc.) • Domain expertise in the industries of Supply Chain, Manufacturing, Aviation, Energy Benefits: • Time to dedicate to personal development, as well as InstaDeep-provided education opportunities • Long-term incentive stock plans • Private Health insurance • Monthly Gym allowance About us: InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, New York, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry. Join us to be a part of the AI revolution! Our commitment to our people: We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds. Right to work: Please note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.

Hard Skills 3
Skill Source Confidence
Python llm_hard
100%
Reinforcement Learning llm_hard
100%
Kubernetes llm_hard
80%
Soft Skills 9
Skill Source Confidence
Problem-Solving llm_soft
100%
Teamwork llm_soft
100%
Collaboration llm_soft
100%
Continuous Learning llm_soft
80%
Professional Development llm_soft
80%
Cross-Functional Communication llm_soft
80%
Skill Development llm_soft
80%
Critical Thinking llm_soft
80%
Analytical Thinking llm_soft
80%
Apply Options
Publisher Direct Link
LinkedIn No Apply
Talents By StudySmarter No Apply
BeBee GB No Apply
HiringCafe No Apply
Trabajo.org No Apply
Devfound No Apply
Jobilize No Apply
LinkedIn No Apply
API Logs for this Job
Query Country Status Response ms Created
Research Engineer in Reinforcement Learning extracted 7113 2026-03-22 02:42
Research Engineer in Reinforcement Learning classified 439 2026-03-21 21:05
junior ML engineer in United Kingdom gb duplicate 22049 2026-03-21 17:00
junior ML engineer in United Kingdom gb processed 22049 2026-03-21 17:00
Raw JSON
{
  "job_id": "1Lg4lUIkdMinHsdcAAAAAA==",
  "job_city": null,
  "job_state": null,
  "job_title": "Research Engineer in Reinforcement Learning",
  "job_salary": null,
  "job_country": "GB",
  "job_benefits": null,
  "job_latitude": 55.378051,
  "job_location": "United Kingdom",
  "job_onet_soc": "15111100",
  "apply_options": [
    {
      "is_direct": false,
      "publisher": "LinkedIn",
      "apply_link": "https://uk.linkedin.com/jobs/view/research-engineer-in-reinforcement-learning-at-instadeep-4386865410?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Talents By StudySmarter",
      "apply_link": "https://talents.studysmarter.co.uk/companies/huawei-technologies-research-development-uk-ltd/ml-research-scientist-reinforcement-learning-llms-18906361/?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "BeBee GB",
      "apply_link": "https://gb.bebee.com/job/cc4f54ce7a0164335d675d6fc13fb3bf?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "HiringCafe",
      "apply_link": "https://hiring.cafe/viewjob/nstku9n0dpbsw1x0?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Trabajo.org",
      "apply_link": "https://gb.trabajo.org/job-5002-b869ee00c9db2c6844f9c00169b156bb?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Devfound",
      "apply_link": "https://devfound-web-production.up.railway.app/ai-research-scientist-open-endedness-reinforcement-learning-iconic-interactive/120064?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Jobilize",
      "apply_link": "https://www.jobilize.com/amp/job/gb-london-machine-learning-research-scientist-valence-labs-hiring-pp0exyd?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": null,
      "publisher": "LinkedIn",
      "apply_link": "https://uk.linkedin.com/jobs/view/research-engineer-in-reinforcement-learning-at-instadeep-4386865410"
    }
  ],
  "employer_logo": "https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRgzUwIhu6mV7iYp0jGMI4uanr7dCIh6buPM1WR&s=0",
  "employer_name": "InstaDeep",
  "job_is_remote": false,
  "job_longitude": -3.4359729999999997,
  "job_posted_at": "3 days ago",
  "job_publisher": "LinkedIn",
  "job_apply_link": "https://uk.linkedin.com/jobs/view/research-engineer-in-reinforcement-learning-at-instadeep-4386865410?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic",
  "job_highlights": {},
  "job_max_salary": null,
  "job_min_salary": null,
  "job_description": "About Applied AI:\n\nThe Applied AI team at InstaDeep creates optimization solutions for large scale real-world industrial problems. Each solution is built to be ready for full production use by our clients and it is the Applied AI team that owns all stages of this lifecycle: Within Applied AI, Research Engineering builds the AI optimization systems, MLOps sets up the training and inference infrastructure and the Software teams build graphical user interfaces and API servers. These teams work together to provide the end-to-end capabilities required to create scalable and high-performing optimization tools for the end user, all done 100% within InstaDeep. Our work spreads across many industries - from Energy to Logistics to Production Optimization, there are plenty of interesting topics!\n\nWhat does a working day look like:\n• Research Engineers at InstaDeep work closely with our clients to build a deep understanding of the use case and the constraints the operational experts face in their daily operations. One of the core responsibilities of a Research Engineer is then to translate this industrial knowledge into optimization problem statements.\n• On a daily basis, you will be working with your InstaDeep colleagues to brainstorm, prototype new research ideas and to iterate and improve on existing implementations by running large-scale experiments and deploying high-quality engineering. You will also have access to InstaDeep’s powerful compute resources (as of 2024 among the top 20 H100 GPU clusters), enabling you to efficiently train models and accelerate innovation.\n• There are also opportunities to engage in pre-sales activities and to support our Business Development team with your acquired unique combination of domain knowledge and AI expertise.\n\nRequirements:\n• Professional experience in AI Research, Applied AI/ML or Mathematical Optimization\n• Proven experience in software development in Python, ideally in projects with codebases of production-grade quality, multiple contributors and version control\n• Candidates must have the right to work in the UK. Visa sponsorship is not available for this role.\n• Attendance in the office for 3 days a week mandatory, we do not offer remote roles.\n\nNice to have:\n• Professional experience in Reinforcement Learning and / or dealing with combinatorial optimization problems\n• Hands-on experience in high-performance computing environments (Kubernetes, Ray etc.)\n• Domain expertise in the industries of Supply Chain, Manufacturing, Aviation, Energy\n\nBenefits:\n• Time to dedicate to personal development, as well as InstaDeep-provided education opportunities\n• Long-term incentive stock plans\n• Private Health insurance\n• Monthly Gym allowance\n\nAbout us:\n\nInstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, New York, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.\n\nJoin us to be a part of the AI revolution!\n\nOur commitment to our people:\n\nWe empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.\n\nRight to work:\n\nPlease note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.",
  "job_google_link": "https://www.google.com/search?q=jobs&gl=gb&hl=en&udm=8#vhid=vt%3D20/docid%3D1Lg4lUIkdMinHsdcAAAAAA%3D%3D&vssid=jobs-detail-viewer",
  "employer_website": "https://instadeep.com",
  "job_onet_job_zone": "5",
  "job_salary_period": null,
  "job_apply_is_direct": false,
  "job_employment_type": "Full–time",
  "job_employment_types": [
    "FULLTIME"
  ],
  "job_posted_at_timestamp": 1773792000,
  "job_posted_at_datetime_utc": "2026-03-18T00:00:00.000Z"
}