HTGAJ Admin — Research Engineer in Reinforcement Learning

Job Detail

Research Engineer in Reinforcement Learning

InstaDeep · GB

Data Science and AI Full–time

ID: #10823

Posted: 2026-03-18

Apply Link

Salary

—

Description

About Applied AI: The Applied AI team at InstaDeep creates optimization solutions for large scale real-world industrial problems. Each solution is built to be ready for full production use by our clients and it is the Applied AI team that owns all stages of this lifecycle: Within Applied AI, Research Engineering builds the AI optimization systems, MLOps sets up the training and inference infrastructure and the Software teams build graphical user interfaces and API servers. These teams work together to provide the end-to-end capabilities required to create scalable and high-performing optimization tools for the end user, all done 100% within InstaDeep. Our work spreads across many industries - from Energy to Logistics to Production Optimization, there are plenty of interesting topics! What does a working day look like: • Research Engineers at InstaDeep work closely with our clients to build a deep understanding of the use case and the constraints the operational experts face in their daily operations. One of the core responsibilities of a Research Engineer is then to translate this industrial knowledge into optimization problem statements. • On a daily basis, you will be working with your InstaDeep colleagues to brainstorm, prototype new research ideas and to iterate and improve on existing implementations by running large-scale experiments and deploying high-quality engineering. You will also have access to InstaDeep’s powerful compute resources (as of 2024 among the top 20 H100 GPU clusters), enabling you to efficiently train models and accelerate innovation. • There are also opportunities to engage in pre-sales activities and to support our Business Development team with your acquired unique combination of domain knowledge and AI expertise. Requirements: • Professional experience in AI Research, Applied AI/ML or Mathematical Optimization • Proven experience in software development in Python, ideally in projects with codebases of production-grade quality, multiple contributors and version control • Candidates must have the right to work in the UK. Visa sponsorship is not available for this role. • Attendance in the office for 3 days a week mandatory, we do not offer remote roles. Nice to have: • Professional experience in Reinforcement Learning and / or dealing with combinatorial optimization problems • Hands-on experience in high-performance computing environments (Kubernetes, Ray etc.) • Domain expertise in the industries of Supply Chain, Manufacturing, Aviation, Energy Benefits: • Time to dedicate to personal development, as well as InstaDeep-provided education opportunities • Long-term incentive stock plans • Private Health insurance • Monthly Gym allowance About us: InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, New York, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry. Join us to be a part of the AI revolution! Our commitment to our people: We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds. Right to work: Please note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.

Hard Skills 3

Skill	Source	Confidence
Python	llm_hard	100%
Reinforcement Learning	llm_hard	100%
Kubernetes	llm_hard	80%

Soft Skills 9

Skill	Source	Confidence
Problem-Solving	llm_soft	100%
Teamwork	llm_soft	100%
Collaboration	llm_soft	100%
Continuous Learning	llm_soft	80%
Professional Development	llm_soft	80%
Cross-Functional Communication	llm_soft	80%
Skill Development	llm_soft	80%
Critical Thinking	llm_soft	80%
Analytical Thinking	llm_soft	80%

Apply Options

Publisher	Direct	Link
LinkedIn	No	Apply
Talents By StudySmarter	No	Apply
BeBee GB	No	Apply
HiringCafe	No	Apply
Trabajo.org	No	Apply
Devfound	No	Apply
Jobilize	No	Apply
LinkedIn	No	Apply

API Logs for this Job

Query	Country	Status	Response ms	Created
Research Engineer in Reinforcement Learning		extracted	7113	2026-03-22 02:42
Research Engineer in Reinforcement Learning		classified	439	2026-03-21 21:05
junior ML engineer in United Kingdom	gb	duplicate	22049	2026-03-21 17:00
junior ML engineer in United Kingdom	gb	processed	22049	2026-03-21 17:00

Raw JSON

{
  "job_id": "1Lg4lUIkdMinHsdcAAAAAA==",
  "job_city": null,
  "job_state": null,
  "job_title": "Research Engineer in Reinforcement Learning",
  "job_salary": null,
  "job_country": "GB",
  "job_benefits": null,
  "job_latitude": 55.378051,
  "job_location": "United Kingdom",
  "job_onet_soc": "15111100",
  "apply_options": [
    {
      "is_direct": false,
      "publisher": "LinkedIn",
      "apply_link": "https://uk.linkedin.com/jobs/view/research-engineer-in-reinforcement-learning-at-instadeep-4386865410?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Talents By StudySmarter",
      "apply_link": "https://talents.studysmarter.co.uk/companies/huawei-technologies-research-development-uk-ltd/ml-research-scientist-reinforcement-learning-llms-18906361/?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "BeBee GB",
      "apply_link": "https://gb.bebee.com/job/cc4f54ce7a0164335d675d6fc13fb3bf?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "HiringCafe",
      "apply_link": "https://hiring.cafe/viewjob/nstku9n0dpbsw1x0?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Trabajo.org",
      "apply_link": "https://gb.trabajo.org/job-5002-b869ee00c9db2c6844f9c00169b156bb?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Devfound",
      "apply_link": "https://devfound-web-production.up.railway.app/ai-research-scientist-open-endedness-reinforcement-learning-iconic-interactive/120064?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Jobilize",
      "apply_link": "https://www.jobilize.com/amp/job/gb-london-machine-learning-research-scientist-valence-labs-hiring-pp0exyd?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": null,
      "publisher": "LinkedIn",
      "apply_link": "https://uk.linkedin.com/jobs/view/research-engineer-in-reinforcement-learning-at-instadeep-4386865410"
    }
  ],
  "employer_logo": "https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRgzUwIhu6mV7iYp0jGMI4uanr7dCIh6buPM1WR&s=0",
  "employer_name": "InstaDeep",
  "job_is_remote": false,
  "job_longitude": -3.4359729999999997,
  "job_posted_at": "3 days ago",
  "job_publisher": "LinkedIn",
  "job_apply_link": "https://uk.linkedin.com/jobs/view/research-engineer-in-reinforcement-learning-at-instadeep-4386865410?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic",
  "job_highlights": {},
  "job_max_salary": null,
  "job_min_salary": null,
  "job_description": "About Applied AI:\n\nThe Applied AI team at InstaDeep creates optimization solutions for large scale real-world industrial problems. Each solution is built to be ready for full production use by our clients and it is the Applied AI team that owns all stages of this lifecycle: Within Applied AI, Research Engineering builds the AI optimization systems, MLOps sets up the training and inference infrastructure and the Software teams build graphical user interfaces and API servers. These teams work together to provide the end-to-end capabilities required to create scalable and high-performing optimization tools for the end user, all done 100% within InstaDeep. Our work spreads across many industries - from Energy to Logistics to Production Optimization, there are plenty of interesting topics!\n\nWhat does a working day look like:\n• Research Engineers at InstaDeep work closely with our clients to build a deep understanding of the use case and the constraints the operational experts face in their daily operations. One of the core responsibilities of a Research Engineer is then to translate this industrial knowledge into optimization problem statements.\n• On a daily basis, you will be working with your InstaDeep colleagues to brainstorm, prototype new research ideas and to iterate and improve on existing implementations by running large-scale experiments and deploying high-quality engineering. You will also have access to InstaDeep’s powerful compute resources (as of 2024 among the top 20 H100 GPU clusters), enabling you to efficiently train models and accelerate innovation.\n• There are also opportunities to engage in pre-sales activities and to support our Business Development team with your acquired unique combination of domain knowledge and AI expertise.\n\nRequirements:\n• Professional experience in AI Research, Applied AI/ML or Mathematical Optimization\n• Proven experience in software development in Python, ideally in projects with codebases of production-grade quality, multiple contributors and version control\n• Candidates must have the right to work in the UK. Visa sponsorship is not available for this role.\n• Attendance in the office for 3 days a week mandatory, we do not offer remote roles.\n\nNice to have:\n• Professional experience in Reinforcement Learning and / or dealing with combinatorial optimization problems\n• Hands-on experience in high-performance computing environments (Kubernetes, Ray etc.)\n• Domain expertise in the industries of Supply Chain, Manufacturing, Aviation, Energy\n\nBenefits:\n• Time to dedicate to personal development, as well as InstaDeep-provided education opportunities\n• Long-term incentive stock plans\n• Private Health insurance\n• Monthly Gym allowance\n\nAbout us:\n\nInstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, New York, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.\n\nJoin us to be a part of the AI revolution!\n\nOur commitment to our people:\n\nWe empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.\n\nRight to work:\n\nPlease note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.",
  "job_google_link": "https://www.google.com/search?q=jobs&gl=gb&hl=en&udm=8#vhid=vt%3D20/docid%3D1Lg4lUIkdMinHsdcAAAAAA%3D%3D&vssid=jobs-detail-viewer",
  "employer_website": "https://instadeep.com",
  "job_onet_job_zone": "5",
  "job_salary_period": null,
  "job_apply_is_direct": false,
  "job_employment_type": "Full–time",
  "job_employment_types": [
    "FULLTIME"
  ],
  "job_posted_at_timestamp": 1773792000,
  "job_posted_at_datetime_utc": "2026-03-18T00:00:00.000Z"
}