HTGAJ Admin — Reinforcement Learning (RL) Engineer, Manipulation

Job Detail

Reinforcement Learning (RL) Engineer, Manipulation

Humanoid · GB

Data Science and AI Full–time

ID: #10564

Posted: 2026-03-12

Apply Link

Salary

—

Description

Humanoid is the first AI and robotics company in the UK, creating the world’s most advanced, reliable, commercially scalable, and safe humanoid robots. Our first humanoid robot HMND 01 is a next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. Our Mission At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into daily life and amplify human capacity. What You’ll Do • Train language-vision conditioned manipulation policies via reinforcement learning (RL) in simulation and in the real world. • Construct challenging and diverse suites of manipulation tasks in simulation. • Partner with teleoperations to collect trajectories in simulation for behavior cloning. • Partner with testing and operations to establish real-world RL training pipelines. • Experiment with various ways of bringing policies trained in simulation to the real world.. We’re Looking For • 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it. • Hands‑on with at least one of: LLMs, VLMs, or image/video generative models — architecture, training, and inference. • Experience solving real problems using reinforcement learning with deep neural networks in any domain. • Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code. • You are self-driven, pro-active, communicate efficiently, document experiments clearly and communicate trade‑offs crisply. Nice to have • Experience with simulators for robotics (Isaac Sim, MuJoCo etc.) • Experience in RL for robotics. • Experience building infrastructure for large-scale RL (e.g. using ray). • Publications at ICLR/ICML/NeurIPS or equivalent open‑source contributions. • Familiarity with OpenVLA, Physical Intelligence (π) models, or similar open VLA frameworks. What We Offer • Competitive salary plus participation in our Stock Option Plan • Paid vacation with adjustments based on your location to comply with local labor laws, and additional paid sick leave days • Travel opportunities to our Vancouver and Boston offices • Office perks: free breakfasts, lunches, snacks, and regular team events • Freedom to influence the product and own key initiatives • Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics • Startup culture prioritising speed, transparency, and minimal bureaucracy How to Apply Does this role sound like the perfect fit for you? Fill in the form and include links or files that showcase the best of what you’ve built and achieved.

Hard Skills 4

Skill	Source	Confidence
Python	llm_hard	100%
Reinforcement Learning	llm_hard	100%
Large Language Models (LLMs)	llm_hard	100%
PyTorch	llm_hard	100%

Soft Skills 5

Skill	Source	Confidence
Written Communication	llm_soft	100%
Documentation	llm_soft	100%
Self-Motivation	llm_soft	100%
Initiative	llm_soft	100%
Collaboration	llm_soft	80%

Apply Options

Publisher	Direct	Link
LinkedIn	No	Apply
Talents By StudySmarter	No	Apply
LinkedIn	No	Apply

API Logs for this Job

Query	Country	Status	Response ms	Created
Reinforcement Learning (RL) Engineer, Manipulation		extracted	3763	2026-03-22 02:28
Reinforcement Learning (RL) Engineer, Manipulation		classified	443	2026-03-21 21:01
junior deep learning engineer in United Kingdom	gb	duplicate	13733	2026-03-21 17:11
junior AI engineer in United Kingdom	gb	duplicate	21364	2026-03-21 17:04
junior ML engineer in United Kingdom	gb	duplicate	22049	2026-03-21 17:00
junior machine learning engineer in United Kingdom	gb	processed	9050	2026-03-21 16:57

Raw JSON

{
  "job_id": "u3JJ7cgXl_Df4NLHAAAAAA==",
  "job_city": null,
  "job_state": null,
  "job_title": "Reinforcement Learning (RL) Engineer, Manipulation",
  "job_salary": null,
  "job_country": "GB",
  "job_benefits": null,
  "job_latitude": 55.378051,
  "job_location": "United Kingdom",
  "job_onet_soc": "15111100",
  "apply_options": [
    {
      "is_direct": false,
      "publisher": "LinkedIn",
      "apply_link": "https://uk.linkedin.com/jobs/view/reinforcement-learning-rl-engineer-manipulation-at-humanoid-4318766298?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Talents By StudySmarter",
      "apply_link": "https://talents.studysmarter.co.uk/companies/humanoid/reinforcement-learning-rl-engineer-manipulation-18240673/?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": null,
      "publisher": "LinkedIn",
      "apply_link": "https://uk.linkedin.com/jobs/view/reinforcement-learning-rl-engineer-manipulation-at-humanoid-4318766298"
    }
  ],
  "employer_logo": "https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSRnXqRrJM1wOdD0llVuhRw_B9vfNreBQ_GprLe&s=0",
  "employer_name": "Humanoid",
  "job_is_remote": false,
  "job_longitude": -3.4359729999999997,
  "job_posted_at": "9 days ago",
  "job_publisher": "LinkedIn",
  "job_apply_link": "https://uk.linkedin.com/jobs/view/reinforcement-learning-rl-engineer-manipulation-at-humanoid-4318766298?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic",
  "job_highlights": {},
  "job_max_salary": null,
  "job_min_salary": null,
  "job_description": "Humanoid is the first AI and robotics company in the UK, creating the world’s most advanced, reliable, commercially scalable, and safe humanoid robots. Our first humanoid robot HMND 01 is a next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications.\n\nOur Mission\n\nAt Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into daily life and amplify human capacity.\n\nWhat You’ll Do\n• Train language-vision conditioned manipulation policies via reinforcement learning (RL) in simulation and in the real world.\n• Construct challenging and diverse suites of manipulation tasks in simulation.\n• Partner with teleoperations to collect trajectories in simulation for behavior cloning.\n• Partner with testing and operations to establish real-world RL training pipelines.\n• Experiment with various ways of bringing policies trained in simulation to the real world..\n\nWe’re Looking For\n• 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it.\n• Hands‑on with at least one of: LLMs, VLMs, or image/video generative models — architecture, training, and inference.\n• Experience solving real problems using reinforcement learning with deep neural networks in any domain.\n• Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code.\n• You are self-driven, pro-active, communicate efficiently, document experiments clearly and communicate trade‑offs crisply.\n\nNice to have\n• Experience with simulators for robotics (Isaac Sim, MuJoCo etc.)\n• Experience in RL for robotics.\n• Experience building infrastructure for large-scale RL (e.g. using ray).\n• Publications at ICLR/ICML/NeurIPS or equivalent open‑source contributions.\n• Familiarity with OpenVLA, Physical Intelligence (π) models, or similar open VLA frameworks.\n\nWhat We Offer\n• Competitive salary plus participation in our Stock Option Plan\n• Paid vacation with adjustments based on your location to comply with local labor laws, and additional paid sick leave days\n• Travel opportunities to our Vancouver and Boston offices\n• Office perks: free breakfasts, lunches, snacks, and regular team events\n• Freedom to influence the product and own key initiatives\n• Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics\n• Startup culture prioritising speed, transparency, and minimal bureaucracy\n\nHow to Apply\n\nDoes this role sound like the perfect fit for you?\n\nFill in the form and include links or files that showcase the best of what you’ve built and achieved.",
  "job_google_link": "https://www.google.com/search?q=jobs&gl=gb&hl=en&udm=8#vhid=vt%3D20/docid%3Du3JJ7cgXl_Df4NLHAAAAAA%3D%3D&vssid=jobs-detail-viewer",
  "employer_website": "https://thehumanoid.ai",
  "job_onet_job_zone": "5",
  "job_salary_period": null,
  "job_apply_is_direct": false,
  "job_employment_type": "Full–time",
  "job_employment_types": [
    "FULLTIME"
  ],
  "job_posted_at_timestamp": 1773273600,
  "job_posted_at_datetime_utc": "2026-03-12T00:00:00.000Z"
}