HTGAJ Admin — Senior Data Specialist

Job Detail

Senior Data Specialist

Alex Staff Agency · Manchester, GB

Others Full–time

ID: #12328

Posted: 2026-02-27

Apply Link

Salary

—

Description

We need someone who understands data deeply and uses Python to wrangle it — not a platform engineer, not a pure pipeline builder, but a data specialist who's comfortable with research, investigation, and the unglamorous work of making messy energy market data actually usable. You'll spend significant time on tasks like: mapping BM units to power plants and fuel types, reconciling legacy data formats with current ones, ensuring consistency between different Elexon message types, and cleaning time-series data (outliers, gaps, overlaps). Some of this requires genuine investigation — cross-referencing sources, making judgment calls, documenting edge cases. There's no API that solves these problems for you. Python is your primary tool (Pandas, Numpy, standard libraries) to minimise manual effort, but you should be comfortable that some detective work is unavoidable. If you find satisfaction in truly understanding a dataset's structure and quirks — rather than just piping data through and hoping for the best — this role is for you. Data Mapping and Research • Map BM units from Elexon to their corresponding power plants, substations, and fuel types — combining API data, public registers, and manual research • Map substations to ETYS zones and grid supply points • Build and maintain reference/master datasets that link identifiers across disparate sources (Elexon, National Grid ESO, TEC register, etc.) • Document mappings, assumptions, and known limitations clearly for downstream users Data Reconciliation and Consistency • Reconcile legacy data formats with current formats (e.g., historical operational data stored in different schemas or granularities) • Ensure consistency between different Elexon message types — understand the market data structure well enough to know why BOALF, BOD, and DISBSAD might not perfectly align and how to handle it • Investigate discrepancies between data sources and determine authoritative values Data Cleaning and Quality • Clean time-series data: detect outliers (price spikes, meter errors), fill gaps appropriately, resolve overlapping or duplicate timestamps • Develop reusable Python-based cleaning routines that can be applied across datasets • Understand why data quality issues occur (settlement reruns, late submissions, format changes) not just patch them Pipeline Development (Supporting the Above) • Write and maintain Python data grabbers for energy market APIs • Build dbt models to transform raw data into clean, analysis‑ready datasets • Orchestrate workflows via Git Hub Actions • Design Postgre SQL schemas that reflect your understanding of the domain Must Have • Strong Python skills for data work — you're fluent with pandas, comfortable writing clean, testable code, and can build reusable data processing logic. This is not an Excel role. • Solid SQL skills — complex queries, window functions, CTEs in PostgreSQL • Experience with messy, real‑world data — you've done reconciliation, cleaning, or mapping work before and understand it's not always automatable • Methodical and detail‑oriented — you notice inconsistencies and want to understand root causes • Good documentation habits — you know that undocumented mappings and assumptions are technical debt • Self‑directed — you can own ambiguous problems, do your own research, and communicate findings clearly Nice to Have • Experience with energy, utilities, or market data (any geography) • Familiarity with UK energy markets, Elexon data, or grid operations • dbt experience for transformation pipelines • Exposure to time‑series data challenges (irregular timestamps, gaps, restatements) Highly Desirable — Agentic AI Coding Experience We value candidates who can build software using agentic AI coding systems. This is fundamentally different from using code completion tools or chat‑based assistants. What we’re NOT looking for: • Git Hub Copilot (code completion/autocomplete) • ChatGPT or similar chat interfaces for generating isolated code snippets • Any tool that only provides single‑turn question/answer interactions What we ARE looking for: Hands‑on experience with agentic coding systems such as Claude Code, Codex (OpenAI’s agentic coding tool), Open Code, or Cursor. Ideal candidates will demonstrate: • Breadth of…

Hard Skills 0

No hard skills extracted

Soft Skills 0

No soft skills extracted

Apply Options

Publisher	Direct	Link
Learn4Good	No	Apply
Learn4Good	No	Apply

API Logs for this Job

Query	Country	Status	Response ms	Created
Senior Data Specialist		fallback	449	2026-03-21 21:17
graduate data scientist in Manchester	gb	processed	10738	2026-03-21 17:20

Raw JSON

{
  "job_id": "ISIOXKYChaHEP-rfAAAAAA==",
  "job_city": "Manchester",
  "job_state": null,
  "job_title": "Senior Data Specialist",
  "job_salary": null,
  "job_country": "GB",
  "job_benefits": null,
  "job_latitude": 53.480759299999995,
  "job_location": "Manchester",
  "job_onet_soc": "43911100",
  "apply_options": [
    {
      "is_direct": false,
      "publisher": "Learn4Good",
      "apply_link": "https://www.learn4good.com/jobs/manchester/uk/info_technology/4900720056/e/?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": null,
      "publisher": "Learn4Good",
      "apply_link": "https://www.learn4good.com/jobs/manchester/uk/info_technology/4900720056/e/"
    }
  ],
  "employer_logo": null,
  "employer_name": "Alex Staff Agency",
  "job_is_remote": false,
  "job_longitude": -2.2426304999999997,
  "job_posted_at": "22 days ago",
  "job_publisher": "Learn4Good",
  "job_apply_link": "https://www.learn4good.com/jobs/manchester/uk/info_technology/4900720056/e/?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic",
  "job_highlights": {},
  "job_max_salary": null,
  "job_min_salary": null,
  "job_description": "We need someone who understands data deeply and uses Python to wrangle it — not a platform engineer, not a pure pipeline builder, but a data specialist who's comfortable with research, investigation, and the unglamorous work of making messy energy market data actually usable.\n\nYou'll spend significant time on tasks like: mapping BM units to power plants and fuel types, reconciling legacy data formats with current ones, ensuring consistency between different Elexon message types, and cleaning time-series data (outliers, gaps, overlaps). Some of this requires genuine investigation — cross-referencing sources, making judgment calls, documenting edge cases. There's no API that solves these problems for you.\n\nPython is your primary tool (Pandas, Numpy, standard libraries) to minimise manual effort, but you should be comfortable that some detective work is unavoidable. If you find satisfaction in truly understanding a dataset's structure and quirks — rather than just piping data through and hoping for the best — this role is for you.\nData Mapping and Research\n• Map BM units from Elexon to their corresponding power plants, substations, and fuel types — combining API data, public registers, and manual research\n• Map substations to ETYS zones and grid supply points\n• Build and maintain reference/master datasets that link identifiers across disparate sources (Elexon, National Grid ESO, TEC register, etc.)\n• Document mappings, assumptions, and known limitations clearly for downstream users\nData Reconciliation and Consistency\n• Reconcile legacy data formats with current formats (e.g., historical operational data stored in different schemas or granularities)\n• Ensure consistency between different Elexon message types — understand the market data structure well enough to know why BOALF, BOD, and DISBSAD might not perfectly align and how to handle it\n• Investigate discrepancies between data sources and determine authoritative values\nData Cleaning and Quality\n• Clean time-series data: detect outliers (price spikes, meter errors), fill gaps appropriately, resolve overlapping or duplicate timestamps\n• Develop reusable Python-based cleaning routines that can be applied across datasets\n• Understand why data quality issues occur (settlement reruns, late submissions, format changes) not just patch them\nPipeline Development (Supporting the Above)\n• Write and maintain Python data grabbers for energy market APIs\n• Build dbt models to transform raw data into clean, analysis‑ready datasets\n• Orchestrate workflows via Git Hub Actions\n• Design Postgre\n\nSQL schemas that reflect your understanding of the domain\nMust Have\n• Strong Python skills for data work — you're fluent with pandas, comfortable writing clean, testable code, and can build reusable data processing logic. This is not an Excel role.\n• Solid SQL skills — complex queries, window functions, CTEs in PostgreSQL\n• Experience with messy, real‑world data — you've done reconciliation, cleaning, or mapping work before and understand it's not always automatable\n• Methodical and detail‑oriented — you notice inconsistencies and want to understand root causes\n• Good documentation habits — you know that undocumented mappings and assumptions are technical debt\n• Self‑directed — you can own ambiguous problems, do your own research, and communicate findings clearly\nNice to Have\n• Experience with energy, utilities, or market data (any geography)\n• Familiarity with UK energy markets, Elexon data, or grid operations\n• dbt experience for transformation pipelines\n• Exposure to time‑series data challenges (irregular timestamps, gaps, restatements)\nHighly Desirable — Agentic AI Coding Experience\n\nWe value candidates who can build software using agentic AI coding systems. This is fundamentally different from using code completion tools or chat‑based assistants.\n\nWhat we’re NOT looking for:\n• Git Hub Copilot (code completion/autocomplete)\n• ChatGPT or similar chat interfaces for generating isolated code snippets\n• Any tool that only provides single‑turn question/answer interactions\n\nWhat we ARE looking for: Hands‑on experience with agentic coding systems such as Claude Code, Codex (OpenAI’s agentic coding tool), Open Code, or Cursor.\n\nIdeal candidates will demonstrate:\n• Breadth of…",
  "job_google_link": "https://www.google.com/search?q=jobs&gl=gb&hl=en&udm=8#vhid=vt%3D20/docid%3DISIOXKYChaHEP-rfAAAAAA%3D%3D&vssid=jobs-detail-viewer",
  "employer_website": "https://alexstaff.agency",
  "job_onet_job_zone": "4",
  "job_salary_period": null,
  "job_apply_is_direct": false,
  "job_employment_type": "Full–time",
  "job_employment_types": [
    "FULLTIME"
  ],
  "job_posted_at_timestamp": 1772150400,
  "job_posted_at_datetime_utc": "2026-02-27T00:00:00.000Z"
}