top of page

ML Data Engineer in the UK, We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates

You will develop our own data-pipeline infrastructure, design and manage high-throughput ingestion and preprocessing on Kubernetes, and collaborate closely with ML engineers to deliver datasets that advance model quality.

ML Data Engineer in the UK, We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates | visajobshq.com

ML Data Engineer in the UK, We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates | visajobshq.com

Key Responsibilities


  • Develop and maintain data-ingestion pipelines to source and prepare large-scale image (and occasional text/HTML) datasets from open, publicly accessible, and permitted sources.

  • Own the end-to-end flow: raw data → quality/beauty/relevance filtering → dedup/validation → ready-to-train artifacts.Operate and improve our Kubernetes-based data-pipeline framework (distributed jobs, retries, monitoring, automation).

  • Work with S3-style object storage: efficient layouts, lifecycle, throughput, and cost awareness.

  • Add tooling around pipelines (progress/health visualization, metrics, alerts) for observability and faster iteration.

  • Collaborate closely with ML engineers to align datasets with training needs and accelerate experimentation.

Requirements

Must-have

  • Strong Python fundamentals; you write clean, maintainable, production-ready code.

  • Solid hands-on Kubernetes experience (containers, jobs, batch/distributed processing).

  • Proven track record with unstructured data, especially images (loading, filtering, transforming at scale).

  • Experience developing data-ingestion or parsing tools for publicly accessible sources, including handling real-world reliability and failure cases gracefully.

  • Comfort with S3/object storage and moving lots of data efficiently and safely.

  • Pragmatic, detail-oriented, ownership mindset; you enjoy making systems reliable and fast.

Nice-to-have

  • Familiarity with ML workflows (PyTorch) and downstream training considerations.

  • Experience with image quality scoring, captioning, or image-to-text pipelines.

  • DAG/workflow visualizations or pipeline UX tooling.

  • DevOps fluency: Docker, CI/CD, infra automation.

What We Offer

  • ​​Competitive salary and equity.

  • We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates.

  • Real impact on model quality: your pipelines directly power training runs and product improvements.

  • Ownership with support: autonomy to design and improve systems, alongside experienced ML peers.

  • Modern stack: Python, Kubernetes, S3, internal pipeline framework built for scale.

  • Growth: a fast-moving environment where shipping well-engineered systems is the norm.



Join our mailing list

Thanks for submitting!

  • White Facebook Icon
  • White Twitter Icon
  • White Instagram Icon

© 2025 by Visa Jobs Hq 

Visa Jobs Hq is a registered company with number 15952171.

bottom of page