Synthesia Logo

Synthesia

Senior Machine Learning Platform Engineer

Reposted 11 Days Ago
Be an Early Applicant
In-Office
32 Locations
Mid level
In-Office
32 Locations
Mid level
The Machine Learning Platform Engineer will manage cloud infrastructure, support AI researchers, and deploy models in production, focusing on MLOps and CI/CD practices.
The summary above was generated by AI
Welcome to the video first world

From your everyday PowerPoint presentations to Hollywood movies, AI will transform the way we create and consume content. Today, people want to watch and listen, not read — both at home and at work. If you’re reading this and nodding, check out our brand video.

Despite the clear preference for video, communication and knowledge sharing in the business environment are still dominated by text, largely because high-quality video production remains complex and challenging to scale—until now….

Meet Synthesia

We're on a mission to make video easy for everyone. Born in an AI lab, our AI video communications platform simplifies the entire video production process, making it easy for everyone, regardless of skill level, to create, collaborate, and share high-quality videos. Whether it's for delivering essential training to employees and customers or marketing products and services, Synthesia enables large organizations to communicate and share knowledge through video quickly and efficiently. We’re trusted by leading brands such as Heineken, Zoom, Xerox, McDonald’s and more. Read stories from happy customers and what 1,200+ people say on G2.

In February 2024, G2 named us as the fastest growing company in the world. Today, we're at a $2.1bn valuation and we recently raised our Series D. This brings our total funding to over $330M from top-tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook.

About the role

We’re looking for an experienced Machine Learning Platform Engineer to join our MLOps team at Synthesia. MLOps a group that enables our AI researchers and engineers to build, train, serve, and deploy state-of-the-art generative models at scale.

You’ll own critical infrastructure across both research and production, helping bridge our DevOps and MLOps domains. You can expect to work across cloud infrastructure, CI/CD pipelines, observability, and tooling, with autonomy to identify and fix bottlenecks in a fast-moving AI company.

This is a hands-on senior IC role (roughly level 5 scope). You’ll be joining a growing team that’s shifting from enablement to direct execution, and you’ll help shape how we scale our infrastructure over the next year.

What you'll do 
  • Manage and evolve our AWS (and some GCP) cloud environments, balancing reliability, cost, and velocity.

  • Maintain and scale Kubernetes (EKS) clusters — managing workloads, deployments, and monitoring at production scale.

  • Own and improve our CI/CD systems (GitHub Actions on our self-hosted AWS runners).

  • Define and implement Infrastructure as Code using Terraform and Terragrunt.

  • Strengthen observability via Datadog and enable teams to understand their systems in production.

  • Collaborate with AI researchers to deploy and monitor ML models — no prior ML experience required.

  • Drive FinOps practices: vendor management, cost allocation, and financial feedback loops.

  • Contribute to internal tooling, automation, and reporting platforms that improve developer experience.

You’ll thrive in this role if you have: 
  • Deep hands-on DevOps / SRE / Platform experience in a SaaS or high-traffic product environment.

  • Strong Kubernetes experience - spinning up and managing clusters, not just consuming them.

  • Proven AWS and or GCP expertise. 

  • Proficiency with Terraform / Terragrunt, Linux, and Python scripting.

  • Strong understanding of CI/CD design patterns.

  • Experience with Datadog or similar observability tooling.

  • Comfortable operating autonomously in ambiguous environments.

  • A pragmatic mindset - focusing on scalable, maintainable solutions over theoretical perfection.

  • A bias toward execution and written communication, especially in remote contexts.

Bonus points for:

  • Familiarity with Temporal.io, or workflow orchestration frameworks.

  • Light frontend or tooling development experience (React, Node.js).

  • Previous work supporting AI research or data-intensive environments

Our culture

At Synthesia we’re passionate about building, not talking, planning or politicising. We strive to hire the smartest, kindest and most unrelenting people and let them do their best work without distractions. Our work principles serve as our charter for how we make decisions, give feedback and structure our work to empower everyone to go as fast as possible. You can find out more about these principles here.

The hiring process:
  1. 30min call with a technical recruiter
  2. 45min call with engineering lead for MLOps to discuss your past projects
  3. Take-home assignment - does not have a deadline and it is syntax agnostic
  4. 60min technical discussion
  5. 30min call with leadership 

Other important info:

  • This is a remote role from an EU country, UK or Switzerland or hybrid from one of our London, Munich, Copenhagen, or Zurich hubs. 
  • This is full-time employment only - no contractors possible - usually through OysterHR or a local entity.
  • We only sponsor visas if you are in the UK or some EU countries already. 

Top Skills

AWS
Datadog
Github Actions
Kubernetes
Node.js
Python
Temporal.Io
Terraform

Similar Jobs

24 Minutes Ago
Hybrid
28 Locations
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
The Staff Engineer will architect and optimize Pfizer's AI infrastructure, build MLOps platforms, and enhance developer experiences while ensuring compliance with healthcare regulations.
Top Skills: AWSAzureCi/CdDockerGCPGrafanaHelmKserveKubeflowKubernetesMlflowPagerdutyPrometheusPythonSeldon CoreTensorflow ServingTerraformTorchserve
25 Minutes Ago
Hybrid
Chortiatis, GRC
Expert/Leader
Expert/Leader
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Lead the design and maintenance of scalable data infrastructure and pipelines, ensuring optimal performance and implementing DevOps principles across the data lifecycle.
Top Skills: Apache AirflowSparkAWSAzureBigQueryDatabricksGCPJavaKafkaPrefectPythonRedshiftScalaSnowflakeSQL
18 Hours Ago
Hybrid
28 Locations
Entry level
Entry level
Fintech • Machine Learning • Software • Financial Services
Join the HackaTUM challenge for a chance to win prizes. Team registration required; no CV needed. Insights sessions at specified times.

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account