Fortis Games Logo

Fortis Games

Lead SRE

Reposted 6 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United Kingdom
Senior level
Remote
Hiring Remotely in United Kingdom
Senior level
Lead design and implementation of systems for game reliability and performance. Optimize deployments, mentor team members, and ensure cloud security.
The summary above was generated by AI

Who we are
At Fortis Games we aspire to make great games that bring people together while redefining how game companies work. We believe in building a sense of belonging through our games, their communities, and how we operate and treat each other. Through our game communities, we will create powerful connections and lasting memories. We will foster a culture of diversity, equity and belonging where together our diverse skills, experiences and backgrounds impact the games we make.
We are an early but mighty organization with a leadership team of game industry veterans. There are many opportunities for you to have a big impact on the products we'll be making as well as the overall direction of the company. If you're passionate about tackling difficult problems with direct and thoughtful communication and team first mentality, we may be the right place for you.

About the role

As a Lead Site Reliability Engineer, you’ll lead the design, implementation, and optimization of systems that ensure our games scale seamlessly, and deliver an exceptional player experience. This role is central to driving our production readiness initiative, working closely with development teams to embed reliability, observability, and cost-conscious practices into our product lifecycle. You’ll leverage monitoring tools to make recommendations and implement changes using infrastructure as code tools such as Terraform and Terragrunt.

What you’ll achieve

  • Implement security measures for cloud-based solutions, focusing on identity and access management (IAM), data privacy, ECS and Kubernetes.
  • Lead production readiness reviews to understand the state of the system and suggest improvements.
  • Design and run incident response tabletop exercises with our development teams.
  • Implement tooling to optimize systems for predictable deployments, better observability, reduced MTTR.
  • Conduct cost optimization exercises with development teams to help build a cost aware culture.
  • Mentor junior team members, guiding them in best practices and contributing to their professional growth.

What you’ll need to be successful

  • Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role.
  • Strong proficiency with automation tools and scripting languages (e.g., Python, Javascript, Shell, PowerShell).
  • Hands-on experience with containerization and orchestration technologies (e.g., Docker, AWS ECS, Kubernetes).
  • Familiarity with Infrastructure as Code tools (e.g., Terraform, Ansible).
  • Hands-on experience managing Kubernetes workloads (EKS preferred). Familiarity with ECS is a plus.
  • Excellent problem-solving skills and the ability to work collaboratively with other teams.
  • Proven expertise in designing systems that prioritize customer outcomes, leveraging fast feedback loops and iterative design.
  • Extensive experience with logging, application performance, and monitoring tools (e.g., Datadog, Prometheus, Grafana).

Why join us
There are many reasons to join us, but here are a few:

  • We strongly believe we are changing how games studios operate and at the core of what we do is making great games that create a connected community
  • We're not just about making Games Where You Belong. We're also about building communities where our people belong. That's why Fortis is a thriving environment that celebrates diversity, embraces inclusivity, and fosters growth.
  • Build and grow with a seasoned team of accomplished talent who have left an impactful mark in their disciplines, both in and out of gaming

Fortis is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, gender expression, national origin, protected veteran status, or any other basis protected by applicable law, and will not be discriminated against on the basis of disability.

Top Skills

Ansible
Aws Ecs
Datadog
Docker
Grafana
JavaScript
Kubernetes
Powershell
Prometheus
Python
Shell
Terraform

Similar Jobs

23 Days Ago
Remote
29 Locations
Senior level
Senior level
Information Technology • Software
As a Lead Site Reliability Engineer, you'll mentor a team, drive site reliability initiatives, improve incident management, and architect observability solutions.
Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
12 Days Ago
Remote
30 Locations
Senior level
Senior level
Information Technology
As a Senior Site Reliability Engineer, you'll build and maintain infrastructure, tackle operational challenges, and automate processes to enhance reliability.
Top Skills: DockerDocker ComposeGoLinuxPerlPython
24 Days Ago
Remote
UK
Senior level
Senior level
Information Technology • Software
As a Senior Site Reliability Engineer, you will ensure reliability of cloud infrastructure, lead incident management, and build observability solutions.
Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account