Writer Logo

Writer

Site reliability engineer

Reposted 16 Days Ago
In-Office or Remote
2 Locations
Senior level
In-Office or Remote
2 Locations
Senior level
This role involves designing and maintaining cloud infrastructure, automating provisioning, and enhancing system reliability through monitoring, collaboration, and mentorship.
The summary above was generated by AI

📐 About this role 

We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of WRITER’s critical systems, taking a proactive approach to guarantee that our high-ROI products reach our customers seamlessly.


🦸🏻‍♀️ Your responsibilities:

  • Lead the design, implementation, and maintenance of WRITER, Inc.’s cloud infrastructure to ensure high availability and performance

  • Design and implement scalable cloud automation to support seamless deployment for our largest enterprise customers

  • Automate infrastructure provisioning and management using Terraform & Python

  • Collaborate with development teams to optimize cloud resources and enhance system reliability

  • Develop and maintain monitoring and alerting systems to proactively identify and resolve issues affecting the reliability of our writing solutions

  • Conduct post-mortem analyses of system failures to identify root causes and implement preventive measures

  • Optimize and scale our cloud infrastructure to support growing user demand and ensure cost efficiency

  • Ensure the security and compliance of our systems, adhering to industry standards and regulations

  • Provide mentorship and technical guidance to junior engineers, fostering a culture of reliability and continuous improvement

  • Stay current with emerging technologies and industry trends to continuously improve our site reliability practices

⭐ Is this you? 

  • Proven expertise in Site Reliability Engineering with a minimum of 7 years of hands-on experience

  • Deep understanding of system architecture and infrastructure design to ensure high availability and performance

  • Bachelor’s degree in Computer Science, Engineering, or a related technical field

  • Strong proficiency in programming languages such as Python, Java, Go for automation and monitoring

  • Experience with cloud platforms like AWS, Azure, or GCP, and their respective services for scalable and resilient systems

  • Expertise in containerization technologies (e.g., Docker, Kubernetes) and orchestration tools

  • Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) to maintain system health and performance

  • Ability to lead and mentor junior engineers in best practices for reliability and system optimization

  • Excellent communication skills to collaborate effectively with cross-functional teams and stakeholders

  • Proactive approach to identifying and mitigating potential system failures and performance bottlenecks

Preferred skills & experience:
  • Software engineering expertise

  • Terraform

  • Python

  • Kubernetes

  • Scala

  • AWS/GCP

🍩 Benefits & perks (UK full-time employees):

  • Generous PTO, plus company holidays

  • Comprehensive medical and dental insurance

  • Paid parental leave for all parents (12 weeks)

  • Fertility and family planning support

  • Early-detection cancer testing through Galleri

  • Competitive pension scheme and company contribution

  • Annual work-life stipends for:

    • Home office setup, cell phone, internet

    • Wellness stipend for gym, massage/chiropractor, personal training, etc.

    • Learning and development stipend

  • Company-wide off-sites and team off-sites

  • Competitive compensation and company stock options


#BI-Remote

Top Skills

AWS
Azure
Docker
Elk Stack
GCP
Go
Grafana
Java
Kubernetes
Prometheus
Python
Terraform

Similar Jobs

6 Hours Ago
Easy Apply
Remote
28 Locations
Easy Apply
Junior
Junior
Cloud • Security • Software • Cybersecurity • Automation
As an SRE, you'll automate environments, debug production issues, contribute to CI/CD workflows, and enhance observability while collaborating across teams.
Top Skills: AIAnsibleDevsecopsElkGitlabGoGrafanaKubernetesPrometheusRubyTerraform
5 Days Ago
Remote
United Kingdom
Senior level
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
As a Senior Site Reliability Engineer, you'll design and manage blockchain infrastructure, focusing on scalability and reliability, while supporting various engineering teams through automation and mentorship.
Top Skills: AWSCi/CdGCPGoKubernetesPythonShellSQLTerraform
One Month Ago
Easy Apply
Remote
United Kingdom
Easy Apply
Junior
Junior
Events • Information Technology • Payments • Software • Wearables • Hospitality
As a Junior Site Reliability Engineer, you will support cloud environments, automate tasks, and learn site reliability practices under mentorship, ensuring optimal operations for the managed accesso Horizon product.
Top Skills: AnsibleAzure SqlBashCoralogixDynatraceGitGrafanaKubernetesPythonRdsTerraform

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account