Valstro Logo

Valstro

Site Reliability Engineer (SRE)

Posted 2 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in London, Greater London, England
Mid level
In-Office or Remote
Hiring Remotely in London, Greater London, England
Mid level
The Site Reliability Engineer will ensure system reliability and performance of a cloud-based trading platform, automate tasks, and improve integration and operations.
The summary above was generated by AI

Valstro is looking for a Site Reliability Engineer (SRE), to join our team! This person will help ensure the reliability, availability, and performance of our cloud native trading platform. The role entails building and maintaining infrastructure, automating process and working closely with the Development and Platform teams to ensure seamless integration and deployment of the service.

The successful candidate will serve as an essential link between the wider organization, executive leadership, and external vendors. Their responsibilities will include ensuring system reliability, building and maintaining monitoring solutions for both production and UAT systems, automating operational tasks, responding to incidents, and continuously improving systems and processes.

This is a remote position that will report to the Site Reliability Lead.

What will you be doing?

· Act as a key intermediary between engineering, executive leadership, and external vendors.

· Ensure the reliability, availability, and performance of our cloud-based trading solutions.

· Develop and maintain monitoring solutions to track system performance and reliability.

· Automate operational tasks to improve efficiency and reduce manual intervention.

· Collaborate with development teams to ensure seamless integration and deployment.

· Respond to incidents and troubleshoot issues to minimize downtime.

· Continuously improve systems and processes to enhance reliability and performance.

· Participate in on-call rotations to provide 24/7 support for critical systems.


Requirements

· 3+ years experience supporting Production level systems

· Strong experience in site reliability engineering, systems engineering, or a related field.

· Proficiency in cloud-based infrastructure (e.g. AWS, Azure, or Google Cloud.)

· Experience with monitoring and logging tools (e.g., ELK, LGTM, Prometheus, Datadog).

· Expertise in automation and scripting (e.g., Golang, Python, Bash, Terraform).

· Knowledge of containerization and orchestration (e.g., Docker, Kubernetes).

· Ability to effectively communicate and liaise between stakeholders, including internal teams, executive management and external vendors.

· Strong troubleshooting and problem-solving skills.

· Experience in establishing and enhancing reliability engineering practices and processes.

· Capable of operating effectively in a dynamic organizational environment with high delivery and quality expectations.

Fintech = bonus

Technical

· A recent bachelor's degree in Computer Science, Software Engineering or related field

· Knowledge of SREing

· Knowledge of observability and tooling particularly the Grafana stack


Benefits

Valstro offers an excellent benefits package, including pension or 401 (k) plans, unlimited PTO and highly competitive compensation. Our leadership team brings a wealth of experience and deep industry knowledge, and despite being a young company, we believe we have carefully dialed in our product-market fit.

Similar Jobs

13 Days Ago
Easy Apply
Remote
United Kingdom
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As a Cloud Cost Utilization SRE at GitLab, you'll manage cloud spending, improve tracking and optimization of cloud usage, and collaborate with finance and engineering teams to enhance cost efficiency across AWS and GCP.
Top Skills: AnsibleAWSElkGCPGrafanaLokiMimirPrometheusTempoTerraform
Yesterday
Remote
United Kingdom
Senior level
Senior level
Semiconductor • Manufacturing
The Senior Site Reliability Engineer will manage reliability initiatives for AI operations, oversee SLOs and error budgets, support engineering teams, and enhance observability and automation in a semiconductor-focused platform.
Top Skills: Atlassian CompassAWSBackstageBashBitbucket PipelinesDatadogDockerGithub ActionsGitopsJavaKubernetesPythonSpring BootTerraform
7 Days Ago
Remote
United Kingdom
Senior level
Senior level
Software
As a Senior Site Reliability Engineer, you will oversee cloud platform reliability, drive automation, define reliability metrics, and mentor teams to improve performance and security across Civica's SaaS products.
Top Skills: .NetAksAnsibleAWSAzureDatadogEcsElkGithub ActionsGoGrafanaJaegerJavaKubernetesKubevirtOpensearchOpenshiftPackerPrometheusPythonTerraformVMware

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account