Luupli Logo

Luupli

Site Reliability Engineer

Reposted 6 Days Ago
Remote
Hiring Remotely in United Kingdom
Mid level
Remote
Hiring Remotely in United Kingdom
Mid level
The Site Reliability Engineer will design, build, and maintain AWS cloud infrastructure, ensure performance and reliability, automate tasks, and participate in incident management.
The summary above was generated by AI

About Luupli

Luupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be a force for good, and we are committed to creating a platform that maximizes the value that creators and businesses can gain from it, while making a positive impact on society and the planet. Luupli started internal testing since June 2024 and getting ready for a commercial BETA testing from December 2024, with the hope of launching fully summer of 2025

Job Title: Site Reliability Platform Engineer


About Luupli:

Luupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be a force for good, and we are committed to creating a platform that maximizes the value that creators and businesses can gain from it, while making a positive impact on society and the planet. Our team is made up of passionate and dedicated individuals who are committed to making Luupli a success.



Role Description:

We are seeking a talented and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure and services, primarily hosted on AWS. If you have a passion for problem-solving, a deep understanding of AWS services, hands-on experience with Terraform, and proficiency in scripting with Python or Bash, we invite you to apply for this exciting opportunity.


Role and Responsibilities:


1. Infrastructure Design and Automation:

- Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform.

- Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components.


2. Monitoring and Incident Management:

- Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues.

- Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents.


3. Reliability and Performance Optimization:

- Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning.

- Identify opportunities to automate manual processes and improve system resilience.


4. Scripting and Automation:

- Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments.

- Implement and improve continuous integration and continuous deployment (CI/CD) pipelines.


5. Security and Compliance:

- Collaborate with security teams to implement best practices for securing cloud infrastructure and services.

- Ensure compliance with relevant industry standards and regulations.


6. Deployment and Release Management:

- Support CI/CD pipelines for application deployments and updates.

- Contribute to the design and implementation of deployment strategies that promote zero-downtime releases.


7. Documentation and Knowledge Sharing:

- Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures.

- Participate in knowledge sharing with team members to enhance overall expertise and skill sets.


Requirements:


1. Education and Experience:

- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).

- Proven experience as a Site Reliability Engineer or similar role.


2. Technical Skills:

- Extensive experience with Amazon Web Services (AWS) and its core services (EC2, S3, RDS, IAM, etc.).

- Strong proficiency in infrastructure-as-code (IaC) tools, with a focus on Terraform.

- Proficient in scripting with Python or Bash for automation and operational tasks.

- Solid understanding of networking principles and protocols.

- Knowledge of CI/CD pipelines and related tools.


3. Problem-Solving and Analytical Abilities:

- Ability to diagnose and resolve complex technical issues in a fast-paced environment.

- Analytical mindset to proactively identify potential system weaknesses and performance bottlenecks.


4. Collaboration and Communication:

- Strong teamwork and collaboration skills to work effectively with cross-functional teams.

- Excellent verbal and written communication skills.



Compensation

This is an equity-only position, offering a unique opportunity to gain a stake in a rapidly growing company and contribute directly to its success.


Top Skills

AWS
Bash
Python
Terraform

Similar Jobs

4 Days Ago
In-Office or Remote
10 Locations
Senior level
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Design, operate, and scale production blockchain node infrastructure across multiple clouds. Build and maintain Kubernetes clusters, IaC with Terraform, CI/CD automation, and integrate AI-assisted tooling. Provide 24/7 on-call incident response, partner with security, mentor engineers, and improve reliability for a fast-growing blockchain platform.
Top Skills: Kubernetes,Helm,Terraform,Go,Python,Shell,Aws,Gcp,Sql,Ci/Cd,Container Image Builds,Blue-Green Deployment,Canary Deployment,Observability,Kubernetes Operators,Kubernetes Controllers,Rbac,Blockchain Nodes (Arc,Ethereum,Solana,Base),Smart Contracts,Cursor,Agentic Workflows
Yesterday
Remote
United Kingdom
Mid level
Mid level
Database
As a Designated Site Reliability Engineer, you will provide advanced technical support for Cohesity NetBackup and Flex Appliances, managing cases, collaborating with teams, and driving knowledge management to improve product performance and customer satisfaction while handling post-sales issues.
Top Skills: Advanced NetworkingAWSCloudComputer SecurityInfoscaleNetbackupOperating Systems
8 Days Ago
Easy Apply
Remote
United Kingdom
Easy Apply
Senior level
Senior level
Software
Owner of production reliability for Grafana Cloud databases (Mimir, Loki, Tempo, Pyroscope). Partner with product squads, implement automation, define per-tenant SLOs, lead incident response/on-call, reduce toil, and improve observability and scalability.
Top Skills: Grafana,Mimir,Loki,Tempo,Pyroscope,Aws,Gcp,Azure,Kubernetes,Helm,Terraform,Jsonnet,Go,Python,Java,Linux

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account