As a Cloud Platform Engineer, you'll design and manage Kubernetes clusters across AWS, Azure, and GCP, optimize performance, automate infrastructure through IaC with Terraform, and implement GitOps with ArgoCD. You'll also ensure security, establish monitoring systems, develop automation tools, and collaborate with development teams to enhance application deployment.
We're seeking a versatile Cloud Platform Engineer passionate about building and maintaining a highly reliable, scalable, and cloud-native infrastructure. You'll be vital in bridging the gap between development, operations, and SRE, ensuring our applications run smoothly on Kubernetes across multiple cloud platforms. Your deep understanding of Kubernetes, cloud technologies, and automation will be instrumental in empowering our teams to deliver high-quality software quickly and reliably.
What will you do?
- Design, deploy, and operate Kubernetes clusters across AWS, Azure, and GCP. Optimize cluster performance, ensure high availability, and implement robust security practices.
- Build and maintain cloud-native infrastructure components (load balancers, networking, storage, etc.) to support applications running on Kubernetes. Leverage Infrastructure as Code (IaC) with Terraform to automate and manage infrastructure provisioning and configuration.
- Embrace GitOps principles using ArgoCD to automate deployments and configuration changes and ensure consistency between the desired and actual system state.
- Establish comprehensive monitoring, logging, and alerting systems to gain insights into platform health and performance. Troubleshoot incidents swiftly and apply SRE principles to improve reliability and resilience.
- Develop automation scripts and tools (Python, Go, or other languages) to streamline workflows, eliminate manual tasks, and reduce operational overhead.
- Partner closely with development teams to understand their needs, provide guidance on platform best practices, and enable smooth integration and deployment of their applications.
- Implement and maintain stringent security measures for Kubernetes and cloud environments, ensuring compliance with industry standards and data protection regulations.
- Analyze resource usage and implement optimization strategies to maximize performance while controlling cloud costs.
- Participate in an on-call rotation, troubleshooting and resolving production issues promptly.
What makes you a match?
- 3+ years of experience working with Kubernetes in production environments. Deep understanding of cluster operations, networking, storage, and security within Kubernetes.
- Strong knowledge of AWS, Azure, and GCP, including core services, networking concepts, and security best practices.
- Proven experience implementing GitOps workflows with ArgoCD and managing infrastructure using Terraform.
- Fluency in at least one programming language (Python, Go, Java) for automation, scripting, and tool development.
- Familiarity with SRE practices like SLOs (Service Level Objectives), error budgeting, and blameless postmortems.
- Excellent analytical and troubleshooting skills to identify and resolve issues in complex cloud environments.
- Ability to communicate effectively with development, operations, and security teams to drive cross-functional initiatives.
- Ability to work from 8.30 PM to 5.30 AM IST to provide coverage for US time zones.
Top Skills
Go
Java
Kubernetes
Python
Similar Jobs
As a Support Engineer in the Global BI Development Team at PUMA, you will support and troubleshoot REST APIs and Azure applications, work closely with developers, create operational documentation, and assist external vendors. You will monitor API performance and propose improvements based on feedback.
Be an Early Applicant
As a Senior Engineer on the ML Platform team at CrowdStrike, you will build and maintain scalable ML pipelines, contribute to the development of the ML Experimentation Platform, and collaborate with Data Scientists and Engineers. You’ll facilitate the adoption of modern data and ML solutions, modularize complex ML code, and ensure adherence to software development best practices.
Be an Early Applicant
As a Salesforce QA Lead Engineer, you will build strategies for testing, develop and maintain automation scripts, and collaborate with various teams to align testing with product goals. Your role includes writing robust automation code, performing unit and regression tests, and keeping up with new automation technologies.
What you need to know about the Belfast Tech Scene
If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.