Veza Technologies Logo

Veza Technologies

Sr. Site Reliability Engineer (SRE)

Posted Yesterday
Be an Early Applicant
Remote
28 Locations
Senior level
Remote
28 Locations
Senior level
The role involves ensuring infrastructure reliability, incident response, operational improvements, customer support, and technical documentation. Leadership in SRE and cloud automation are key aspects.
The summary above was generated by AI
We are seeking a highly motivated Site Reliability Engineer (SRE) with a strong operational focus to join our growing team. In this role, you will play a vital role in ensuring the smooth operation and performance of our critical infrastructure and services. You'll work cross-functionally to create alignment and deliver results alongside builders who have helped to shape the success of companies such as Google, Okta, AWS, Snowflake.
We are looking for someone with experience leading small teams and has a technical leadership mindset as we grow the team. We are building the next generation data security platform for the multi-cloud era - will you join us?

You will:
  • Deploy software for Cloud Prem and SAAS customers.
  • Respond to and diagnose system incidents in a timely and efficient manner, minimizing downtime and impact on users.
  • Collaborate with other engineers to establish root causes and implement effective resolutions.
  • Continuously improve incident response processes and documentation for future occurrences.
  • Proactively monitor and maintain the health and performance of our infrastructure and services.
  • Perform routine administrative tasks such as system configuration, user management, and data backups.
  • Identify and implement operational improvements to ensure ongoing system reliability and efficiency.
  • Develop and implement scripts and automated solutions to streamline operational tasks and reduce manual workload.
  • Participate in the on-call rotation to address critical incidents outside of regular business hours.
  • Ensure effective handoff between on-call engineers and document post-incident information for future reference.
  • Document processes for support and create, maintain and execute run-books for identified situations
  • Provide tier 2/3 technical support to customers experiencing platform issues or requiring advanced troubleshooting
  • Work directly with customer technical teams to resolve complex deployment, configuration, and integration challenges
  • Conduct technical onboarding sessions and provide guidance on best practices for customer implementations
  • Collaborate with customer success teams to ensure smooth customer experiences and rapid issue resolution
  • Create and maintain customer-facing technical documentation, troubleshooting guides, and knowledge base articles
  • Escalate customer feedback and feature requests to product and engineering teams
  • Participate in customer calls and technical discussions to provide expert-level platform guidance
  • Track and analyze customer support metrics to identify trends and areas for improvement
You have:
  • Education:
    • BS degree in Computer Science or related field
  • Experience:
    • 3+ years of experience in Site Reliability Engineering
    • 2+ years experience working with cloud platform and cloud automation tools especially in AWS
    • Strong experience with Kubernetes, Linux, AWS networking(VPC) and Terraform
    • Experience with the GitOps model for deployment
    • Familiarity with distributed version control
  • Other:
    • Experience with monitoring and alerting tools (e.g., Prometheus, Grafana).
    • Bazel and Helm experience a plus
    • Understanding of software configuration best practices
    • Ability to wear multiple hats in a fast-paced environment
    • Hands-on, “can do” attitude and a bias for action
    • Low ego and high intellectual curiosity
    • Comfortable working across time zones to support global customer base
    • Excellent communication skills with ability to explain technical concepts to both technical and non-technical audiences
    • Strong customer service orientation with patience and empathy when working with frustrated customers

Our Culture 

We’re driven to build a strong company culture and are looking for individuals with solid alignment with the following:

  • Ownership Mindset
  • Act with Integrity
  • Guardians of our Customers
  • Opinionated Humility
  • Build Trust, Earn Trust

At Veza, your base pay is one part of your total compensation package. For this position, the reasonably expected pay range can be discussed with your recruiter for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for equity and a competitive benefits package.

Veza is proud to be an equal opportunity employer. We are committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. We also consider qualified applicants according to applicable federal, state, and local laws. If a candidate with a disability requires an accommodation during the recruitment process, please email [email protected]

About Veza

Veza is the identity security company. Identity and security teams use Veza to secure identity access across SaaS apps, on-prem apps, data systems, and cloud infrastructure. Veza solves the blind spots of traditional identity tools with its unique ability to ingest and organize permissions metadata in the Veza Authorization Graph. Global enterprises like Blackstone, Wynn Resorts, and Expedia trust Veza to visualize access permissions, monitor permissions activity, automate access reviews, and remediate privilege violations. Founded in 2020, Veza is headquartered in Redwood City, California, and is funded by Accel, Bain Capital, Ballistic Ventures, GV, Norwest Venture Partners, and True Ventures. Visit us at veza.com and follow us on LinkedIn, Twitter, and YouTube.

Top Skills

AWS
Bazel
Cloud
Gitops
Grafana
Helm
Kubernetes
Linux
Prometheus
Terraform

Similar Jobs

12 Days Ago
Remote
30 Locations
Senior level
Senior level
Information Technology
As a Senior Site Reliability Engineer, you'll build and maintain infrastructure, tackle operational challenges, and automate processes to enhance reliability.
Top Skills: DockerDocker ComposeGoLinuxPerlPython
24 Days Ago
In-Office or Remote
33 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of our inference platform, leveraging Kubernetes and Terraform while ensuring smooth scalability of systems under load.
Top Skills: BashGrafanaKubernetesMlopsPrometheusPythonRayTerraformTritonVllm
2 Days Ago
In-Office or Remote
29 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
The Senior Site Reliability Engineer ensures system fault-tolerance, scalability, and operational continuity by leveraging cloud technologies and improving CI/CD processes.
Top Skills: AnsibleC++DockerGoHelmK8SPythonSaltTerraformUnix

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account