en

Services

The UK's leading employers trust us to deliver fast, efficient talent solutions that are tailored to their exact requirements. Browse our range of bespoke services and resources.

Read more
Jobs

Let our industry specialists listen to your aspirations and present your story to the most esteemed organisations in the UK, as we collaborate to write the next chapter of your successful career.

See all jobs
Services

The UK's leading employers trust us to deliver fast, efficient talent solutions that are tailored to their exact requirements. Browse our range of bespoke services and resources.

Read more
About Robert Walters UK

Since our establishment in 1985, our belief remains the same: Building strong relationships with people is vital in a successful partnership.

Learn more

Work for us

Our people are the difference. Hear stories from our people to learn more about a career at Robert Walters UK

Learn more

Lead Site Reliability Engineer – Cloud

Save job

We are seeking an experienced and technically proficient Lead Site Reliability Engineer to join a growing team focused on delivering reliable, scalable, and secure cloud-based services. This is an excellent opportunity to play a pivotal role in one of the organisation’s key technology transformation programmes.

About the Role
As a Lead Site Reliability Engineer, you will contribute to the design, development, and operation of cloud infrastructure and applications on Google Cloud Platform. You will work collaboratively with engineering and infrastructure teams to implement site reliability engineering (SRE) principles, focusing on system reliability, observability, automation, and operational excellence.

This role follows a hybrid working model, requiring attendance at the Bristol office for at least two days per week or 40% of the working time.

Key Responsibilities

  • Promote and embed SRE best practices within engineering teams and microservices environments

  • Partner with infrastructure and DevOps engineers to improve system resilience and performance

  • Troubleshoot complex incidents and implement long-term solutions through code and automation

  • Develop and improve automation pipelines to reduce manual operations and enhance system efficiency

  • Contribute to multiple strategic digital initiatives and collaborate across engineering domains

Essential Skills and Experience

  • Background in software engineering or telemetry, with current focus on SRE

  • Extensive experience with public cloud platforms, particularly Google Cloud (or AWS/Azure)

  • Proven ability to manage Kubernetes clusters in production environments

  • Competence in scripting and development using languages such as Python, Java, Go, Bash, or PowerShell

  • Strong understanding of service-level objectives (SLOs), indicators (SLIs), and monitoring practices

  • Hands-on experience with infrastructure as code (e.g., Terraform) and CI/CD tools (e.g., Jenkins, Azure DevOps)

Desirable Knowledge

  • Familiarity with observability and performance tools such as Dynatrace, Stackdriver, Cloud Monitoring, or similar

  • Exposure to cost monitoring, logging frameworks, and cloud consumption analytics

Personal Attributes

  • Ability to mentor and support engineers in adopting SRE methodologies

  • Logical and structured problem-solving approach

  • Excellent collaboration and communication skills within cross-functional teams

  • Strong awareness of the software development lifecycle and agile delivery practices

What We Offer

  • Competitive pension contribution of up to 15%

  • Annual performance-based bonus

  • Company share schemes, including free shares

  • 30 days of annual leave plus bank holidays

  • A broad selection of benefits tailored to lifestyle, wellbeing, and personal circumstances

  • Inclusive policies, including enhanced parental leave and workplace flexibility

We are committed to creating an inclusive working environment that supports diversity in all forms. We welcome applications from all backgrounds and offer reasonable adjustments throughout the recruitment process.

Robert Walters Operations Limited is an employment business and employment agency and welcomes applications from all candidates

Contract Type: Permanent

Specialism: Technology & Digital

Focus: DevOps & Cloud

Industry: Banking

Salary: £90,000 - £110,000 per annum + + Bonus

Workplace Type: Hybrid

Experience Level: Senior Management

Location: Bristol

Job Reference: C39Q2J-DF27925D

Date posted: 7 July 2025

Consultant: Charlie Douds