Leave us your email address and we'll send you all the new jobs according to your preferences.

Senior Site Reliability Engineer

Posted 3 days 4 hours ago by Lloyds Banking Group

Permanent
Full Time
Other
Gloucestershire, Bristol, United Kingdom, BS153
Job Description

JOB TITLE: Senior Site Reliability Engineer

SALARY: £70,929 - £78,810

LOCATION(S): Bristol

HOURS: Full-time

WORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at our Bristol office.

About this opportunity

Our SRE (Site Reliability Engineering) team within the Analytics & AI platform are looking for an experienced and passionate Engineer with strong hands-on development experience.

As a Senior SRE you'll be leading a team of engineers - working on a suite of automation and gen AI products. You will run and maintain a set of applications and services on a combination of Private and Public Clouds that will enable the business to realise the next generation of services that form the Bank's vision for 2025 and beyond!

What you'll do

  • Play a role in supporting a multi-application, multi-platform team
  • Investigate, fix and remediate service issues to improve the performance and the reliability of our diverse application estate
  • Contribute to and/or lead post-incident reviews and drive problem resolution to prevent recurrence
  • Identify and eliminate toil through automation with a focus on continuous improvement
  • Create pro-active monitoring and observability solutions to help us see issues before our customers do
  • Define and measure Service Level Objectives and Service Level Indicators

What you'll need

  • Strong practitioner in SRE principles (SLI, SLO & SLA) using Observability, Logging, Monitoring & Alerting
  • Experience of Infrastructure as Code and CI/CD pipelines using tools such as Terraform, Jenkins and Harness
  • Can independently design, implement, manage and optimise cloud deployments, proficient in using orchestrations tools such as Jenkins, GitHub, and Harness
  • Confident programming and troubleshooting issues in at least one of the following languages: Java, Groovy, Python, Ruby, JavaScript, PowerShell or Bash/Shell.
  • Experience of developing for, or administrating Kubernetes
  • Knowledge of GCP and Azure cloud platforms
  • Experience in identifying toil and building automated solutions to remove it
  • A strong understanding of Cloud security and networking
  • Experience in Cloud monitoring tools, preferably Dynatrace
Email this Job