Share this Job

Senior Site Reliability Engineer

Date: 27-Dec-2020

Location: UXBRIDGE, GB

Company: Telefónica S.A.

giffgaff is the commercial brand of Telefónica UK Limited, a leading digital communications company owned by Telefónica S.A. We like to do things a little differently here at giffgaff.

We may be a small company, but we like to think big and create some radical waves in the telco land. At the heart of it, we believe in simplicity. A better way to do mobile. We'd rather you stay with us because you want to, not because there's a nasty contract forcing you to. It's why we work our socks off every day to keep you and guess what? It works. We're uSwitch Network of the Year 2019.

 

About Us

Software engineering is at the heart of what we do here at giffgaff. Our agile engineering teams build and support a wide variety of applications and services. These combine to create our unique user experience on the giffgaff website, enable a whole range of awesome features via modern APIs and, additionally, enable our internal teams to work in the most productive and efficient ways..

 

The quality and maintainability of our software is very important for us, we are looking for engineers with real world experience of building and running contemporary software, especially in terms of the whole development lifecycle. You will help us to design and implement tightly focussed APIs, user interfaces, services that internalise our core business logic or internal tools and, together with your team, you will support them in production. We want you to share your opinions on how we are doing things - and help us get better!

 

About The Role

We are looking for SRE engineers with passion and energy, a strong desire to learn and improve and a commitment to testing and excellence.

You’ll have to make tough decisions, implement best practice and provide compromises when situations are not ideal. Oh and you’ll need to explain that to people. Not just the technical ones. Because we work in product teams where not everyone is a techie. It’s fun, you get closer to the business and know more about why we do what we do, not just how.

We’re looking for passionate engineers who are comfortable with using a variety of frameworks, languages & tools and are happy to pick up new skills when the need arises.

We’d expect you to have a bachelor’s degree in computer science, programming, mathematics, or a related field or be able to show that you have equivalent experience.

 

Key Skills for the role

You'll have hands on experience with: -

  • Cloud services platforms: AWS most common services
  • Container Orchestration platforms: Kubernetes/EKS, Openshift
  • Helm, Kubernetes Operators or similar package managers for Kubernetes
  • Strong background in Linux/Unix
  • Strong experience coding in one or more of the following programming languages to manage systems: Go, Python, NodeJS
  • Source code management: Git
  • Implementation of monitoring solutions / Metric-driven alerting, logging and tracing (Prometheus, EFK, Alertmanager, Jaeger/Zipkin)
  • Troubleshooting in complex environments using the monitoring tools
  • Establishing and measuring SLIs and SLOs with engineering teams
  • Participate in periodic 24x7 on-call duties
  • Build and manage systems, infrastructure and applications through automation (Terraform, Ansible)
  • CI/CD tools: Jenkins, Nexus, Artifactory
  • Deploy, support and monitor new and existing services, platforms, and application stacks
  • Capacity and performance management of environments
  • Extra bounties for experience implementing and managing istio in cloud environments
  • (Managed istio in hybrid environments? We love you!)

Good to have experience with:

  • Coding in one of more of the following programming languages: Java, PHP, ReactJS
  • Experience operating a variety of data stores ideally in cloud. We primarily use Oracle, Postgres, and ElasticSearch
  • Experience operating applications across multiple environments, cloud and in-house

You'll also bring in these traits:

  • Strong opinions about evolving SRE practices
  • Strong inclination to a test-first development approach including DevOps automation code
  • Contributed to opensource DevOps and SRE tools, however small the contribution is
  • Participated in public domain discussions related to SRE tools and practices
  • Passion to identify processes that can be automated and improved i.e., You cannot withstand manual operations

 

The Nuts & Bolts

  • Love delivering a wow to our members
  • Ability to communicate clearly at all levels of our organisation
  • Curiosity & Creativity
  • Excellent attention to detail
  • Have an interest in technology, mobile and able to work under pressure and manage workloads effectively - Organised, self-motivated and accountable for own workload
  • A strong team player - Fun to be with and work with

 

Finally... This is a chance to work for one of the most sought after UK companies, highly regarded for its community model. In return for your outstanding efforts, you’ll be rewarded with a competitive salary and excellent benefits. We believe that hard work should be supported and recognised. This position plays an important role across the business, allowing you to work cross functionality, take on more responsibility and gain experience, which will greatly benefit you in the future.