Job Details
SRE - SC Eligible or cleared
- Negotiable
- UK, London City
- FULL_TIME
• 40% client site attendance is non-negotiable.
Profile 1: Site Reliability Engineer Lead Role
Key Responsibilities:
• Leading the SRE team and evangelise SRE approach across product groups in FCA
• Providing the thought leadership of SRE from the Cognizant delivery team
• Review and guide the team for the day-to-day operations of observability tool and maintenance of observability tools.
• Support engineering teams directly with delivery of their observability backlog on a demand basis.
• Work with product teams to create patterns, blueprints, and automations for monitoring & alerting.
• Measure & capture performance and capacity metrics.
• Work closely with Product Groups to discuss, understand, and shape the observability needs of existing and new services in alignment to the Event Management standard.
• Build business-focused dashboards based on stakeholder requirements.
• Drive and quality assure Observability Plans issued by projects.
• Input into and quality assure testing plans and quality results.
Key Skills and Experience:
• Strong Experience with primary role of SRE Engineer
• Strong experience in Devops Tools (Git Hub, Git Hub Actions, Workflow, CodeQL Jenkins, Nexus, CloudFormation/Terraform etc.)
• Strong experience in monitoring tool (Datadog is preferred)
• Strong Knowledge of AWS services EC2, ELB, ECS, S3, Config, CloudTrail, EFS, Lambda, VPC
• Strong Knowledge and experience of python/shell scripting
Nice to have.
• Knowledge of Docker or container-based systems
• Knowledge of Chef
• AWS certification
