AWS -Site Reliability Engineer

March 1, 2026
£100000 - £100000
Urgent

Job Description

Excellent opportunity for AWS Site Reliability Engineers to be part of our Cloud Infrastructure & Security services practice. Cognizant Infrastructure Services – Provides IT infrastructure & Cloud services for clients across industry verticals, including both Consulting/Professional and Managed Services, across Enterprise Computing, Cloud services, Security Services, DevOps, Data Centres, End User Computing, Service Desk, Network Services and Environment Management Services.

Key Responsibilities

  • Design, code, test, and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Collaborate with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Collaborate with senior technical leads and mentor junior engineers.
  • Design, deploy and manage AWS environments with a focus on automation, scalability and security.
  • Build and maintain Infrastructure as Code(IaC) using tools such as Terraform.
  • Monitor and optimize system performance, availability, and security, applying observability best practices.

Key Skills and Experience

  • Must have Strong Hands-on exposure in AWS, Terraform, Python/Bash, CI/CD
  • Experience with Infrastructure as code(IaC) and CI/CD(Bitbucket, Jenkins,spinnaker).
  • Strong knowledge of containerization and orchestration, including Docker and Kubernetes.
  • Strong scripting skills in Python or Bash for automation.
  • Proven experience deploying and managing and deep understanding of AWS cloud infrastructure in secure environments.
  • Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks, VPC, subnets and security groups.)
  • Excellent troubleshooting, problem solving and debugging skills.
  • Bachelor’s degree or equivalent experience in software engineering discipline
  • Nice to have skills – Basic knowledge of AI technologies and prompt engineering to leverage generative AI for enhancing productivity and automating tasks

Location