Job Description
Excellent opportunity for AWS Site Reliability Engineers to be part of our Cloud Infrastructure & Security services practice. Cognizant Infrastructure Services – Provides IT infrastructure & Cloud services for clients across industry verticals, including both Consulting/Professional and Managed Services, across Enterprise Computing, Cloud services, Security Services, DevOps, Data Centres, End User Computing, Service Desk, Network Services and Environment Management Services.
Key Responsibilities
- Design, code, test, and deliver software to automate manual operational work
- Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
- Collaborate with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
- Identify application patterns and analytics in support of better service level objectives
- Design self-healing and resiliency patterns
- Design automated software and product upgrades, change management, and release management solutions
- Collaborate with senior technical leads and mentor junior engineers.
- Design, deploy and manage AWS environments with a focus on automation, scalability and security.
- Build and maintain Infrastructure as Code(IaC) using tools such as Terraform.
- Monitor and optimize system performance, availability, and security, applying observability best practices.
Key Skills and Experience
- Must have Strong Hands-on exposure in AWS, Terraform, Python/Bash, CI/CD
- Experience with Infrastructure as code(IaC) and CI/CD(Bitbucket, Jenkins,spinnaker).
- Strong knowledge of containerization and orchestration, including Docker and Kubernetes.
- Strong scripting skills in Python or Bash for automation.
- Proven experience deploying and managing and deep understanding of AWS cloud infrastructure in secure environments.
- Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks, VPC, subnets and security groups.)
- Excellent troubleshooting, problem solving and debugging skills.
- Bachelor’s degree or equivalent experience in software engineering discipline
- Nice to have skills – Basic knowledge of AI technologies and prompt engineering to leverage generative AI for enhancing productivity and automating tasks