CGIBangalore, KA

AWS Cloud DevOps / Site Reliability Engineer (SRE)

Description

Job Title: AWS Cloud DevOps / Site Reliability Engineer (SRE) Position: LA Experience: 10+ yrs Category: IT Infrastructure Main location: Bangalore, ODC (Close bay)

  • All 5 days work from office Shift : 3:30 PM
  • 12:30 PM IST Joining period : 0-30 days Position ID: J0126-0882 Employment Type: Full Time

Qualification: Bachelor's degree in Computer Science or related field or higher with minimum 10 years of relevant experience.

Job Summary We are looking for a skilled and proactive AWS Cloud DevOps / Site Reliability Engineer (SRE) to become a key member of our team. This position requires a blend of software engineering and cloud operations skills to design, build, and maintain scalable, secure, and reliable cloud infrastructure leveraging AWS services. The ideal candidate possesses extensive experience in DevOps best practices, cloud infrastructure automation, CI/CD pipelines, monitoring, and incident response. Key Responsibilities • Establish and maintain efficient and reliable Azure DevOps CI/CD pipelines to enable seamless integration and delivery processes across environments. • Manage source code repositories using version control tools, implementing branching strategies and overseeing release management. • Implement and manage infrastructure using Infrastructure as Code (IaC) tools such as Terraform and Terragrunt. • Integrate IaC workflows into CI/CD pipelines for automated and consistent deployments. • Oversee scalable, secure, and highly available AWS infrastructure, utilizing services such as Lambda, CloudWatch, EC2, S3, RDS, DynamoDB, API Gateway, and VPC. • Maintain reusable Terragrunt and Terraform modules and templates to support standardized infrastructure patterns. • Develop and operate monitoring and alerting systems to ensure high availability and resilience, leveraging automation, alerting, and auto-healing strategies. • Identify and address performance bottlenecks, optimize system resources, and implement effective scaling solutions to accommodate growth. • Troubleshoot infrastructure and application issues, conduct root cause analyses, and lead incident response efforts. • Implement security best practices, conduct vulnerability assessments, manage IAM roles and policies, and respond promptly to security incidents. • Continuously evaluate existing systems, tools, and processes to identify and execute improvements that enhance efficiency, reliability, and scalability. • Create and maintain comprehensive documentation for infrastructure, processes, and best practices. • Collaborate closely with developers to support deployment, performance, and reliability requirements for services. • Conduct incident response activities and root cause analyses for infrastructure issues, optimizing cloud infrastructure for both performance and cost. • Develop and maintain documentation and runbooks for operational procedures and troubleshooting. • Participate in an on-call rotation to provide SRE support. Required Qualifications • Practical experience in a DevOps or SRE position. • Hands-on expertise with monitoring and logging tools, including CloudWatch, Splunk, and Dynatrace. • Strong proficiency with AWS services such as S3, RDS, Lambda, API Gateway, VPC, IAM, EventBridge, and serverless functions. • Experience with Infrastructure as Code, preferably Terraform. • Proficiency in scripting languages, including Python, TypeScript, and Boto3. • Comprehensive understanding of CI/CD tools and processes. • Experience with observability and monitoring tools. • Solid understanding of networking, security, and cloud cost management. Soft Skills • Excellent verbal and written communication skills. • Ability to work independently as well as collaboratively within a team environment. • Strong problem-solving and debugging abilities. • Enthusiasm for automation and enhancing system reliability.

Behavioural Competencies : Proven experience of delivering process efficiencies and improvements Clear and fluent English (both verbal and written) Ability to build and maintain efficient working relationships with remote teams Demonstrate ability to take ownership of and accountability for relevant products and services Ability to plan, prioritize and complete your own work, whilst remaining a team player Willingness to engage with and work in other technologies

CGI is an equal opportunity employer. In addition, CGI is committed to providing accommodations for people with disabilities in accordance with provincial legislation. Please let us know if you require a reasonable accommodation due to a disability during any aspect of the recruitment process and we will work with you to address your needs. Life at CGI: It is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because… You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons Come join our team, one of the largest IT and business consulting services firms in the world.

Skills:

DevOps, Terraform

Skills

AzureSplunkDevOpsIamCI/CDTypeScriptPythonTerraformDynamodbSecurityAWS