Bangalore, Karnataka, India
16 hours ago
Senior Site Reliability Engineer

The Senior Site Reliability Engineer plays a crucial role within a small team, ensuring our critical services are secure, reliable, cost-effective, performant, and operationally excellent. This position demands a versatile professional who can contribute across development, system operations, resiliency testing, security hardening, and performance engineering. The ideal candidate is comfortable tackling new engineering challenges, conceptualising solutions, and implementing designs collaboratively. This role is pivotal in guiding our organisation towards modern application and infrastructure management practices while fostering the team's growth and skills development.

Key responsibilities include:

Addressing the most complex problems impacting the team's products Developing innovative tools and processes to solve high-level challenges Advocating for and modelling best practices, particularly for junior team members Building trust and relationships with product development teams Collaborating with development teams to diagnose and resolve systems issues Mentoring junior team members in their SRE journey Demonstrating deep knowledge across the team's product portfolio Ensuring consistent process implementation across multiple applications

Minimum Qualifications:

Minimum of 3-6 years of relevant infrastructure development and software support experience Experience architecting cloud-based solutions on AWS Proficiency in managing cloud infrastructure on AWS Strong familiarity with Linux operating systems Proficiency in scripting languages like Ruby, Python, or Bash Experience with Terraform

Nice to Have:

Experience with pipeline processes and implementations (e.g., Jenkins and Groovy) Solid understanding of SDLC and Agile methodologies Familiarity with cloud computing concepts, particularly AWS Broad understanding of diverse infrastructure platforms and concepts Versatility in troubleshooting various hosting technologies (web servers, Java platforms, OS, networks, virtualisation, databases) Knowledge of general networking concepts (CDN, WAF, DNS, PKI) Understanding of security policies and implementation Familiarity with backup and disaster recovery concepts Experience in a production environment supporting mission-critical applications Knowledge of standard production practices, including change management

This role requires a proactive and adaptable individual who can thrive in a dynamic environment and drive innovation, service reliability, and performance excellence.

Confirm your E-mail: Send Email