Senior Site Reliability Engineer II
๐ Job Overview
- Job Title: Senior Site Reliability Engineer II
- Company: Remitly
- Location: Mumbai, Mahฤrฤshtra, India
- Job Type: On-site
- Category: DevOps & Site Reliability Engineering
- Date Posted: June 25, 2025
๐ Role Summary
- DevOps/Site Reliability Engineer (SRE) role with a strong operational mindset and broad systems knowledge.
- Automate, optimize, and improve systems through smart scripting, tool integration, and an eye for observability and cost efficiency.
- Work in a mixed environment (Linux and Windows) with a diverse stack including containers, GitHub workflows, and infrastructure-as-code.
๐ป Primary Responsibilities
- Develop and manage GitHub Actions/Workflows to automate routine operations (e.g., password rotation, environment stop/start).
- Optimize CI/CD pipelines to streamline deployments and system updates.
- Implement and manage alerting, anomaly detection, and metrics collection.
- Build and maintain dashboards and reports using CloudWatch and other tools to monitor health and usage.
- Support and enhance container-based environments (Docker or equivalent).
- Maintain infrastructure through tools like Terraform and Ansible (if applicable).
- Assist in migration and support of Windows-based services to Linux, including light scripting in PowerShell and .NET (F# or C#).
- Identify and implement opportunities for cloud cost reduction, particularly through usage monitoring.
๐ Skills & Qualifications
Education: Bachelor's degree in Computer Science, Engineering, or a related field. Relevant experience may be considered in lieu of a degree.
Experience: 5-10 years of experience in DevOps or Site Reliability Engineering.
Required Skills:
- Experience with DevOps or SRE skills
- Good experience working on AWS cloud or any cloud services
- Familiarity with PowerShell and Windows Server (legacy support)
- Hands-on experience with GitHub and GitHub Workflows
- Hands-on experience with infrastructure as code โ Terraform and/or Ansible
- Familiarity working on Bash or equivalent scripting languages. Linux and containers (Docker or similar)
- Strong understanding of observability tools and practices (metrics, alerting, dashboards)
- Ability to perform light coding or debugging in .NET (F# or C#)
Preferred Skills:
- Experience with Kubernetes or similar container orchestration platforms
- Familiarity with infrastructure as code (IaC) best practices and security principles
- Knowledge of CI/CD best practices and pipelines
- Experience with monitoring and logging tools (e.g., Prometheus, ELK Stack)
- Familiarity with cloud cost management tools and practices
๐ Web Portfolio & Project Requirements
Portfolio Essentials:
- Demonstrate your experience with GitHub Actions/Workflows, CI/CD pipelines, and infrastructure as code.
- Showcase your scripting skills with examples of automation, optimization, and problem-solving.
- Highlight your observability and monitoring skills with examples of alerting, metrics collection, and dashboard creation.
Technical Documentation:
- Detailed documentation of your GitHub Actions/Workflows, CI/CD pipelines, and infrastructure as code.
- Explain your approach to cloud cost reduction and optimization.
- Include any relevant case studies or success stories demonstrating your impact on system reliability and performance.
๐ต Compensation & Benefits
Salary Range: INR 2,500,000 - 3,500,000 per annum (region-appropriate for Mumbai, Mahฤrฤshtra, India, based on experience level and industry standards)
Benefits:
- Competitive health, dental, and vision insurance
- Retirement savings plan with company match
- Generous time off and leave policies
- Employee assistance program
- Professional development opportunities and tuition reimbursement
- Employee discounts and perks
Working Hours: 40 hours per week, with flexible working hours and remote work options available for some roles.
๐ฏ Team & Company Context
๐ข Company Culture
Industry: Technology, focusing on risk assessment, identity authentication, and data management solutions.
Company Size: Medium-sized team within the larger LexisNexis Risk Solutions organization.
Founded: 1979 (LexisNexis), with the Risk Solutions division established in 2000.
Team Structure:
- Collaborative and cross-functional teams working on various software products and services.
- Diverse skill sets within the team, including software engineers, SREs, data scientists, and product managers.
Development Methodology:
- Agile development methodologies with regular sprint planning, code reviews, and testing practices.
- Infrastructure as code (IaC) principles for automated deployment and configuration management.
- Continuous integration and continuous deployment (CI/CD) pipelines for streamlined software delivery.
Company Website: LexisNexis Risk Solutions
๐ Career & Growth Analysis
Web Technology Career Level: Senior Site Reliability Engineer II, with significant experience in DevOps or SRE roles and a strong operational mindset.
Reporting Structure: Reports directly to the Site Reliability Engineering Manager, with a dotted line to the Director of Engineering for technical guidance and mentorship.
Technical Impact: Responsible for the stability, performance, and cost-efficiency of critical systems and services, ensuring minimal downtime and optimal resource utilization.
Growth Opportunities:
- Technical leadership opportunities within the Site Reliability Engineering team and across the broader organization.
- Mentoring and knowledge-sharing opportunities with junior team members and other engineers.
- Emerging technology adoption and innovation, driving improvements in system reliability and performance.
๐ Work Environment
Office Type: On-site, with a modern and collaborative office space in Mumbai, India.
Office Location(s): Mumbai, with additional offices in Chennai and Gurgaon for regional support.
Workspace Context:
- Dedicated workspace with multiple monitors and ergonomic furniture.
- Access to relevant tools and software for efficient work and learning.
- Collaborative workspaces for team meetings, brainstorming sessions, and knowledge-sharing events.
Work Schedule: Standard business hours with flexible working hours and remote work options available for some roles.
๐ Technology Stack & Web Infrastructure
Frontend Technologies: Not applicable for this role.
Backend & Server Technologies:
- AWS cloud services (EC2, RDS, Lambda, etc.)
- Linux and Windows Server operating systems
- Containers (Docker, Kubernetes)
- GitHub for version control and collaboration
- Infrastructure as code (IaC) tools (Terraform, Ansible)
- Monitoring and logging tools (CloudWatch, Prometheus, ELK Stack)
- CI/CD pipelines (GitHub Actions, Jenkins)
- Server management tools (Ansible, Puppet)
Development & DevOps Tools:
- Version control with Git and GitHub
- CI/CD pipelines for automated testing and deployment
- Infrastructure as code (IaC) tools for automated deployment and configuration management
- Monitoring and logging tools for system health and performance tracking
- Container orchestration platforms (Kubernetes, Docker Swarm)
Database Technologies: Not applicable for this role.
๐ฅ Team Culture & Values
Web Development Values:
- Collaboration and knowledge-sharing to drive continuous learning and improvement.
- Customer-focused approach to ensure high-quality software and services.
- Innovation and problem-solving to tackle complex challenges and drive business value.
- Quality and excellence in all aspects of work, from coding standards to customer interactions.
Collaboration Style:
- Cross-functional collaboration between software engineers, SREs, product managers, and other teams.
- Regular team meetings and stand-ups to discuss progress, obstacles, and solutions.
- Code reviews and pair programming for knowledge-sharing and quality assurance.
- Mentoring and coaching opportunities for professional growth and development.
โก Challenges & Growth Opportunities
Technical Challenges:
- Legacy system migration and modernization to improve performance and scalability.
- Cost optimization through resource utilization, automation, and infrastructure as code.
- Emerging technology adoption and integration to drive innovation and competitive advantage.
- Complex system troubleshooting and root cause analysis to minimize downtime and impact.
Learning & Development Opportunities:
- Emerging technology workshops and training sessions to stay current with industry trends and best practices.
- Conferences and meetups attendance to network with peers and gain insights into new tools and techniques.
- Mentoring and coaching relationships with senior team members and industry experts.
- On-the-job training and project-based learning to develop new skills and deepen existing ones.
๐ก Interview Preparation
Technical Questions:
- System design and architecture questions focusing on scalability, performance, and fault tolerance.
- Troubleshooting and root cause analysis scenarios to evaluate problem-solving skills and technical depth.
- Scripting and automation challenges to assess coding proficiency and automation mindset.
- Observability and monitoring questions to evaluate understanding of metrics, alerting, and dashboard creation.
Company & Culture Questions:
- Company-specific challenges and how you would approach them in this role.
- Team dynamics and how you would contribute to a collaborative and productive work environment.
- Customer focus and how you would ensure high-quality software and services that meet user needs and expectations.
- Innovation and problem-solving approaches to drive business value and competitive advantage.
Portfolio Presentation Strategy:
- Demonstrate your experience with GitHub Actions/Workflows, CI/CD pipelines, and infrastructure as code.
- Showcase your scripting skills with examples of automation, optimization, and problem-solving.
- Highlight your observability and monitoring skills with examples of alerting, metrics collection, and dashboard creation.
- Explain your approach to cloud cost reduction and optimization, with relevant case studies or success stories.
๐ Application Steps
To apply for this Senior Site Reliability Engineer II position:
- Update your resume to highlight your relevant experience with DevOps, SRE, and cloud technologies.
- Prepare your portfolio to showcase your GitHub Actions/Workflows, CI/CD pipelines, and infrastructure as code projects.
- Research the company and team to understand their culture, values, and technical challenges.
- Prepare for technical interviews by brushing up on your system design, scripting, and automation skills.
- Practice common interview questions and develop your responses to demonstrate your problem-solving skills and technical depth.
โ ๏ธ Important Notice: This enhanced job description includes AI-generated insights and web technology industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.
Application Requirements
Experience in DevOps or Site Reliability Engineering is required, along with familiarity with AWS and scripting languages. Candidates should have hands-on experience with GitHub, infrastructure as code tools, and observability practices.