Sr SRE Engineer

Zensar
Full_timeIndia

📍 Job Overview

  • Job Title: Sr SRE Engineer
  • Company: Zensar
  • Location: Hyderabad, Telangana, India
  • Job Type: On-site, Full-time
  • Category: DevOps, Site Reliability Engineering
  • Date Posted: June 20, 2025
  • Experience Level: 5-10 years

🚀 Role Summary

  • Key Responsibilities: Design, implement, and maintain highly available and scalable systems, collaborate with cross-functional teams, monitor systems, automate tasks, and improve system performance.
  • Key Skills: Linux/Unix systems, scripting languages, configuration management, cloud platforms, networking principles, containerization technologies, monitoring tools, Citrix technologies, virtualization technologies, problem-solving, and communication skills.

📝 Enhancement Note: This role requires a strong background in Linux/Unix systems and scripting languages, as well as experience with configuration management tools and cloud platforms. Proficiency in containerization technologies and monitoring tools is also essential for success in this role.

💻 Primary Responsibilities

  • System Design & Implementation: Design and implement highly available and scalable systems to ensure the reliability and performance of the company's website or application.
  • Cross-Functional Collaboration: Collaborate with cross-functional teams to define and establish service level objectives (SLOs) and service level agreements (SLAs) for critical systems.
  • System Monitoring: Monitor systems and applications, proactively identifying and resolving performance bottlenecks or availability issues.
  • Alerts & Dashboards: Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
  • Incident Analysis: Conduct post-incident analyses to identify root causes and implement preventive measures to avoid future incidents.
  • Automation: Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
  • Documentation: Create and maintain documentation for system architecture, configuration, and troubleshooting procedures.
  • Capacity Planning: Perform capacity planning and resource allocation to ensure optimal system performance and scalability.
  • Deployment Collaboration: Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability and performance standards.
  • Industry Best Practices: Stay up to date with industry best practices, new technologies, and emerging trends in site reliability engineering.

📝 Enhancement Note: This role requires a strong focus on system design, monitoring, and automation to ensure optimal system performance and reliability. Candidates should have experience with various monitoring tools and be able to analyze and resolve complex technical issues.

🎓 Skills & Qualifications

Education: Bachelor's degree in computer science, engineering, or a related field.

Experience: Proven experience as a Site Reliability Engineer or a similar role.

Required Skills:

  • Strong knowledge of Linux/Unix systems and command line tools.
  • Proficiency in scripting languages such as Python, Shell, or Perl.
  • Experience with configuration management tools like Ansible, Puppet, or Chef.
  • Familiarity with cloud platforms like AWS, Azure, or Google Cloud.
  • Understanding of networking principles and protocols (TCP/IP, HTTP, DNS, etc.).
  • Knowledge of containerization technologies (Docker, Kubernetes) and orchestration tools.
  • Expertise in monitoring and logging tools such as Prometheus, Grafana, ELK stack, or Splunk.
  • Experience with Citrix technologies such as XenApp, XenDesktop, and NetScaler.
  • Support the administration and engineering of the Citrix environment.
  • Work with Citrix Provisioning Server, SQL Database, and Citrix License Server.
  • Experienced knowledge of virtualization technologies such as VMware or Hyper-V.
  • Strong problem-solving and troubleshooting skills, with the ability to analyze and resolve complex technical issues.
  • Excellent communication and collaboration skills to work effectively with cross-functional teams.
  • Strong attention to detail and ability to work in a fast-paced, dynamic environment.
  • Terraform basic syntax and GitLab CI/CD configuration, pipelines, jobs.
  • Cloud resources provisioning and configuration through CLI/API.
  • Understanding of how to do basic queries in logs tools for general questions.
  • Operating system (Linux) configuration, package management, startup and troubleshooting.
  • Block and object storage configuration.
  • Networking VPCs, proxies, and CDNs.

Preferred Skills:

  • Certification in relevant technologies or frameworks (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator).
  • Familiarity with continuous integration/continuous deployment (CI/CD) pipelines.
  • Experience with source control systems such as Git or SVN.
  • Knowledge of security best practices and experience implementing security measures in a production environment.
  • Ability to work independently and handle multiple projects and priorities simultaneously.
  • Strong analytical and problem-solving skills, with a focus on continuous improvement and automation.

📝 Enhancement Note: This role requires a strong technical background with a focus on Linux/Unix systems, scripting languages, and configuration management tools. Candidates should also have experience with cloud platforms and monitoring tools to succeed in this role.

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate strong knowledge of Linux/Unix systems and scripting languages through relevant projects.
  • Showcase experience with configuration management tools and cloud platforms through case studies or project documentation.
  • Highlight proficiency in containerization technologies and monitoring tools through live demos or project walkthroughs.
  • Display expertise in Citrix technologies and virtualization through relevant project examples.

Technical Documentation:

  • Provide detailed documentation for system architecture, configuration, and troubleshooting procedures for past projects.
  • Include examples of capacity planning and resource allocation strategies used in previous roles.
  • Showcase experience with Terraform, GitLab CI/CD, and cloud resources provisioning through project case studies.

📝 Enhancement Note: This role requires a strong focus on system design, monitoring, and automation. Candidates should provide detailed documentation and case studies demonstrating their experience with various monitoring tools and their ability to analyze and resolve complex technical issues.

💵 Compensation & Benefits

Salary Range: INR 1,200,000 - 1,800,000 per annum (Based on experience and market standards for Site Reliability Engineers in Hyderabad, India)

Benefits:

  • Competitive salary package.
  • Performance-based bonuses and incentives.
  • Health, dental, and vision insurance.
  • Retirement savings plans.
  • Employee stock purchase plan.
  • Professional development opportunities and training programs.
  • Flexible work arrangements and remote work options.

Working Hours: 40 hours per week, with flexible hours and overtime as needed to ensure system availability and performance.

📝 Enhancement Note: The salary range for this role is based on market standards for Site Reliability Engineers in Hyderabad, India. Benefits may vary based on the company's policies and the candidate's eligibility.

🎯 Team & Company Context

🏢 Company Culture

Industry: Information Technology and Services.

Company Size: Medium (250-1,000 employees).

Founded: 1991.

Team Structure:

  • The DevOps team consists of Site Reliability Engineers, DevOps Engineers, and System Administrators.
  • The team follows an Agile/Scrum development methodology, with regular sprint planning and stand-up meetings.
  • The team works closely with cross-functional teams, including development, quality assurance, and project management, to ensure the reliability and performance of the company's applications.

Development Methodology:

  • The team uses version control systems like Git for collaborative development and code reviews.
  • They implement continuous integration and continuous deployment (CI/CD) pipelines using tools like Jenkins or GitLab CI/CD.
  • The team uses infrastructure as code (IaC) tools like Terraform or CloudFormation for automated deployment and configuration management.

Company Website: Zensar Technologies

📝 Enhancement Note: Zensar is a medium-sized IT services company with a strong focus on Agile/Scrum development methodologies. The DevOps team works closely with cross-functional teams to ensure the reliability and performance of the company's applications.

📈 Career & Growth Analysis

Web Technology Career Level: Senior Site Reliability Engineer (SRE) - Responsible for designing, implementing, and maintaining highly available and scalable systems, as well as monitoring and improving system performance.

Reporting Structure: Reports directly to the Head of DevOps or a similar role, with cross-functional collaboration with development, quality assurance, and project management teams.

Technical Impact: Has a significant impact on the reliability, performance, and scalability of the company's applications, as well as the overall user experience.

Growth Opportunities:

  • Technical Leadership: With experience and strong performance, candidates can progress to become a Technical Lead or Manager, responsible for leading teams and driving technical strategy.
  • Architecture & Design: Candidates can specialize in architecture and design, focusing on designing and implementing complex, highly available, and scalable systems.
  • Emerging Technologies: Candidates can explore emerging technologies and trends in site reliability engineering, such as AI/ML-driven operations, chaos engineering, or multi-cloud environments.

📝 Enhancement Note: This role offers significant growth opportunities for candidates with strong technical skills and a focus on continuous learning and improvement. Candidates can progress to technical leadership roles or specialize in architecture and design, depending on their interests and strengths.

🌐 Work Environment

Office Type: Modern, collaborative office space with dedicated workstations, meeting rooms, and breakout areas.

Office Location(s): Hyderabad, Telangana, India.

Workspace Context:

  • The office provides a collaborative work environment with dedicated workstations, multiple monitors, and testing devices available for web development and server administration tasks.
  • The team works closely together, with regular stand-up meetings and code reviews to ensure knowledge sharing and continuous learning.
  • The office is located in a convenient area with easy access to public transportation and amenities.

Work Schedule: Flexible work hours with core hours between 10:00 AM and 6:00 PM, with the option to work remotely or from home as needed.

📝 Enhancement Note: The office provides a collaborative work environment with dedicated workstations and multiple monitors, as well as easy access to public transportation and amenities. The flexible work schedule allows for a healthy work-life balance.

📄 Application & Technical Interview Process

Interview Process:

  1. Online Assessment: A technical assessment focusing on Linux/Unix systems, scripting languages, and cloud platforms.
  2. Technical Deep Dive: A detailed discussion of the candidate's experience with configuration management tools, containerization technologies, and monitoring tools.
  3. Behavioral & Cultural Fit: An assessment of the candidate's problem-solving skills, communication skills, and cultural fit within the team.
  4. Final Interview: A meeting with the hiring manager or a senior team member to discuss the candidate's fit for the role and the team.

Portfolio Review Tips:

  • Highlight relevant projects demonstrating experience with Linux/Unix systems, scripting languages, and cloud platforms.
  • Include case studies or project documentation showcasing experience with configuration management tools, containerization technologies, and monitoring tools.
  • Provide examples of system architecture, capacity planning, and resource allocation strategies used in previous roles.

Technical Challenge Preparation:

  • Brush up on Linux/Unix systems, scripting languages, and cloud platforms.
  • Review configuration management tools, containerization technologies, and monitoring tools.
  • Practice problem-solving and troubleshooting exercises to improve analytical and problem-solving skills.

ATS Keywords: Linux, Unix, Scripting, Cloud, Configuration Management, Containerization, Monitoring, Citrix, Virtualization, Problem-Solving, Communication, Terraform, GitLab CI/CD, Cloud Resources Provisioning, Operating System Configuration, Storage Configuration, Networking.

📝 Enhancement Note: The interview process for this role focuses on the candidate's technical skills and problem-solving abilities, as well as their cultural fit within the team. Candidates should be prepared to discuss their experience with various technologies and provide examples of their work.

🛠 Technology Stack & Web Infrastructure

Frontend Technologies: Not applicable for this role.

Backend & Server Technologies:

  • Linux/Unix systems (Ubuntu, CentOS, RedHat).
  • Scripting languages (Python, Shell, Perl).
  • Configuration management tools (Ansible, Puppet, Chef).
  • Cloud platforms (AWS, Azure, Google Cloud).
  • Containerization technologies (Docker, Kubernetes).
  • Monitoring tools (Prometheus, Grafana, ELK stack, Splunk).
  • Citrix technologies (XenApp, XenDesktop, NetScaler).
  • Virtualization technologies (VMware, Hyper-V).
  • Infrastructure as code (IaC) tools (Terraform, CloudFormation).
  • Version control systems (Git, SVN).
  • Continuous integration/continuous deployment (CI/CD) tools (Jenkins, GitLab CI/CD).

Development & DevOps Tools:

  • Version control systems (Git, SVN).
  • Continuous integration/continuous deployment (CI/CD) tools (Jenkins, GitLab CI/CD).
  • Infrastructure as code (IaC) tools (Terraform, CloudFormation).
  • Monitoring tools (Prometheus, Grafana, ELK stack, Splunk).
  • Logging tools (ELK stack, Splunk).
  • Configuration management tools (Ansible, Puppet, Chef).
  • Containerization technologies (Docker, Kubernetes).
  • Cloud platforms (AWS, Azure, Google Cloud).
  • Virtualization technologies (VMware, Hyper-V).

📝 Enhancement Note: This role requires a strong background in Linux/Unix systems, scripting languages, and cloud platforms. Candidates should have experience with configuration management tools, containerization technologies, and monitoring tools to succeed in this role.

👥 Team Culture & Values

Web Development Values:

  • Reliability: Ensure the reliability and performance of the company's applications through system design, implementation, and monitoring.
  • Scalability: Design and implement scalable systems that can handle increased load and traffic.
  • Automation: Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
  • Continuous Improvement: Continuously monitor and improve system performance, with a focus on pushing capabilities forward and innovating for continual improvement.

Collaboration Style:

  • Cross-Functional Collaboration: Work closely with development, quality assurance, and project management teams to ensure the reliability and performance of the company's applications.
  • Code Reviews: Conduct regular code reviews to ensure knowledge sharing and continuous learning.
  • Mentoring & Knowledge Sharing: Provide mentoring and knowledge sharing opportunities to help team members grow and develop their skills.

📝 Enhancement Note: The team values reliability, scalability, automation, and continuous improvement, with a focus on designing and implementing highly available and scalable systems. The team works closely with cross-functional teams to ensure the reliability and performance of the company's applications.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • System Design & Scalability: Design and implement highly available and scalable systems that can handle increased load and traffic.
  • Monitoring & Alerting: Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
  • Incident Management: Conduct post-incident analyses to identify root causes and implement preventive measures to avoid future incidents.
  • Automation: Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
  • Emerging Technologies: Stay up to date with industry best practices, new technologies, and emerging trends in site reliability engineering.

Learning & Development Opportunities:

  • Technical Skill Development: Develop technical skills in emerging technologies, such as AI/ML-driven operations, chaos engineering, or multi-cloud environments.
  • Leadership Development: Develop leadership skills through mentoring, coaching, and team management opportunities.
  • Architecture & Design: Specialize in architecture and design, focusing on designing and implementing complex, highly available, and scalable systems.
  • Conferences & Training: Attend industry conferences, webinars, and training programs to stay up to date with the latest trends and best practices in site reliability engineering.

📝 Enhancement Note: This role offers significant technical challenges and growth opportunities for candidates with strong technical skills and a focus on continuous learning and improvement. Candidates can develop their skills in emerging technologies, leadership, architecture and design, or through industry conferences and training programs.

💡 Interview Preparation

Technical Questions:

  • Linux/Unix Systems: Describe your experience with Linux/Unix systems and command line tools. Provide examples of how you have used them to solve complex technical issues.
  • Scripting Languages: Explain your proficiency in scripting languages such as Python, Shell, or Perl. Provide examples of how you have used them to automate tasks and improve efficiency.
  • Configuration Management: Discuss your experience with configuration management tools like Ansible, Puppet, or Chef. Describe how you have used them to manage and automate system configuration.
  • Cloud Platforms: Explain your familiarity with cloud platforms like AWS, Azure, or Google Cloud. Describe how you have used them to provision and manage cloud resources.
  • Monitoring Tools: Describe your expertise in monitoring and logging tools such as Prometheus, Grafana, ELK stack, or Splunk. Provide examples of how you have used them to monitor system health and performance.

Company & Culture Questions:

  • Team Dynamics: Describe your experience working in a collaborative, cross-functional team environment. Explain how you have contributed to team success and knowledge sharing.
  • Agile Methodologies: Discuss your experience with Agile/Scrum development methodologies. Explain how you have used them to improve system reliability and performance.
  • Incident Management: Describe your approach to incident management and resolution. Provide examples of how you have conducted post-incident analyses and implemented preventive measures.

Portfolio Presentation Strategy:

  • Project Walkthrough: Provide a detailed walkthrough of relevant projects, highlighting your experience with Linux/Unix systems, scripting languages, and cloud platforms.
  • Case Studies: Include case studies or project documentation showcasing your experience with configuration management tools, containerization technologies, and monitoring tools.
  • Architecture & Design: Showcase your expertise in system architecture and design, with examples of how you have implemented highly available and scalable systems.

📝 Enhancement Note: The interview process for this role focuses on the candidate's technical skills and problem-solving abilities, as well as their cultural fit within the team. Candidates should be prepared to discuss their experience with various technologies and provide examples of their work.

📌 Application Steps

To apply for this Sr SRE Engineer position:

  1. Customize Your Resume: Tailor your resume to highlight your experience with Linux/Unix systems, scripting languages, and cloud platforms. Include relevant keywords and examples of your work.
  2. Prepare Your Portfolio: Curate a portfolio showcasing your experience with configuration management tools, containerization technologies, and monitoring tools. Include case studies or project documentation to demonstrate your skills and accomplishments.
  3. Practice Technical Exercises: Brush up on your technical skills and practice problem-solving and troubleshooting exercises to improve your analytical and problem-solving abilities.
  4. Research the Company: Learn about Zensar's company culture, industry, and technology stack. Prepare questions to ask during the interview process to demonstrate your interest and engagement.
  5. Apply: Submit your application through the application link provided in the job listing. Follow the instructions carefully and provide all the required information.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.


Application Requirements

Candidates should have a bachelor's degree in a related field and proven experience as a Site Reliability Engineer or similar role. Strong knowledge of software development methodologies and DevOps principles is essential.