Infrastructure Engineer- SRE- Senior Analyst

Citi
Full_timeβ€’Chennai, India

πŸ“ Job Overview

  • Job Title: Infrastructure Engineer - SRE - Senior Analyst
  • Company: Citi
  • Location: Chennai, Tamil Nādu, India
  • Job Type: On-site
  • Category: DevOps, Infrastructure
  • Date Posted: 2025-07-06
  • Experience Level: 5-10 years

πŸš€ Role Summary

  • Key Responsibilities: Provide technical leadership for GCG applications, drive non-production stability, and enable SRE transformation.
  • Key Skills: Service Reliability Engineering, ITIL, Cloud Technology, Automation Technologies, Distributed Consumer Applications, Problem Management, Risk Management, Change Management, CI/CD Pipeline, Analytical Skills, Communication Skills, Technical Leadership, Database Management, Programming, Monitoring, Operational Readiness.

πŸ“ Enhancement Note: This role requires a blend of traditional ITIL-based production management and service reliability engineering, with a focus on driving stability and improving system service levels through automation and AI technologies.

πŸ’» Primary Responsibilities

  • Technical Leadership: Provide expertise and lead the technical direction for GCG applications, driving non-production stability and enabling SRE transformation.
  • Collaboration: Partner with app Dev community, CTI partners, TPM, and other stakeholders to identify value chain, conduct POCs, and plug gap areas.
  • Service Readiness: Execute robust service readiness and facilitate standard toolset adoption for all services in the domain.
  • Incident Management: Provide expertise support to L1 technical recovery center on P1/P2 incidents, create PMRs, and drive service improvements.
  • Risk Management: Identify and classify risks in the non-production estate, create service improvement plans, and drive them to closure.
  • Operational Readiness: Create operational readiness documents for major initiatives and provide handover to the production team in a seamless manner.
  • Monitoring: Overall accountability for monitoring and its usage by stakeholders, working with the monitoring team for setup and overall accountability.
  • Representation: Represent DevOps team in various digital forums and facilitate generation of reports and presentations.
  • Technical Proficiency: Be proficient in various technologies such as OSE, Kubernetes, APIGEE, Platform services, DataPower, Google Cloud, AWS, CI/CD pipeline, ITIL, and service management.

πŸ“ Enhancement Note: This role requires a strong understanding of various technologies and the ability to provide expert support and guidance to other teams.

πŸŽ“ Skills & Qualifications

Education:

  • Bachelor’s/University degree or equivalent experience
  • Certification in Site Reliability Engineer, Sales Force, or Cloud-Based Certification like AWS

Experience:

  • 6-10 years of development or production support experience with North America Consumer applications
  • Experience or familiarity with cloud technology is a plus
  • Solid ITIL foundation understanding
  • Engineering background in system admin, development, DevOps, or equivalent field, preferably with experience in distributed consumer applications
  • Experience or familiarity with automation technologies, advanced analytics, and predictive modeling
  • Experience with databases such as Oracle, MSSQL, MongoDB, Teradata, DB2
  • Experience in programming in one of the following languages: UNIX shell scripting, Java, etc.
  • Competent with cloud concepts such as API, web services, and microservices
  • Strong analytical, algorithmic, and critical thinking skills

Required Skills:

  • Fluent English
  • Strong analytical skills, strong critical thinking skills, and ability to logically break down tasks into smaller manageable parts
  • Solid understanding of systems and application design
  • Systematic problem-solving approach
  • Effective communication skills and sense of ownership and drive
  • Adaptable and can work with large complex and multi-team-owned services
  • Extremely organized, detailed-oriented, and thorough in every aspect
  • Ability to balance multiple tasks and projects effectively while adapting to new variables
  • Utilizing creative and innovative thinking but also adhering to a powerful sense of ownership, customer service, and integrity demonstrated through clear communication
  • Drive, self-motivated, and eager to learn

πŸ“Š Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate expertise in various technologies such as OSE, Kubernetes, APIGEE, Platform services, DataPower, Google Cloud, AWS, CI/CD pipeline, ITIL, and service management.
  • Showcase problem-solving skills and ability to drive service improvements through incident management and risk management.
  • Highlight technical leadership skills and ability to collaborate with various teams to drive non-production stability and enable SRE transformation.
  • Display strong analytical and critical thinking skills through real-world examples and case studies.

Technical Documentation:

  • Provide detailed documentation of incident management processes, risk management strategies, and service improvement plans.
  • Showcase code quality, commenting, and documentation standards for programming languages such as UNIX shell scripting and Java.
  • Demonstrate version control, deployment processes, and server configuration management skills.
  • Highlight testing methodologies, performance metrics, and optimization techniques for various technologies and applications.

πŸ“ Enhancement Note: This role requires a strong portfolio that showcases technical expertise, problem-solving skills, and ability to drive service improvements through incident management and risk management.

πŸ’΅ Compensation & Benefits

Salary Range: INR 1,200,000 - 1,800,000 per annum (region-appropriate web development and server administration industry standard based on experience level, location, and regional cost of living)

Benefits:

  • Competitive salary and performance-based bonuses
  • Comprehensive health, dental, and vision care plans
  • Retirement savings plans with company match
  • Generous time-off and leave policies
  • Employee discounts and perks
  • Opportunities for professional development and career growth

Working Hours: Full-time position with standard working hours and flexible arrangements for deployment windows, maintenance, and project deadlines.

πŸ“ Enhancement Note: The salary range is estimated based on regional web development and server administration industry standards, regional cost of living, and experience level. Actual salary may vary based on company-specific factors and individual qualifications.

🎯 Team & Company Context

Company Culture:

  • Industry: Financial Services
  • Company Size: Large (200,000+ employees)
  • Founded: 1812
  • Team Structure: Large, global teams with cross-functional collaboration between developers, designers, and stakeholders.
  • Development Methodology: Agile/Scrum methodologies, code review, testing, and quality assurance practices, deployment strategies, CI/CD pipelines, and server management.
  • Company Website: Citi

πŸ“ Enhancement Note: Citi's large, global teams require strong collaboration and communication skills to drive service improvements and enable SRE transformation.

Career & Growth Analysis:

  • Web Technology Career Level: Senior Analyst - Technical Leadership
  • Reporting Structure: Reports directly to the SRE Manager, with cross-functional collaboration with app Dev community, CTI partners, TPM, and other stakeholders.
  • Technical Impact: Drives non-production stability, enables SRE transformation, and provides expert support and guidance to other teams.
  • Growth Opportunities:
    • Technical leadership and architecture decision-making
    • Mentoring and knowledge sharing with junior team members
    • Career progression to Senior Manager or Principal Engineer roles

πŸ“ Enhancement Note: This role offers significant growth opportunities for technical leadership and architecture decision-making, with potential career progression to Senior Manager or Principal Engineer roles.

🌐 Work Environment

Office Type: Large, global offices with multiple locations and remote work arrangements for some teams. Office Location(s): Chennai, Tamil Nādu, India (primary location for this role) Workspace Context:

  • Large, collaborative workspaces with multiple monitors and testing devices available
  • Cross-functional collaboration with developers, designers, and stakeholders
  • Knowledge sharing, technical mentoring, and continuous learning opportunities

Work Schedule: Full-time position with standard working hours and flexible arrangements for deployment windows, maintenance, and project deadlines.

πŸ“ Enhancement Note: Citi's large, global offices require strong collaboration and communication skills to drive service improvements and enable SRE transformation.

πŸ“„ Application & Technical Interview Process

Interview Process:

  1. Technical Preparation: Brush up on technical skills related to Service Reliability Engineering, ITIL, cloud technologies, automation, and incident management.
  2. Problem-Solving: Prepare for problem-solving scenarios related to incident management, risk management, and service improvement.
  3. Collaboration: Practice explaining technical concepts and collaborating with team members to drive service improvements.
  4. Technical Deep Dive: Prepare for deep dives into specific technologies such as OSE, Kubernetes, APIGEE, Platform services, DataPower, Google Cloud, AWS, CI/CD pipeline, ITIL, and service management.

Portfolio Review Tips:

  • Highlight real-world examples of incident management, risk management, and service improvement.
  • Showcase technical leadership skills and ability to collaborate with various teams.
  • Demonstrate strong analytical and critical thinking skills through case studies and problem-solving scenarios.

Technical Challenge Preparation:

  • Brush up on technical skills related to Service Reliability Engineering, ITIL, cloud technologies, automation, and incident management.
  • Practice problem-solving scenarios related to incident management, risk management, and service improvement.
  • Prepare for technical deep dives into specific technologies and applications.

ATS Keywords: Service Reliability Engineering, ITIL, Cloud Technology, Automation Technologies, Distributed Consumer Applications, Problem Management, Risk Management, Change Management, CI/CD Pipeline, Analytical Skills, Communication Skills, Technical Leadership, Database Management, Programming, Monitoring, Operational Readiness, Incident Management, Risk Management, Service Improvement, Portfolio, Technical Leadership, Architecture Decision-Making, Mentoring, Knowledge Sharing, Career Progression, Technical Deep Dive, Problem-Solving, Collaboration, Technical Skills, Web Portfolio, Case Studies, Real-World Examples, ATS Keywords, Web Development, Server Administration, Infrastructure, DevOps, Cloud Technology, ITIL, Service Management, Incident Management, Risk Management, Service Improvement, Technical Leadership, Architecture Decision-Making, Mentoring, Knowledge Sharing, Career Progression, Technical Skills, Web Portfolio, Case Studies, Real-World Examples.

πŸ“ Enhancement Note: This role requires a strong technical interview process that focuses on problem-solving, collaboration, and technical deep dives into specific technologies and applications.

πŸ›  Technology Stack & Web Infrastructure

Frontend Technologies: Not applicable for this role. Backend & Server Technologies:

  • OSE, Kubernetes, APIGEE, Platform services, DataPower, Google Cloud, AWS
  • Experience with databases such as Oracle, MSSQL, MongoDB, Teradata, DB2
  • Programming languages such as UNIX shell scripting, Java, etc.

Development & DevOps Tools:

  • CI/CD pipeline, ITIL, and service management
  • Monitoring tools for server monitoring and performance tracking

πŸ“ Enhancement Note: This role requires expertise in various backend and server technologies, with a focus on cloud technologies, automation, and incident management.

πŸ‘₯ Team Culture & Values

Web Development Values:

  • User-centric design and customer experience focus
  • Performance optimization and accessibility standards
  • Code quality approach and collaborative development practices
  • Innovation expectations and emerging technology adoption

Collaboration Style:

  • Cross-functional integration between developers, designers, and stakeholders
  • Code review culture and peer programming practices
  • Knowledge sharing, technical mentoring, and continuous learning

πŸ“ Enhancement Note: Citi's large, global teams require strong collaboration and communication skills to drive service improvements and enable SRE transformation.

⚑ Challenges & Growth Opportunities

Technical Challenges:

  • Driving non-production stability and enabling SRE transformation in a large, global organization
  • Collaborating with various teams to improve service levels and implement service automation
  • Providing expert support and guidance to other teams while balancing multiple tasks and projects

Learning & Development Opportunities:

  • Technical leadership and architecture decision-making
  • Mentoring and knowledge sharing with junior team members
  • Career progression to Senior Manager or Principal Engineer roles

πŸ“ Enhancement Note: This role offers significant growth opportunities for technical leadership and architecture decision-making, with potential career progression to Senior Manager or Principal Engineer roles.

πŸ’‘ Interview Preparation

Technical Questions:

  • Service Reliability Engineering, ITIL, cloud technologies, automation, and incident management
  • Problem-solving scenarios related to incident management, risk management, and service improvement
  • Technical deep dives into specific technologies and applications

Company & Culture Questions:

  • Citi's large, global teams and cross-functional collaboration
  • Technical leadership and architecture decision-making
  • Mentoring and knowledge sharing with junior team members
  • Career progression to Senior Manager or Principal Engineer roles

Portfolio Presentation Strategy:

  • Highlight real-world examples of incident management, risk management, and service improvement.
  • Showcase technical leadership skills and ability to collaborate with various teams.
  • Demonstrate strong analytical and critical thinking skills through case studies and problem-solving scenarios.

πŸ“ Enhancement Note: This role requires a strong interview preparation process that focuses on problem-solving, collaboration, and technical deep dives into specific technologies and applications.

πŸ“Œ Application Steps

To apply for this Infrastructure Engineer - SRE - Senior Analyst position:

  1. Submit your application through the application link provided.
  2. Customize your web portfolio with live demos and responsive examples, highlighting real-world examples of incident management, risk management, and service improvement.
  3. Optimize your resume for web technology roles, emphasizing project highlights and technical skills relevant to this role.
  4. Prepare for the technical interview process, focusing on problem-solving, collaboration, and technical deep dives into specific technologies and applications.
  5. Research Citi's company culture, web development teams, and the role's specific requirements to demonstrate a strong understanding of the company and the position.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

Application Requirements

Candidates should have 6-10 years of experience in development or production support, particularly with North America Consumer applications. A solid understanding of ITIL and experience with cloud technologies and automation is also required.