Infra Tech Lead Analyst - VP
📍 Job Overview
- Job Title: Infra Tech Lead Analyst - VP
- Company: Citi
- Location: Irving, Texas, United States
- Job Type: On-site
- Category: Infrastructure
- Date Posted: June 24, 2025
- Experience Level: 5-10 years
- Remote Status: On-site
🚀 Role Summary
- Lead the monitoring, troubleshooting, and maintenance of AWS/GCP environments to ensure high availability and performance.
- Drive incident management, root cause analysis, and collaborate with engineering teams to resolve issues.
- Provide technical leadership, automate repetitive tasks, and improve operational processes.
- Ensure ongoing compliance with regulatory requirements and implement security best practices.
📝 Enhancement Note: This role requires a strong background in cloud operations and infrastructure delivery, with a focus on incident management and process improvement. The ideal candidate will have experience working in a large, complex, and global environment, preferably in the financial services industry.
💻 Primary Responsibilities
-
Environment Monitoring & Management:
- Monitor AWS/GCP infrastructure and services to ensure availability, performance, and reliability.
- Implement and maintain monitoring, logging, and alerting tools (e.g., CloudWatch, Stackdriver, Prometheus).
-
Incident Management & Resolution:
- Perform incident management, including triage, impact assessment, and coordination with engineering teams to resolve issues.
- Participate in on-call rotation for high severity/major incidents support coverage.
- Provide Root Cause Analysis (RCA) post-restoration of service.
-
Process Improvement & Automation:
- Design testing approaches, complex processes, and assist with the automation of repetitive tasks.
- Identify and automate repetitive operational tasks to reduce toil.
- Create, maintain, and enhance operational runbooks, SPOs, and knowledge base articles.
-
Collaboration & Stakeholder Management:
- Collaborate with product, engineering, security, and other stakeholders towards value-adding outcomes.
- Act as a subject matter expert (SME) to senior stakeholders and/or other team members.
- Provide technical/strategic direction to team members and contribute to team growth and development.
🎓 Skills & Qualifications
Education: Bachelor's/University degree or equivalent experience in a relevant field (e.g., Computer Science, Information Technology, or a related discipline).
Experience: At least 6+ years of experience in roles centered around infrastructure delivery, with a proven track record of operational process change and improvement. Experience in cloud operations/support and site reliability is essential.
Required Skills:
- Hands-on experience with AWS and/or GCP.
- Proficiency with Infrastructure as Code (IaC) tools like Terraform, CloudFormation, and working knowledge of scripting (bash, Python, or similar).
- Strong understanding of networking, DNS, IAM, load balancing, and cloud-native services.
- Ability to develop projects required for design of metrics, analytical tools, benchmarking activities, and best practices.
- Ability to work with virtual/in-person teams and work under pressure/deadlines.
- Effective written and verbal communication skills, with the ability to communicate technical concepts well to non-technical audiences.
Preferred Skills:
- Experience in a financial services or large, complex, and/or global environment.
- Familiarity with Agile methodologies and continuous integration/continuous deployment (CI/CD) pipelines.
📊 Web Portfolio & Project Requirements
-
Portfolio Essentials:
- Demonstrate experience in cloud infrastructure management, incident resolution, and process improvement.
- Showcase your ability to monitor, troubleshoot, and maintain AWS/GCP environments.
- Highlight your experience with incident management, root cause analysis, and collaboration with engineering teams.
-
Technical Documentation:
- Provide examples of operational runbooks, SPOs, and knowledge base articles you have created or maintained.
- Demonstrate your ability to design testing approaches, complex processes, and automate repetitive tasks.
💵 Compensation & Benefits
Salary Range: $125,760 - $188,640 per year (based on the primary location full-time salary range provided by the company).
Benefits:
- Medical, dental, and vision coverage.
- 401(k) plan.
- Life, accident, and disability insurance.
- Wellness programs.
- Paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays.
Working Hours: Full-time, typically 40 hours per week.
🎯 Team & Company Context
Company Culture:
- Industry: Financial Services.
- Company Size: Large, global organization.
- Founded: 1812 (headquartered in New York, NY, USA).
- Team Structure: The Infra Tech Lead Analyst role is part of the Infrastructure team, which works closely with product, engineering, security, and other stakeholders to deliver and maintain high-quality cloud infrastructure services.
- Development Methodology: Agile methodologies are used for software development and project management.
Company Website: https://www.citi.com/
📝 Enhancement Note: Citi is a large, global financial services company with a strong focus on technology and innovation. The company values diversity, inclusion, and collaboration, and offers a wide range of benefits and growth opportunities for its employees.
📈 Career & Growth Analysis
Web Technology Career Level: This role is a senior-level position, focusing on technical leadership, process improvement, and incident management within the cloud infrastructure domain.
Reporting Structure: The Infra Tech Lead Analyst reports directly to the Head of Infrastructure or a similar role, depending on the organization's structure.
Technical Impact: This role has a significant impact on the performance, availability, and security of the company's cloud infrastructure. The Infra Tech Lead Analyst is responsible for ensuring that the infrastructure meets the needs of the business and complies with regulatory requirements.
Growth Opportunities:
- Technical Growth: Develop expertise in cloud infrastructure management, incident resolution, and process improvement. Stay up-to-date with emerging technologies and best practices in cloud operations and site reliability.
- Leadership Growth: Gain experience in managing teams, mentoring team members, and driving process improvement initiatives. Develop strong communication and stakeholder management skills to collaborate effectively with various teams and stakeholders.
- Career Progression: Progress to more senior roles, such as Head of Infrastructure, Director of Cloud Operations, or other senior leadership positions within the technology organization.
📝 Enhancement Note: Citi offers a wide range of growth opportunities for employees, including technical training, mentorship programs, and career development resources. The company values internal promotions and provides opportunities for employees to grow and advance within the organization.
🌐 Work Environment
Office Type: On-site, with a large, global office network.
Office Location(s): Irving, Texas, United States (with additional offices worldwide).
Workspace Context:
- Collaborative Environment: Work closely with various teams, including product, engineering, security, and other infrastructure teams, to deliver and maintain high-quality cloud infrastructure services.
- Development Tools & Resources: Access to industry-standard tools, technologies, and resources for cloud infrastructure management, incident resolution, and process improvement.
- Team Interaction: Collaborate with team members, stakeholders, and other teams to ensure the smooth operation of the cloud infrastructure and resolve any incidents or issues that arise.
Work Schedule: Full-time, typically 40 hours per week, with on-call rotation for high severity/major incidents support coverage.
📝 Enhancement Note: Citi offers a flexible work environment, with opportunities for remote work and hybrid arrangements, depending on the role and business needs. The company values work-life balance and provides resources and support to help employees manage their personal and professional responsibilities.
📄 Application & Technical Interview Process
Interview Process:
- Phone Screen: A brief phone call to discuss your experience, qualifications, and fit for the role.
- Technical Deep Dive: A comprehensive technical interview focusing on cloud infrastructure management, incident resolution, and process improvement. Be prepared to discuss your experience with AWS/GCP, Infrastructure as Code tools, and scripting.
- Behavioral & Cultural Fit: An interview focused on your problem-solving skills, communication style, and cultural fit within the team and organization.
- Final Decision: A final interview with the hiring manager or a panel of stakeholders to discuss your qualifications, fit, and career aspirations.
Portfolio Review Tips:
- Highlight your experience in cloud infrastructure management, incident resolution, and process improvement.
- Showcase your ability to monitor, troubleshoot, and maintain AWS/GCP environments.
- Demonstrate your experience with incident management, root cause analysis, and collaboration with engineering teams.
- Provide examples of operational runbooks, SPOs, and knowledge base articles you have created or maintained.
- Be prepared to discuss your approach to process improvement, automation, and stakeholder management.
Technical Challenge Preparation:
- Brush up on your knowledge of AWS/GCP, Infrastructure as Code tools, and scripting.
- Familiarize yourself with cloud-native services, networking, DNS, IAM, and load balancing concepts.
- Prepare for behavioral and situational interview questions that focus on problem-solving, communication, and teamwork.
ATS Keywords: AWS, GCP, Cloud Operations, Site Reliability, Infrastructure as Code, IaC, Terraform, CloudFormation, Scripting, Bash, Python, Networking, DNS, IAM, Load Balancing, Cloud Native Services, Incident Management, Root Cause Analysis, Process Improvement, Agile, CI/CD, On-Call Rotation, Technical Leadership, Stakeholder Management, Financial Services, Global Organization, Team Collaboration, Career Growth, Cloud Infrastructure.
📝 Enhancement Note: Citi uses Applicant Tracking Systems (ATS) to manage job applications and resumes. Including relevant keywords in your resume and application materials can help ensure that your qualifications are properly matched with the role and increase your chances of being selected for an interview.
🛠 Technology Stack & Web Infrastructure
Cloud Platforms:
- Amazon Web Services (AWS)
- Google Cloud Platform (GCP)
Infrastructure as Code (IaC) Tools:
- Terraform
- CloudFormation
Scripting Languages:
- Bash
- Python
Monitoring & Logging Tools:
- CloudWatch (AWS)
- Stackdriver (GCP)
- Prometheus
Incident Management & Collaboration Tools:
- PagerDuty
- OpsGenie
- Slack
- Microsoft Teams
Version Control Systems:
- Git
- GitHub
- Bitbucket
📝 Enhancement Note: Citi uses a wide range of tools and technologies to manage its cloud infrastructure and ensure high availability, performance, and security. The specific tools and technologies used may vary depending on the role and business needs.
👥 Team Culture & Values
Citi's Core Values:
- Client Centricity: Putting clients at the center of everything we do.
- Responsibility: Taking personal accountability for our actions and their outcomes.
- Integrity: Upholding the highest ethical standards in all that we do.
- Collaboration: Working together to deliver exceptional results.
- Inclusion: Valuing diversity and fostering an inclusive work environment.
- Respect: Treating colleagues, clients, and partners with dignity and respect.
Citi's Approach to Technology:
- Innovation: Embracing new technologies and approaches to drive business value.
- Agility: Adapting quickly to changing market conditions and customer needs.
- Quality: Delivering high-quality products and services that meet or exceed customer expectations.
- Security: Prioritizing the security of our systems, data, and customers.
Citi's Approach to Teamwork:
- Collaboration: Working together to achieve common goals and deliver exceptional results.
- Communication: Communicating effectively and transparently with colleagues, clients, and partners.
- Support: Providing assistance and resources to help colleagues succeed.
- Recognition: Acknowledging and celebrating the achievements of colleagues and teams.
📝 Enhancement Note: Citi's culture is built on a foundation of integrity, respect, and collaboration. The company values diversity, inclusion, and innovation, and offers a supportive and engaging work environment for employees.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- Cloud Infrastructure Management: Stay up-to-date with the latest AWS/GCP features, services, and best practices. Develop expertise in managing complex, large-scale cloud environments.
- Incident Resolution: Hone your incident management and troubleshooting skills to resolve complex issues quickly and effectively. Develop a deep understanding of AWS/GCP services and their interactions.
- Process Improvement: Continuously identify and implement process improvements to enhance efficiency, reduce toil, and improve the overall quality of cloud infrastructure services.
- Stakeholder Management: Develop strong communication and stakeholder management skills to collaborate effectively with various teams and stakeholders. Build relationships and influence others to drive process improvement and achieve common goals.
Learning & Development Opportunities:
- Technical Skill Development: Expand your knowledge of AWS/GCP, Infrastructure as Code tools, and scripting. Stay up-to-date with emerging technologies and best practices in cloud operations and site reliability.
- Leadership Development: Develop your leadership skills through mentoring, coaching, and training programs. Gain experience in managing teams, driving process improvement initiatives, and making strategic decisions.
- Career Progression: Progress to more senior roles, such as Head of Infrastructure, Director of Cloud Operations, or other senior leadership positions within the technology organization. Expand your knowledge of the broader business and develop a strategic perspective on technology and its role in driving business value.
📝 Enhancement Note: Citi offers a wide range of learning and development opportunities to help employees grow and advance in their careers. The company provides technical training, mentorship programs, and career development resources to support employees' professional growth.
💡 Interview Preparation
Technical Questions:
- Cloud Infrastructure Management:
- Describe your experience with AWS/GCP, Infrastructure as Code tools, and scripting.
- How have you managed complex, large-scale cloud environments in the past?
- Can you walk me through a complex incident you've resolved, and how you approached it?
- Incident Resolution:
- How do you approach troubleshooting and incident resolution in a cloud environment?
- Can you describe a time when you had to resolve a complex issue under pressure?
- How do you ensure that incidents are properly documented and lessons are learned for the future?
- Process Improvement:
- How do you identify and implement process improvements in a cloud operations environment?
- Can you provide an example of a process improvement you've implemented, and the results it achieved?
- How do you measure the success of process improvements, and how do you ensure they are sustained over time?
Company & Culture Questions:
- Citi's Core Values: How do you exemplify Citi's core values in your work, and how have you seen them demonstrated by others?
- Teamwork & Collaboration: Can you describe a time when you worked collaboratively with others to achieve a common goal? What was the outcome, and what did you learn from the experience?
- Adaptability & Agility: How have you adapted to changes in technology, processes, or business priorities in the past? Can you provide an example of a time when you had to pivot quickly to meet new challenges or opportunities?
Portfolio Presentation Strategy:
- Cloud Infrastructure Management: Highlight your experience in managing AWS/GCP environments, incident resolution, and process improvement. Showcase your ability to monitor, troubleshoot, and maintain cloud infrastructure services.
- Incident Resolution: Demonstrate your problem-solving skills, communication style, and ability to work effectively under pressure. Provide examples of complex incidents you've resolved and the steps you took to ensure a successful outcome.
- Process Improvement: Showcase your ability to identify and implement process improvements, and the positive impact they've had on cloud infrastructure services and team efficiency. Provide data-driven examples of the results achieved and the benefits realized.
📝 Enhancement Note: Citi's interview process is designed to assess your technical skills, problem-solving abilities, and cultural fit within the team and organization. By preparing thoroughly and demonstrating your expertise in cloud infrastructure management, incident resolution, and process improvement, you can increase your chances of success in the interview process.
📌 Application Steps
To apply for the Infra Tech Lead Analyst - VP position at Citi:
- Submit Your Application: Click on the "Apply" button on the job listing page and follow the prompts to submit your resume and other required documents.
- Tailor Your Resume: Highlight your relevant experience in cloud infrastructure management, incident resolution, and process improvement. Include specific examples of your achievements and the impact you've made in previous roles. Use relevant keywords from the job description to optimize your resume for the Applicant Tracking System (ATS).
- Prepare for Phone Screen: Brush up on your knowledge of AWS/GCP, Infrastructure as Code tools, and scripting. Familiarize yourself with Citi's core values and approach to technology. Be ready to discuss your experience, qualifications, and fit for the role.
- Research Citi: Learn about Citi's history, culture, and business. Understand the company's approach to technology, innovation, and customer centricity. Prepare questions to ask during the interview process to demonstrate your interest in the role and the company.
⚠️ Important Notice: This enhanced job description includes AI-generated insights and web technology industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.
Application Requirements
Candidates should have at least 6 years of experience in infrastructure delivery with a focus on cloud operations and site reliability. Proficiency in AWS/GCP, Infrastructure as Code tools, and strong analytical skills are essential.