Senior Big Data Platform Engineer – C12 – AVP - Pune

Citi
Full_timePune, India

📍 Job Overview

  • Job Title: Senior Big Data Platform Engineer – C12 – AVP - Pune
  • Company: Citi
  • Location: Pune, Maharashtra, India
  • Job Type: Full-time
  • Category: Data Engineer
  • Date Posted: August 1, 2025
  • Experience Level: 5-10 years
  • Remote Status: On-site

🚀 Role Summary

  • Lead the installation, configuration, and maintenance of Cloudera clusters, ensuring high availability and security.
  • Collaborate with application teams to manage and review Hadoop cluster connectivity, security, and performance.
  • Develop and automate processes for maintenance and monitoring of the environment using Python, Java, and Shell Script.
  • Plan and execute major platform software and operating system upgrades and maintenance across physical environments.
  • Create and maintain detailed, up-to-date technical documentation.

📝 Enhancement Note: This role requires a strong background in big data technologies and cloud services, with a focus on Hadoop, Hive, Spark, and Kafka. Familiarity with AWS is also essential for success in this position.

💻 Primary Responsibilities

  • Cluster Management: Install, configure, and maintain Cloudera clusters, ensuring high availability and security.
  • Security Measures: Implement security measures for all aspects of the cluster, including SSL, disk encryption, and role-based access via Cloudera Sentry.
  • Performance Tuning: Review performance stats and query execution/explain plans; recommend changes for tuning Hive/Impala queries.
  • Collaboration: Collaborate with application teams to install operating system and Hadoop updates, patches, version upgrades when required.
  • Documentation: Create and maintain detailed, up-to-date technical documentation.

📝 Enhancement Note: This role requires a deep understanding of big data technologies and the ability to troubleshoot problems quickly. Strong communication skills are also essential for collaborating with application teams and stakeholders.

🎓 Skills & Qualifications

Education: Bachelor’s degree/University degree

Experience: 5+ years of experience with big data tools, 3+ years with Python, Java, and Shell Script, 2+ years with cloud services platform (preferably AWS)

Required Skills:

  • Proficient in big data tools: Hadoop, Hive, HBase, Spark, Kafka, Impala, Solr, Phoenix
  • Strong programming skills in Python, Java, and Shell Script
  • Experience with cloud services platform (preferably AWS)
  • Ability to troubleshoot problems and quickly resolve issues
  • Cluster maintenance and creation/removal of nodes
  • Performance tuning of Hadoop clusters
  • Screen Hadoop cluster job performances and capacity planning
  • Monitor Hadoop cluster connectivity and security
  • Manage and review Hadoop log files
  • File system management and monitoring
  • HDFS support and maintenance

Preferred Skills:

  • Familiarity with Agile methodologies
  • Experience with CI/CD pipelines
  • Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation)
  • Experience with containerization (e.g., Docker, Kubernetes)

📝 Enhancement Note: While not explicitly stated, familiarity with Agile methodologies, CI/CD pipelines, and infrastructure as code tools would be beneficial for success in this role. Additionally, experience with containerization could be an asset for managing and deploying big data applications.

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate experience with big data tools, highlighting projects that showcase your ability to install, configure, and maintain Cloudera clusters.
  • Include examples of performance tuning and optimization, as well as security measures implemented for big data clusters.
  • Showcase your programming skills in Python, Java, and Shell Script with relevant projects or code snippets.

Technical Documentation:

  • Provide examples of technical documentation created for big data projects, demonstrating your ability to create clear, concise, and well-organized documentation.
  • Include any process documentation or guides related to big data cluster maintenance and management.

📝 Enhancement Note: While a portfolio is not explicitly mentioned in the job listing, demonstrating your technical skills and experience through relevant projects and documentation can significantly strengthen your application.

💵 Compensation & Benefits

Salary Range: INR 2,000,000 - 3,000,000 per annum (Estimated based on industry standards for senior data engineers in Pune, India)

Benefits:

  • Competitive health, dental, and vision insurance
  • Retirement savings plan with company match
  • Generous time-off policies, including vacation, sick leave, and holidays
  • Employee discounts on various products and services
  • Professional development opportunities, including training and certifications

Working Hours: Full-time, typically 40 hours per week, with flexibility for project deadlines and maintenance windows

📝 Enhancement Note: The salary range provided is an estimate based on industry standards for senior data engineers in Pune, India. Actual compensation may vary based on factors such as experience, skills, and market conditions.

🎯 Team & Company Context

🏢 Company Culture

Industry: Financial Services

Company Size: Large (Over 250,000 employees)

Founded: 1812

Team Structure:

  • The data engineering team is part of the broader Technology division, working closely with application development teams and other technology functions.
  • The team is structured with a mix of senior, mid-level, and junior engineers, providing ample opportunities for mentorship and knowledge sharing.

Development Methodology:

  • Agile methodologies are used for project management, with a focus on iterative development and continuous improvement.
  • Code reviews, testing, and quality assurance practices are in place to ensure high-quality deliverables.
  • Deployment strategies, CI/CD pipelines, and server management are essential aspects of the role.

Company Website: Citi

📝 Enhancement Note: Citi is a large, global financial services company with a strong focus on technology and innovation. The company values collaboration, continuous learning, and a culture of excellence.

📈 Career & Growth Analysis

Data Engineering Career Level: Senior Data Engineer

Reporting Structure: Reports directly to the Data Engineering Manager, with a dotted line to the relevant application development teams.

Technical Impact: Responsible for ensuring the availability, performance, and security of big data platforms, directly impacting the success of data-driven applications and services.

Growth Opportunities:

  • Technical Growth: Expand your expertise in big data technologies, cloud services, and emerging data engineering trends.
  • Leadership Development: Gain experience managing and mentoring junior team members, with opportunities to take on more significant leadership roles as you grow within the organization.
  • Architecture Decisions: Contribute to strategic decisions related to big data architecture, helping to shape the future of the company's data ecosystem.

📝 Enhancement Note: This role offers significant growth potential for experienced data engineers looking to advance their careers in a large, global organization. Opportunities for technical specialization, leadership development, and strategic decision-making are all available to the right candidate.

🌐 Work Environment

Office Type: Modern, collaborative office space with dedicated workstations, meeting rooms, and breakout areas.

Office Location(s): Pune, Maharashtra, India

Workspace Context:

  • Collaboration: The office is designed to foster collaboration and communication, with open-plan workspaces and dedicated team areas.
  • Workstation: Each employee has access to a dedicated workstation with multiple monitors and testing devices.
  • Flexibility: The work environment offers flexibility for remote work, with a hybrid work arrangement available for eligible roles.

Work Schedule: Full-time, with flexible hours for deployment windows, maintenance, and project deadlines.

📝 Enhancement Note: Citi's Pune office provides a modern, collaborative work environment that supports the needs of data engineers and other technology professionals. The hybrid work arrangement offers flexibility for employees, allowing them to balance their work and personal lives effectively.

📄 Application & Technical Interview Process

Interview Process:

  1. Phone/Video Screen: A brief conversation to assess your communication skills and overall fit for the role.
  2. Technical Assessment: A hands-on assessment focused on big data technologies, cloud services, and programming skills in Python, Java, and Shell Script.
  3. On-site/Video Interview: A comprehensive interview with the hiring manager and other team members, focusing on your technical skills, experience, and cultural fit.
  4. Final Decision: A decision is made based on the results of the interview process, and a job offer is extended to the successful candidate.

Portfolio Review Tips:

  • Highlight your experience with big data technologies, focusing on projects that demonstrate your ability to install, configure, and maintain Cloudera clusters.
  • Include examples of performance tuning, security measures, and technical documentation created for big data projects.
  • Showcase your programming skills in Python, Java, and Shell Script with relevant projects or code snippets.

Technical Challenge Preparation:

  • Brush up on your big data technologies, cloud services, and programming skills in Python, Java, and Shell Script.
  • Practice common data engineering interview questions, focusing on system design, performance optimization, and troubleshooting.
  • Familiarize yourself with Citi's data engineering team and the broader technology division, understanding their goals, challenges, and priorities.

📝 Enhancement Note: The interview process for this role is designed to assess your technical skills, experience, and cultural fit. By preparing thoroughly and demonstrating your expertise in big data technologies and cloud services, you can increase your chances of success in the interview process.

🛠 Technology Stack & Web Infrastructure

Big Data Technologies:

  • Hadoop
  • Hive
  • HBase
  • Spark
  • Kafka
  • Impala
  • Solr
  • Phoenix

Cloud Services:

  • AWS (Preferred)

Programming Languages:

  • Python
  • Java
  • Shell Script

Infrastructure Tools:

  • Cloudera Manager
  • Ansible
  • Terraform (Preferred)
  • CloudFormation (Preferred)

📝 Enhancement Note: Familiarity with the specified big data technologies, cloud services, and programming languages is essential for success in this role. Additionally, experience with infrastructure as code tools like Terraform and CloudFormation can be beneficial for managing and deploying big data applications.

👥 Team Culture & Values

Data Engineering Values:

  • Expertise: Demonstrate a deep understanding of big data technologies and a commitment to continuous learning and improvement.
  • Collaboration: Work effectively with cross-functional teams, including application development, data science, and business stakeholders.
  • Innovation: Embrace new technologies and approaches to drive improvements in data processing, storage, and analysis.
  • Quality: Ensure high-quality deliverables through rigorous testing, code reviews, and quality assurance practices.

Collaboration Style:

  • Cross-functional Integration: Work closely with application development, data science, and business teams to understand requirements, define data models, and ensure data quality.
  • Code Review Culture: Participate in code reviews to ensure high-quality deliverables and knowledge sharing among team members.
  • Knowledge Sharing: Contribute to a culture of learning and growth by sharing your expertise with junior team members and other colleagues.

📝 Enhancement Note: Citi's data engineering team values expertise, collaboration, innovation, and quality. By embracing these values and working effectively with cross-functional teams, you can make a significant impact on the success of data-driven applications and services.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • Scalability: Design and implement big data architectures that can scale to meet the growing demands of the business.
  • Performance Optimization: Identify and address performance bottlenecks in big data processing, storage, and analysis.
  • Security & Compliance: Ensure the security and compliance of big data platforms, protecting sensitive data and adhering to regulatory requirements.
  • Emerging Technologies: Stay up-to-date with the latest big data technologies and trends, integrating new tools and approaches into existing data ecosystems.

Learning & Development Opportunities:

  • Technical Skill Development: Expand your expertise in big data technologies, cloud services, and emerging data engineering trends through training, certifications, and hands-on projects.
  • Leadership Development: Gain experience managing and mentoring junior team members, developing your leadership skills and preparing for more significant roles within the organization.
  • Architecture Decisions: Contribute to strategic decisions related to big data architecture, helping to shape the future of the company's data ecosystem.

📝 Enhancement Note: This role presents numerous technical challenges and growth opportunities for experienced data engineers looking to advance their careers in a large, global organization. By embracing these challenges and pursuing continuous learning and development, you can make a significant impact on the success of data-driven applications and services.

💡 Interview Preparation

Technical Questions:

  • Big Data Technologies: Demonstrate your expertise in big data technologies, including Hadoop, Hive, Spark, Kafka, Impala, Solr, and Phoenix.
  • Cloud Services: Showcase your experience with cloud services, with a preference for AWS.
  • Programming Skills: Highlight your proficiency in Python, Java, and Shell Script, providing examples of relevant projects or code snippets.
  • System Design: Discuss your approach to designing and implementing big data architectures, focusing on scalability, performance, and security.
  • Performance Tuning: Explain your process for identifying and addressing performance bottlenecks in big data processing, storage, and analysis.

Company & Culture Questions:

  • Data Engineering Team: Research Citi's data engineering team, understanding their goals, challenges, and priorities.
  • Company Culture: Familiarize yourself with Citi's company culture, values, and mission, demonstrating your alignment with the organization's objectives.
  • Data-Driven Decision Making: Explain your approach to using data to drive business decisions, highlighting your experience with data analysis, visualization, and reporting.

Portfolio Presentation Strategy:

  • Big Data Projects: Highlight your experience with big data technologies, focusing on projects that demonstrate your ability to install, configure, and maintain Cloudera clusters.
  • Performance Tuning: Include examples of performance tuning and optimization, as well as security measures implemented for big data clusters.
  • Technical Documentation: Showcase your ability to create clear, concise, and well-organized technical documentation for big data projects.

📝 Enhancement Note: By preparing thoroughly and demonstrating your expertise in big data technologies, cloud services, and programming skills, you can increase your chances of success in the interview process. Additionally, researching Citi's data engineering team and company culture can help you tailor your responses and showcase your alignment with the organization's objectives.

📌 Application Steps

To apply for this Senior Big Data Platform Engineer – C12 – AVP - Pune position:

  1. Customize Your Resume: Tailor your resume to highlight your experience with big data technologies, cloud services, and programming skills in Python, Java, and Shell Script.
  2. Prepare Your Portfolio: Showcase your experience with big data technologies, focusing on projects that demonstrate your ability to install, configure, and maintain Cloudera clusters. Include examples of performance tuning, security measures, and technical documentation created for big data projects.
  3. Practice Interview Questions: Brush up on your big data technologies, cloud services, and programming skills in Python, Java, and Shell Script. Practice common data engineering interview questions, focusing on system design, performance optimization, and troubleshooting.
  4. Research Citi: Familiarize yourself with Citi's data engineering team, company culture, values, and mission. Understand their goals, challenges, and priorities to demonstrate your alignment with the organization's objectives.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and data engineering industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

Application Requirements

Candidates should have over 5 years of experience with big data tools and at least 3 years of experience in Python, Java, and Shell scripting. Familiarity with cloud services, particularly AWS, is also required.