Senior Cloud Operations Engineer - AWS
📍 Job Overview
- Job Title: Senior Cloud Operations Engineer - AWS
- Company: Endava
- Location: Cluj-Napoca, Cluj, Romania
- Job Type: Full-time
- Category: DevOps, Infrastructure
- Date Posted: 2025-06-19
- Experience Level: 5-10 years
- Remote Status: On-site/Hybrid
🚀 Role Summary
- Manage and maintain AWS cloud infrastructure for business-critical services, ensuring high availability, performance, and reliability.
- Collaborate with development and application teams to streamline deployments and operational processes.
- Build and manage AWS cloud environments using Infrastructure as Code (Terraform) and implement automation for operational tasks.
- 📝 Enhancement Note: This role requires a strong background in AWS and Terraform, with a focus on operational excellence and ITIL-based service management.
💻 Primary Responsibilities
- Incident Management: Rapid detection, investigation, and resolution of infrastructure incidents to minimize business impact.
- Request Fulfillment: Handle and fulfill infrastructure service requests within agreed service levels.
- Problem Management: Identify root causes of recurring issues and implement permanent fixes to prevent reoccurrence.
- Infrastructure Management: Ensure infrastructure availability for business-critical services in a 24/7 support model, including participation in on-call rotations.
- Automation and IaC: Build and manage AWS cloud environments using Infrastructure as Code (Terraform) and automate operational tasks.
- Collaboration: Work with development and application teams to optimize deployments and operational processes.
- Documentation: Maintain detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides.
- Compliance: Ensure compliance with security, governance, and operational standards across all environments.
- Capacity Planning: Participate in capacity planning, cost optimization, and performance tuning activities.
- Change Management: Support change management processes, ensuring safe and auditable infrastructure changes.
🎓 Skills & Qualifications
Education: Bachelor's degree in Computer Science, Engineering, or a related field. Relevant certifications (e.g., AWS Certified Solutions Architect, AWS Certified DevOps Engineer) are a plus.
Experience: Proven experience (5-10 years) managing AWS cloud infrastructure and strong hands-on experience with Infrastructure as Code using Terraform. Experience supporting and operating production infrastructure in a 24/7 environment is essential.
Required Skills:
- Strong technical acumen in AWS and Terraform
- Solid grasp of ITIL-based service management
- Experience in incident, request, and problem management
- Strong scripting skills in at least one language (e.g., Bash, Python, PowerShell)
- Familiarity with networking, security groups, load balancers, IAM policies, and monitoring in cloud environments
- Strong communication and collaboration skills with the ability to interface effectively with technical and non-technical stakeholders
Preferred Skills:
- Experience with observability tools such as Splunk, CloudWatch, Datadog, or similar
- AWS certifications
📊 Web Portfolio & Project Requirements
Portfolio Essentials:
- Demonstrate experience in managing AWS cloud infrastructure and Infrastructure as Code using Terraform.
- Showcase successful incident management, request fulfillment, and problem management cases.
- Highlight automation and scripting projects that improved system consistency and reliability.
Technical Documentation:
- Provide detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides.
- Include examples of capacity planning, cost optimization, and performance tuning activities.
- Demonstrate understanding of security, governance, and operational standards in cloud environments.
💵 Compensation & Benefits
Salary Range: The salary range for this role in Cluj-Napoca, Romania, is approximately 35,000 - 50,000 RON per year (based on market research and experience level). This is inclusive of the gross salary and any additional benefits.
Benefits:
- Competitive salary package
- Share plan
- Company performance bonuses
- Value-based recognition awards
- Referral bonus
- Career coaching
- Global career opportunities
- Non-linear career paths
- Internal development programmes for management and technical leadership
- Complex projects, rotations, internal tech communities, training, certifications, coaching, online learning platforms subscriptions, pass-it-on sessions, workshops, conferences
- Global internal wellbeing programme
- Access to wellbeing apps
- Inclusion and diversity programmes
Working Hours: Full-time, with a commitment to 24/7 support coverage and participation in on-call rotations.
🎯 Team & Company Context
Company Culture
- Industry: Endava is a global technology company focused on delivering innovative solutions for various industries, including finance, telecom, media, and retail.
- Company Size: Endava has over 10,000 employees across multiple locations worldwide, providing ample opportunities for collaboration and growth.
- Founded: 2000
- Team Structure: The cloud operations team consists of experienced engineers responsible for managing and maintaining AWS cloud infrastructure. The team works closely with development and application teams to ensure seamless integration and optimal performance.
- Development Methodology: Endava follows Agile methodologies, with a focus on continuous improvement and collaboration.
Career & Growth Analysis
- Web Technology Career Level: This role is suitable for experienced cloud operations engineers with a strong background in AWS and Terraform, seeking to advance their careers in a dynamic and growing organization.
- Reporting Structure: The senior cloud operations engineer will report directly to the cloud operations manager and work closely with other teams, including development, application, and security.
- Technical Impact: The engineer will play a crucial role in maintaining the stability, performance, and availability of cloud infrastructure, ensuring that critical business functions run smoothly.
Growth Opportunities:
- Career Progression: Endava offers non-linear career paths and internal development programs, providing opportunities for technical and management growth.
- Technical Skill Development: The company encourages continuous learning and offers access to various training, certifications, and online learning platforms.
- Technical Leadership: With experience and proven performance, engineers can progress to technical leadership roles, driving architecture decisions and mentoring junior team members.
Work Environment
- Office Type: Endava's Cluj-Napoca office is a modern, collaborative workspace designed to foster innovation and teamwork.
- Office Location(s): Cluj-Napoca, Romania
- Workspace Context: The workspace is equipped with state-of-the-art technology, multiple monitors, and testing devices to support cloud operations engineers in their daily tasks. The office also offers ample opportunities for cross-functional collaboration with other teams, such as development, design, and marketing.
- Work Schedule: Full-time, with a commitment to 24/7 support coverage and participation in on-call rotations. The work schedule is flexible, with a focus on results and performance.
📄 Application & Technical Interview Process
Interview Process:
- Phone/Video Screen: A brief conversation to assess communication skills, cultural fit, and initial technical understanding (30-45 minutes).
- Technical Deep Dive: A detailed discussion of AWS and Terraform, focusing on infrastructure management, automation, and problem-solving (60-90 minutes).
- Behavioral and Cultural Fit: An in-depth conversation to evaluate problem-solving skills, adaptability, and cultural fit within the Endava team (45-60 minutes).
- Final Decision: A review of the candidate's overall performance and fit for the role.
Portfolio Review Tips:
- Highlight successful incident management, request fulfillment, and problem management cases.
- Showcase automation and scripting projects that improved system consistency and reliability.
- Include detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides.
Technical Challenge Preparation:
- Brush up on AWS services, including EC2, RDS, ELB, and IAM.
- Familiarize yourself with Terraform, including provisioning, state management, and remote state storage.
- Practice incident management, problem-solving, and automation scenarios using AWS and Terraform.
ATS Keywords:
- Programming Languages: Bash, Python, PowerShell
- Web Frameworks: Terraform
- Server Technologies: AWS (EC2, RDS, ELB, IAM, etc.)
- Databases: AWS RDS, DynamoDB, Redshift
- Tools: AWS CloudWatch, AWS CloudFormation, AWS Systems Manager, Splunk, Datadog
- Methodologies: ITIL, Agile, DevOps
- Soft Skills: Communication, Collaboration, Problem-Solving, Adaptability
- Industry Terms: Cloud Operations, Infrastructure as Code, Incident Management, Problem Management, Request Fulfillment
🛠 Technology Stack & Web Infrastructure
Cloud Platform: AWS (Amazon Web Services)
Infrastructure as Code: Terraform
Monitoring Tools: AWS CloudWatch, Splunk, Datadog
Configuration Management: AWS Systems Manager, Ansible (optional)
Automation Tools: AWS Lambda, AWS Step Functions, Jenkins, GitLab CI/CD
Version Control: Git
Containerization: Docker (optional)
Orchestration: Kubernetes (optional)
👥 Team Culture & Values
Cloud Operations Values:
- Reliability: Ensuring high availability, performance, and reliability of cloud infrastructure.
- Automation: Streamlining operational tasks through automation and Infrastructure as Code.
- Collaboration: Working closely with development and application teams to optimize deployments and operational processes.
- Continuous Improvement: Regularly reviewing and enhancing infrastructure and operational processes.
Collaboration Style:
- Cross-Functional Integration: Collaborating with development, design, and marketing teams to ensure seamless integration and optimal performance.
- Code Review Culture: Encouraging knowledge sharing and continuous learning through code reviews and peer programming.
- Knowledge Sharing: Facilitating technical mentoring and workshops to foster a culture of continuous learning and improvement.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- Incident Management: Developing and refining incident management processes to minimize business impact and ensure rapid resolution.
- Automation: Identifying and automating operational tasks to improve system consistency and reliability.
- Scalability: Designing and implementing scalable infrastructure to support business growth and high availability.
- Cost Optimization: Regularly reviewing and optimizing cloud infrastructure to minimize costs and maximize efficiency.
Learning & Development Opportunities:
- AWS Certifications: Endava encourages employees to pursue relevant AWS certifications to enhance their technical skills and career prospects.
- Training and Workshops: The company offers various training opportunities, including online learning platforms, conferences, and internal tech communities.
- Mentorship: Endava provides mentorship programs to help employees develop their technical and leadership skills.
💡 Interview Preparation
Technical Questions:
- AWS Services: Be prepared to discuss AWS services, including EC2, RDS, ELB, and IAM, and their use cases.
- Terraform: Demonstrate a deep understanding of Terraform, including provisioning, state management, and remote state storage.
- Incident Management: Prepare for scenario-based questions on incident management, problem-solving, and automation using AWS and Terraform.
Company & Culture Questions:
- Endava's Mission: Be prepared to discuss Endava's mission, values, and how they align with your personal and professional goals.
- Team Dynamics: Demonstrate an understanding of Endava's team culture and how you would contribute to a collaborative and innovative environment.
- Adaptability: Prepare for questions on your ability to adapt to new technologies, tools, and processes in a dynamic and growing organization.
Portfolio Presentation Strategy:
- Live Demos: Prepare live demos of your AWS and Terraform projects, highlighting successful incident management, request fulfillment, and problem management cases.
- Documentation: Include detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides to support your portfolio presentation.
- User Experience: Tailor your portfolio presentation to showcase your understanding of the business impact of cloud infrastructure and user experience.
📌 Application Steps
To apply for this Senior Cloud Operations Engineer - AWS position at Endava:
- Submit your application through the application link provided.
- Customize your resume and portfolio to highlight your relevant experience in AWS and Terraform, with a focus on incident management, automation, and problem-solving.
- Prepare for the phone/video screen, technical deep dive, and behavioral and cultural fit interviews by brushing up on your AWS and Terraform skills, and researching Endava's mission, values, and team culture.
- Review the job description and company information to ensure a strong understanding of the role and Endava's expectations.
📝 Enhancement Note: This enhanced job description includes AI-generated insights and industry-standard assumptions. All details should be verified directly with Endava before making application decisions.
Application Requirements
Candidates must have proven experience managing AWS cloud infrastructure and strong hands-on experience with Infrastructure as Code using Terraform. Familiarity with ITIL-based service management and strong scripting skills are also essential.