Senior Cloud Operations Engineer - AWS

Endava
Full_timeCluj-Napoca, Romania

📍 Job Overview

  • Job Title: Senior Cloud Operations Engineer - AWS
  • Company: Endava
  • Location: Cluj-Napoca, Cluj, Romania
  • Job Type: Full-time
  • Category: DevOps Engineer
  • Date Posted: June 19, 2025
  • Experience Level: 5-10 years
  • Remote Status: On-site/Hybrid

🚀 Role Summary

  • Cloud Infrastructure Management: Oversee day-to-day operations, ensuring reliability, performance, and availability of cloud environments.
  • Incident & Problem Management: Implement ITIL-based incident, request, and problem management to minimize business impact and prevent reoccurrence.
  • Automation & Infrastructure as Code: Build, manage, and automate AWS cloud environments using Terraform to reduce manual intervention and improve system consistency.
  • Collaboration & Communication: Work with development and application teams to streamline deployments and operational processes, and effectively communicate with technical and non-technical stakeholders.

📝 Enhancement Note: This role requires a strong technical background in AWS and Terraform, with a focus on operational excellence and incident management. Candidates should be comfortable working in a 24/7 support model and have a solid understanding of ITIL-based service management.

💻 Primary Responsibilities

  • Cloud Infrastructure Management: Monitor, maintain, and improve system availability, performance, and reliability across AWS environments.
  • Incident & Problem Management: Detect, investigate, and resolve infrastructure incidents, and identify root causes of recurring issues to implement permanent fixes.
  • Automation & Infrastructure as Code: Build and manage AWS cloud environments using Infrastructure as Code (Terraform), and implement automation for operational tasks.
  • Collaboration & Communication: Work with development and application teams to streamline deployments and operational processes, and effectively communicate with technical and non-technical stakeholders.
  • Documentation & Knowledge Sharing: Maintain detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides, and share knowledge with the team.
  • Compliance & Security: Ensure infrastructure compliance with security, governance, and operational standards across all environments.
  • Capacity Planning & Cost Optimization: Participate in capacity planning, cost optimization, and performance tuning activities to improve system efficiency.
  • Change Management: Support change management processes to ensure safe and auditable infrastructure changes.

🎓 Skills & Qualifications

Education: Bachelor's degree in Computer Science, Engineering, or a related field. Relevant experience may be considered in lieu of a degree.

Experience: Proven experience managing AWS cloud infrastructure (5-10 years), with strong hands-on experience in Terraform and ITIL-based service management.

Required Skills:

  • AWS cloud infrastructure management
  • Terraform (Infrastructure as Code)
  • ITIL-based service management (incident, request, and problem management)
  • Scripting skills (e.g., Bash, Python, PowerShell)
  • Familiarity with networking, security groups, load balancers, IAM policies, and monitoring in cloud environments
  • Strong communication and collaboration skills

Preferred Skills:

  • Experience with observability tools (e.g., Splunk, CloudWatch, Datadog)
  • Familiarity with CI/CD pipelines, configuration management tools, and automation frameworks
  • AWS certifications

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate experience managing AWS cloud infrastructure with examples of Terraform configurations and ITIL-based service management implementations.
  • Showcase incident management and problem-solving skills with case studies of infrastructure incidents and their resolution.
  • Highlight automation and scripting skills with examples of automated operational tasks and scripts used to improve system consistency and reliability.

Technical Documentation:

  • Provide detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides.
  • Include examples of capacity planning, cost optimization, and performance tuning activities.
  • Demonstrate understanding of security, governance, and operational standards with relevant documentation and compliance examples.

💵 Compensation & Benefits

Salary Range: The estimated salary range for this role in Cluj-Napoca, Romania is between 35,000 RON and 45,000 RON per year, based on market research and industry standards for senior cloud operations engineers with 5-10 years of experience.

Benefits:

  • Competitive salary package
  • Share plan
  • Company performance bonuses
  • Value-based recognition awards
  • Referral bonus
  • Career coaching
  • Global career opportunities
  • Non-linear career paths
  • Internal development programmes for management and technical leadership
  • Complex projects, rotations, internal tech communities, training, certifications, coaching, online learning platforms subscriptions, pass-it-on sessions, workshops, conferences
  • Hybrid work and flexible working hours
  • Employee assistance programme
  • Global internal wellbeing programme, access to wellbeing apps
  • Global internal tech communities, hobby clubs and interest groups, inclusion and diversity programmes, events and celebrations

Working Hours: Full-time position with a commitment to 24/7 support coverage, including participation in on-call rotations.

🎯 Team & Company Context

🏢 Company Culture

Industry: Endava is a global technology company focused on delivering innovative solutions for various industries, including finance, retail, and healthcare.

Company Size: Endava has over 7,000 employees across multiple locations worldwide, providing ample opportunities for collaboration and growth within the organization.

Founded: Endava was founded in 2000 and has since grown into a global leader in technology and engineering services.

Team Structure:

  • The cloud operations team consists of experienced engineers responsible for managing, maintaining, and improving cloud infrastructure across various clients and projects.
  • The team follows an Agile/Scrum development methodology, with regular sprint planning and collaboration sessions.
  • Cross-functional collaboration with development, application, and other teams is essential for streamlining deployments and operational processes.

Development Methodology:

  • Endava follows an Agile/Scrum development methodology, with regular sprint planning, daily stand-ups, and sprint retrospectives.
  • Code reviews, testing, and quality assurance practices are integral to the development process.
  • Deployment strategies, CI/CD pipelines, and server management are handled by the DevOps team, with a focus on automation and infrastructure as code.

Company Website: Endava

📝 Enhancement Note: Endava's global presence and diverse client base offer unique opportunities for cloud operations engineers to work on complex projects and gain exposure to various industries. The company's commitment to career development and learning opportunities makes it an attractive choice for experienced professionals seeking to advance their careers in cloud operations.

📈 Career & Growth Analysis

Web Technology Career Level: This role is a senior-level position, requiring a high degree of technical expertise and experience in cloud operations, AWS, and Terraform. The engineer will be responsible for managing and maintaining critical infrastructure, making strategic decisions, and mentoring junior team members.

Reporting Structure: The senior cloud operations engineer will report directly to the head of cloud operations and work closely with the development, application, and other teams to ensure smooth deployment and operational processes.

Technical Impact: The engineer will have a significant impact on the performance, reliability, and security of cloud infrastructure, directly contributing to the success of Endava's clients and projects.

Growth Opportunities:

  • Technical Leadership: With experience and proven performance, the engineer may progress to a technical lead or architect role, focusing on driving innovation and best practices within the cloud operations team.
  • Management: Demonstrating strong leadership and mentoring skills, the engineer may transition into a management role, overseeing the day-to-day operations of the cloud operations team and driving strategic initiatives.
  • Specialization: The engineer may choose to specialize in a specific area of cloud operations, such as security, governance, or cost optimization, becoming a subject matter expert and driving best practices within the organization.

📝 Enhancement Note: Endava's commitment to career development and learning opportunities provides ample growth prospects for experienced cloud operations engineers. With a focus on technical leadership, management, and specialization, the company offers a clear path for professionals seeking to advance their careers in cloud operations.

🌐 Work Environment

Office Type: Endava's Cluj-Napoca office is a modern, collaborative workspace designed to foster innovation and creativity. The office features open-plan workspaces, meeting rooms, and breakout areas, promoting interaction and collaboration among team members.

Office Location(s): Endava's Cluj-Napoca office is conveniently located in the city center, with easy access to public transportation and nearby amenities.

Workspace Context:

  • Collaborative Workspace: The open-plan office layout encourages teamwork and knowledge sharing among cloud operations engineers and other team members.
  • Workstation Setup: Each engineer has access to a dedicated workstation with multiple monitors, testing devices, and development tools tailored to their specific needs.
  • Cross-Functional Interaction: The office layout facilitates interaction with other teams, such as development, design, and project management, fostering a collaborative work environment.

Work Schedule: Full-time position with a commitment to 24/7 support coverage, including participation in on-call rotations. The work schedule is flexible, with a focus on delivering results and maintaining work-life balance.

📝 Enhancement Note: Endava's modern, collaborative workspace and flexible work schedule create an ideal environment for cloud operations engineers to thrive and grow professionally. The focus on teamwork, knowledge sharing, and work-life balance makes Endava an attractive choice for experienced engineers seeking a supportive and engaging work environment.

📄 Application & Technical Interview Process

Interview Process:

  1. Technical Phone Screen: A 30-minute phone or video call to assess the candidate's technical proficiency in AWS, Terraform, and ITIL-based service management. The interviewer may ask questions about incident management, automation, and scripting skills.
  2. On-site Technical Assessment: A 2-3 hour on-site assessment, consisting of a technical deep dive into the candidate's experience with AWS, Terraform, and incident management. The assessment may include live coding exercises, system design discussions, and problem-solving scenarios.
  3. Behavioral & Cultural Fit Interview: A 1-hour interview focused on understanding the candidate's problem-solving approach, communication skills, and cultural fit within the Endava team. The interviewer may ask questions about the candidate's experience with ITIL-based service management, automation, and incident management.
  4. Final Evaluation: A 30-minute meeting with the hiring manager to discuss the candidate's technical and cultural fit, and make a final hiring decision.

Portfolio Review Tips:

  • Highlight experience managing AWS cloud infrastructure, with a focus on Terraform configurations and ITIL-based service management implementations.
  • Include case studies of incident management and problem-solving skills, demonstrating the ability to detect, investigate, and resolve infrastructure incidents.
  • Showcase automation and scripting skills with examples of automated operational tasks and scripts used to improve system consistency and reliability.

Technical Challenge Preparation:

  • Brush up on AWS services, Terraform configurations, and ITIL-based service management principles.
  • Practice incident management and problem-solving scenarios, focusing on detecting, investigating, and resolving infrastructure incidents.
  • Familiarize yourself with Endava's company culture, values, and mission to demonstrate a strong cultural fit during the interview process.

ATS Keywords: [AWS, Terraform, ITIL, Incident Management, Request Fulfillment, Problem Management, Scripting, Automation, CI/CD, Infrastructure as Code, Cloud Operations, Senior Engineer, Technical Lead, Mentoring, Career Development, Cloud Infrastructure, Hybrid Work, Flexible Work Hours, Global Technology Company, Agile/Scrum Methodology, Technical Interview, Behavioral Interview, Final Evaluation]

📝 Enhancement Note: Endava's technical interview process is designed to evaluate the candidate's technical proficiency in AWS, Terraform, and ITIL-based service management, as well as their problem-solving approach, communication skills, and cultural fit within the organization. By preparing for each stage of the interview process and demonstrating a strong understanding of Endava's company culture and values, candidates can increase their chances of success in securing the senior cloud operations engineer role.

🛠 Technology Stack & Web Infrastructure

Cloud Infrastructure:

  • AWS: Endava's cloud infrastructure is primarily hosted on AWS, utilizing a range of services such as EC2, RDS, and S3 for compute, database, and storage needs.
  • Terraform: Terraform is used for Infrastructure as Code (IaC) to manage and provision AWS resources, ensuring consistency, version control, and automated deployment.

Networking & Security:

  • Networking: Endava's cloud infrastructure is designed with high availability and fault tolerance in mind, utilizing AWS services such as VPC, subnets, route tables, and network ACLs to manage network traffic and security.
  • Security: Endava implements robust security measures to protect its cloud infrastructure, including IAM policies, security groups, and encryption at rest and in transit.

Monitoring & Logging:

  • Monitoring: Endava uses AWS CloudWatch and third-party tools like Datadog or New Relic to monitor the performance, availability, and health of its cloud infrastructure.
  • Logging: AWS CloudTrail and CloudWatch Logs are used to collect and store logs from AWS resources, enabling auditing, troubleshooting, and compliance.

CI/CD & Automation:

  • CI/CD: Endava employs CI/CD pipelines to automate the build, test, and deployment process, ensuring consistent and reliable software delivery.
  • Automation: Endava uses tools like Ansible, Puppet, or Chef to automate operational tasks, infrastructure provisioning, and configuration management.

📝 Enhancement Note: Endava's technology stack is designed to provide a scalable, secure, and highly available cloud infrastructure for its clients and projects. By leveraging AWS services, Terraform, and other best-of-breed tools, Endava ensures that its cloud operations engineers have access to the latest technologies and best practices in cloud infrastructure management.

👥 Team Culture & Values

Cloud Operations Values:

  • Reliability: Endava's cloud operations team is committed to ensuring the availability, performance, and reliability of cloud infrastructure, with a focus on minimizing downtime and maximizing system uptime.
  • Proactivity: The team proactively monitors, maintains, and improves cloud infrastructure, identifying and addressing potential issues before they impact business operations.
  • Collaboration: Endava's cloud operations team works closely with development, application, and other teams to streamline deployments, operational processes, and incident management.
  • Continuous Learning: The team is committed to staying up-to-date with the latest AWS services, Terraform best practices, and ITIL-based service management principles, continuously improving its skills and knowledge.

Collaboration Style:

  • Cross-Functional Integration: Endava's cloud operations team works closely with development, design, and project management teams to ensure smooth deployment and operational processes.
  • Code Review Culture: The team encourages peer-to-peer code reviews and knowledge sharing, fostering a collaborative work environment and driving best practices in cloud infrastructure management.
  • Mentoring & Knowledge Sharing: Endava's cloud operations team actively mentors junior engineers, providing guidance and support to help them develop their skills and advance their careers in cloud operations.

📝 Enhancement Note: Endava's cloud operations team is committed to delivering reliable, secure, and highly available cloud infrastructure for its clients and projects. By fostering a culture of collaboration, continuous learning, and knowledge sharing, Endava ensures that its cloud operations engineers have the skills, tools, and support they need to succeed in their roles and advance their careers in cloud operations.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • Incident Management: Endava's cloud operations engineers may face challenging incident management scenarios, requiring quick thinking, problem-solving, and effective communication to minimize business impact and ensure rapid resolution.
  • Automation & Scripting: The team may encounter complex automation and scripting challenges, requiring creative solutions and best practices in infrastructure as code and operational task automation.
  • Performance Optimization: Endava's cloud infrastructure may face performance optimization challenges, requiring engineers to identify bottlenecks, optimize resource utilization, and implement best practices in cloud architecture and design.
  • Emerging Technologies: As AWS and other cloud providers introduce new services and features, Endava's cloud operations engineers must stay up-to-date with the latest technologies and best practices, continuously expanding their skill set and knowledge base.

Learning & Development Opportunities:

  • Technical Skill Development: Endava offers opportunities for cloud operations engineers to develop their technical skills through training, certifications, and hands-on experience with the latest AWS services and Terraform best practices.
  • Conference Attendance & Community Involvement: Endava encourages its engineers to attend industry conferences, join online communities, and participate in local meetups to expand their knowledge and network within the cloud operations industry.
  • Mentorship & Leadership Development: Endava provides opportunities for cloud operations engineers to mentor junior engineers, develop their leadership skills, and advance their careers in technical or management roles.

📝 Enhancement Note: Endava's cloud operations engineers face a range of technical challenges, requiring quick thinking, problem-solving, and effective communication to ensure the reliability, performance, and security of cloud infrastructure. By providing opportunities for continuous learning, skill development, and mentorship, Endava ensures that its cloud operations engineers have the tools and support they need to succeed in their roles and advance their careers in cloud operations.

💡 Interview Preparation

Technical Questions:

  • AWS & Terraform: Be prepared to discuss your experience with AWS services, Terraform configurations, and Infrastructure as Code (IaC) best practices. The interviewer may ask about your familiarity with AWS services like EC2, RDS, and S3, as well as your experience with Terraform modules, providers, and state management.
  • ITIL-based Service Management: Demonstrate your understanding of ITIL-based service management principles, incident management, request fulfillment, and problem management. The interviewer may ask about your experience with ITIL processes, service level agreements (SLAs), and operational level agreements (OLAs).
  • Scripting & Automation: Showcase your scripting skills and experience with automation tools like Ansible, Puppet, or Chef. The interviewer may ask about your familiarity with shell scripting, Python, or PowerShell, as well as your experience with CI/CD pipelines and infrastructure as code.
  • Problem-solving & System Design: Prepare for problem-solving scenarios and system design discussions, demonstrating your ability to think critically, identify root causes, and implement effective solutions. The interviewer may ask about your experience with incident management, capacity planning, and performance tuning.

Company & Culture Questions:

  • Endava's Mission & Values: Familiarize yourself with Endava's mission, values, and company culture, and be prepared to discuss how your personal values align with the company's. The interviewer may ask about your commitment to collaboration, continuous learning, and customer focus.
  • Agile/Scrum Methodology: Demonstrate your understanding of Agile/Scrum development methodologies, including sprint planning, daily stand-ups, and sprint retrospectives. The interviewer may ask about your experience with Agile teams, backlog management, and user story estimation.
  • Customer Focus: Prepare to discuss your experience working with customers, understanding their needs, and delivering solutions that meet their business objectives. The interviewer may ask about your ability to build strong relationships, communicate effectively, and drive customer satisfaction.

Portfolio Presentation Strategy:

  • Cloud Infrastructure Management: Highlight your experience managing AWS cloud infrastructure, with a focus on Terraform configurations, incident management, and automation.
  • System Design & Architecture: Showcase your ability to design and implement scalable, secure, and highly available cloud infrastructure, with a focus on best practices in cloud architecture and design.
  • Problem-solving & Incident Management: Demonstrate your ability to detect, investigate, and resolve infrastructure incidents, with a focus on minimizing business impact and ensuring rapid resolution.

📝 Enhancement Note: Endava's technical interview process is designed to evaluate the candidate's technical proficiency in AWS, Terraform, and ITIL-based service management, as well as their problem-solving approach, communication skills, and cultural fit within the organization. By preparing for each stage of the interview process and demonstrating a strong understanding of Endava's company culture and values, candidates can increase their chances of success in securing the senior cloud operations engineer role.

📌 Application Steps

To apply for this senior cloud operations engineer position at Endava:

  1. Update Your Resume: Tailor your resume to highlight your experience with AWS, Terraform, and ITIL-based service management, with a focus on incident management, automation, and scripting skills.
  2. Prepare Your Portfolio: Showcase your experience managing AWS cloud infrastructure, with a focus on Terraform configurations, incident management, and automation. Include case studies of challenging incident management scenarios and your approach to rapid resolution.
  3. Research Endava: Familiarize yourself with Endava's company culture, values, and mission, and be prepared to discuss how your personal values align with the company's.
  4. Practice Technical Interview Questions: Brush up on your AWS, Terraform, and ITIL-based service management knowledge, and practice answering technical interview questions to build confidence and improve your performance.
  5. Prepare for Behavioral Interview Questions: Reflect on your experience working with customers, understanding their needs, and delivering solutions that meet their business objectives. Prepare to discuss your ability to build strong relationships, communicate effectively, and drive customer satisfaction.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web technology industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

Application Requirements

Candidates must have proven experience managing AWS cloud infrastructure and strong hands-on experience with Infrastructure as Code using Terraform. Familiarity with ITIL-based service management and scripting skills in languages like Bash or Python are also essential.