Senior Cloud Operations Engineer - AWS

Endava
Full_timeCluj-Napoca, Romania

📍 Job Overview

  • Job Title: Senior Cloud Operations Engineer - AWS
  • Company: Endava
  • Location: Cluj-Napoca, Cluj, Romania
  • Job Type: Full-time, Hybrid
  • Category: DevOps, Infrastructure
  • Date Posted: June 19, 2025
  • Experience Level: 5-10 years
  • Remote Status: On-site/Hybrid

🚀 Role Summary

  • Cloud Infrastructure Management: Proactively monitor, maintain, and improve AWS cloud infrastructure's availability, performance, and reliability.
  • ITIL-based Service Management: Own and execute the full lifecycle of operational support, including incident, request, and problem management.
  • Infrastructure as Code (IaC): Manage AWS cloud environments using Terraform and automate operational tasks to reduce manual intervention.
  • Collaboration: Work closely with development and application teams to streamline deployments and operational processes.
  • Compliance & Documentation: Ensure compliance with security, governance, and operational standards, and maintain detailed documentation.

📝 Enhancement Note: This role requires a strong focus on operational excellence, automation, and collaboration with development teams to ensure efficient and reliable cloud infrastructure.

💻 Primary Responsibilities

  • Cloud Infrastructure Management:

    • Monitor, maintain, and improve system availability, performance, and reliability.
    • Ensure infrastructure availability for business-critical services in a 24/7 support model.
    • Participate in on-call rotations and support change management processes.
  • ITIL-based Service Management:

    • Rapidly detect, investigate, and resolve infrastructure incidents to minimize business impact.
    • Handle and fulfill infrastructure service requests within agreed service levels.
    • Identify root causes of recurring issues and implement permanent fixes to prevent reoccurrence.
  • Infrastructure as Code (IaC) & Automation:

    • Build and manage AWS cloud environments using Terraform.
    • Implement automation for operational tasks to reduce manual intervention and improve system consistency and reliability.
  • Collaboration & Documentation:

    • Collaborate with development and application teams to streamline deployments and operational processes.
    • Maintain detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides.

🎓 Skills & Qualifications

Education: Relevant degree in Computer Science, Engineering, or a related field. Relevant experience may be considered in lieu of a degree.

Experience: Proven experience managing AWS cloud infrastructure (5-10 years) and strong hands-on experience with IaC, Terraform.

Required Skills:

  • Strong technical acumen in AWS and Terraform.
  • Working knowledge and experience in incident, request, and problem management (ITIL).
  • Experience supporting and operating production infrastructure in a 24/7 environment.
  • Strong scripting skills in at least one language (e.g., Bash, Python, PowerShell).
  • Familiarity with networking, security groups, load balancers, IAM policies, and monitoring in cloud environments.
  • Strong communication and collaboration skills with the ability to interface effectively with technical and non-technical stakeholders.

Preferred Skills:

  • Familiarity with observability tools such as Splunk, CloudWatch, Datadog, or similar.
  • AWS certifications and ITIL certifications.

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate experience managing AWS cloud infrastructure with live examples and case studies.
  • Showcase automation scripts and Terraform configurations for infrastructure as code.
  • Highlight problem-solving skills and incident management experiences with real-life examples.

Technical Documentation:

  • Provide documentation of infrastructure configurations, operating procedures, and troubleshooting guides.
  • Include examples of process improvement initiatives and automation scripts.

📝 Enhancement Note: A strong portfolio should showcase the candidate's ability to manage AWS cloud infrastructure, automate operational tasks, and effectively document processes and procedures.

💵 Compensation & Benefits

Salary Range: The estimated salary range for this role in Cluj-Napoca, Romania is approximately 25,000 - 35,000 RON per month (based on market research and industry standards for senior cloud operations engineers with 5-10 years of experience).

Benefits:

  • Competitive salary package, share plan, company performance bonuses, value-based recognition awards, referral bonus.
  • Career coaching, global career opportunities, non-linear career paths, internal development programs for management and technical leadership.
  • Complex projects, rotations, internal tech communities, training, certifications, coaching, online learning platforms subscriptions, pass-it-on sessions, workshops, conferences.
  • Hybrid work and flexible working hours, employee assistance program.
  • Global internal wellbeing program, access to wellbeing apps.

Working Hours: Full-time, 40 hours per week, with flexible working hours and participation in on-call rotations.

📝 Enhancement Note: The estimated salary range is based on market research and industry standards for senior cloud operations engineers in Cluj-Napoca, Romania. Benefits include a comprehensive package focused on career development, work-life balance, and employee wellbeing.

🎯 Team & Company Context

🏢 Company Culture

Industry: Endava is a global technology company focused on delivering innovative solutions for various industries, including finance, retail, and healthcare.

Company Size: Endava has over 10,000 employees globally, providing ample opportunities for collaboration and career growth.

Founded: Endava was founded in 2000 and has since grown into a global leader in technology and engineering services.

Team Structure:

  • The cloud operations team consists of experienced engineers responsible for managing and maintaining AWS cloud infrastructure.
  • The team follows a matrix structure, collaborating with development, application, and other technical teams to ensure efficient and reliable cloud services.
  • The team works in an Agile environment, focusing on continuous improvement and automation.

Development Methodology:

  • The team follows ITIL-based service management for incident, request, and problem management.
  • They use Infrastructure as Code (IaC) with Terraform for managing AWS cloud environments and automating operational tasks.
  • Endava emphasizes collaboration, knowledge sharing, and continuous learning, with regular training and certification opportunities.

Company Website: Endava

📝 Enhancement Note: Endava's culture emphasizes collaboration, knowledge sharing, and continuous learning, providing a supportive environment for cloud operations engineers to grow and develop their skills.

📈 Career & Growth Analysis

Cloud Operations Engineer Career Level: This role is at the senior level, focusing on managing and maintaining AWS cloud infrastructure, executing ITIL-based service management, and driving operational excellence.

Reporting Structure: The senior cloud operations engineer reports to the cloud operations manager and works closely with development, application, and other technical teams.

Technical Impact: The role has a significant impact on the company's cloud infrastructure, ensuring high availability, performance, and reliability for business-critical services. The engineer's work directly influences user experience and business outcomes.

Growth Opportunities:

  • Technical Growth: Deepen expertise in AWS and other cloud technologies, pursue specialized certifications, and contribute to the development of internal tools and frameworks.
  • Leadership Growth: Develop management and leadership skills by mentoring junior engineers, driving process improvements, and contributing to strategic decision-making.
  • Career Progression: Explore opportunities for career progression within the cloud operations team or other technical roles within Endava.

📝 Enhancement Note: Endava offers numerous growth opportunities for senior cloud operations engineers, focusing on technical skill development, leadership, and career progression within the company.

🌐 Work Environment

Office Type: Endava's Cluj-Napoca office is a modern, collaborative workspace designed to foster innovation and teamwork.

Office Location(s): The main office is located in Cluj-Napoca, with additional offices in Iasi, Romania, and other global locations.

Workspace Context:

  • The office provides ample space for collaboration, with dedicated areas for team meetings and workshops.
  • Engineers have access to multiple monitors, testing devices, and other tools necessary for their roles.
  • Endava encourages a culture of knowledge sharing and continuous learning, with regular tech talks, workshops, and hackathons.

Work Schedule: Full-time, 40 hours per week, with flexible working hours and participation in on-call rotations for 24/7 support.

📝 Enhancement Note: Endava's work environment encourages collaboration, knowledge sharing, and continuous learning, providing a supportive and engaging space for cloud operations engineers to thrive.

📄 Application & Technical Interview Process

Interview Process:

  1. Online Assessment: A technical assessment focusing on AWS cloud infrastructure management, Terraform, and ITIL-based service management.
  2. Technical Deep Dive: A detailed discussion of the candidate's experience with AWS, Terraform, and incident management, as well as their approach to automation and process improvement.
  3. Behavioral & Cultural Fit: An assessment of the candidate's communication skills, problem-solving abilities, and cultural fit within Endava's teams.
  4. Final Decision: A review of the candidate's performance throughout the interview process and a final decision on hiring.

Portfolio Review Tips:

  • Highlight experience managing AWS cloud infrastructure and demonstrate a strong understanding of Terraform and ITIL-based service management.
  • Showcase automation scripts and Terraform configurations, emphasizing the candidate's ability to reduce manual intervention and improve system consistency and reliability.
  • Include real-life examples of incident management, problem-solving, and process improvement initiatives.

Technical Challenge Preparation:

  • Brush up on AWS cloud infrastructure management, Terraform, and ITIL-based service management.
  • Practice incident management scenarios and problem-solving exercises to demonstrate strong decision-making and communication skills.
  • Familiarize oneself with Endava's company culture and values to ensure a good fit with the team.

ATS Keywords: [AWS, Terraform, ITIL, Incident Management, Problem Management, Automation, Scripting, Cloud Infrastructure, Monitoring, Networking, Security, Load Balancing, IAM, CloudWatch, Datadog, Splunk, Agile, Collaboration, Knowledge Sharing, Continuous Learning]

📝 Enhancement Note: Endava's interview process focuses on assessing the candidate's technical expertise in AWS cloud infrastructure management, Terraform, and ITIL-based service management, as well as their problem-solving skills, communication, and cultural fit within the team.

🛠 Technology Stack & Web Infrastructure

Cloud Infrastructure:

  • AWS: Endava's primary cloud provider, with a focus on managing and maintaining AWS cloud infrastructure.
  • Terraform: Used for Infrastructure as Code (IaC) to manage AWS cloud environments and automate operational tasks.

Monitoring & Logging:

  • CloudWatch: AWS's native monitoring and logging service for cloud infrastructure.
  • Datadog or Splunk: Endava may use third-party monitoring and logging tools for enhanced visibility and analytics.

Configuration Management:

  • Ansible or Puppet: Endava may use configuration management tools to automate the deployment and management of cloud infrastructure.

CI/CD Pipelines:

  • Jenkins or GitLab CI/CD: Endava uses CI/CD pipelines for automated testing, deployment, and infrastructure management.

📝 Enhancement Note: Endava's technology stack focuses on AWS cloud infrastructure management, with a strong emphasis on Infrastructure as Code (IaC) using Terraform and automation for operational tasks.

👥 Team Culture & Values

Cloud Operations Values:

  • Reliability: Endava emphasizes the importance of ensuring high availability, performance, and reliability for business-critical services.
  • Automation: The team values automation as a means of reducing manual intervention, improving system consistency, and driving operational excellence.
  • Collaboration: Endava fosters a culture of collaboration, knowledge sharing, and continuous learning within its cloud operations teams.
  • Continuous Improvement: The team is committed to driving process improvement and driving operational excellence through regular review and optimization of cloud infrastructure and service management processes.

Collaboration Style:

  • Cross-functional Collaboration: Cloud operations teams work closely with development, application, and other technical teams to ensure efficient and reliable cloud services.
  • Peer Review & Knowledge Sharing: Endava encourages peer review and knowledge sharing, with regular tech talks, workshops, and hackathons.
  • Mentoring & Leadership Development: Endava provides mentoring and leadership development opportunities for cloud operations engineers to grow and develop their skills.

📝 Enhancement Note: Endava's cloud operations team values reliability, automation, collaboration, and continuous improvement, fostering a culture of knowledge sharing, continuous learning, and teamwork.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • Cloud Infrastructure Management: Manage and maintain AWS cloud infrastructure, ensuring high availability, performance, and reliability for business-critical services.
  • Incident & Problem Management: Rapidly detect, investigate, and resolve infrastructure incidents, and identify root causes of recurring issues to prevent reoccurrence.
  • Automation & Process Improvement: Implement automation for operational tasks and drive process improvement initiatives to reduce manual intervention and improve system consistency and reliability.
  • Emerging Technologies: Stay up-to-date with emerging cloud technologies and consider their integration into Endava's cloud infrastructure and service management processes.

Learning & Development Opportunities:

  • AWS Training & Certification: Endava offers regular training and certification opportunities for AWS cloud infrastructure management.
  • Terraform Training & Certification: Endava provides training and certification opportunities for Infrastructure as Code (IaC) using Terraform.
  • ITIL Training & Certification: Endava offers training and certification opportunities for ITIL-based service management.
  • Leadership Development: Endava provides mentoring and leadership development opportunities for cloud operations engineers to grow and develop their skills.

📝 Enhancement Note: Endava offers numerous technical challenges and growth opportunities for cloud operations engineers, focusing on cloud infrastructure management, incident and problem management, automation, and continuous learning.

💡 Interview Preparation

Technical Questions:

  • AWS Cloud Infrastructure Management: Describe your experience managing AWS cloud infrastructure and how you ensure high availability, performance, and reliability for business-critical services.
  • Terraform & Infrastructure as Code (IaC): Explain your approach to managing AWS cloud environments using Terraform and automating operational tasks.
  • Incident & Problem Management: Walk through a real-life incident or problem management scenario, demonstrating your ability to rapidly detect, investigate, and resolve infrastructure incidents and identify root causes of recurring issues.
  • Automation & Process Improvement: Discuss your approach to automating operational tasks and driving process improvement initiatives to reduce manual intervention and improve system consistency and reliability.

Company & Culture Questions:

  • Endava's Cloud Operations Team: Describe your understanding of Endava's cloud operations team structure, values, and collaboration style.
  • AWS & Endava's Technology Stack: Explain your familiarity with Endava's technology stack, focusing on AWS cloud infrastructure management, Terraform, and other relevant tools.
  • Endava's Company Culture: Discuss your understanding of Endava's company culture, values, and commitment to knowledge sharing, continuous learning, and teamwork.

Portfolio Presentation Strategy:

  • Cloud Infrastructure Management: Highlight your experience managing AWS cloud infrastructure, demonstrating a strong understanding of Terraform and ITIL-based service management.
  • Automation & Process Improvement: Showcase automation scripts and Terraform configurations, emphasizing your ability to reduce manual intervention and improve system consistency and reliability.
  • Incident & Problem Management: Include real-life examples of incident management, problem-solving, and process improvement initiatives to demonstrate your strong decision-making and communication skills.

📝 Enhancement Note: Endava's interview process focuses on assessing the candidate's technical expertise in AWS cloud infrastructure management, Terraform, and ITIL-based service management, as well as their problem-solving skills, communication, and cultural fit within the team.

📌 Application Steps

To apply for this Senior Cloud Operations Engineer - AWS position at Endava:

  1. Customize Your Application: Tailor your resume and cover letter to highlight your experience with AWS cloud infrastructure management, Terraform, and ITIL-based service management.
  2. Prepare for Technical Assessment: Brush up on your technical skills in AWS, Terraform, and ITIL-based service management, and practice incident management scenarios and problem-solving exercises.
  3. Research Endava: Familiarize yourself with Endava's company culture, values, and technology stack to ensure a good fit with the team.
  4. Prepare for Behavioral & Cultural Fit Assessment: Reflect on your communication skills, problem-solving abilities, and cultural fit within Endava's teams.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.


Application Requirements

Candidates must have proven experience managing AWS cloud infrastructure and strong hands-on experience with Terraform. Familiarity with ITIL-based service management and scripting skills in at least one language is essential.