Senior Cloud Operations Engineer - AWS
π Job Overview
- Job Title: Senior Cloud Operations Engineer - AWS
- Company: Endava
- Location: Bucharest, BucureΕti, Romania & Cluj-Napoca, Cluj, Romania
- Job Type: Full-time, Hybrid
- Category: DevOps Engineer, System Administrator, Web Infrastructure
- Date Posted: June 19, 2025
- Experience Level: 5-10 years
- Remote Status: On-site/Hybrid
π Role Summary
-
Key Responsibilities: Manage and maintain AWS cloud infrastructure, ensuring high availability, performance, and reliability. Collaborate with development teams to streamline deployments and operational processes. Build and manage cloud environments using Infrastructure as Code (Terraform). Implement automation for operational tasks and ensure compliance with security, governance, and operational standards.
-
Key Technologies: AWS, Terraform, Infrastructure as Code, ITIL, Incident Management, Request Fulfillment, Problem Management, Scripting, CI/CD, Configuration Management, Networking, Security, Monitoring, Observability Tools
π» Primary Responsibilities
-
Cloud Infrastructure Management: Proactively monitor, maintain, and improve system availability, performance, and reliability. Ensure infrastructure availability for business-critical services in a 24/7 support model.
-
Incident, Request, and Problem Management: Handle incidents, service requests, and identify root causes of recurring issues following ITIL standards. Implement permanent fixes to prevent reoccurrence.
-
Infrastructure as Code (IaC) and Automation: Build and manage AWS cloud environments using Terraform. Implement automation for operational tasks to reduce manual intervention and improve system consistency and reliability.
-
Collaboration and Documentation: Collaborate with development and application teams to streamline deployments and operational processes. Maintain detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides.
-
Capacity Planning, Cost Optimization, and Performance Tuning: Participate in capacity planning, cost optimization, and performance tuning activities. Ensure compliance with security, governance, and operational standards across all environments.
-
Change Management: Support change management processes, ensuring safe and auditable infrastructure changes.
π Enhancement Note: This role requires a strong focus on operational excellence, with a commitment to 24/7 support coverage. Candidates should be comfortable working in fast-paced environments and have a solid grasp of ITIL-based service management.
π Skills & Qualifications
Education: Bachelor's degree in Computer Science, IT, or a related field. Relevant certifications such as AWS and ITIL are a plus.
Experience: Proven experience managing AWS cloud infrastructure (5-10 years). Strong hands-on experience with IaC, Terraform, and working knowledge of incident, request, and problem management.
Required Skills:
- Proven experience managing AWS cloud infrastructure
- Strong hands-on experience with IaC, Terraform
- Working knowledge and experience in incident, request, and problem management
- Experience supporting and operating production infrastructure in a 24/7 environment
- Strong scripting skills in at least one language (e.g., Bash, Python, PowerShell)
- Familiarity with CI/CD pipelines, configuration management tools, and automation frameworks
- Understanding of networking, security groups, load balancers, IAM policies, and monitoring in cloud environments
- Strong communication and collaboration skills with the ability to interface effectively with technical and non-technical stakeholders
Preferred Skills:
- Exposure to observability tools such as Splunk, CloudWatch, Datadog, or similar
- AWS certifications and ITIL certifications
π Enhancement Note: Candidates should have a strong technical acumen in AWS and Terraform, with a passion for automation and a solid grasp of ITIL-based service management. Experience supporting and operating production infrastructure in a 24/7 environment is essential.
π Web Portfolio & Project Requirements
Portfolio Essentials:
- Demonstrate experience managing AWS cloud infrastructure with case studies showcasing incident resolution, request fulfillment, and problem management.
- Highlight automation projects using Terraform and other tools to improve system consistency and reliability.
- Showcase your understanding of networking, security, and monitoring in cloud environments with relevant examples.
Technical Documentation:
- Provide documentation of infrastructure configurations, operating procedures, and troubleshooting guides for your AWS cloud projects.
- Include examples of capacity planning, cost optimization, and performance tuning activities.
- Demonstrate your ability to maintain detailed and up-to-date documentation following ITIL standards.
π Enhancement Note: Candidates should be prepared to discuss their portfolio projects in the context of AWS cloud infrastructure management, incident resolution, and automation. Emphasize your ability to maintain detailed documentation and follow ITIL standards.
π΅ Compensation & Benefits
Salary Range: The salary range for this role in Romania is approximately 35,000 - 50,000 RON per year (gross), depending on experience and qualifications. This estimate is based on market research and regional salary standards for senior cloud operations engineers with AWS experience.
Benefits:
- Competitive salary package
- Share plan
- Company performance bonuses
- Value-based recognition awards
- Referral bonus
- Career coaching
- Global career opportunities
- Non-linear career paths
- Internal development programmes
- Training
- Certifications
- Coaching
- Online learning platforms subscriptions
- Work-life balance (hybrid work and flexible working hours)
- Employee assistance programme
- Global internal wellbeing programme
- Access to wellbeing apps
- Global internal tech communities
- Hobby clubs and interest groups
- Inclusion and diversity programmes
- Events and celebrations
Working Hours: Full-time, 40 hours per week, with a commitment to 24/7 support coverage and participation in on-call rotations.
π Enhancement Note: The salary range provided is an estimate based on market research and regional salary standards. Candidates should research salary data for senior cloud operations engineers with AWS experience in Romania to validate the estimate.
π― Team & Company Context
Company Culture:
- Industry: Endava is a global technology company focused on delivering innovative solutions for clients in various industries, including financial services, insurance, media, and technology.
- Company Size: Endava has over 10,000 employees worldwide, providing a large and diverse team environment.
- Founded: Endava was founded in 2000, with a strong focus on technology and innovation.
Team Structure:
- The cloud operations team consists of experienced engineers responsible for managing and maintaining AWS cloud infrastructure.
- The team follows a matrix structure, collaborating with development, application, and other technical teams to ensure high availability, performance, and reliability.
- The team works closely with the company's global network of offices, providing 24/7 support coverage.
Development Methodology:
- Endava follows Agile methodologies, with a focus on iterative development and continuous improvement.
- The company emphasizes collaboration, knowledge sharing, and a culture of learning and development.
- Endava encourages employees to stay up-to-date with emerging technologies and industry best practices.
Company Website: Endava
π Enhancement Note: Endava's culture emphasizes collaboration, knowledge sharing, and a focus on technology and innovation. Candidates should be comfortable working in a large, global team environment and be open to learning and adapting to new technologies and methodologies.
π Career & Growth Analysis
Web Technology Career Level: Senior Cloud Operations Engineer - AWS
- Role Scope: Responsible for managing and maintaining AWS cloud infrastructure, ensuring high availability, performance, and reliability. Collaborate with development teams to streamline deployments and operational processes. Build and manage cloud environments using Infrastructure as Code (Terraform). Implement automation for operational tasks and ensure compliance with security, governance, and operational standards.
- Reporting Structure: Reports directly to the Cloud Operations Manager or a similar role within the organization.
- Technical Impact: Directly impacts the performance, availability, and reliability of AWS cloud infrastructure, ensuring business-critical services remain operational and performant.
Growth Opportunities:
- Technical Growth: Deepen expertise in AWS cloud infrastructure management, automation, and ITIL-based service management. Explore emerging technologies and tools to improve system performance, reliability, and security.
- Leadership Development: Develop leadership skills through mentoring junior team members, driving team initiatives, and contributing to the company's knowledge-sharing culture.
- Architecture and Design: Expand your role to include infrastructure architecture and design, contributing to the development of scalable, secure, and high-performing cloud environments.
π Enhancement Note: This role offers significant opportunities for technical growth and leadership development. Candidates should be eager to learn, adapt, and contribute to the company's knowledge-sharing culture.
π Work Environment
Office Type: Endava's offices are modern, collaborative workspaces designed to foster innovation and knowledge sharing. The company encourages a hybrid work arrangement, balancing on-site collaboration with remote work flexibility.
Office Location(s): Bucharest, Romania (Headquarters) & Cluj-Napoca, Romania
Workspace Context:
- Collaborative Environment: Endava's offices feature open-plan workspaces, meeting rooms, and breakout areas designed to encourage collaboration and communication among team members.
- Development Tools: Endava provides access to industry-standard development tools, multiple monitors, and testing devices to support the work of cloud operations engineers.
- Cross-Functional Collaboration: Cloud operations engineers at Endava work closely with development, application, and other technical teams to ensure high availability, performance, and reliability of AWS cloud infrastructure.
Work Schedule: Full-time, 40 hours per week, with a commitment to 24/7 support coverage and participation in on-call rotations. The company offers flexible working hours and a hybrid work arrangement to balance work-life needs.
π Enhancement Note: Endava's work environment encourages collaboration, knowledge sharing, and a focus on technology and innovation. Candidates should be comfortable working in a modern, open-plan office and be open to learning and adapting to new technologies and methodologies.
π Application & Technical Interview Process
Interview Process:
- Technical Assessment: A hands-on technical assessment focused on AWS cloud infrastructure management, incident resolution, and automation. Candidates will be expected to demonstrate their ability to manage AWS cloud environments, troubleshoot incidents, and implement automation solutions using Terraform and other tools.
- Behavioral Interview: A structured behavioral interview focused on problem-solving, communication, and collaboration skills. Candidates will be asked to discuss their approach to incident management, request fulfillment, and problem management following ITIL standards.
- Team Fit Interview: A team fit interview focused on cultural alignment, knowledge sharing, and collaboration. Candidates will meet with members of the cloud operations team to discuss their approach to work, learning, and development.
- Final Evaluation: A final evaluation based on the candidate's performance in the technical assessment, behavioral interview, and team fit interview. Candidates may be asked to provide additional information or clarify their responses from previous interviews.
Portfolio Review Tips:
- Highlight your experience managing AWS cloud infrastructure with case studies showcasing incident resolution, request fulfillment, and problem management.
- Demonstrate your ability to automate operational tasks using Terraform and other tools to improve system consistency and reliability.
- Showcase your understanding of networking, security, and monitoring in cloud environments with relevant examples.
- Emphasize your ability to maintain detailed and up-to-date documentation following ITIL standards.
Technical Challenge Preparation:
- Brush up on your AWS cloud infrastructure management skills, focusing on incident resolution, request fulfillment, and problem management.
- Familiarize yourself with Terraform and other automation tools used for managing AWS cloud environments.
- Prepare for behavioral interview questions focused on problem-solving, communication, and collaboration skills. Practice discussing your approach to incident management, request fulfillment, and problem management following ITIL standards.
ATS Keywords: AWS, Terraform, Infrastructure as Code, ITIL, Incident Management, Request Fulfillment, Problem Management, Scripting, CI/CD, Configuration Management, Networking, Security, Monitoring, Observability Tools, Cloud Operations, Senior Cloud Operations Engineer, AWS Certified, ITIL Certified
π Enhancement Note: Endava's interview process focuses on technical competency, problem-solving skills, and cultural fit. Candidates should be prepared to discuss their approach to AWS cloud infrastructure management, incident resolution, and automation in detail.
π Technology Stack & Web Infrastructure
Frontend Technologies: Not applicable for this role.
Backend & Server Technologies:
- AWS: Endava's cloud operations engineers manage and maintain AWS cloud infrastructure, ensuring high availability, performance, and reliability.
- Terraform: Endava uses Terraform for Infrastructure as Code (IaC) to build and manage AWS cloud environments. Candidates should have strong hands-on experience with Terraform.
Development & DevOps Tools:
- ITIL: Endava follows ITIL-based service management for incident, request, and problem management. Candidates should have working knowledge and experience in ITIL-based service management.
- Splunk, CloudWatch, Datadog: Endava uses observability tools such as Splunk, CloudWatch, and Datadog for monitoring AWS cloud infrastructure. Candidates should have exposure to at least one of these tools.
π Enhancement Note: Endava's technology stack focuses on AWS cloud infrastructure management, with a strong emphasis on Infrastructure as Code (IaC) using Terraform. Candidates should have proven experience with AWS and Terraform, with a solid grasp of ITIL-based service management.
π₯ Team Culture & Values
Web Development Values:
- Innovation: Endava encourages a culture of innovation and continuous learning, with a focus on staying up-to-date with emerging technologies and industry best practices.
- Collaboration: Endava emphasizes collaboration and knowledge sharing, with a focus on working together to deliver high-quality solutions for clients.
- Quality: Endava is committed to delivering high-quality solutions that meet or exceed client expectations.
- Responsibility: Endava expects its employees to take responsibility for their work and contribute to the company's success.
Collaboration Style:
- Cross-Functional Integration: Endava's cloud operations engineers work closely with development, application, and other technical teams to ensure high availability, performance, and reliability of AWS cloud infrastructure.
- Code Review Culture: Endava encourages a culture of code review and peer programming to ensure high-quality solutions and knowledge sharing.
- Knowledge Sharing: Endava fosters a culture of knowledge sharing, with a focus on learning from and teaching others.
π Enhancement Note: Endava's culture emphasizes innovation, collaboration, and continuous learning. Candidates should be comfortable working in a collaborative, knowledge-sharing environment and be open to learning and adapting to new technologies and methodologies.
β‘ Challenges & Growth Opportunities
Technical Challenges:
- Incident Resolution: Manage and resolve incidents affecting AWS cloud infrastructure, ensuring minimal business impact and rapid recovery.
- Automation: Implement automation solutions using Terraform and other tools to improve system consistency, reliability, and efficiency.
- Performance Optimization: Identify and address performance bottlenecks in AWS cloud infrastructure, ensuring optimal resource utilization and cost efficiency.
- Scalability: Design and implement scalable AWS cloud infrastructure solutions that can adapt to changing business demands and requirements.
Learning & Development Opportunities:
- Technical Skill Development: Deepen your expertise in AWS cloud infrastructure management, automation, and ITIL-based service management. Explore emerging technologies and tools to improve system performance, reliability, and security.
- Leadership Development: Develop leadership skills through mentoring junior team members, driving team initiatives, and contributing to the company's knowledge-sharing culture.
- Architecture and Design: Expand your role to include infrastructure architecture and design, contributing to the development of scalable, secure, and high-performing cloud environments.
π Enhancement Note: Endava offers significant opportunities for technical growth and leadership development. Candidates should be eager to learn, adapt, and contribute to the company's knowledge-sharing culture.
π‘ Interview Preparation
Technical Questions:
- AWS Cloud Infrastructure Management: Describe your experience managing AWS cloud infrastructure, focusing on incident resolution, request fulfillment, and problem management. Provide examples of challenges you've faced and how you've overcome them.
- Terraform: Explain your experience with Terraform and how you've used it to automate operational tasks and improve system consistency and reliability. Provide examples of Terraform projects you've worked on.
- ITIL-Based Service Management: Discuss your working knowledge and experience in ITIL-based service management, focusing on incident, request, and problem management. Provide examples of how you've applied ITIL principles in your previous roles.
Company & Culture Questions:
- Endava's Culture: Describe what you understand about Endava's culture and how you think you would fit in. Discuss your approach to collaboration, knowledge sharing, and continuous learning.
- AWS Cloud Infrastructure Challenges: Describe how you would approach managing and optimizing AWS cloud infrastructure in a large, global organization like Endava. Discuss your strategies for incident resolution, automation, and performance optimization.
- Team Fit: Explain how you would contribute to Endava's team culture, focusing on collaboration, knowledge sharing, and driving team initiatives. Discuss your approach to mentoring junior team members and contributing to the company's knowledge-sharing culture.
Portfolio Presentation Strategy:
- Incident Resolution Case Studies: Highlight your experience managing and resolving incidents affecting AWS cloud infrastructure. Provide detailed examples of challenges you've faced and how you've overcome them.
- Terraform Projects: Showcase your Terraform projects, focusing on automation solutions that improve system consistency, reliability, and efficiency. Explain your approach to Terraform configuration and best practices.
- ITIL-Based Service Management: Demonstrate your understanding of ITIL-based service management, focusing on incident, request, and problem management. Provide examples of how you've applied ITIL principles in your previous roles.
π Enhancement Note: Endava's interview process focuses on technical competency, problem-solving skills, and cultural fit. Candidates should be prepared to discuss their approach to AWS cloud infrastructure management, incident resolution, and automation in detail.
π Application Steps
To apply for this Senior Cloud Operations Engineer - AWS position at Endava, follow these steps:
- Update Your Portfolio: Highlight your experience managing AWS cloud infrastructure, focusing on incident resolution, request fulfillment, and problem management. Include Terraform projects that demonstrate your automation skills and understanding of ITIL-based service management.
- Tailor Your Resume: Emphasize your relevant experience and skills in AWS cloud infrastructure management, Terraform, and ITIL-based service management. Include specific examples of your achievements and the impact you've made in previous roles.
- Prepare for Technical Assessment: Brush up on your AWS cloud infrastructure management skills, focusing on incident resolution, request fulfillment, and problem management. Familiarize yourself with Terraform and other automation tools used for managing AWS cloud environments.
- Research Endava: Learn about Endava's culture, values, and approach to technology and innovation. Prepare for behavioral interview questions focused on problem-solving, communication, and collaboration skills. Practice discussing your approach to incident management, request fulfillment, and problem management following ITIL standards.
- Apply: Submit your application through the application link provided, including your resume and portfolio.
β οΈ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.
Application Requirements
Candidates must have proven experience with AWS and Terraform, along with strong scripting skills. Familiarity with ITIL-based service management and experience in a 24/7 operational environment are essential.